How to Build Future-Oriented AI Agents Using Edge Large Models?

How to Build Future-Oriented AI Agents Using Edge Large Models?

Click the blue words to follow us In recent years, general artificial intelligence technology has rapidly developed and has become an important driving force for the digital upgrade and intelligent enhancement of the automotive industry. Many companies have successively established cloud-based large models to support the iterative updates and application expansion of vehicle-side functions. However, … Read more

How Mianbi Intelligent Surpasses Large Models with MiniCPM

How Mianbi Intelligent Surpasses Large Models with MiniCPM

Cost is the invisible competitive advantage of large models. Author|Liu Yangnan Editor|Zhao Jian Today, the Tsinghua University-affiliated large model company “Mianbi Intelligent” released its first flagship large model “Mianbi MiniCPM”, which has been aptly named “Little Cannon”. According to Mianbi Intelligent’s co-founder and CEO Li Dahai, the parameter scale of Mianbi MiniCPM is 2B, using … Read more

EdgeBERT: Limit Compression, 13 Times Lighter Than ALBERT!

EdgeBERT: Limit Compression, 13 Times Lighter Than ALBERT!

Machine Heart Reprint Source: Xixiaoyao’s Cute Selling House Author: Sheryc_Wang Su There are two types of highly challenging engineering projects in this world: the first is to maximize something very ordinary, like expanding a language model to write poetry, prose, and code like GPT-3; while the other is exactly the opposite, to minimize something very … Read more

Goodbye Large Models: MiniRAG for Efficient Knowledge Retrieval

Goodbye Large Models: MiniRAG for Efficient Knowledge Retrieval

Today, I will share a retrieval-augmented generation method designed for resource-constrained scenarios: MiniRAG. Paper link: https://arxiv.org/pdf/2501.06713 Code link: https://github.com/HKUDS/MiniRAG Introduction With the rapid development of retrieval-augmented generation (RAG) technology, the performance of language models in knowledge retrieval and generation tasks has significantly improved. However, existing methods heavily rely on large language models (LLMs), leading to … Read more

Generative AI Inference Technology, Market, and Future

Generative AI Inference Technology, Market, and Future

OpenAI o1, QwQ-32B-Preview,DeepSeek R1-Lite-Preview’s successive release signifies that generative AI research is shifting from pre-training to inference to enhance AI logical reasoning capabilities. This transition will greatly promote the development of upper-layer applications.Sequoia Capital recently pointed out, that in the foreseeable future, logical reasoning and computation during inference will be an important theme, ushering in … Read more

A Brief Introduction to AI Agents

A Brief Introduction to AI Agents

1.Definition An AI Agent is a software or hardware entity capable of perceiving its environment through sensors and affecting it through actuators. It possesses autonomy, reactivity, proactiveness, and learning ability. 2. Core Features Autonomy: Able to operate and make decisions without human intervention. Reactivity: Capable of perceiving environmental changes and responding in real-time. Proactiveness: Not … Read more