Retrieval-Augmented Generation Archives

Latest Overview of Agentic RAG Full-Stack Technology

2025-07-06 by AI Agent

The RAG technology will not disappear in 2025; instead, it will be more widely and deeply applied with the new paradigm Agentic RAG. Here is the fresh overview of the full-stack technology of Agentic RAG, just released in 2025: A comprehensive review of the development history of RAG, from the initial Naïve RAG to Advanced … Read more

Advanced RAG: Enhancing RAG Performance

2025-04-23 by AI Agent

Author: Luv Bansal Translation: wwl Proofreading: Zhang Yiran This article is approximately 4400 words long and is recommended for a reading time of over 10 minutes. This article discusses various techniques for optimizing different parts of the RAG pipeline and enhancing the overall RAG workflow. Image generated by the author using Dalle-3 provided by Bing … Read more

Latest Overview of RAG: Review of 15 Classic RAG Frameworks (Part 2)

2025-04-22 by AI Agent

Source: Deep Graph Learning and Large Model LLM This article is approximately 3500 words long and is recommended for a 9-minute read. It delves into the development of Retrieval-Augmented Generation (RAG), from basic concepts to the latest technologies. 4. Overview of Existing RAG Frameworks Agent-Based RAG A new agent-based Retrieval-Augmented Generation (RAG) framework employs a … Read more

Comprehensive Introduction to Large Models and RAG

2025-04-22 by AI Agent

This article is about 11,000 words long and is recommended to be read in 6 minutes. This article introduces large models + RAG. 1 Introduction Large Language Models (LLMs) have limitations when handling domain-specific or highly specialized queries, such as generating inaccurate information or “hallucinations.” A promising approach to mitigate these limitations is Retrieval-Augmented Generation … Read more

Building a Minimal RAG Model Step by Step

2025-03-29 by AI Agent

Datawhale Insights Author: Song Zhixue, Datawhale Member Hello everyone, I am not a fan of garlic and ginger. Next, I will guide you step by step to implement a simple RAG model, which is a simplified version of RAG, called Tiny-RAG. Tiny-RAG is a simplified version of RAG that only includes the core functions of … Read more

Impact of Irrelevant Inputs on LLMs in RAG Systems

2025-03-29 by AI Agent

Introduction Hello everyone, I am Liu Cong from NLP. RAG (Retrieval-Augmented Generation) finds information fragments relevant to user questions through a retrieval system, utilizing large models to synthesize an answer. This greatly addresses issues such as hallucination and outdated information in large models, and has become an important means for the practical application of large … Read more

Combining RAG and LLMs: A Review of Retrieval-Augmented Large Language Models

2025-03-29 by AI Agent

As one of the most advanced technologies in artificial intelligence, Retrieval-Augmented Generation (RAG) technology can provide reliable and up-to-date external knowledge, offering great convenience for numerous tasks. Especially in the era of AI-Generated Content (AIGC), RAG’s powerful retrieval capabilities in providing additional knowledge enable it to assist existing generative AI in producing high-quality outputs. Recently, … Read more

Analysis of Key Modules in RAG Full Link

2025-03-29 by AI Agent

Original: https://zhuanlan.zhihu.com/p/682253496 Compiled by: Qingke AI Leave a message in the backend ‘ Exchange ‘, Join the NewBee discussion group 1. Background Introduction RAG (Retrieval Augmented Generation) method refers to a combination of retrieval-based models and generative models to improve the quality and relevance of generated text. This method was proposed by Meta in the … Read more

Key Module Analysis of RAG Full Link

2025-03-29 by AI Agent

Original: https://zhuanlan.zhihu.com/p/682253496 Organizer: Qingke AI 1. Background Introduction The RAG (Retrieval Augmented Generation) method refers to a combination of retrieval-based models and generative models to improve the quality and relevance of generated text. This method was proposed by Meta in the 2020 paper “Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks”[1], allowing language models (LM) to acquire … Read more

Overview of LlamaIndex Components

2025-03-26 by AI Agent

Introduction This article provides an overall introduction to the LlamaIndex framework, including its functions, components, and explanations. LlamaIndex is an open-source LLM application development framework built on large models (including Agents and Workflows) to create context-enhanced generative AI applications. Components of LlamaIndex The following diagram illustrates the overall functional structure of LlamaIndex, abstracted from the … Read more