Detailed Explanation of RAG 2.0 Architecture: Building End-to-End Retrieval-Augmented Generation Systems

Detailed Explanation of RAG 2.0 Architecture: Building End-to-End Retrieval-Augmented Generation Systems

Click on “Deephub Imba“, follow the public account, and don’t miss out on great articles! There have been many articles about Retrieval-Augmented Generation (RAG). If we could create a trainable retriever, or if the entire RAG could be customized like fine-tuning a large language model (LLM), we would definitely achieve better results. However, the current … Read more

RAG Demo in One Week, But Takes Six Months to Launch – Solutions

RAG Demo in One Week, But Takes Six Months to Launch - Solutions

Many practitioners have found that although RAG can quickly build a demo in a short time, it faces numerous challenges in actual production environments. This article analyzes the core issue of RAG’s industrial implementation from the perspective of entrepreneurs in the AI large model field—problem grading—and discusses the challenges and solutions of four types of … Read more

Summary of Baichuan Intelligent RAG Approach: The Journey of the Baichuan Intelligent Model RAG

Summary of Baichuan Intelligent RAG Approach: The Journey of the Baichuan Intelligent Model RAG

Happy New Year, everyone! Today, I will interpret Baichuan’s RAG approach. Baichuan Intelligent has a profound background in search; let’s see how they navigated the pitfalls of RAG! In general, Baichuan combines a long context model (192k) with search enhancement methods to address knowledge updates and reduce model hallucinations, achieving 95% accuracy on a dataset … Read more

Injecting Knowledge Graphs at Different RAG Stages

Injecting Knowledge Graphs at Different RAG Stages

Reprinted from WeChat Official Account | Blue’s Little Firefly In this article, I would like to accurately introduce the application areas of Knowledge Graphs (KG) in the RAG pipeline. We will explore the different types of questions that arise in the RAG pipeline and how to address these issues by applying knowledge graphs at various … Read more

Chunk Segmentation Based on Semantics in RAG

Chunk Segmentation Based on Semantics in RAG

In RAG, after reading the files, the main task is to split the data into smaller chunks and then embed these features to express their semantics. The location of this process in RAG is shown in the figure below. The most common chunking method is rule-based, using techniques such as fixed chunk sizes or overlapping … Read more

Analysis of Key Modules in RAG Full Link

Analysis of Key Modules in RAG Full Link

Original: https://zhuanlan.zhihu.com/p/682253496 Compiled by: Qingke AI Leave a message in the backend ‘ Exchange ‘, Join the NewBee discussion group 1. Background Introduction RAG (Retrieval Augmented Generation) method refers to a combination of retrieval-based models and generative models to improve the quality and relevance of generated text. This method was proposed by Meta in the … Read more

From Traditional RAG to Graph RAG – When Large Models Meet Knowledge Graphs

From Traditional RAG to Graph RAG - When Large Models Meet Knowledge Graphs

Abstract: The transition from traditional RAG to Graph RAG enhances large language models by integrating knowledge graphs, enabling them to provide more detailed and accurate responses to complex queries. The effectiveness of Graph RAG also depends on the quality and breadth of the underlying knowledge graph and the engineering aspects of RAG. Main Points: – … Read more

Recommendation Systems From RAG Perspective: Opportunities and Challenges

Recommendation Systems From RAG Perspective: Opportunities and Challenges

Wang Haofen, Tongji University, “Hundred Talents Program”, Distinguished Researcher, PhD Supervisor Personal Introduction: Wang Haofen, distinguished researcher and PhD supervisor in the “Hundred Talents Program” at Tongji University. He has served as CTO in frontline artificial intelligence companies for a long time. He is one of the initiators of OpenKG, the world’s largest Chinese open … Read more

RAT: Retrieval Augmented Thoughts for Context-Aware Reasoning

RAT: Retrieval Augmented Thoughts for Context-Aware Reasoning

This article primarily explains the paper “RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation”[1]. Currently, there are some relevant introductions available online, but most only discuss the ideas and mainly rely on GPT translations, which can be quite awkward and do not provide a detailed understanding of all principles. Therefore, a detailed description … Read more

17 Essential Tips for Understanding RAG

17 Essential Tips for Understanding RAG

Recently, while writing articles, I wanted to fill in some gaps left by last year’s RAG (Retrieval-Augmented Generation) and hope to share some tips to help everyone with RAG. As the old saying goes: Building a prototype of a large model is easy, but turning it into a product that can actually be put into … Read more