Overview of Deep Learning: Chapter One

Overview of Deep Learning: Chapter One

Deep learning has become very popular, but many people still have different understandings of it. The first chapter analyzes and compares the characteristics of deep learning with other concepts such as machine learning and AI, making it worth a read. Read the original text to download the latest version 0.2 PDF. Join the learning discussion: … Read more

Youdao Cloud Note Supports OCR Recognition: Convert Images to Text, PDF to Word

Youdao Cloud Note Supports OCR Recognition: Convert Images to Text, PDF to Word

If you are not familiar with what the new OCR feature can do for you, you must have encountered the following situations, and the OCR feature is the “magic remedy” for these “issues”. In work or research studies, dealing with a large number of paper documents, magazines, and PDF electronic materials makes it difficult to … Read more

RAG Series 07: Building Indexes and Using Large Models for QA with PDF Tables

RAG Series 07: Building Indexes and Using Large Models for QA with PDF Tables

Effectively parsing and understanding tables in unstructured documents remains a significant challenge when designing RAG solutions. This is especially difficult in cases where tables exist in image formats, such as scanned documents. These challenges include several aspects: The complexity of scanned or image documents, such as diverse structures, the presence of non-text elements, and the … Read more

Advanced LlamaIndex Documentation: Using LlamaParse for Complex PDFs

Advanced LlamaIndex Documentation: Using LlamaParse for Complex PDFs

In the previous article, we primarily used SimpleDirectoryReader to process documents. Today, we will explore how to use LlamaParse to handle more complex PDF documents, achieving more accurate document parsing and information extraction. 1. Introduction to LlamaParse LlamaParse is a tool specifically designed for handling complex documents, with the following features: Accurate layout recognition Table … Read more

Building a PDF Q&A Application Using OLLama and LangChain

Building a PDF Q&A Application Using OLLama and LangChain

Yesterday, I wrote about How OLLama Builds a Knowledge Base for Q&A Using Existing Documents? OLLama + LangChain to Assist You. There was a streamlit related to the graphical interface for the OLLama knowledge base Q&A that I didn’t write about because the code didn’t run, so today I will supplement that part. Finally, there … Read more

Using LlamaIndex to Create Custom Agent Functions

Using LlamaIndex to Create Custom Agent Functions

Overview This article introduces how to use LlamaIndex to write your own Agent handling functions. Note that this article uses a locally deployed LLM supported by Ollama for practical implementation, rather than remotely calling the OpenAI API. The goal of this article is to save the output content to a PDF file and then stop … Read more

Using LlamaIndex Agent to Call Multiple Tool Functions

Using LlamaIndex Agent to Call Multiple Tool Functions

Overview This article introduces how to use LlamaIndex’s Agent to call multiple custom Agent tool functions. As with the previous articles in this series, this article does not use the OpenAI API and relies entirely on a local large model to complete the entire functionality. The goal of this article is simple: to save the … Read more

How to Handle Table Data in RAG Knowledge Base Documents?

How to Handle Table Data in RAG Knowledge Base Documents?

In developing the RAG system, the data formats in the knowledge base can be diverse, and most of them are unstructured data content. For example, PDF documents in the knowledge base are likely to contain table data, and our approach to handling this needs special attention to ensure that the table information can be correctly … Read more

Using Claude API to Translate English PDF Books

Using Claude API to Translate English PDF Books

Reporting on the previous use of the Cline and openrouter APIs to translate English PDF books. Completed last Christmas (last month). However, I was unable to publish the article due to a severe cold. Below are screenshots of the program running. According to the previous article, it would roughly cost $4, but the result on … Read more