PDF Translation Archives

Mathematical Formula OCR Recognition

2025-05-02 by AI Agent

Introduction Many researchers often need to input a bunch of theorem formulas when writing papers. Some formulas are exactly as they appear in books or literature, and typing them one by one into Word or LaTeX can be very tiring. If there were software that could recognize formulas, it would definitely improve efficiency. Today, I … Read more

Overview of Deep Learning: Chapter One

2025-04-11 by AI Agent

Deep learning has become very popular, but many people still have different understandings of it. The first chapter analyzes and compares the characteristics of deep learning with other concepts such as machine learning and AI, making it worth a read. Read the original text to download the latest version 0.2 PDF. Join the learning discussion: … Read more

Youdao Cloud Note Supports OCR Recognition: Convert Images to Text, PDF to Word

2025-04-02 by AI Agent

If you are not familiar with what the new OCR feature can do for you, you must have encountered the following situations, and the OCR feature is the “magic remedy” for these “issues”. In work or research studies, dealing with a large number of paper documents, magazines, and PDF electronic materials makes it difficult to … Read more

RAG Series 07: Building Indexes and Using Large Models for QA with PDF Tables

2025-03-29 by AI Agent

Effectively parsing and understanding tables in unstructured documents remains a significant challenge when designing RAG solutions. This is especially difficult in cases where tables exist in image formats, such as scanned documents. These challenges include several aspects: The complexity of scanned or image documents, such as diverse structures, the presence of non-text elements, and the … Read more

Advanced LlamaIndex Documentation: Using LlamaParse for Complex PDFs

2025-03-25 by AI Agent

In the previous article, we primarily used SimpleDirectoryReader to process documents. Today, we will explore how to use LlamaParse to handle more complex PDF documents, achieving more accurate document parsing and information extraction. 1. Introduction to LlamaParse LlamaParse is a tool specifically designed for handling complex documents, with the following features: Accurate layout recognition Table … Read more

Building a PDF Q&A Application Using OLLama and LangChain

2025-03-22 by AI Agent

Yesterday, I wrote about How OLLama Builds a Knowledge Base for Q&A Using Existing Documents? OLLama + LangChain to Assist You. There was a streamlit related to the graphical interface for the OLLama knowledge base Q&A that I didn’t write about because the code didn’t run, so today I will supplement that part. Finally, there … Read more

Using LlamaIndex to Create Custom Agent Functions

2025-03-05 by AI Agent

Overview This article introduces how to use LlamaIndex to write your own Agent handling functions. Note that this article uses a locally deployed LLM supported by Ollama for practical implementation, rather than remotely calling the OpenAI API. The goal of this article is to save the output content to a PDF file and then stop … Read more

Using LlamaIndex Agent to Call Multiple Tool Functions

2025-03-05 by AI Agent

Overview This article introduces how to use LlamaIndex’s Agent to call multiple custom Agent tool functions. As with the previous articles in this series, this article does not use the OpenAI API and relies entirely on a local large model to complete the entire functionality. The goal of this article is simple: to save the … Read more

How to Handle Table Data in RAG Knowledge Base Documents?

2025-03-02 by AI Agent

In developing the RAG system, the data formats in the knowledge base can be diverse, and most of them are unstructured data content. For example, PDF documents in the knowledge base are likely to contain table data, and our approach to handling this needs special attention to ensure that the table information can be correctly … Read more

Using Claude API to Translate English PDF Books

2025-01-22 by AI Agent

Reporting on the previous use of the Cline and openrouter APIs to translate English PDF books. Completed last Christmas (last month). However, I was unable to publish the article due to a severe cold. Below are screenshots of the program running. According to the previous article, it would roughly cost $4, but the result on … Read more