Practical Guide: Using OpenCV and PyTesseract for Document OCR

Practical Guide: Using OpenCV and PyTesseract for Document OCR

Click on the above“Beginner’s Guide to Vision”, select to addStar or “Top” Important content delivered first As organizations around the world seek to digitize their operations, converting physical documents into digital formats has become very common. This is usually done through Optical Character Recognition (OCR), where text images (scanned physical documents) are converted into machine … Read more

Super Simple Implementation of OCR Text Recognition Application

Super Simple Implementation of OCR Text Recognition Application

Click on the above “Visual Learning for Beginners” and choose to add “Star” or “Top” Important content delivered in real-time 1. Introduction Recently, I encountered a small problem where I needed to extract some text information printed on images. Since there were quite a few images—several with hundreds of characters each—manually typing them out seemed … Read more

Fudan PhD Uses OCR and Regex to Verify Nucleic Acid Reports in 2 Minutes

Fudan PhD Uses OCR and Regex to Verify Nucleic Acid Reports in 2 Minutes

Source: Big Data Digest This article is about 2000 words long and is recommended for a 5-minute read. 800 images take only 2 minutes, and the program has been packaged. Recently, according to reports from Fudan University, Li Xiaokang, a PhD student from the School of Information Science and Engineering, used OCR and regular expressions … Read more

Practical Guide | OpenCV + OCR Circular Text Recognition

Practical Guide | OpenCV + OCR Circular Text Recognition

Click the "Xiaobai Learns Vision" above, select "Star" or "Top" Heavy content delivered first Introduction This article will introduce the detailed steps and code demonstration for implementing circular text recognition using OpenCV + OCR. (Source: WeChat Official Account: OpenCV and AI Deep Learning) Background Introduction In optical character recognition (OCR) scenarios, there are many special … Read more

Simple Algorithm Flow for OCR ID Card Recognition

Simple Algorithm Flow for OCR ID Card Recognition

Click the above “Visual Learning for Beginners“, select to add “Star” or “Top“ Essential content delivered promptly Conducting research on OCR ID card recognition, I will organize the algorithm flow from the perspective of basic image processing. Introduction to OCR Optical Character Recognition (OCR) The goal is to analyze and recognize characters in images, converting … Read more

Easily Recognize Text With This Python OCR Library Supporting Over 80 Languages

Easily Recognize Text With This Python OCR Library Supporting Over 80 Languages

Click the above“Beginner’s Guide to Vision” and choose to add a Star or “Pin” Important content delivered first This article is adapted from | Python Big Data Analysis What is OCR? There is a software called “ScanMaster”, which some of you may have heard of. This is an integrated OCR software that can scan image … Read more

Unlocking the World of OCR: Comprehensive Toolkit and Datasets

Unlocking the World of OCR: Comprehensive Toolkit and Datasets

In daily life, do you often use WeChat to recognize images and extract text information? Besides this, there are other applications like photo-based question searching, photo translation, document information extraction, and logistics information recognition, all thanks to the support of OCR technology. With the continuous development of deep learning technologies, intelligent OCR algorithms and applications … Read more

PyTesseract: A Powerful OCR Tool!

In 2024, OCR technology is no longer a black technology! Today, let’s play with something interesting – PyTesseract, a Python tool that allows you to easily extract text from images. Whether it’s scanned documents, screenshots, or photos taken with a phone, it can help you convert them into editable text with just one click, making … Read more

Principles and Applications of OCR Technology

Principles and Applications of OCR Technology

Click on "Xiaobai Learning Vision" above, select to add "star" or "top" Heavy content delivered in real time Introduction Text is one of the most important sources of information for humans, and natural scenes are filled with various character symbols. OCR (Optical Character Recognition) is a familiar term, referring to the process where electronic devices … Read more

How Google Gemini Achieves Smart Interaction Through Language Models

How Google Gemini Achieves Smart Interaction Through Language Models

Introduction: This article aims to explore an innovative path by utilizing Google’s Gemini Flash instead of the widely known LangChain to achieve intelligent interaction with CSV files. We will build a simple CSV interpreter that generates code and parses data without LangChain, creating an intuitive user interface with Streamlit.©️【Deep Blue AI】 In today’s rapidly evolving … Read more