Efficient Open Source OCR Tool: Introduction and Usage of Surya-OCR

Efficient Open Source OCR Tool: Introduction and Usage of Surya-OCR

Click the card below to follow “Machine Vision and Deep Learning” Visual/image heavy content delivered first! Background In many enterprise applications, Optical Character Recognition (OCR) is a fundamental technology. In this article, we will delve into Surya-OCR, a recently popular solution. Text detection and extraction are crucial in various business use cases. For example: In … Read more

Next-Gen RAG Engine Based on OCR and Document Parsing

Next-Gen RAG Engine Based on OCR and Document Parsing

Introduction It is an open-source RAG (Retrieval-Augmented Generation) engine built on deep document understanding. It mainly provides a streamlined RAG workflow for enterprises and individuals of various sizes, leveraging large language models (LLMs) to handle users’ diverse complex format data, offering reliable Q&A and well-founded citations. Its main features include: 1. Deep Document Understanding: Capable … Read more

How Far Is AI from Practical Automation?

How Far Is AI from Practical Automation?

Artificial intelligence, from its name, suggests two characteristics: automation and intelligence. From these two perspectives, the level of intelligence is insufficient, leading to an inability to truly achieve automation in practice. For example, a private enterprise has bank reconciliation statements, each bank’s statement has different formats and a large quantity, resulting in a significant workload! … Read more

Implementing OCR Recognition Using Halcon

Implementing OCR Recognition Using Halcon

Previously, I worked with OpenCV, but now the company has a project for OCR, and I’ve implemented it using Halcon. There is a lot of information online about OCR teaching, but it can be overwhelming. Below is the practical implementation based on the materials and the current project. First, we need to create a sample … Read more

Why Image Recognition Needs to Convert Color Images to Grayscale

Why Image Recognition Needs to Convert Color Images to Grayscale

Click the above “Beginner Learning Vision” and choose to add Star or “Pin” Important Content Delivered First Hand Previously, when introducing OCR recognition technology, we mentioned grayscale conversion in the image preprocessing section. You might wonder: Why do we need to convert color images to grayscale for image recognition? Before explaining this question, we need … Read more

OPT Smart Code Reader with Deep Learning OCR Technology

OPT Smart Code Reader with Deep Learning OCR Technology

The domestic code reader market has a wide variety of products, but few have ventured into the application of OCR (Optical Character Recognition) technology. OPT has launched a code reader product equipped with a deep learning OCR algorithm, leveraging years of profound technological accumulation, pushing the application of code readers into more refined and complex … Read more

Practical Insights on Intelligent Construction of OCR Platforms in Commercial Banks

Practical Insights on Intelligent Construction of OCR Platforms in Commercial Banks

Written by / China Bank Enterprise Architecture Construction Office, Song Shouwen In recent years, with the rapid development of artificial intelligence, OCR technology has been continuously updated and iterated, gradually enhancing its processing capabilities in recognition, and its application in the financial industry has matured and become widespread. Most commercial banks in China have introduced … Read more

Bing Translate: A Comprehensive Guide to Microsoft’s Translation Tool

Bing Translate: A Comprehensive Guide to Microsoft's Translation Tool

1. Tool Introduction Bing Translate is a translation website launched by Microsoft that provides full-text translation for paragraphs and web pages. It can translate both text and whole web pages. Similar to Google Translate, Bing Translate also employs statistical machine translation technology. The web version supports translation among over 40 languages. With Microsoft’s robust corpus … Read more

Understanding the Knowledge System of Computer Vision

Understanding the Knowledge System of Computer Vision

Click on the top "Xiaobai Learns Vision" to choose to add "Starred" or "Pinned" Heavyweight content delivered first time Introduction Computer vision is an important field of artificial intelligence technology. To put it metaphorically (not necessarily accurate), I believe computer vision is the eyes of the AI era, which shows its importance. Computer vision is … Read more

China Telecom Implements AI Using TensorFlow

China Telecom Implements AI Using TensorFlow

The Telecom Business Hall APP, as the entry-level application for China Telecom’s online services, has allowed its development team to have close contact with TensorFlow and artificial intelligence (AI) technology. AI is an area that the Telecom Business Hall APP had never explored before, and even the engineers involved in the project transitioned from Android … Read more