Click the above “Visual Learning for Beginners“, select to add “Star” or “Top“
Essential content delivered promptly
Conducting research on OCR ID card recognition, I will organize the algorithm flow from the perspective of basic image processing.
-
Optical Character Recognition (OCR)
-
The goal is to analyze and recognize characters in images, converting them into a sequence of text characters.
-
Utilizes pattern recognition and digital image processing techniques to solve text input problems.
Classification by Input Method
-
Printed Text
-
Handwritten Text (input via scanner/input via graphics tablet)
Classification by Recognizable Character Set
-
English, Chinese, Japanese, Korean, etc.
-
Chinese, commonly used 4000 characters, various fonts, similar characters are difficult to distinguish
Application-Oriented OCR
-
Licenses, receipts, documents, business cards, ID cards, driver’s licenses, automotive manufacturing.
A typical algorithm flowchart for ID card recognition is as follows:
-
Denoising filtering, illumination processing
-
Enhancement (optional) gray stretching
-
Binarization converting grayscale images to binary images
-
Skew correction Hough transform, projection method
-
Line segmentation
The character information distribution rules in ID card images, with a certain gap between each line; using horizontal projection method for image segmentation
-
Character segmentation
Vertical projection
-
Template matching method
Establish a standard template for each character, performing graphic matching, stroke matching, and geometric feature matching. Characteristics: simple implementation, high image quality requirements, slow computation speed, low recognition rate for similar characters
-
Artificial Neural Network Character Recognition Algorithm
Artificial neural network, referred to as neural network, is a mathematical model or computational model that mimics the structure and function of biological neural networks.
– ID card number verification
– Validity verification
ID Card Recognition Software
-
Yunmai
-
WenTong
-
Abbyy
-
……
The recognition effect is significantly correlated with photo clarity, tilt, background, lighting, contrast, etc.
The author is CSDN blogger maowenbei
Link: https://blog.csdn.net/maowenbei/article/details/72765647
Community Group
Welcome to join the public account reader group to communicate with peers. Currently, there are WeChat groups for SLAM, three-dimensional vision, sensors, autonomous driving, computational photography, detection, segmentation, recognition, medical imaging, GAN, algorithm competitions (which will gradually be subdivided in the future), please scan the WeChat number below to join the group, noting: “nickname + school/company + research direction”, for example: “Zhang San + Shanghai Jiao Tong University + Visual SLAM”. Please follow the format, otherwise, you will not be accepted. After successful addition, invitations will be sent to relevant WeChat groups based on research direction. Please do not send advertisements in the group, otherwise, you will be removed, thank you for your understanding~