当前位置:网站首页>OCR knowledge summary
OCR knowledge summary
2022-07-28 10:40:00 【Upper back left love】
Basic knowledge of image processing
OCR
Character recognition is also CV One of the main research directions , The direction of character recognition is mainly :
1. Separate character recognition
2. Combined with text detection
3. Text end-to-end recognition
Character recognition technology : Locate the text in the image area through text detection , Feature on the extracted region , On this basis, do special character recognition , There are many end-to-end ENd2End OCR
- Text detection — Locate the text area in the picture ( The accuracy of positioning directly affects the subsequent Recongnition)
Text detection in Concept :- ground truth(GT): In supervised learning Data is a marker (X,t)
x It's input data , Correct t The label is ground truth
In image recognition : Input the alpha chart , The original drawing uses Alpha The big brother label is GT (Aplha Channels indicate the transparency and opacity of an image ) - detecting box: Window moved box
- IOU: Standard performance metrics for image segmentation , The similarity between the predicted area and the live area
- Text detection algorithm :
- EAST/CTPN/SegLink/PixelLink/TextBoxes/TextBoxes++/TextSnake/MSR/…
- ground truth(GT): In supervised learning Data is a marker (X,t)
- Character recognition :
For text recognition without bending
* CNN + RNN + CTC
* CNN + seq2deq+Attention
* CNN + LSTM + CTC Verification code recognition
For curved text recognition :
In the traditional way There are a lot of invalid areas ,STN Network learning transformation parameters
Use Deformable Convolution Deformable convolution Different shape features of the text area can be extracted
Reference article : https://zhuanlan.zhihu.com/p/657075435
边栏推荐
猜你喜欢

8、Yarn系统架构与原理详解

SQL Server 2016 learning records - data update

Typora tutorial

SQL Server 2016 学习记录 --- 数据定义

生成对抗网络在DeepFake中的未来

Typora使用教程

Yarn报错:Exception message: /bin/bash: line 0: fg: no job control

Chapter 1: cross end development of small programs of uniapp ----- create a uniapp project

粒子群解决tsp的技术问题

8. Detailed explanation of yarn system architecture and principle
随机推荐
Codeforces Round #614 (Div. 2) B. JOE is on TV!
The IPO of SMIC International Technology Innovation Board passed smoothly, and its market value is expected to exceed 200billion!
ACM寒假集训#7
Small knowledge in Oracle
读写分离备机备份报错
GKCoherentNoiseSource
GKCircleObstacle
数据库安全 --- 创建登录名 用户+配置权限【笔记】
字符串匹配
20200229训练赛 L2 - 2 树种统计 (25分)
20200217 training match L1 - 7 2019 is coming (20 points)
Get to know SuperMap idesktop for the first time
Machine learning -- handwritten English alphabet 3 -- engineering features
6、MapReduce自定义分区实现
13. Hash table - the first common node of two linked lists
Codeforces Round #614 (Div. 2) A. ConneR and the A.R.C. Markland-N
最短路专题
7. MapReduce custom sorting implementation
2019年9月PAT甲级题目
ICML 2022 | 图表示学习的结构感知Transformer模型