当前位置:网站首页>OCR 知识 概括
OCR 知识 概括
2022-07-28 10:22:00 【上后左爱】
图像处理基础知识
OCR
文字识别也是CV主要研究方向之一,文字识别方向主要:
1.单独文字识别
2.结合文字进行检测
3. 文字端到端识别
文字识别技术: 通过文字检测定位文字在图像区域,在提取区域上特征,在此基础上做专门的字符识别,出现许多端到端ENd2End OCR
- 文字检测— 定位图片中文本区域(定位的精度直接影响后续的Recongnition)
文字检测 中 概念:- ground truth(GT): 在有监督学习中 数据是标记(X,t)
x 是输入数据,正确的t 的标注是 ground truth
在图像识别中: 输入图像的alpha图,原始图使用Alpha大哥标签就是GT (Aplha 通道表示一个图片透明和不透明程度) - detecting box: 窗口移动的 box
- IOU: 图像分割问题标准性能度量,预测区域与实况区域之间的相似性
- 文字检测算法:
- EAST/CTPN/SegLink/PixelLink/TextBoxes/TextBoxes++/TextSnake/MSR/…
- ground truth(GT): 在有监督学习中 数据是标记(X,t)
- 文字识别:
对于不弯曲的文本识别
* CNN + RNN + CTC
* CNN + seq2deq+Attention
* CNN + LSTM + CTC 验证码识别
对于弯曲文本识别:
按照传统方式 出现大量无效的区域,STN 网络学习变换参数
使用Deformable Convolution 可变形卷积 可以提取文字区域的不同形状特征
参考文章: https://zhuanlan.zhihu.com/p/657075435
边栏推荐
- Codeforces Round #614 (Div. 2) B. JOE is on TV!
- RoboCup (2D) experiment 50 questions and the meaning of main functions
- Troubleshooting of tool failure caused by Chinese characters in PT kill query
- Excel word simple skills sorting (continuous update ~)
- SQL Server 2016 learning record - nested query
- Machine learning -- handwritten English alphabet 2 -- importing and processing data
- SQL Server 2016 learning records - set query
- Add new startup logo and startup / shutdown animation in mt6735
- GKLinearCongruentialRandomSource
- 7、MapReduce自定义排序实现
猜你喜欢

Sword finger offer

SQL Server 2016 learning records - data update

Idea create my first project

Idea packages jar packages and runs jar package commands

Uni app project directory, file function introduction and development specification

C language secondary pointer explanation and example code

6. Double pointer -- the sum of the two numbers of the incremental array is equal to the target number

最短路专题

7. Dichotomy -- find a set of repeated or ordered but rotating arrays

Get to know SuperMap idesktop for the first time
随机推荐
逆元&组合数&快速幂
GKRandom
SQL Server 2016 学习记录 --- 数据定义
爱可可AI前沿推介(7.28)
GKConstantNoiseSource
读写分离备机备份报错
ACM寒假集训#7
8. Numbers that appear more than half of the time in the array
【栈的应用】--- 中缀表达式转后缀表达式
Get to know SuperMap idesktop for the first time
Hurun released the 2020 top 10 Chinese chip design private enterprises: Huawei Hisilicon did not appear on the list!
2020 second intelligence cup preliminaries
GKCheckerboardNoiseSource
GKBillowNoiseSource
ACM winter vacation training 5
SDUT Round #9 2020-新春大作战
Why does the cluster need root permission
ACM寒假集训#6
20200229 training race L2 - 2 tree species Statistics (25 points)
GKNoise