当前位置:网站首页>Python image recognition OCR
Python image recognition OCR
2020-11-07 20:56:00 【Coxhuang】
List of articles
- Python Image recognition OCR
- #1 demand
- #2 Environmental Science
- #3 install
- #3.1 macOS
- #3.2 Linux(CentOS)
- #4 Use
- #4.1 python install pytesseract library
- #4.2 Python Code
- #5 Online case
Python Image recognition OCR
#1 demand
- Identify the information in the picture , Such as QR code
#2 Environmental Science
macOS / Linux Python3.7.6
#3 install
#3.1 macOS
- install tesseract
// Install only tesseract, Don't install training tools brew install tesseract // install tesseract At the same time install training tools brew install --with-training-tools tesseract // install tesseract Install all languages at the same time , The language pack is bigger , If installed, it will take a long time , It is not recommended to install , Select on demand brew install --all-languages tesseract // install tesseract, And install training tools and language brew install --all-languages --with-training-tools tesseract
2. Download the language pack
Address : https://github.com/tesseract-ocr/tessdata
I have installed a Chinese language pack here
Chinese language pack : https://github.com/tesseract-ocr/tessdata/blob/master/chi_sim.traineddata
Then copy the downloaded Chinese language pack to the following path :
/usr/local/Cellar/tesseract/4.0.0_1/share/tessdata
3. Check out the local language pack
tesseract --list-langs
#3.2 Linux(CentOS)
- Installation dependency
yum install autoconf automake libtool libjpeg-devel libpng-devel libtiff-devel zlib-devel
2. install leptonica
download : wget https://github.com/tesseract-ocr/tesseract/archive/4.1.0.tar.gz
Unpack the installation
tar -xzvf leptonica-1.74.4.tar.gz cd leptonica-1.74.4.tar.gz ./configure --profix=/usr/local/leptonica make sudo make install
3. install tesseract-ocr
wget https://github.com/tesseract-ocr/tesseract/archive/3.04.zip unzip 3.04.zip cd tesseract-3.04/ ./configure make && make install sudo ldconfig
I have installed a Chinese language pack here
Chinese language pack : https://github.com/tesseract-ocr/tessdata/blob/master/chi_sim.traineddata
Then copy the downloaded Chinese language pack to the following path :
/usr/local/share/tessdata
#4 Use
#4.1 python install pytesseract library
pip install pytesseract pip install Pillow
#4.2 Python Code
from PIL import Image
import pytesseract
# Specify the image path and identify the language
data = pytesseract.image_to_string(Image.open('/Users/Documents/1.png'), lang='chi_sim')
print(data)
#5 Online case
Address :
Participation of this paper Tencent cloud media sharing plan , You are welcome to join us , share .
版权声明
本文为[Coxhuang]所创,转载请带上原文链接,感谢
边栏推荐
- AFO
- Annual salary of 900000 programmers is not as good as 3800 civil servants a month? How to choose between stability and high income?
- Deep into web workers (1)
- 关于update操作并发问题
- 在pandas中使用pipe()提升代码可读性
- 一万四千字分布式事务原理解析,全部掌握你还怕面试被问?
- Cpp(三) 什么是CMake
- 是时候结束 BERTology了
- Git代码提交操作,以及git push提示failed to push some refs'XXX'
- 模型预测准确率高达94%!利用机器学习完美解决2000亿美元库存难题
猜你喜欢

Git代码提交操作,以及git push提示failed to push some refs'XXX'

Got timeout reading communication packets解决方法

Design pattern of facade and mediator

C language I blog assignment 03

盘点那些争议最大的编程观点,你是什么看法呢?

Animation techniques and details you may not know

How to learn technology efficiently

android基础-RadioButton(单选按钮)

The CPU does this without the memory

14000 word distributed transaction principle analysis, master all of them, are you afraid of being asked in the interview?
随机推荐
Why do we need software engineering -- looking at a simple project
Implementation of Caesar cipher
Share several vs Code plug-ins I use everyday
使用jsDelivr加速你的网站
How Facebook open source framework simplifies pytorch experiment
laravel8更新之维护模式改进
Improvement of maintenance mode of laravel8 update
Do not understand the underlying principle of database index? That's because you don't have a B tree in your heart
WPF 关于绘图个人总结
建议患者自杀,OpenAI警告:GPT-3用于医疗目的风险太高
快速上手Git
[original] the influence of arm platform memory and cache on the real-time performance of xenomai
计组-总线通信控制之异步串行通信的数据传输
分享几个我日常使用的VS Code插件
C language I blog assignment 03
Python 图片识别 OCR
How to deal with data leakage and deletion related to business life and death?
Don't treat exceptions as business logic, which you can't afford
使用 Xunit.DependencyInjection 改造测试项目
Insight -- the application of sanet in arbitrary style transfer