An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.



An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

As an IOT/Computer Visions Intern at the Graduate Rotational Internship program (GRIP) by The Sparks Foundation (TSF), the first task is to implement a character detector which extracts printed or handwritten text from an image/video.

For more learning purposes, I've utilized this feature in cleaning/extracting valuable information from Blood Pressure Reports as images.



  • tesseract-ocr package
  • pytesseract 0.3.8
  • Open-cv
  • Pandas

    Using the pytesseract open source library to detect text on image/video.

    Open-cv for Image Processing

    Pandas for data manipulation

  • Owner
    Ramsis Hammadi
    Ramsis Hammadi

    手语识别 0、使用到的模型 (1). openpose,作者:CMU-Perceptual-Computing-Lab (2). 图像分类classification,作者:Bubbl

    20 Dec 15, 2022
    This is the code for our paper DAAIN: Detection of Anomalous and AdversarialInput using Normalizing Flows

    Merantix-Labs: DAAIN This is the code for our paper DAAIN: Detection of Anomalous and Adversarial Input using Normalizing Flows which can be found at

    Merantix 14 Oct 12, 2022
    Select range and every time the screen changes, OCR is activated.

    ASOCR(Auto Screen OCR) Select range and every time you press Space key, OCR is activated. 範囲を選ぶと、あなたがスペースキーを押すたびに、画面が変わる度にOCRが起動します。 usage1: simple OC

    1 Feb 13, 2022
    Characterizing possible failure modes in physics-informed neural networks.

    Characterizing possible failure modes in physics-informed neural networks This repository contains the PyTorch source code for the experiments in the

    Aditi Krishnapriyan 55 Jan 02, 2023
    WACV 2022 Paper - Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

    Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching Code based on our WACV 2022 Accepted Paper:

    Andres 13 Dec 17, 2022
    Toolbox for OCR post-correction

    Ochre Ochre is a toolbox for OCR post-correction. Please note that this software is experimental and very much a work in progress! Overview of OCR pos

    National Library of the Netherlands / Research 117 Nov 10, 2022
    Handwritten Character Recognition using CNN

    Handwritten Character Recognition using CNN Problem Definition The main objective of this project is to solve the problem of handwritten character rec

    Mohit Kaushik 4 Mar 02, 2022
    Image processing using OpenCv

    Image processing using OpenCv Write a program that opens the webcam, and the user selects one of the following on the video: ✅ If the user presses the

    M.Najafi 4 Feb 18, 2022

    click-warpPolar 3点クリックで円を指定し、極座標変換を行うサンプルプログラムです。 Requirements OpenCV 3.4.2 or Later Usage 実行方法は以下です。 起動後、マウスで3点をクリックし円を指定してください。 python click-warpPol

    KazuhitoTakahashi 17 Dec 30, 2022
    A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV.

    DcoumentScanner A document scanner application for laptops/desktops developed using python, Tkinter and OpenCV. Directly install the .exe file to inst

    Harsh Vardhan Singh 1 Oct 29, 2021
    make a better chinese character recognition OCR than tesseract

    deep ocr See for English installation documentation. 只在ubuntu下面测试通过,需要virtualenv安装,安装路径可自行调整: git clone

    Jinpeng 1.5k Dec 28, 2022
    ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

    ScanTailor Advanced The ScanTailor version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and f

    952 Dec 31, 2022
    Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

    LayoutAnalysisEvaluator Layout Analysis Evaluator for: ICDAR 2019 Historical Document Reading Challenge on Large Structured Chinese Family Records ICD

    17 Dec 08, 2022
    Memory tests solver with using OpenCV

    Human Benchmark project This project is OpenCV based programs which are puzzle solvers for 7 different games for made as

    Bahadır Araz 24 Dec 27, 2022
    Code for the paper: Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

    Fusformer Code for the paper: "Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution" Plateform Python 3.8.5 + Pytor

    Jin-Fan Hu (胡锦帆) 11 Dec 12, 2022
    This is a GUI for scrapping PDFs with the help of optical character recognition making easier than ever to scrape PDFs.

    pdf-scraper-with-ocr With this tool I am aiming to facilitate the work of those who need to scrape PDFs either by hand or using tools that doesn't imp

    Jacobo José Guijarro Villalba 75 Oct 21, 2022
    PAGE XML format collection for document image page content and more

    PAGE-XML PAGE XML format collection for document image page content and more For an introduction, please see the following publication: http://www.pri

    PRImA Research Lab 46 Nov 14, 2022
    A curated list of promising OCR resources

    Call for contributor(paper summary,dataset generation,algorithm implementation and any other useful resources) awesome-ocr A curated list of promising

    wanghaisheng 1.6k Jan 04, 2023
    End-to-end pipeline for real-time scene text detection and recognition.

    Real-time-Scene-Text-Detection-and-Recognition-System End-to-end pipeline for real-time scene text detection and recognition. The detection model use

    Fangneng Zhan 89 Aug 04, 2022
    TextBoxes re-implement using tensorflow

    TextBoxes-TensorFlow TextBoxes re-implementation using tensorflow. This project is greatly inspired by slim project And many functions are modified ba

    Gu Xiaodong 44 Dec 29, 2022