Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Last update: Dec 06, 2022

Related tags

Overview

This is the official implementation of "Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation".

For more details, please refer to our paper.

Citing the paper

Please cite the paper in your publications if it helps your research:

@inproceedings{lyu2018multi,
      title={Multi-oriented scene text detection via corner localization and region segmentation},
      author={Lyu, Pengyuan and Yao, Cong and Wu, Wenhao and Yan, Shuicheng and Bai, Xiang},
      booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
      pages={7553--7563},
      year={2018}
}

Requirements
Installation
Models
Test
Train
License

Requirements

NVIDIA GPU, Ubuntu 14.04, Python2.7, CUDA8/9
PyTorch 0.2.0_3

Installation

git clone https://github.com/lvpengyuan/corner.git
sh ./make.sh   or  cd rpsroi_pooling && python build.py

Models

Download the model and place it in weights/

Our trained model: Google Drive;

Test

You can test a model in a single scale:

python eval_all.py

or in multi-scale:

python eval_multiscale.py

Note that, you should modify the model path and the test dataset before testing.

Train

python train.py

To train a new model, you should modify the training settings before training.

License

This code is only for academic purpose.

Multi-Oriented Scene Text Detection via Corner Localization and Region Segmentation

Related tags

Overview

Citing the paper

Contents

Requirements

Installation

Models

Test

Train

License

Owner

Pengyuan Lyu

⛓ marc is a small, but flexible Markov chain generator

An interactive document scanner built in Python using OpenCV

Unofficial implementation of "TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images"

kaldi-asr/kaldi is the official location of the Kaldi project.

Source Code for AAAI 2022 paper "Graph Convolutional Networks with Dual Message Passing for Subgraph Isomorphism Counting and Matching"

Deep LearningImage Captcha 2

This is a project to detect gestures to zoom in or out, using the real-time distance between the index finger and the thumb. It's based on OpenCV and Mediapipe.

A tool for extracting text from scanned documents (via OCR), with user-defined post-processing.

Line based ATR Engine based on OCRopy

Creating a virtual tv using opencv in python3.

Deskew is a command line tool for deskewing scanned text documents. It uses Hough transform to detect "text lines" in the image. As an output, you get an image rotated so that the lines are horizontal.

textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.

Pixie - A full-featured 2D graphics library for Python

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

Generates a message from the infamous Jerma Impostor image

Using Opencv ,based on Augmental Reality(AR) and will show the feature matching of image and then by finding its matching

Comparison-of-OCR (KerasOCR, PyTesseract,EasyOCR)

SCOUTER: Slot Attention-based Classifier for Explainable Image Recognition

Code for CVPR 2022 paper "Bailando: 3D dance generation via Actor-Critic GPT with Choreographic Memory"