EAST Detector for Text Detection

OpenCV’s EAST(Efficient and Accurate Scene Text Detection ) text detector is a deep learning model, based on a novel architecture and training pattern. It is capable of

running at near real-time at 13 FPS on 720p images and
obtains state-of-the-art text detection accuracy.

Link to paper

OpenCV’s text detector implementation of EAST is quite robust, capable of localizing text even when it’s blurred, reflective, or partially obscured.

There are many natural scene text detection challenges that have been described by Celine Mancas-Thillou and Bernard Gosselin in their excellent 2017 paper, Natural Scene Text Understanding below:

Image/sensor noise: Sensor noise from a handheld camera is typically higher than that of a traditional scanner. Additionally, low-priced cameras will typically interpolate the pixels of raw sensors to produce real colors.
Viewing angles: Natural scene text can naturally have viewing angles that are not parallel to the text, making the text harder to recognize. Blurring: Uncontrolled environments tend to have blur, especially if the end user is utilizing a smartphone that does not have some form of stabilization.
Lighting conditions: We cannot make any assumptions regarding our lighting conditions in natural scene images. It may be near dark, the flash on the camera may be on, or the sun may be shining brightly, saturating the entire image.
Resolution: Not all cameras are created equal — we may be dealing with cameras with sub-par resolution.
Non-paper objects: Most, but not all, paper is not reflective (at least in context of paper you are trying to scan). Text in natural scenes may be reflective, including logos, signs, etc.
Non-planar objects: Consider what happens when you wrap text around a bottle — the text on the surface becomes distorted and deformed. While humans may still be able to easily “detect” and read the text, our algorithms will struggle. We need to be able to handle such use cases.
Unknown layout: We cannot use any a priori information to give our algorithms “clues” as to where the text resides.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Thanks to Adrian's Blog for a comprehensive blog on EAST Detector.

License

MIT

Text Detection from images using OpenCV

Related tags

Overview

EAST Detector for Text Detection

Contributing

Thanks to Adrian's Blog for a comprehensive blog on EAST Detector.

License

Owner

Abhishek Singh

Um simples projeto para fazer o reconhecimento do captcha usado pelo jogo bombcrypto

Sort By Face

Opencv face recognition desktop application

PSENet - Shape Robust Text Detection with Progressive Scale Expansion Network.

1st place solution for SIIM-FISABIO-RSNA COVID-19 Detection Challenge

BoxToolBox is a simple python application built around the openCV library

Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Morphological edge detection or object's boundary detection using erosion and dialation in OpenCV python

This is the code for our paper DAAIN: Detection of Anomalous and AdversarialInput using Normalizing Flows

A python programusing Tkinter graphics library to randomize questions and answers contained in text files

Code for the "Sensing leg movement enhances wearable monitoring of energy expenditure" paper.

Pre-Recognize Library - library with algorithms for improving OCR quality.

A little but useful tool to explore OCR data extracted with `pytesseract` and `opencv`

Omdena-abuja-anpd - Automatic Number Plate Detection for the security of lives and properties using Computer Vision.

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

Toolbox for OCR post-correction

The virtual calculator will be above the live streaming from your camera