pytextractor

python ocr using tesseract/ with EAST opencv text detector

Uses the EAST opencv detector defined here with pytesseract to extract text(default) or numbers from images.

Usage main

usage: text_detection.py [-h] [--east EAST] [-c CONFIDENCE] [-w WIDTH]
                         [-e HEIGHT] [-d] [-n] [-p PERCENTAGE] [-b MIN_BOXES]
                         [-i MAX_ITERATIONS]
                         images [images ...]

Text/Number extractor from image

positional arguments:
  images                path(s) to input image(s)

optional arguments:
  -h, --help            show this help message and exit
  --east EAST           path to input EAST text detector
  -c CONFIDENCE, --confidence CONFIDENCE
                        minimum probability required to inspect a region
  -w WIDTH, --width WIDTH
                        resized image width (should be multiple of 32)
  -e HEIGHT, --height HEIGHT
                        resized image height (should be multiple of 32)
  -d, --display         Display bounding boxes
  -n, --numbers         Detect only numbers
  -p PERCENTAGE, --percentage PERCENTAGE
                        Expand/shrink detected bound box
  -b MIN_BOXES, --min-boxes MIN_BOXES
                        minimum number of detected boxes to return
  -i MAX_ITERATIONS, --max-iterations MAX_ITERATIONS
                        max number of iterations finding min_boxes

Usage lib

from pytextractor import pytextractor

extractor = pytextractor.PyTextractor()

Running tests

python setup.py test

make sure tesseract is installed *

brew | apt-get install tesseract

python ocr using tesseract/ with EAST opencv detector

Related tags

Overview

pytextractor

Usage main

Usage lib

Running tests

Owner

Danny Crasto

Detect handwritten words in a text-line (classic image processing method).

Creating of virtual elements of the graphical interface using opencv and mediapipe.

code for our ICCV 2021 paper "DeepCAD: A Deep Generative Network for Computer-Aided Design Models"

Code for the paper "Controllable Video Captioning with an Exemplar Sentence"

A program that takes in the hand gesture displayed by the user and translates ASL.

keras复现场景文本检测网络CPTN: 《Detecting Text in Natural Image with Connectionist Text Proposal Network》；欢迎试用，关注，并反馈问题...

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

([email protected]) Boosting Co-teaching with Compression Regularization for Label Noise

textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention

kaldi-asr/kaldi is the official location of the Kaldi project.

ScanTailor Advanced is the version that merges the features of the ScanTailor Featured and ScanTailor Enhanced versions, brings new ones and fixes.

ocroseg - This is a deep learning model for page layout analysis / segmentation.

第一届西安交通大学人工智能实践大赛（2018AI实践大赛--图片文字识别）第一名；仅采用densenet识别图中文字

SceneCollisionNet This repo contains the code for "Object Rearrangement Using Learned Implicit Collision Functions", an ICRA 2021 paper. For more info

A Joint Video and Image Encoder for End-to-End Retrieval

Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.

3点クリックで円を指定し、極座標変換を行うサンプルプログラム

Text modding tools for FF7R (Final Fantasy VII Remake)

Scene text recognition