A curated list of papers and resources for scene text detection and recognition

Overview

Awesome

Awesome Scene Text

A curated list of papers and resources for scene text detection and recognition

The year when a paper was first published, including ArXiv publications, is used. As a result, there may be cases when a paper was accepted for example to CVPR 2019, but it is listed in year 2018 because it was published in 2018 on ArXiv.

Table of contents
1. Scene Text Detection
2. Weakly Supervised Scene Text Detection
3. Scene Text Recognition
4. Other scene text papers
5. Scene Text Survey papers

Scene Text Detection (including methods for end-to-end detection and recognition)

2010

  • Detecting text in natural scenes with stroke width transform [CVPR 2010] [paper]
  • A Method for Text Localization and Recognition in Real-World Images [ACCV 2010] [paper]

2011

2012

  • Real-time scene text localization and recognition [CVPR 2012] [paper]

2013

2014

  • Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees [ECCV 2014] [paper]

2015

  • Symmetry-based text line detection in natural scenes [CVPR 2015] [paper]
  • Object proposals for text extraction in the wild [ICDAR 2015] [paper]
  • Text-Attentional Convolutional Neural Network for Scene Text Detection [TIP 2016] [paper]
  • Text Flow : A Unified Text Detection System in Natural Scene Images [ICCV 2015] [paper]

2016

2017

2018

2019

  • MSR: Multi-Scale Shape Regression for Scene Text Detection [IJCAI 2019] [paper]
  • Scene Text Detection with Inception Text Proposal Generation Module [ICMLC 2019] [paper]
  • Towards Robust Curve Text Detection with Conditional Spatial Expansion [CVPR 2019] [paper]
  • Curve Text Detection with Local Segmentation Network and Curve Connection [ArXiv] [paper]
  • Pyramid Mask Text Detector [ArXiv] [paper]
  • Tightness-aware Evaluation Protocol for Scene Text Detection [CVPR 2019] [paper]
  • Character Region Awareness for Text Detection [CVPR 2019] [paper]
  • Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes [CVPR 2019] [paper]
  • TextCohesion: Detecting Text for Arbitrary Shapes [ArXiv] [paper]
  • Arbitrary Shape Scene Text Detection With Adaptive Text Region Representation [CVPR 2019] [paper]
  • Learning Shape-Aware Embedding for Scene Text Detection [CVPR 2019] [paper]
  • A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning [ACMMM 2019] [paper]
  • Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network [ICCV 2019] [paper]
  • Towards Unconstrained End-to-End Text Spotting [ICCV 2019] [paper]
  • TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting [paper]
  • Convolutional Character Networks [ICCV 2019] [paper]

Weakly supervised Scene Text Detection & Recognition

2017

  • Attention-Based Extraction of Structured Information from Street View Imagery [ICDAR 2017] [paper]
  • WeText: Scene Text Detection under Weak Supervision [ICCV 2017] [paper]
  • SEE: Towards Semi-Supervised End-to-End Scene Text Recognition [AAAI 2018] [paper]

Scene Text Recognition

2014

2015

2016

2017

2018

  • Char-Net: A Character-Aware Neural Network for Distorted Scene Text Recognition [AAAI 2018] [paper]
  • SqueezedText: A Real-time Scene Text Recognition by Binary Convolutional Encoder-decoder Network [AAAI 2018] [paper]
  • Edit Probability for Scene Text Recognition [CVPR 2018] [paper]
  • ASTER: An Attentional Scene Text Recognizer with Flexible Rectification [TPAMI 2018] [paper]
  • Synthetically Supervised Feature Learning for Scene Text Recognition [ECCV 2018] [paper]
  • Scene Text Recognition from Two-Dimensional Perspective [AAAI 2019] [paper]
  • ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification [CVPR 2019] [paper]

2019

Script Identification

Other scene text related papers

2016

2019

  • Scene Text Synthesis for Efficient and Effective Deep Network Training [ArXiv] [paper]

Scene text survey

2018

  • Scene Text Detection and Recognition: The Deep Learning Era [ArXiv] [paper]

2019

  • Scene text detection and recognition with advances in deep learning: a survey [IJDAR 2019] [paper]
Owner
Jan Zdenek
Jan Zdenek
Table Extraction Tool

Tree Structure - Table Extraction Fonduer has been successfully extended to perform information extraction from richly formatted data such as tables.

HazyResearch 88 Jun 02, 2022
A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database. The structure, shape and proportions of the faces are comp

Pavankumar Khot 4 Mar 19, 2022
The CIS OCR PostCorrectionTool

The CIS OCR Post Correction Tool PoCoTo Source code for the Java-based PoCoTo client enabling fast interactive batch corrections of complete OCR error

CIS OCR Group 36 Dec 15, 2022
RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

RepMLP RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition Released the code of RepMLP together with an example o

260 Jan 03, 2023
BNF Globalization Code (CVPR 2016)

Boundary Neural Fields Globalization This is the code for Boundary Neural Fields globalization method. The technical report of the method can be found

25 Apr 15, 2022
Automatically remove the mosaics in images and videos, or add mosaics to them.

Automatically remove the mosaics in images and videos, or add mosaics to them.

Hypo 1.4k Dec 30, 2022
OpenCVを用いたカメラキャリブレーションのサンプルです。2021/06/21時点でPython実装のある3種類(通常カメラ向け、魚眼レンズ向け(fisheyeモジュール)、全方位カメラ向け(omnidirモジュール))について用意しています。

OpenCV-CameraCalibration-Example FishEyeCameraCalibration.mp4 OpenCVを用いたカメラキャリブレーションのサンプルです 2021/06/21時点でPython実装のある以下3種類について用意しています。 通常カメラ向け 魚眼レンズ向け(

KazuhitoTakahashi 34 Nov 17, 2022
Détection de créneaux de vaccination disponibles pour l'outil ViteMaDose

Vite Ma Dose ! est un outil open source de CovidTracker permettant de détecter les rendez-vous disponibles dans votre département afin de vous faire v

CovidTracker 239 Dec 13, 2022
An easy to use an (hopefully useful) captcha solution for pyTelegramBotAPI

pyTelegramBotCAPTCHA An easy to use and (hopefully useful) image CAPTCHA soltion for pyTelegramBotAPI. Installation: pip install pyTelegramBotCAPTCHA

29 Dec 26, 2022
OCR, Scene-Text-Understanding, Text Recognition

Scene-Text-Understanding Survey [2015-PAMI] Text Detection and Recognition in Imagery: A Survey paper [2014-Front.Comput.Sci] Scene Text Detection and

Alan Tang 354 Dec 12, 2022
EQFace: An implementation of EQFace: A Simple Explicit Quality Network for Face Recognition

EQFace: A Simple Explicit Quality Network for Face Recognition The first face recognition network that generates explicit face quality online.

DeepCam Shenzhen 141 Dec 31, 2022
An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports.

Optical_Character_Recognition An Optical Character Recognition system using Pytesseract/Extracting data from Blood Pressure Reports. As an IOT/Compute

Ramsis Hammadi 1 Feb 12, 2022
MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition

MORAN: A Multi-Object Rectified Attention Network for Scene Text Recognition Python 2.7 Python 3.6 MORAN is a network with rectification mechanism for

Canjie Luo 595 Dec 27, 2022
Camera Intrinsic Calibration and Hand-Eye Calibration in Pybullet

This repository is mainly for camera intrinsic calibration and hand-eye calibration. Synthetic experiments are conducted in PyBullet simulator. 1. Tes

CAI Junhao 7 Oct 03, 2022
Give a solution to recognize MaoYan font.

猫眼字体识别 该 github repo 在于帮助xjtlu的同学们识别猫眼的扭曲字体。已经打包上传至 pypi ,可以使用 pip 直接安装。 猫眼字体的识别不出来的原理与解决思路在采茶上 使用方法: import MaoYanFontRecognize

Aruix 4 Jun 30, 2022
Satoshi is a discord bot template in python using discord.py that allow you to track some live crypto prices with your own discord bot.

Satoshi ~ DiscordCryptoBot Satoshi is a simple python discord bot using discord.py that allow you to track your favorites cryptos prices with your own

Théo 2 Sep 15, 2022
A little but useful tool to explore OCR data extracted with `pytesseract` and `opencv`

Screenshot OCR Tool Extracting data from screen time screenshots in iOS and Android. We are exploring 3 options: Simple OCR with no text position usin

Gabriele Marini 1 Dec 07, 2021
Papers, Datasets, Algorithms, SOTA for STR. Long-time Maintaining

Scene Text Recognition Recommendations Everythin about Scene Text Recognition SOTA • Papers • Datasets • Code Contents 1. Papers 2. Datasets 2.1 Synth

Deep Learning and Vision Computing Lab, SCUT 197 Jan 05, 2023
Fusion 360 Add-in that creates a pair of toothed curves that can be used to split a body and create two pieces that slide and lock together.

Fusion-360-Add-In-PuzzleSpline Fusion 360 Add-in that creates a pair of toothed curves that can be used to split a body and create two pieces that sli

Michiel van Wessem 1 Nov 15, 2021
POT : Python Optimal Transport

This open source Python library provide several solvers for optimization problems related to Optimal Transport for signal, image processing and machine learning.

Python Optimal Transport 1.7k Jan 04, 2023