Tool which allow you to detect and translate text.

Last update: Nov 28, 2022

Overview

Text detection and recognition

This repository contains tool which allow to detect region with text and translate it one by one.

Description

Two pretrained neural networks are used. One of them is responsible for detecting places in which text appear and return its coordinates. Structure use for this operation is based on CRAFT architecture.

Craft Paper

Second network take detected words and recognize words included inside it. Convolutional Recurrential neural networks (CRNN) are used for this operation.

CRNN Paper

Example

Under construction

Deployment

I decided to deploy it on heroku (temporarily solution), but the amount of memory available on this platform is not enough. You can check it on heroku app. I decided to add bootstrap template because whole solution become more intuitive.

Windows Installation

To install it locally, you can run from your virtual env

python -m pip install requirements.txt

Linux installation

to install it properly on Linux OS you have to install additionaly


apt-get update
apt-get install -y libsm6 libxext6 libxrender-dev
pip install opencv-python

If problems with cv2 imports are still appearing then you should install

pip install opencv-contrib-python

Then you can run

```python
python -m pip install requirements.txt

Run

To run it locally, please activate your environment

> win
venv\Scripts\activate.bat

>linux
source venv\Scripts\activate

and run straight from project origin

python  app.py

If everything goes properly, you'll see on localhost:8000, screen just like one below.

Updates

I decided to remove argparse, because as I mention earlier, it was less intuitive. Solution is not fast, is more like an toy example which shows how to use Pytorch model on deployment environment.

Version which I use here contain torch-cpu which make preprocessing and detecting slightly slower. I test it on cuda and it was much faster.

If you have more information, drop me a line If you like it, give a star

Draft: Show how does it work on complex .tif example document.

Contact Info

Tool which allow you to detect and translate text.

Related tags

Overview

Text detection and recognition

Description

Example

Deployment

Windows Installation

Linux installation

Run

Updates

Owner

Damian Panek

A solution to the 2D Ising model of ferromagnetism, implemented using the Metropolis algorithm

Решения, подсказки, тесты и утилиты для тренировки по алгоритмам от Яндекса.

Self-describing JSON-RPC services made easy

List of awesome things around semantic segmentation 🎉

USAD - UnSupervised Anomaly Detection on multivariate time series

functorch is a prototype of JAX-like composable function transforms for PyTorch.

Our implementation used for the MICCAI 2021 FLARE Challenge titled 'Efficient Multi-Organ Segmentation Using SpatialConfiguartion-Net with Low GPU Memory Requirements'.

Official implementation of Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models at NeurIPS 2021

Computer Vision and Pattern Recognition, NUS CS4243, 2022

Vision-Language Pre-training for Image Captioning and Question Answering

PyTorch implementation for Stochastic Fine-grained Labeling of Multi-state Sign Glosses for Continuous Sign Language Recognition.

Virtual Dance Reality Stage is a feature that offers you to share a stage with another user virtually.

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

It's final year project of Diploma Engineering. This project is based on Computer Vision.

Specificity-preserving RGB-D Saliency Detection

Breast Cancer Classification Model is applied on a different dataset

Python Blood Vessel Topology Analysis

The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.

Source code for deep symbolic optimization.

To SMOTE, or not to SMOTE?