An interactive document scanner built in Python using OpenCV

Last update: Feb 12, 2022

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

The scanner takes a poorly scanned image, finds the corners of the document, applies the perspective transformation to get a top-down view of the document, sharpens the image, and applies an adaptive color threshold to clean up the image.

On my test dataset of 280 images, the program correctly detected the corners of the document 92.8% of the time.

This project makes use of the transform and imutils modules from pyimagesearch (which can be accessed here). The UI code for the interactive mode is adapted from poly_editor.py from here.

You can manually click and drag the corners of the document to be perspective transformed:
The scanner can also process an entire directory of images automatically and save the output in an output directory:

Here are some examples of images before and after scan:

Usage

python scan.py (--images 
   
     | --image 
    
     ) [-i]

The -i flag enables interactive mode, where you will be prompted to click and drag the corners of the document. For example, to scan a single image with interactive mode enabled:

python scan.py --image sample_images/desk.JPG -i

Alternatively, to scan all images in a directory without any input:

python scan.py --images sample_images

An interactive document scanner built in Python using OpenCV

Related tags

Overview

Document Scanner

An interactive document scanner built in Python using OpenCV

Here are some examples of images before and after scan:

Usage

Owner

Kushal Shingote

A novel region proposal network for more general object detection ( including scene text detection ).

Smart computer vision application

The virtual calculator will be above the live streaming from your camera

This repository contains the code for the paper "SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks"

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

TedEval: A Fair Evaluation Metric for Scene Text Detectors

textspotter - An End-to-End TextSpotter with Explicit Alignment and Attention

A tool to enhance your old/damaged pictures built using python & opencv.

A curated list of resources dedicated to scene text localization and recognition

A facial recognition device is a device that takes an image or a video of a human face and compares it to another image faces in a database.

chineseocr/table_line 表格线检测模型pytorch版

Python rubik's cube solver

Handwriting Recognition System based on a deep Convolutional Recurrent Neural Network architecture

Resizing Canny Countour In Python

A Python script to capture images from multiple webcams at once and save them into your local machine

This repository summarized computer vision theories.

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation, CVPR 2020 (Oral)

Fatigue Driving Detection Based on Dlib

Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?

This is the open source implementation of the ICLR2022 paper "StyleNeRF: A Style-based 3D-Aware Generator for High-resolution Image Synthesis"