A post-processing tool for scanned sheets of paper.

Last update: Dec 07, 2022

Related tags

Overview

unpaper

Originally written by Jens Gulden — see AUTHORS for more information. Licensed under GNU GPL v2 — see COPYING for more information.

Overview

unpaper is a post-processing tool for scanned sheets of paper, especially for book pages that have been scanned from previously created photocopies. The main purpose is to make scanned book pages better readable on screen after conversion to PDF. Additionally, unpaper might be useful to enhance the quality of scanned pages before performing optical character recognition (OCR).

unpaper tries to clean scanned images by removing dark edges that appeared through scanning or copying on areas outside the actual page content (e.g. dark areas between the left-hand-side and the right-hand-side of a double- sided book-page scan).

The program also tries to detect misaligned centering and rotation of pages and will automatically straighten each page by rotating it to the correct angle. This process is called "deskewing".

Note that the automatic processing will sometimes fail. It is always a good idea to manually control the results of unpaper and adjust the parameter settings according to the requirements of the input. Each processing step can also be disabled individually for each sheet.

See further documentation for the supported file formats notes.

Dependencies

The only hard dependency of unpaper is ffmpeg, which is used for file input and output.

Building instructions

unpaper uses GNU Autotools for its build system, so you should be able to execute the same commands used for other software packages:

./configure
make
sudo make install

There are, though, some recommendations about the way you build the code. Since the tasks are calculation-intensive, it is important to build with optimizations turned on:

./configure CFLAGS="-O2 -march-native -pipe"

Even better, if your compiler supports it, is to use Link-Time Optimizations, as that has shown that execution time can improve sensibly:

./configure CFLAGS="-O2 -march=native -pipe -flto"

Further optimizations such as -ftracer and -ftree-vectorize are thought to work, but their effect has not been evaluated so your mileage may vary.

Further Information

You can find more information on the basic concepts and the image processing in the available documentation.

A post-processing tool for scanned sheets of paper.

Related tags

Overview

unpaper

Overview

Dependencies

Building instructions

Further Information

Owner

Smart computer vision application

This project is basically to draw lines with your hand, using python, opencv, mediapipe.

code for our ICCV 2021 paper "DeepCAD: A Deep Generative Network for Computer-Aided Design Models"

Volume Control using OpenCV

EAST for ICPR MTWI 2018 Challenge II (Text detection of network images)

BNF Globalization Code (CVPR 2016)

Recognizing cropped text in natural images.

This is a tensorflow re-implementation of PSENet: Shape Robust Text Detection with Progressive Scale Expansion Network.My blog:

Course material for the Multi-agents and computer graphics course

Document Layout Analysis Projects

Responsive Doc. scanner using U^2-Net, Textcleaner and Tesseract

A post-processing tool for scanned sheets of paper.

A novel region proposal network for more general object detection ( including scene text detection ).

A little but useful tool to explore OCR data extracted with `pytesseract` and `opencv`

Scene text detection and recognition based on Extremal Region(ER)

CTPN + DenseNet + CTC based end-to-end Chinese OCR implemented using tensorflow and keras

A community-supported supercharged version of paperless: scan, index and archive all your physical documents

Document Layout Analysis

A Screen Translator/OCR Translator made by using Python and Tesseract, the user interface are made using Tkinter. All code written in python.

The code of "Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes"