A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Last update: Jul 12, 2022

Related tags

Deep Learning buggy-resizing-critique

Overview

A Criticism of the Paper On Buggy Resizing Libraries

This repository contains:

a Jupyter notebook for reproducing the aliased image downsampling fenomenon, as demonstrated in the On Buggy Resizing Libraries paper, which argues that the image downsampling methods of the OpenCV, Tensorflow and PyTorch libraries are "buggy", with only PIL being correct.
simple solutions for antialiasing in every framework, which solves the issue in all cases using the same functions, simply by setting parameters appropriately:
- OpenCV: change the interpolation from bilinear to area (from cv2.INTER_LINEAR to cv2.INTER_AREA)
- Tensorflow: set the antialias flag to True
- PyTorch: change the interpolation mode from bilinear to area, or simply use torchvision.transforms.Resize() instead of torch.nn.functional.interpolate()

Try it out in a Colab Notebook:

My opinion:

neither of the used image downsampling methods is "buggy", not applying antialiasing by default is an understandable design decision for both image and tensor operations.
the main figure of the paper is misleading, and it only illustrates the issues of aliasing for image resizing.
the aliasing issue with downsampling can be solved in all frameworks by simply setting a few parameters correctly. My criticism is that this is not mentioned in the paper.
torchvision.transforms.Resize() is claimed to only be a "a wrapper around the PIL library" in a note in Section 3.2 of the paper. This is true for PIL image inputs, but is incorrect for torch.Tensors, which are resized using torchvision interpolation operations.
the remaining parts of the paper provide valuable insights into the effects of interpolation methods, quantization and compression on the FID score of generative models.

Update: Just found out that there is another, very thorough investigation of the same issue. Highly recommend checking the blogpost out. They also implement an OpenCV-compatible Pillow-equivalent resizing that provides proper antialiasing for all interpolations.

Bilinear downsampling results with and without aliasing:

The main figure (Figure 1) of the paper:

A criticism of a recent paper on buggy image downsampling methods in popular image processing and deep learning libraries.

Related tags

Overview

A Criticism of the Paper On Buggy Resizing Libraries

Owner

A pytorch-version implementation codes of paper: "BSN++: Complementary Boundary Regressor with Scale-Balanced Relation Modeling for Temporal Action Proposal Generation"

End-To-End Crowdsourcing

Codes for TIM2021 paper "Anchor-Based Spatio-Temporal Attention 3-D Convolutional Networks for Dynamic 3-D Point Cloud Sequences"

An inofficial PyTorch implementation of PREDATOR based on KPConv.

Fast Style Transfer in TensorFlow

PyTorch code for the ICCV'21 paper: "Always Be Dreaming: A New Approach for Class-Incremental Learning"

DeconvNet : Learning Deconvolution Network for Semantic Segmentation

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Build fully-functioning computer vision models with PyTorch

High-quality implementations of standard and SOTA methods on a variety of tasks.

The Self-Supervised Learner can be used to train a classifier with fewer labeled examples needed using self-supervised learning.

Repository for the Bias Benchmark for QA dataset.

This is the code repository for the paper "Identification of the Generalized Condorcet Winner in Multi-dueling Bandits" (NeurIPS 2021).

GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion

Digan - Official PyTorch implementation of Generating Videos with Dynamics-aware Implicit Generative Adversarial Networks

Paddle pit - Rethinking Spatial Dimensions of Vision Transformers

Official pytorch implementation of Rainbow Memory (CVPR 2021)

Code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

The official code for PRIMER: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization

A PyTorch implementation of a Factorization Machine module in cython.