📖 Deep Attentional Guided Image Filtering

Last update: Dec 23, 2022

Related tags

Overview

📖 Deep Attentional Guided Image Filtering

[Paper] Zhiwei Zhong, Xianming Liu, Junjun Jiang, Debin Zhao ,Xiangyang Ji
Harbin Institute of Technology, Tsinghua University

Abstract

Guided filter is a fundamental tool in computer vision and computer graphics which aims to transfer structure information from guidance image to target image. Most existing methods construct filter kernels from the guidance itself without considering the mutual dependency between the guidance and the target. However, since there typically exist significantly different edges in the two images, simply transferring all structural information of the guidance to the target would result in various artifacts. To cope with this problem, we propose an effective framework named deep attentional guided image filtering, the filtering process of which can fully integrate the complementary information contained in both images. Specifically, we propose an attentional kernel learning module to generate dual sets of filter kernels from the guidance and the target, respectively, and then adaptively combine them by modeling the pixel-wise dependency between the two images. Meanwhile, we propose a multi-scale guided image filtering module to progressively generate the filtering result with the constructed kernels in a coarse-to-fine manner. Correspondingly, a multi-scale fusion strategy is introduced to reuse the intermediate results in the coarse-to-fine process. Extensive experiments show that the proposed framework compares favorably with the state-of-the-art methods in a wide range of guided image filtering applications, such as guided super-resolution, cross-modality restoration, texture removal, and semantic segmentation.

This repository is an official PyTorch implementation of the paper "Deep Attentional Guided Filtering"

🔧 Dependencies and Installation

Python >= 3.5 (Recommend to use Anaconda or Miniconda)
[PyTorch >= 1.2(https://pytorch.org/
NVIDIA GPU + CUDA

Installation

Clone repo

git https://github.com/zhwzhong/DAGF.git
cd DAGF

Install dependent packages
```
pip install -r requirements.txt
```

Dataset

Trained Models

You can directly download the trained model and put it in checkpoints:

DAGF (Nearest):4, 8, 16
DAGF (Bicubic): 4, 8, 16

Train

You can also train by yourself:

 python main.py  --scale=16  --save_real --dataset_name='NYU' --model_name='DAGF'

Pay attention to the settings in the option (e.g. gpu id, model_name).

Test

We provide the processed test data in 'test_data' and pre-trained models in 'pre_trained' With the trained model, you can test and save depth images.

python quick_test.py

Acknowledgments

Thank for NYU, Lu, Middlebury, Sintel and DUT-OMRON datasets. % - Thank authors of GF, DJFR, DKN, PacNet, DSRN, JBU, Yang, DGDIE, DMSG, TGV, SDF and FBS for sharing their codes.

TO DO

Release the trained models for compared models:
- DGF: 4, 8, 16
- DJF: 4, 8, 16
- DMSG: 4, 8, 16
- DJFR: 4, 8, 16
- DSRN: 4, 8, 16
- PAC: 4, 8, 16
- DKN: 4, 8, 16
Release the experimental resutls of the compared models.

🏅 Our method won the Real DSR Challenge in ICMR 2021.

The detail information can be fond here.

📧 Contact

If you have any question, please email [email protected]

📖 Deep Attentional Guided Image Filtering

Related tags

Overview

📖 Deep Attentional Guided Image Filtering

Abstract

🔧 Dependencies and Installation

Installation

Dataset

Trained Models

Train

Test

Acknowledgments

TO DO

🏅 Our method won the Real DSR Challenge in ICMR 2021.

Owner

This is a JAX implementation of Neural Radiance Fields for learning purposes.

This repository contains the code for the CVPR 2020 paper "Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision"

Code for the paper: On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

This repository builds a basic vision transformer from scratch so that one beginner can understand the theory of vision transformer.

Chess reinforcement learning by AlphaGo Zero methods.

Additional code for Stable-baselines3 to load and upload models from the Hub.

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Efficient and intelligent interactive segmentation annotation software

Detail-Preserving Transformer for Light Field Image Super-Resolution

Reimplementation of the paper "Attention, Learn to Solve Routing Problems!" in jax/flax.

Kinetics-Data-Preprocessing

Audio Visual Emotion Recognition using TDA

Framework for estimating the structures and parameters of Bayesian networks (DAGs) at per-sample resolution

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

Weakly- and Semi-Supervised Panoptic Segmentation (ECCV18)

PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

[SIGGRAPH'22] StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

Genetic Programming in Python, with a scikit-learn inspired API