Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

Last update: Dec 31, 2022

Overview

I2V-GAN

This repository is the official Pytorch implementation for ACMMM2021 paper
"I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

Traffic I2V Example:

Monitoring I2V Example:

Flower Translation Example:

Introduction

Abstract

Human vision is often adversely affected by complex environmental factors, especially in night vision scenarios. Thus, infrared cameras are often leveraged to help enhance the visual effects via detecting infrared radiation in the surrounding environment, but the infrared videos are undesirable due to the lack of detailed semantic information. In such a case, an effective video-to-video translation method from the infrared domain to the visible counterpart is strongly needed by overcoming the intrinsic huge gap between infrared and visible fields.
Our work propose an infrared-to-visible (I2V) video translation method I2V-GAN to generate fine-grained and spatial-temporal consistent visible light video by given an unpaired infrared video.
The backbone network follows Cycle-GAN and Recycle-GAN.

Technically, our model capitalizes on three types of constraints: adversarial constraint to generate synthetic frame that is similar to the real one, cyclic consistency with the introduced perceptual loss for effective content conversion as well as style preservation, and similarity constraint across and within domains to enhance the content and motion consistency in both spatial and temporal spaces at a fine-grained level.

IRVI Dataset

Click here to download IRVI dataset from Baidu Netdisk. Access code: IRVI.

Data Structure

SUBSET		TRAIN	TEST	TOTAL FRAME
Traffic		17000	1000	18000
Mornitoring	sub-1	1384	347	1731	6352
	sub-2	1040	260	1300
	sub-3	1232	308	1540
	sub-4	672	169	841
	sub-5	752	188	940

Installation

The code is implemented with Python(3.6) and Pytorch(1.9.0) for CUDA Version 11.2

Install dependencies:
pip install -r requirements.txt

Usage

Train

python train.py --dataroot /path/to/dataset \
--display_env visdom_env_name --name exp_name \
--model i2vgan --which_model_netG resnet_6blocks \
--no_dropout --pool_size 0 \
--which_model_netP unet_128 --npf 8 --dataset_mode unaligned_triplet

Test

python test.py --dataroot /path/to/dataset \
--which_epoch latest --name exp_name --model cycle_gan \
--which_model_netG resnet_6blocks --which_model_netP unet_128 \
--dataset_mode unaligned --no_dropout --loadSize 256 --resize_or_crop crop

Citation

If you find our work useful in your research or publication, please cite our work:

@inproceedings{I2V-GAN2021,
  title     = {I2V-GAN: Unpaired Infrared-to-Visible Video Translation},
  author    = {Shuang Li and Bingfeng Han and Zhenjie Yu and Chi Harold Liu and Kai Chen and Shuigen Wang},
  booktitle = {ACMMM},
  year      = {2021}
}

Acknowledgements

This code borrows heavily from the PyTorch implementation of Cycle-GAN and Pix2Pix and RecycleGAN.
A huge thanks to them!

@inproceedings{CycleGAN2017,
  title     = {Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networkss},
  author    = {Zhu, Jun-Yan and Park, Taesung and Isola, Phillip and Efros, Alexei A},
  booktitle = {ICCV},
  year      = {2017}
}

@inproceedings{Recycle-GAN2018,
  title     = {Recycle-GAN: Unsupervised Video Retargeting},
  author    = {Aayush Bansal and Shugao Ma and Deva Ramanan and Yaser Sheikh},
  booktitle = {ECCV},
  year      = {2018}
}

Pytorch implementation for ACMMM2021 paper "I2V-GAN: Unpaired Infrared-to-Visible Video Translation".

Related tags

Overview

I2V-GAN

Traffic I2V Example:

Monitoring I2V Example:

Flower Translation Example:

Introduction

Abstract

IRVI Dataset

Data Structure

Installation

Usage

Train

Test

Citation

Acknowledgements

Owner

Gans-in-action - Companion repository to GANs in Action: Deep learning with Generative Adversarial Networks

Machine learning algorithms for many-body quantum systems

OBBDetection: an oriented object detection toolbox modified from MMdetection

Code for Mining the Benefits of Two-stage and One-stage HOI Detection

El-Gamal on Elliptic Curve (Python)

GoodNews Everyone! Context driven entity aware captioning for news images

Multiview 3D object detection on MultiviewC dataset through moft3d.

Voice Conversion by CycleGAN (语音克隆/语音转换)：CycleGAN-VC3

Tools to create pixel-wise object masks, bounding box labels (2D and 3D) and 3D object model (PLY triangle mesh) for object sequences filmed with an RGB-D camera.

Open source code for Paper "A Co-Interactive Transformer for Joint Slot Filling and Intent Detection"

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

TextBPN Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

MakeItTalk: Speaker-Aware Talking-Head Animation

A simple PyTorch Implementation of Generative Adversarial Networks, focusing on anime face drawing.

ICON: Implicit Clothed humans Obtained from Normals (CVPR 2022)

基于PaddleClas实现垃圾分类，并转换为inference格式用PaddleHub服务端部署

This package proposes simplified exporting pytorch models to ONNX and TensorRT, and also gives some base interface for model inference.

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

Implementation of "Bidirectional Projection Network for Cross Dimension Scene Understanding" CVPR 2021 (Oral)

A code implementation of AC-GC: Activation Compression with Guaranteed Convergence, in NeurIPS 2021.