MDMM - Learning multi-domain multi-modality I2I translation

Last update: Nov 04, 2022

Related tags

Overview

Multi-Domain Multi-Modality I2I translation

Pytorch implementation of multi-modality I2I translation for multi-domains. The project is an extension to the "Diverse Image-to-Image Translation via Disentangled Representations(https://arxiv.org/abs/1808.00948)", ECCV 2018. With the disentangled representation framework, we can learn diverse image-to-image translation among multiple domains. [DRIT]

Contact: Hsin-Ying Lee ([email protected]) and Hung-Yu Tseng ([email protected])

Example Results

Prerequisites

Python 3.5 or Python 3.6
Pytorch 0.4.0 and torchvision (https://pytorch.org/)
TensorboardX
Tensorflow (for tensorboard usage)
Docker file based on CUDA 9.0, CuDNN 7.1, and Ubuntu 16.04 is provided in the [DRIT] github page.

Usage

Training

python train.py --dataroot DATAROOT --name NAME --num_domains NUM_DOMAINS --display_dir DISPLAY_DIR --result_dir RESULT_DIR --isDcontent

Testing

python test.py --dataroot DATAROOT --name NAME --num_domains NUM_DOMAINS --out_dir OUT_DIR --resume MODEL_DIR --num NUM_PER_IMG

Datasets

We validate our model on two datasets:

art: Containing three domains: real images, Monet images, uki-yoe images. Data can be downloaded from CycleGAN website.
weather: Containing four domains: sunny, cloudy, snowy, and foggy. Data is randomly selected from the Image2Weather dataset website.

The different domains in a dataset should be placed in folders "trainA, trainB, ..." in the alphabetical order.

Models

The pretrained model on the art dataset

bash ./models/download_model.sh art

The pretrained model on the weather dataset

bash ./models/download_model.sh weather

Note

The feature transformation (i.e. concat 0) is not fully tested since both art and weather datasets do not require shape variations
The hyper-parameters matter and are task-dependent. They are not carefully selected yet.
Feel free to contact the author for any potential improvement of the code.

Paper

Diverse Image-to-Image Translation via Disentangled Representations
Hsin-Ying Lee*, Hung-Yu Tseng*, Jia-Bin Huang, Maneesh Kumar Singh, and Ming-Hsuan Yang
European Conference on Computer Vision (ECCV), 2018 (oral) (* equal contribution)

Please cite our paper if you find the code or dataset useful for your research.

@inproceedings{DRIT,
  author = {Lee, Hsin-Ying and Tseng, Hung-Yu and Huang, Jia-Bin and Singh, Maneesh Kumar and Yang, Ming-Hsuan},
  booktitle = {European Conference on Computer Vision},
  title = {Diverse Image-to-Image Translation via Disentangled Representations},
  year = {2018}
}

MDMM - Learning multi-domain multi-modality I2I translation

Related tags

Overview

Multi-Domain Multi-Modality I2I translation

Example Results

Prerequisites

Usage

Datasets

Models

Note

Paper

Owner

Hsin-Ying Lee

Voice assistant - Voice assistant with python

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

This is an official implementation of the CVPR2022 paper "Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots".

auto-tuning momentum SGD optimizer

Continual Learning of Electronic Health Records (EHR).

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS

Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR 2022)

OpenMatch: Open-set Consistency Regularization for Semi-supervised Learning with Outliers (NeurIPS 2021)

Code and project page for ICCV 2021 paper "DisUnknown: Distilling Unknown Factors for Disentanglement Learning"

A PyTorch library and evaluation platform for end-to-end compression research

Code for ICML 2021 paper: How could Neural Networks understand Programs?

SVG Icon processing tool for C++

People Interaction Graph

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Editing a Conditional Radiance Field

Face Recognition and Emotion Detector Device

Offline Reinforcement Learning with Implicit Q-Learning

Modifications of the official PyTorch implementation of StyleGAN3. Let's easily generate images and videos with StyleGAN2/2-ADA/3!

Simple reimplemetation experiments about FcaNet

Repository for the paper : Meta-FDMixup: Cross-Domain Few-Shot Learning Guided byLabeled Target Data