Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Last update: Dec 29, 2022

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

Imaginaire is a pytorch library that contains optimized implementation of several image and video synthesis methods developed at NVIDIA.

License

Imaginaire is released under NVIDIA Software license. For commercial use, please consult NVIDIA Research Inquiries.

What's inside?

We have a tutorial for each model. Click on the model name, and your browser should take you to the tutorial page for the project.

Supervised Image-to-Image Translation

Algorithm Name	Feature	Publication
pix2pixHD	Learn a mapping that converts a semantic image to a high-resolution photorealistic image.	Wang et. al. CVPR 2018
SPADE	Improve pix2pixHD on handling diverse input labels and delivering better output quality.	Park et. al. CVPR 2019

Unsupervised Image-to-Image Translation

Algorithm Name	Feature	Publication
UNIT	Learn a one-to-one mapping between two visual domains.	Liu et. al. NeurIPS 2017
MUNIT	Learn a many-to-many mapping between two visual domains.	Huang et. al. ECCV 2018
FUNIT	Learn a style-guided image translation model that can generate translations in unseen domains.	Liu et. al. ICCV 2019
COCO-FUNIT	Improve FUNIT with a content-conditioned style encoding scheme for style code computation.	Saito et. al. ECCV 2020

Video-to-video Translation

Algorithm Name	Feature	Publication
vid2vid	Learn a mapping that converts a semantic video to a photorealistic video.	Wang et. al. NeurIPS 2018
fs-vid2vid	Learn a subject-agnostic mapping that converts a semantic video and an example image to a photoreslitic video.	Wang et. al. NeurIPS 2019

World-to-world Translation

Algorithm Name	Feature	Publication
wc-vid2vid	Improve vid2vid on view consistency and long-term consistency.	Mallya et. al. ECCV 2020
GANcraft	Convert semantic block worlds to realistic-looking worlds.	Hao et. al. ICCV 2021

Imaginaire - NVIDIA's Deep Imagination Team's PyTorch Library

Related tags

Overview

Imaginaire

Docs | License | Installation | Model Zoo

License

What's inside?

Supervised Image-to-Image Translation

Unsupervised Image-to-Image Translation

Video-to-video Translation

World-to-world Translation

Owner

NVIDIA Research Projects

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Code for "Learning Graph Cellular Automata"

Data-depth-inference - Data depth inference with python

Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation

CTRL-C: Camera calibration TRansformer with Line-Classification

DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers

This is a tensorflow-based rotation detection benchmark, also called AlphaRotate.

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

Latte: Cross-framework Python Package for Evaluation of Latent-based Generative Models

Bootstrapped Representation Learning on Graphs

Convert human motion from video to .bvh

A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution (CVPR2022)

A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.

LSTM and QRNN Language Model Toolkit for PyTorch

Optimizaciones incrementales al problema N-Body con el fin de evaluar y comparar las prestaciones de los traductores de Python en el ámbito de HPC.

Code for the ICCV'21 paper "Context-aware Scene Graph Generation with Seq2Seq Transformers"

This is the official implementation of TrivialAugment and a mini-library for the application of multiple image augmentation strategies including RandAugment and TrivialAugment.

Predicts an answer in yes or no.

LabelImg is a graphical image annotation tool.

A package, and script, to perform imaging transcriptomics on a neuroimaging scan.