Collection of generative models in Tensorflow

Last update: Dec 30, 2022

Overview

tensorflow-generative-model-collections

Tensorflow implementation of various GANs and VAEs.

Related Repositories

Pytorch version

Pytorch version of this repository is availabel at https://github.com/znxlwm/pytorch-generative-model-collections

"Are GANs Created Equal? A Large-Scale Study" Paper

https://github.com/google/compare_gan is the code that was used in the paper.
It provides IS/FID and rich experimental results for all gan-variants.

Generative Adversarial Networks (GANs)

Lists

Name	Paper Link	Value Function
GAN	Arxiv
LSGAN	Arxiv
WGAN	Arxiv
WGAN_GP	Arxiv
DRAGAN	Arxiv
CGAN	Arxiv
infoGAN	Arxiv
ACGAN	Arxiv
EBGAN	Arxiv
BEGAN	Arxiv

Variants of GAN structure

Results for mnist

Network architecture of generator and discriminator is the exaclty sames as in infoGAN paper.
For fair comparison of core ideas in all gan variants, all implementations for network architecture are kept same except EBGAN and BEGAN. Small modification is made for EBGAN/BEGAN, since those adopt auto-encoder strucutre for discriminator. But I tried to keep the capacity of discirminator.

The following results can be reproduced with command:

python main.py --dataset mnist --gan_type 
   
     --epoch 25 --batch_size 64

Random generation

All results are randomly sampled.

Name	Epoch 2	Epoch 10	Epoch 25
GAN
LSGAN
WGAN
WGAN_GP
DRAGAN
EBGAN
BEGAN

Conditional generation

Each row has the same noise vector and each column has the same label condition.

Name	Epoch 1	Epoch 10	Epoch 25
CGAN
ACGAN
infoGAN

InfoGAN : Manipulating two continous codes

Results for fashion-mnist

Comments on network architecture in mnist are also applied to here.
Fashion-mnist is a recently proposed dataset consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes. (T-shirt/top, Trouser, Pullover, Dress, Coat, Sandal, Shirt, Sneaker, Bag, Ankle boot)

The following results can be reproduced with command:

python main.py --dataset fashion-mnist --gan_type 
   
     --epoch 40 --batch_size 64

Random generation

All results are randomly sampled.

Name	Epoch 1	Epoch 20	Epoch 40
GAN
LSGAN
WGAN
WGAN_GP
DRAGAN
EBGAN
BEGAN

Conditional generation

Each row has the same noise vector and each column has the same label condition.

Name	Epoch 1	Epoch 20	Epoch 40
CGAN
ACGAN
infoGAN

Without hyper-parameter tuning from mnist-version, ACGAN/infoGAN does not work well as compared with CGAN.
ACGAN tends to fall into mode-collapse.
infoGAN tends to ignore noise-vector. It results in that various style within the same class can not be represented.

InfoGAN : Manipulating two continous codes

Some results for celebA

(to be added)

Variational Auto-Encoders (VAEs)

Lists

Name	Paper Link	Loss Function
VAE	Arxiv
CVAE	Arxiv
DVAE	Arxiv	(to be added)
AAE	Arxiv	(to be added)

Variants of VAE structure

Results for mnist

Network architecture of decoder(generator) and encoder(discriminator) is the exaclty sames as in infoGAN paper. The number of output nodes in encoder is different. (2x z_dim for VAE, 1 for GAN)

The following results can be reproduced with command:

python main.py --dataset mnist --gan_type 
   
     --epoch 25 --batch_size 64

Random generation

All results are randomly sampled.

Name	Epoch 1	Epoch 10	Epoch 25
VAE
GAN

Results of GAN is also given to compare images generated from VAE and GAN. The main difference (VAE generates smooth and blurry images, otherwise GAN generates sharp and artifact images) is cleary observed from the results.

Conditional generation

Each row has the same noise vector and each column has the same label condition.

Name	Epoch 1	Epoch 10	Epoch 25
CVAE
CGAN

Results of CGAN is also given to compare images generated from CVAE and CGAN.

Learned manifold

The following results can be reproduced with command:

python main.py --dataset mnist --gan_type VAE --epoch 25 --batch_size 64 --dim_z 2

Please notice that dimension of noise-vector z is 2.

Name	Epoch 1	Epoch 10	Epoch 25
VAE

Results for fashion-mnist

Comments on network architecture in mnist are also applied to here.

The following results can be reproduced with command:

python main.py --dataset fashion-mnist --gan_type 
   
     --epoch 40 --batch_size 64

Random generation

All results are randomly sampled.

Name	Epoch 1	Epoch 20	Epoch 40
VAE
GAN

Results of GAN is also given to compare images generated from VAE and GAN.

Conditional generation

Each row has the same noise vector and each column has the same label condition.

Name	Epoch 1	Epoch 20	Epoch 40
CVAE
CGAN

Results of CGAN is also given to compare images generated from CVAE and CGAN.

Learned manifold

The following results can be reproduced with command:

python main.py --dataset fashion-mnist --gan_type VAE --epoch 25 --batch_size 64 --dim_z 2

Please notice that dimension of noise-vector z is 2.

Name	Epoch 1	Epoch 10	Epoch 25
VAE

Results for celebA

(to be added)

Folder structure

The following shows basic folder structure.

├── main.py # gateway
├── data
│   ├── mnist # mnist data (not included in this repo)
│   |   ├── t10k-images-idx3-ubyte.gz
│   |   ├── t10k-labels-idx1-ubyte.gz
│   |   ├── train-images-idx3-ubyte.gz
│   |   └── train-labels-idx1-ubyte.gz
│   └── fashion-mnist # fashion-mnist data (not included in this repo)
│       ├── t10k-images-idx3-ubyte.gz
│       ├── t10k-labels-idx1-ubyte.gz
│       ├── train-images-idx3-ubyte.gz
│       └── train-labels-idx1-ubyte.gz
├── GAN.py # vanilla GAN
├── ops.py # some operations on layer
├── utils.py # utils
├── logs # log files for tensorboard to be saved here
└── checkpoint # model files to be saved here

Acknowledgements

This implementation has been based on this repository and tested with Tensorflow over ver1.0 on Windows 10 and Ubuntu 14.04.

Collection of generative models in Tensorflow

Related tags

Overview

tensorflow-generative-model-collections

Related Repositories

Pytorch version

"Are GANs Created Equal? A Large-Scale Study" Paper

Generative Adversarial Networks (GANs)

Lists

Variants of GAN structure

Results for mnist

Random generation

Conditional generation

InfoGAN : Manipulating two continous codes

Results for fashion-mnist

Random generation

Conditional generation

InfoGAN : Manipulating two continous codes

Some results for celebA

Variational Auto-Encoders (VAEs)

Lists

Variants of VAE structure

Results for mnist

Random generation

Conditional generation

Learned manifold

Results for fashion-mnist

Random generation

Conditional generation

Learned manifold

Results for celebA

Folder structure

Acknowledgements

Owner

A CROSS-MODAL FUSION NETWORK BASED ON SELF-ATTENTION AND RESIDUAL STRUCTURE FOR MULTIMODAL EMOTION RECOGNITION

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

SimplEx - Explaining Latent Representations with a Corpus of Examples

An official source code for "Augmentation-Free Self-Supervised Learning on Graphs"

Housing Price Prediction

A implemetation of the LRCN in mxnet

For IBM Quantum Challenge 2021 (May 20 - 26)

Fast algorithms to compute an approximation of the minimal volume oriented bounding box of a point cloud in 3D.

Split your patch similarly to `git add -p` but supporting multiple buckets

Source code for the GPT-2 story generation models in the EMNLP 2020 paper "STORIUM: A Dataset and Evaluation Platform for Human-in-the-Loop Story Generation"

Python package provinding tools for artistic interactive applications using AI

This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".

1st place solution to the Satellite Image Change Detection Challenge hosted by SenseTime

Source code for paper "ATP: AMRize Than Parse! Enhancing AMR Parsing with PseudoAMRs" @NAACL-2022

Keyword spotting on Arm Cortex-M Microcontrollers

Simple helper library to convert a collection of numpy data to tfrecord, and build a tensorflow dataset from the tfrecord.

Code for paper Decoupled Dynamic Spatial-Temporal Graph Neural Network for Traffic Forecasting

A cross-document event and entity coreference resolution system, trained and evaluated on the ECB+ corpus.

This example implements the end-to-end MLOps process using Vertex AI platform and Smart Analytics technology capabilities

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.