Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Last update: Apr 15, 2022

Overview

Spatial unbiased GANs — Simple TensorFlow Implementation [Paper]

: Toward Spatially Unbiased Generative Models (ICCV 2021)

Abstract Recent image generation models show remarkable generation performance. However, they mirror strong location preference in datasets, which we call spatial bias. Therefore, generators render poor samples at unseen locations and scales. We argue that the generators rely on their implicit positional encoding to render spatial content. From our observations, the generator’s implicit positional encoding is translation-variant, making the generator spatially biased. To address this issue, we propose injecting explicit positional encoding at each scale of the generator. By learning the spatially unbiased generator, we facilitate the robust use of generators in multiple tasks, such as GAN inversion, multi-scale generation, generation of arbitrary sizes and aspect ratios. Furthermore, we show that our method can also be applied to denoising diffusion probabilistic models.

Requirements

Tensorflow >= 2.x

Usage

├── dataset
   └── YOUR_DATASET_NAME
       ├── 000001.jpg 
       ├── 000002.png
       └── ...

Train

> python main.py --dataset FFHQ --phase train --img_size 256 --batch_size 4 --n_total_image 6400

Generate Video

> python generate_video.py

Results

FID: 3.81 (6.4M images(200k iterations), 8GPU, each 4 batch size)

Video

Uncuratd

Style mixing

It's worse than stylegan2.

Truncation trick

Reference

Author

Junho Kim

Simple Tensorflow implementation of Toward Spatially Unbiased Generative Models (ICCV 2021)

Related tags

Overview

Spatial unbiased GANs — Simple TensorFlow Implementation [Paper]

: Toward Spatially Unbiased Generative Models (ICCV 2021)

Requirements

Usage

Train

Generate Video

Results

Video

Uncuratd

Style mixing

Truncation trick

Reference

Author

Owner

Junho Kim

ARKitScenes - A Diverse Real-World Dataset for 3D Indoor Scene Understanding Using Mobile RGB-D Data

Official repo for BMVC2021 paper ASFormer: Transformer for Action Segmentation

Retina blood vessel segmentation with a convolutional neural network

End-to-end Temporal Action Detection with Transformer. [Under review]

DeepCO3: Deep Instance Co-segmentation by Co-peak Search and Co-saliency

A python library for face detection and features extraction based on mediapipe library

PyTorch implementation of Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

ARAE-Tensorflow for Discrete Sequences (Adversarially Regularized Autoencoder)

NFNets and Adaptive Gradient Clipping for SGD implemented in PyTorch

Funnels: Exact maximum likelihood with dimensionality reduction.

Generalized hybrid model for mode-locked laser diodes with an extended passive cavity

null

Neural Network Libraries

An open source python library for automated feature engineering

MODNet: Trimap-Free Portrait Matting in Real Time

Multi-scale discriminator feature-wise loss function

When in Doubt: Improving Classification Performance with Alternating Normalization

Compute FID scores with PyTorch.

PyTorch common framework to accelerate network implementation, training and validation

Styled Augmented Translation