A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Last update: Jul 26, 2022

Overview

PokeGAN

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Dataset

The model has been trained on dataset that includes 819 pokémon.
You can download dataset from this kaggle link.

Dependencies

I have used the following versions for code work:

python==3.8.8
tensorflow==2.4.1
tensorflow-gpu==2.4.1
numpy==1.19.1
h5py==2.10.0

Note

There are several difficulties in pokemon generation using GAN :

The difficulty of GAN training is well known; changing a hyperparameter can greatly change the results.
The dataset size is too small! 819 different pokemon images are not enough. For this reason, I applied data augmentation on the data; these are the transformations applied :

img_transf = tf.keras.Sequential([
            	tf.keras.layers.experimental.preprocessing.RandomContrast(factor=(0.05, 0.15)),
                image_aug.RandomBrightness(brightness_delta=(-0.15, 0.15)),
                image_aug.PowerLawTransform(gamma=(0.8,1.2)),
                image_aug.RandomSaturation(sat=(0, 2)),
                image_aug.RandomHue(hue=(0, 0.15)),
                tf.keras.layers.experimental.preprocessing.RandomFlip("horizontal"),
	    	tf.keras.layers.experimental.preprocessing.RandomTranslation(height_factor=(-0.10, 0.10), width_factor=(-0.10, 0.10)),
		tf.keras.layers.experimental.preprocessing.RandomZoom(height_factor=(-0.10, 0.10), width_factor=(-0.10, 0.10)),
		tf.keras.layers.experimental.preprocessing.RandomRotation(factor=(-0.10, 0.10))])

StyleGAN training is very expensive! I trained the model starting from a 4x4 resolution up to the final resolution of 256x256. The model was trained for 8 days using a Tesla V100 32GB SXM2.
To get better results you need to use higher resolutions and train for longer time.

Results

These are some examples of new pokémon generated by the model :

New Generated Pokémon

More results

You can see hundreds of new pokemon here.
I repeat again it : to get better results (better details in pokemon) is necessary to train for more time.

References

This code implementation is inspired by the unofficial keras implementation of styleGAN.

A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.

Related tags

Overview

PokeGAN

Dataset

Dependencies

Note

Results

More results

References

Owner

Code for all the Advent of Code'21 challenges mostly written in python

Evaluating saliency methods on artificial data with different background types

Supporting code for "Autoregressive neural-network wavefunctions for ab initio quantum chemistry".

Pose estimation with MoveNet Lightning

Image augmentation library in Python for machine learning.

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator

Hso-groupie - A pwnable challenge in Real World CTF 4th

This is an official implementation of the High-Resolution Transformer for Dense Prediction.

Selecting Parallel In-domain Sentences for Neural Machine Translation Using Monolingual Texts

Some methods for comparing network representations in deep learning and neuroscience.

Deep Inertial Prediction (DIPr)

A small demonstration of using WebDataset with ImageNet and PyTorch Lightning

Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorch

A embed able annotation tool for end to end cross document co-reference

Code for our paper "MG-GAN: A Multi-Generator Model Preventing Out-of-Distribution Samples in Pedestrian Trajectory Prediction" published at ICCV 2021.

Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models.

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation (ACM MM 2020)

OpenPCDet Toolbox for LiDAR-based 3D Object Detection.

Implementation of CVPR 2020 Dual Super-Resolution Learning for Semantic Segmentation

Learning to Segment Instances in Videos with Spatial Propagation Network