Learning What and Where to Draw

Last update: Nov 18, 2022

Related tags

Deep Learning nips2016

Overview

###Learning What and Where to Draw Scott Reed, Zeynep Akata, Santosh Mohan, Samuel Tenka, Bernt Schiele, Honglak Lee

This is the code for our NIPS 2016 paper on text- and location-controllable image synthesis using conditional GANs. Much of the code is adapted from reedscot/icml2016 and dcgan.torch.

####Setup Instructions

You will need to install Torch, CuDNN, stnbhwd and the display package.

####How to train a text to image model:

Download the data including captions, location annotations and pretrained models.
Download the birds and humans image data.
Modify the CONFIG file to point to your data.
Run one of the training scripts, e.g. ./scripts/train_cub_keypoints.sh

####How to generate samples:

./scripts/run_all_demos.sh.
html files will be generated with results like the following:

Moving the bird's position via bounding box:

Moving the bird's position via keypoints:

Birds text to image with ground-truth keypoints:

Birds text to image with generated keypoints:

Humans text to image with ground-truth keypoints:

Humans text to image with generated keypoints:

####Citation

If you find this useful, please cite our work as follows:

@inproceedings{reed2016learning,
  title={Learning What and Where to Draw},
  author={Scott Reed and Zeynep Akata and Santosh Mohan and Samuel Tenka and Bernt Schiele and Honglak Lee},
  booktitle={Advances in Neural Information Processing Systems},
  year={2016}
}

Learning What and Where to Draw

Related tags

Overview

Owner

Scott Ellison Reed

BABEL: Bodies, Action and Behavior with English Labels [CVPR 2021]

RMTD: Robust Moving Target Defence Against False Data Injection Attacks in Power Grids

A script written in Python that returns a consensus string and profile matrix of a given DNA string(s) in FASTA format.

Temporally Coherent GAN SIGGRAPH project.

gACSON software for visualization, processing and analysis of three-dimensional electron microscopy images

Only a Matter of Style: Age Transformation Using a Style-Based Regression Model

Tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Simulated garment dataset for virtual try-on

CS_Final_Metal_surface_detection - This is a final project for CoderSchool Machine Learning bootcamp on 29/12/2021.

⚓ Eurybia monitor model drift over time and securize model deployment with data validation

Python library for science observations from the James Webb Space Telescope

Dogs classification with Deep Metric Learning using some popular losses

Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code

StellarGraph - Machine Learning on Graphs

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

Manipulation OpenAI Gym environments to simulate robots at the STARS lab

This is a model made out of Neural Network specifically a Convolutional Neural Network model

TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-Captured Scenarios

We provided a matlab implementation for an evolutionary multitasking AUC optimization framework (EMTAUC).

Welcome to The Eigensolver Quantum School, a quantum computing crash course designed by students for students.