Fewshot-face-translation-GAN - Generative adversarial networks integrating modules from FUNIT and SPADE for face-swapping.

Last update: Dec 24, 2022

Overview

Few-shot face translation

A GAN based approach for one model to swap them all.

The table below shows our priliminary face-swapping results requiring one source face and <=5 target face photos. Notice that almost all of the identities, except Stephen Curry, are not in our training data (which is a subset of VGGFace2). More translation results can be found here.

Also, our model is capable of producing faces that has its gaze direction, glasses, and hiar occlusions being consistent with given source face. However, our model has suboptimal performance in terms of translating to asian faces. This is possibly due to limited representability of the feature extractor.

Src.\Tar.	Andrej Karpathy	Andrew Y. Ng	Du Fu	Elon Musk	Emilia Clarke	Geoffrey Hinton	Stephen Curry	Yann Lecun	Yoshua Benjio
Andrej Karpathy	N/A
Andrew Y. Ng		N/A
Du Fu			N/A
Elon Musk				N/A
Emilia Clarke					N/A
Geoffrey Hinton						N/A
Stephen Curry							N/A
Yann Lecun								N/A
Yoshua Benjio									N/A

I really like the Du Fu translation: such an interesting demostration how the GAN imagine the appearance of the prominent Chinese poet from just a painting.

Try in Google Colab

master branch (Jun. 2019)
dev branch (Oct. 2019)

We only provide pre-trained weights and inference script for now. Training script will be released after code cleanup.

Architecture

The above image illustrates our generator, which is a encoder-decoder based network, at test phase. Our swap-them-all approach is basically a GAN conditioned on the latent embeddings extracted from a pre-trained face recognition model. SPADE and AdaIN modules are incorporated in order to inject semantic priors to the networks.

During training phase, the input face A is heavily blurred and we train the model with resonctruction loss. Other objectives that aimed to improve translation performance while keeping semantic consistency, such as perceptual loss on rgb output and cosine similarity loss on latent embeddings, are also introduced.

Things that didn't work

We tried to distort (spline warp, downsample) the input image as in faceswap-GAN instead of masking it. However, the model did not learn proper identity translation but output face that is similar to its input.

Requirements

Python 3.6
Keras 2.2.4
TensorFlow 1.12.0 or 1.13.1

Fewshot-face-translation-GAN - Generative adversarial networks integrating modules from FUNIT and SPADE for face-swapping.

Related tags

Overview

Few-shot face translation

I really like the Du Fu translation: such an interesting demostration how the GAN imagine the appearance of the prominent Chinese poet from just a painting.

Try in Google Colab

We only provide pre-trained weights and inference script for now. Training script will be released after code cleanup.

Architecture

Things that didn't work

Requirements

References

Owner

a pytorch implementation of auto-punctuation learned character by character

The official implementation of ICCV paper "Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds".

Use .csv files to record, play and evaluate motion capture data.

Kaggle: Cell Instance Segmentation

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

The VeriNet toolkit for verification of neural networks

PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility

Official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

A PyTorch implementation of EfficientDet.

You Only 👀 One Sequence

Face-Recognition-based-Attendance-System - An implementation of Attendance System in python.

A python library to build Model Trees with Linear Models at the leaves.

catch-22: CAnonical Time-series CHaracteristics

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

Automatically download the cwru data set, and then divide it into training data set and test data set

Fewshot-face-translation-GAN - Generative adversarial networks integrating modules from FUNIT and SPADE for face-swapping.

Related tags

Overview

Few-shot face translation

I really like the Du Fu translation: such an interesting demostration how the GAN imagine the appearance of the prominent Chinese poet from just a painting.

Try in Google Colab

We only provide pre-trained weights and inference script for now. Training script will be released after code cleanup.

Architecture

Things that didn't work

Requirements

References

Owner

a pytorch implementation of auto-punctuation learned character by character

The official implementation of ICCV paper "Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds".

Use .csv files to record, play and evaluate motion capture data.

Kaggle: Cell Instance Segmentation

Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

The VeriNet toolkit for verification of neural networks

PatchMatch-RL: Deep MVS with Pixelwise Depth, Normal, and Visibility

Official code repository for A Simple Long-Tailed Rocognition Baseline via Vision-Language Model.

Video Corpus Moment Retrieval with Contrastive Learning (SIGIR 2021)

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

A PyTorch implementation of EfficientDet.

You Only 👀 One Sequence

Face-Recognition-based-Attendance-System - An implementation of Attendance System in python.

A python library to build Model Trees with Linear Models at the leaves.

catch-22: CAnonical Time-series CHaracteristics

BOOKSUM: A Collection of Datasets for Long-form Narrative Summarization

Final project code: Implementing MAE with downscaled encoders and datasets, for ESE546 FA21 at University of Pennsylvania

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

Automatically download the cwru data set, and then divide it into training data set and test data set

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务