Multiple style transfer via variational autoencoder

Last update: Oct 29, 2022

Related tags

Deep Learning ST-VAE

Overview

ST-VAE

Multiple style transfer via variational autoencoder

By Zhi-Song Liu, Vicky Kalogeiton and Marie-Paule Cani

This repo only provides simple testing codes, pretrained models and the network strategy demo.

We propose a Multiple style transfer via variational autoencoder (ST-VAE)

Please check our paper or arxiv paper

BibTex

    @InProceedings{Liu2021stvae,
        author = {Zhi-Song Liu and Wan-Chi Siu and Marie-Paule Cani},
        title = {Multiple Style Transfer via Variational AutoEncoder},
        booktitle = {2021 IEEE International Conference on Image Processing(ICIP)},
        month = {Oct},
        year = {2021}
    }

For proposed ST-VAE model, we claim the following points:

• First working on using Variational AutoEncoder for image style transfer.

• Multiple style transfer by proposed VAE based Linear Transformation.

Dependencies

Python > 3.0
Pytorch > 1.0
NVIDIA GPU + CUDA

Complete Architecture

The complete architecture is shown as follows,

Visualization

1. Single style transfer

2. Multiple style transfer

Implementation

1. Quick testing

Download pre-trained models from

https://drive.google.com/file/d/1WZrvjCGBO1mpggkdJiaw8jp-6ywbXn4J/view?usp=sharing

and copy them to the folder "models"

Put your content image under "Test/content" and your style image under "Test/style"
For single style transfer, run

$ python eval.py

The stylized images will be in folder "Test/result" 4. For multiple style transfer, run

$ python eval_multiple_style.py

For real-time demo, run

$ python real-time-demo.py --style_image Test/style/picasso_self_portrait.jpg

For training, put the training images under the folder "train_data"

download MS-COCO dataset from https://cocodataset.org/#home and put it under "train_data/content" download Wikiart from https://www.wikiart.org/ and put them under "train_data/style" then run,

$ python train.py

Special thanks to the contributions of Jakub M. Tomczak for their LT on their LT computation

Multiple style transfer via variational autoencoder

Related tags

Overview

ST-VAE

BibTex

For proposed ST-VAE model, we claim the following points:

Dependencies

Complete Architecture

Visualization

1. Single style transfer

2. Multiple style transfer

Implementation

1. Quick testing

Owner

This repository contains the code used in the paper "Prompt-Based Multi-Modal Image Segmentation".

This is the unofficial code of Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes. which achieve state-of-the-art trade-off between accuracy and speed on cityscapes and camvid, without using inference acceleration and extra data

Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)

PyTorch implementation of Densely Connected Time Delay Neural Network

Code for Estimating Multi-cause Treatment Effects via Single-cause Perturbation (NeurIPS 2021)

A curated list of programmatic weak supervision papers and resources

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

Where-Got-Time - An NUS timetable generator which uses a genetic algorithm to optimise timetables to suit the needs of NUS students

Train Yolov4 using NBX-Jobs

source code of “Visual Saliency Transformer” (ICCV2021)

Unsupervised Feature Loss (UFLoss) for High Fidelity Deep learning (DL)-based reconstruction

Unrolled Variational Bayesian Algorithm for Image Blind Deconvolution

Understanding and Improving Encoder Layer Fusion in Sequence-to-Sequence Learning (ICLR 2021)

Deep Compression for Dense Point Cloud Maps.

Pydantic models for pywttr and aiopywttr.

Model of an AI powered sign language interpreter.

MediaPipeで姿勢推定を行い、Tokyo2020オリンピック風のピクトグラムを表示するデモ

The King is Naked: on the Notion of Robustness for Natural Language Processing

A tutorial on training a DarkNet YOLOv4 model for the CrowdHuman dataset