Deep Sketch-guided Cartoon Video Inbetweening

Last update: Dec 22, 2022

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

The source code of Deep Sketch-guided Cartoon Video Inbetweening by Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander, IEEE Transactions on Visualization and Computer Graphics, 2021.

Prerequisites

Linux or Windows
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Use the Pre-trained Models

You can download the pre-trained model here.

Run the following commands for evaluating the frame synthesis model and full model:

python eval_synthesis.py
python eval_full.py

The frame synthesis model takes img_0, img_1, ske_t as inputs and synthesizes img_t. The full model takes img_0, img_1, ske_t as inputs and interpolates five frames between img_0 and img_1.

Datasets

A dataset is a directory with the following structure:

dataset
    ├── frame
    │   └── ${clip_id}
    │       └──${image_id}.png
    ├── sketch
    │   └── ${clip_id}
    │       └──${image_id}.png
    └── dismap
        └── ${clip_id}
            └──${image_id}.npy

The sketch images can be generated by the script "sketch.py" and the distance maps can be generated by "dismap.py". Due to the copyright issue of the movie Spirited Away, we can not release our training dataset. You can generate your own dataset if you interest.

Training

Run the following command for training the frame synthesis model and full model:

python train_synthesis.py
python train_full.py

Before you train the full model, you must train the frame synthesis model first and use its parameters to initialize the full model.

Citing

If you find our work useful, please consider citing:

@article{li2021deep,
  author    = {Li, Xiaoyu and Zhang, Bo and Liao, Jing and Sander, Pedro},
  journal   = {IEEE Transactions on Visualization and Computer Graphics},
  year      = {2021},
  publisher = {IEEE}
}

Deep Sketch-guided Cartoon Video Inbetweening

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

Prerequisites

Use the Pre-trained Models

Datasets

Training

Citing

Owner

Xiaoyu Li

League of Legends Reinforcement Learning Environment (LoLRLE) multiple training scenarios using PPO.

The official implementation code of "PlantStereo: A Stereo Matching Benchmark for Plant Surface Dense Reconstruction."

Simple Linear 2nd ODE Solver GUI - A 2nd constant coefficient linear ODE solver with simple GUI using euler's method

PyTorch implementation of the method described in the paper VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop.

Keras implementation of the GNM model in paper ’Graph-Based Semi-Supervised Learning with Nonignorable Nonresponses‘

mlpack: a scalable C++ machine learning library --

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

Exploring Relational Context for Multi-Task Dense Prediction [ICCV 2021]

SPEAR: Semi suPErvised dAta progRamming

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation (CoRL 2021)

AITom is an open-source platform for AI driven cellular electron cryo-tomography analysis.

A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.

Code for STFT Transformer used in BirdCLEF 2021 competition.

Mesh Graphormer is a new transformer-based method for human pose and mesh reconsruction from an input image

Implementation for paper "STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement" (ICCV 2021).

Attack classification models with transferability, black-box attack; unrestricted adversarial attacks on imagenet

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

Code for our CVPR2021 paper coordinate attention

This code is an implementation for Singing TTS.

OCRA (Object-Centric Recurrent Attention) source code