GitHub repository for "Improving Video Generation for Multi-functional Applications"

Last update: Dec 07, 2022

Related tags

Overview

Improving Video Generation for Multi-functional Applications

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Paper Link

For more information please refer to our homepage.

Requirements

Tensorflow 1.2.1
Python 2.7
ffmpeg

Data Format

Videos are stored as JPEGs of vertically stacked frames. Every frame needs to be at least 64x64 pixels; videos contain between 16 and 32 frames. For an example datasets see: http://carlvondrick.com/tinyvideo/#data

Training

python main_train.py

Important Parameters:

mode: one of 'generate', 'predict', 'bw2rgb', 'inpaint' depending on weather you want to generate videos, predict future frames, colorize videos or do inpainting.
batch_size: Recommended 64, for colorization use 32 for memory issues.
root_dir: root directory of dataset
index_file: must be in root_dir, containing a list of all training data clips; path relative to root_dir.
experiment_name: name of experiment
output_every: output loss to stdout and write to tensorboard summary every xx steps.
sample_every: generate a visual sample every xx steps.
save_model_very: save the model every xx steps.
recover_model: if true recover model and continue training

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Related tags

Overview

Improving Video Generation for Multi-functional Applications

Requirements

Data Format

Training

Owner

Bernhard Kratzwald

PyKaldi GOP-DNN on Epa-DB

PyTorch implementation of "VRT: A Video Restoration Transformer"

Python3 Implementation of (Subspace Constrained) Mean Shift Algorithm in Euclidean and Directional Product Spaces

Learn about Spice.ai with in-depth samples

Proof-Of-Concept Piano-Drums Music AI Model/Implementation

The Official PyTorch Implementation of DiscoBox.

FairEdit: Preserving Fairness in Graph Neural Networks through Greedy Graph Editing

COVID-VIT: Classification of Covid-19 from CT chest images based on vision transformer models

[ICCV 2021] Official Pytorch implementation for Discriminative Region-based Multi-Label Zero-Shot Learning SOTA results on NUS-WIDE and OpenImages

Multi-angle c(q)uestion answering

Uni-Fold: Training your own deep protein-folding models

HairCLIP: Design Your Hair by Text and Reference Image

RCT-ART is an NLP pipeline built with spaCy for converting clinical trial result sentences into tables through jointly extracting intervention, outcome and outcome measure entities and their relations.

[ICRA 2022] CaTGrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation

Tools for manipulating UVs in the Blender viewport.

The challenge for Quantum Coalition Hackathon 2021

Implementation of Fast Transformer in Pytorch

Deploy optimized transformer based models on Nvidia Triton server

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data