Contrastively Disentangled Sequential Variational Audoencoder

Last update: Dec 24, 2022

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

This is the implementation for our C-DSVAE, a novel self-supervised disentangled sequential representation learning method.

Requirements

Python 3
PyTorch 1.7
Numpy 1.18.5

Dataset

Sprites

We provide the raw Sprites .npy files. One can also find the dataset on a third-party repo.

For each split (train/test), we expect the following components for each sequence sample

x: raw sample of shape [8, 3, 64, 64]
c_aug: content augmentation of shape [8, 3, 64, 64]
m_aug: motion augmentation of shape [8, 3, 64, 64]
motion factors: action (3 classes), direction (3 classes)
content factors: skin, tops, pants, hair (each with 6 classes)

Running

Train

./run_cdsvae.sh

Test

./run_test_sprite.sh

Classification Judge

The judge classifiers are pretrained with full supervision separately.

Sprites judge

C-DSVAE Checkpoints

We provide a sample Sprites checkpoint. Checkpoint parameters can be found in ./run_test_sprite.sh.

Paper

If you are inspired by our work, please cite the following paper:

@inproceedings{bai2021contrastively,
  title={Contrastively Disentangled Sequential Variational Autoencoder},
  author={Bai, Junwen and Wang, Weiran and Gomes, Carla},
  booktitle={Advances in Neural Information Processing Systems},
  volume={},
  year={2021}
}

Contrastively Disentangled Sequential Variational Audoencoder

Related tags

Overview

Contrastively Disentangled Sequential Variational Audoencoder (C-DSVAE)

Overview

Requirements

Dataset

Sprites

Running

Train

Test

Classification Judge

C-DSVAE Checkpoints

Paper

Owner

Junwen Bai

The mini-MusicNet dataset

[ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vikas Chandra, Yingyan Lin

Deep Residual Networks with 1K Layers

Repository aimed at compiling code, papers, demos etc.. related to my PhD on 3D vision and machine learning for fruit detection and shape estimation at the university of Lincoln

Cross-platform CLI tool to generate your Github profile's stats and summary.

A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.

A Keras implementation of YOLOv4 (Tensorflow backend)

[ICLR 2022] DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

Task-related Saliency Network For Few-shot learning

This is the accompanying toolbox for the paper "A Survey on GANs for Anomaly Detection"

Implements MLP-Mixer: An all-MLP Architecture for Vision.

A PyTorch implementation of Mugs proposed by our paper "Mugs: A Multi-Granular Self-Supervised Learning Framework".

Progressive Image Deraining Networks: A Better and Simpler Baseline

Rational Activation Functions - Replacing Padé Activation Units

AgML is a comprehensive library for agricultural machine learning

CLIP (Contrastive Language–Image Pre-training) for Italian

UniLM AI - Large-scale Self-supervised Pre-training across Tasks, Languages, and Modalities

YouRefIt: Embodied Reference Understanding with Language and Gesture

An open framework for Federated Learning.

Sdf sparse conv - Deep Learning on SDF for Classifying Brain Biomarkers