Code for Paper: Self-supervised Learning of Motion Capture

Last update: Jul 25, 2022

Related tags

Overview

Self-supervised Learning of Motion Capture

This is code for the paper: Hsiao-Yu Fish Tung, Hsiao-Wei Tung, Ersin Yumer, Katerina Fragkiadaki, Self-supervised Learning of Motion Capture, NIPS2017 (Spotlight)

Check the project page for more results.

Content

Environment setup and Dataset
Data preprocessing
Pretrained model and small tfrecords
Training
Citation
License

1. Environment setup and Dataset

python We use python2.7.13 from Anaconda and Tensorflow 1.1
SMPL model: We need rest body template from SMPL model.

You can download it from here.

SURREAL Dataset: If you plan to pretrain or test on surreal dataset.

Please download surreal from here

H36M Dataset: If you plan to test on real video with some groundtruth (to evaluate).

Please download H3.6M Dataset from here

2. Data preprocessing

Parse Surreal Dataset into binary files

In order to speed up the read write for tfrecords, we parse surreal dataset into binary files. Open file

data/preparsed/main_parse_surreal

and change the data path and output path.

Build up tfrecords

change the data path to the path you built in the previous step in

pack_data/pack_data_bin.py

and run it. You can specify how many examples you want to have in each tfrecords by changing value for num_samples. If "is_test" is False, we use sequences generated from actor 1, 5, 6, 7, 8 as training samples. If "is_test" is True, we use only sequence "" from actor 9 as validation. You can change this split by modifying the "get_file_list" function in tfrecords_utils.py

3. Pretrained model and small tfrecords

You can downdload a pretrained model using supervision from here surreal_quo0.tfrecords is a small training data and surreal2_100_test_quo1.tfrecords

Note: To make this code pack, I calculate 2d flow directly from 3d groundtruth during testing. But you should replace this with your own predicted flow and keypoints.

4. Train model

open up pretrained.sh, there is one commend for pretraining using supervision, and one commend for finetuning with testing data. Commend out the line that you need

Citation

If you use this code, please cite:

@incollection{NIPS2017_7108, title = {Self-supervised Learning of Motion Capture}, author = {Tung, Hsiao-Yu and Tung, Hsiao-Wei and Yumer, Ersin and Fragkiadaki, Katerina}, booktitle = {Advances in Neural Information Processing Systems 30}, editor = {I. Guyon and U. V. Luxburg and S. Bengio and H. Wallach and R. Fergus and S. Vishwanathan and R. Garnett}, pages = {5236--5246}, year = {2017}, publisher = {Curran Associates, Inc.}, url = {http://papers.nips.cc/paper/7108-self-supervised-learning-of-motion-capture.pdf} }

Code for Paper: Self-supervised Learning of Motion Capture

Related tags

Overview

Self-supervised Learning of Motion Capture

Content

1. Environment setup and Dataset

2. Data preprocessing

3. Pretrained model and small tfrecords

4. Train model

Citation

Owner

Hsiao-Yu Fish Tung

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

Source code for "OmniPhotos: Casual 360° VR Photography"

Our implementation used for the MICCAI 2021 FLARE Challenge titled 'Efficient Multi-Organ Segmentation Using SpatialConfiguartion-Net with Low GPU Memory Requirements'.

Applying curriculum to meta-learning for few shot classification

Pyramid Grafting Network for One-Stage High Resolution Saliency Detection. CVPR 2022

Accompanying code for the paper "A Kernel Test for Causal Association via Noise Contrastive Backdoor Adjustment".

PyTorch code for the NAACL 2021 paper "Improving Generation and Evaluation of Visual Stories via Semantic Consistency"

Exploit Camera Raw Data for Video Super-Resolution via Hidden Markov Model Inference

Official Pytorch implementation of 'RoI Tanh-polar Transformer Network for Face Parsing in the Wild.'

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.

Official repo for QHack—the quantum machine learning hackathon

[LREC] MMChat: Multi-Modal Chat Dataset on Social Media

Pytorch implementation of the paper "Optimization as a Model for Few-Shot Learning"

This repo implements a 3D segmentation task for an airport baggage dataset.

An implementation for the ICCV 2021 paper Deep Permutation Equivariant Structure from Motion.

The official implementation of our CVPR 2021 paper - Hybrid Rotation Averaging: A Fast and Robust Rotation Averaging Approach

This repository contains all data used for writing a research paper Multiple Object Trackers in OpenCV: A Benchmark, presented in ISIE 2021 conference in Kyoto, Japan.

Scalable Graph Neural Networks for Heterogeneous Graphs

DROPO: Sim-to-Real Transfer with Offline Domain Randomization

[ICCV 2021 Oral] Deep Evidential Action Recognition