Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Last update: Dec 13, 2022

Overview

Long-term-Motion-in-3D-Scenes

This is an implementation of the CVPR'21 paper "Synthesizing Long-Term 3D Human Motion and Interaction in 3D".

Please check our paper and the project webpage for more details.

Citation

If you use our code or paper, please consider citing:

@article{wang2020synthesizing,
  title={Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes},
  author={Wang, Jiashun and Xu, Huazhe and Xu, Jingwei and Liu, Sifei and Wang, Xiaolong},
  journal={arXiv preprint arXiv:2012.05522},
  year={2020}
}

Dependencies

Requirements:

python3.6
pytorch==1.1.0
trimesh
open3d
Chamfer Pytorch
Human Body Prior
SMPL-X

Datasets

We use PROX and PROXE datasets as our training data. After downloading them, please put them in './data/'. We provide generate_routepose_data.ipynb and generate_sub_data.ipynb for data generation. Note in PROX, the human meshes and the scene meshes are not in the same area in the world coordinates. Different from PROX and PROXE, we apply the inverse of the camera extrinsics to the scene mesh. Since the scene is the input and we need it to be aligned with the human bodies. This is done in the data generation code. Thus for contact calculating, you do not need to apply transformation to them. While for collision calculating, you still need to apply the transformation to the human bodies similar to PROXE to make it be aligned with SDF. Please be careful with this during training or testing, especially if you want to test on other scenes such as Matterport3D. Please put body_segments data in './data/' as well.

Demo

We provide demo.ipynb to help you play with our method. Before running, please put a downsampled MPH16.ply mesh and the SDF data of this scene in './demo_data/'. You can download them from PROX and PROXE. Still, please be careful with the camera extrinsics when you want to test other scenes, make sure the human body is in the scene. This code will also show you how to optimize the whole motion.

Models

We use SMPL-X to represent human bodies. Please download the SMPL-X models and put them in './models/' and it may look like './models/smplx/SMPLX_NEUTRAL.npz'. Please download vposer model and put it in './' ('./vposer_v1_0/').

We also provide our pretrained model here

Training

After you generate the data. You can train the networks directly,

python train_subgoal.py

python train_route.py

Please train the posenet after you finished training routenet with your own pretrained routenet model,

python train_pose.py

Acknowledgement

This work was supported, in part, by grants from DARPA LwLL, NSF 1730158 CI-New: Cognitive Hardware and Software Ecosystem Community Infrastructure (CHASE-CI), NSF ACI-1541349 CC*DNI Pacific Research Platform, and gifts from Qualcomm and TuSimple. Part of our code is based on PROXE and it may help you with the dependencies and dataset parts as well. Many thanks!

License

Apache-2.0 License

Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Related tags

Overview

Long-term-Motion-in-3D-Scenes

Citation

Dependencies

Datasets

Demo

Models

Training

Acknowledgement

License

Owner

Jiashun Wang

The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"

Distributional Sliced-Wasserstein distance code

An Unsupervised Graph-based Toolbox for Fraud Detection

Code for the paper: Fighting Fake News: Image Splice Detection via Learned Self-Consistency

1st place solution in CCF BDCI 2021 ULSEG challenge

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

Unofficial keras(tensorflow) implementation of MAE model from Masked Autoencoders Are Scalable Vision Learners

Code for Referring Image Segmentation via Cross-Modal Progressive Comprehension, CVPR2020.

Official Repsoitory for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

Merlion: A Machine Learning Framework for Time Series Intelligence

Mini-hmc-jax - A simple implementation of Hamiltonian Monte Carlo in JAX

ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels

The official implementation of Variable-Length Piano Infilling (VLI).

TensorFlow 101: Introduction to Deep Learning for Python Within TensorFlow

Txt2Xml tool will help you convert from txt COCO format to VOC xml format in Object Detection Problem.

PyTorch implementation of EGVSR: Efficcient & Generic Video Super-Resolution (VSR)

Official PyTorch implementation of Learning Intra-Batch Connections for Deep Metric Learning (ICML 2021) published at International Conference on Machine Learning

This is a Image aid classification software based on python TK library development

Neural Module Network for VQA in Pytorch