FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Last update: Jan 07, 2023

Related tags

Overview

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

FrankMocap pursues an easy-to-use single view 3D motion capture system developed by Facebook AI Research (FAIR). FrankMocap provides state-of-the-art 3D pose estimation outputs for body, hand, and body+hands in a single system. The core objective of FrankMocap is to democratize the 3D human pose estimation technology, enabling anyone (researchers, engineers, developers, artists, and others) can easily obtain 3D motion capture outputs from videos and images.

Btw, why the name FrankMocap? Our pipeline to integrate body and hand modules reminds us of Frankenstein's monster!

News:

[2020/10/09] We have improved openGL rendering speed. It's about 40% faster. (e.g., body module: 6fps -> 11fps)

Key Features

Body Motion Capture:

Hand Motion Capture

Egocentric Hand Motion Capture

Whole body Motion Capture (body + hands)

Installation

See INSTALL.md

A Quick Start

Run body motion capture

# using a machine with a monitor to show output on screen
python -m demo.demo_bodymocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

# screenless mode (e.g., a remote server)
xvfb-run -a python -m demo.demo_bodymocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

Run hand motion capture

# using a machine with a monitor to show outputs on screen
python -m demo.demo_handmocap --input_path ./sample_data/han_hand_short.mp4 --out_dir ./mocap_output

# screenless mode  (e.g., a remote server)
xvfb-run -a python -m demo.demo_handmocap --input_path ./sample_data/han_hand_short.mp4 --out_dir ./mocap_output

Run whole body motion capture

# using a machine with a monitor to show outputs on screen
python -m demo.demo_frankmocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

# screenless mode  (e.g., a remote server)
xvfb-run -a python -m demo.demo_frankmocap --input_path ./sample_data/han_short.mp4 --out_dir ./mocap_output

Note:
- Above commands use openGL by default. If it does not work, you may try alternative renderers (pytorch3d or openDR).
- See the readme of each module for details

Joint Order

See joint_order

Body Motion Capture Module

See run_bodymocap

Hand Motion Capture Module

See run_handmocap

Whole Body Motion Capture Module (Body + Hand)

See run_totalmocap

License

CC-BY-NC 4.0. See the LICENSE file.

References

FrankMocap is based on the following research outputs:

@article{rong2020frankmocap,
  title={FrankMocap: Fast Monocular 3D Hand and Body Motion Capture by Regression and Integration},
  author={Rong, Yu and Shiratori, Takaaki and Joo, Hanbyul},
  journal={arXiv preprint arXiv:2008.08324},
  year={2020}
}

@article{joo2020eft,
  title={Exemplar Fine-Tuning for 3D Human Pose Fitting Towards In-the-Wild 3D Human Pose Estimation},
  author={Joo, Hanbyul and Neverova, Natalia and Vedaldi, Andrea},
  journal={arXiv preprint arXiv:2004.03686},
  year={2020}
}

FrankMocap leverages many amazing open-sources shared in research community.
- SMPL, SMPLX
- Detectron2
- Pytorch3D (for rendering)
- OpenDR (for rendering)
- SPIN (for body module)
- 100DOH (for hand detection)
- lightweight-human-pose-estimation (for body detection)

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

Related tags

Overview

FrankMocap: A Strong and Easy-to-use Single View 3D Hand+Body Pose Estimator

News:

Key Features

Installation

A Quick Start

Joint Order

Body Motion Capture Module

Hand Motion Capture Module

Whole Body Motion Capture Module (Body + Hand)

License

References

Owner

Facebook Research

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

Equipped customers with insights about their EVs Hourly energy consumption and helped predict future charging behavior using LSTM model

WarpRNNT loss ported in Numba CPU/CUDA for Pytorch

Rewrite ultralytics/yolov5 v6.0 opencv inference code based on numpy, no need to rely on pytorch

Cancer Drug Response Prediction via a Hybrid Graph Convolutional Network

A very impractical 3D rendering engine that runs in the python terminal.

Implementation for HFGI: High-Fidelity GAN Inversion for Image Attribute Editing

Categorical Depth Distribution Network for Monocular 3D Object Detection

social humanoid robots with GPGPU and IoT

Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition

Code to generate datasets used in "How Useful is Self-Supervised Pretraining for Visual Tasks?"

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

Code base for NeurIPS 2021 publication titled Kernel Functional Optimisation (KFO)

Official implementation of the MM'21 paper Constrained Graphic Layout Generation via Latent Optimization

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

Nicely is a real-time Feedback and Intervention Program Depression is a prevalent issue across all age groups, socioeconomic classes, and cultural identities.

A collection of 100 Deep Learning images and visualizations

TorchFlare is a simple, beginner-friendly, and easy-to-use PyTorch Framework train your models effortlessly.

Fit Fast, Explain Fast

Codebase for ECCV18 "The Sound of Pixels"