This repository contains a pytorch implementation of "StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision".

Last update: Dec 09, 2022

Related tags

Deep Learning StereoPIFu_Code

Overview

StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision

| Project Page | Paper |

This repository contains a pytorch implementation of "StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision (CVPR 2021)".
Authors: Yang Hong, Juyong Zhang, Boyi Jiang, Yudong Guo, Ligang Liu and Hujun Bao.

Requirements

Python 3
Pytorch (<=1.4.0, some compatibility issues may occur in higher versions of pytorch)
tqdm
opencv-python
scikit-image
openmesh

for building evaluation data

pybind11,we recommend "pip install pybind11[global]" for installation.
gcc
cmake

Run the following code to install all pip packages:

pip install -r requirements.txt

Building Evaluation Data

Preliminary

Run the following script to compile & generate the relevant python module, which is used to render left/right color/depth/mask images from the textured/colored mesh.

cd GenEvalData
bash build.sh
cd ..

Usage

#demo, for textured mesh
python GenEvalData.py \
--tex_mesh_path="TempData/SampleData/rp_dennis_posed_004_100k.obj" \
--tex_img_path="TempData/SampleData/rp_dennis_posed_004_dif_2k.jpg" \
--save_dir="./TempData/TexMesh" \
--save_postfix="tex"

#demo, for colored mesh
python GenEvalData.py \
--color_mesh_path="TempData/SampleData/normalized_mesh_0089.off" \
--save_dir="./TempData/ColorMesh" \
--save_postfix="color"

These samples are from renderpeople and BUFF dataset.
Note: the mesh used for rendering needs to be located in a specific bounding box.

Inference

Preliminary

Run the following script to compile & generate deformable convolution from AANet.
```
cd AANetPlusFeature/deform_conv
bash build.sh
cd ../..
```
Download the trained model and mv to the "Models" folder.
Generate evalution data with aboved "Building Evaluation Data", or capture real data by ZED Camera (we test on ZED camera v1).
Note: rectifying left/right images is required before using ZED camera.

Demo

bash eval.sh

The reconsturction result will be saved to "Results" folder.
Note: At least 10GB GPU memory is recommended to run StereoPIFu model.

Citation

@inproceedings{yang2021stereopifu,
  author    = {Yang Hong and Juyong Zhang and Boyi Jiang and Yudong Guo and Ligang Liu and Hujun Bao},
  title     = {StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision},
  booktitle = {{IEEE/CVF} Conference on Computer Vision and Pattern Recognition (CVPR)},
  year      = {2021}
}

Contact

If you have questions, please contact [email protected].

This repository contains a pytorch implementation of "StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision".

Related tags

Overview

StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision

| Project Page | Paper |

Requirements

Building Evaluation Data

Preliminary

Usage

Inference

Preliminary

Demo

Citation

Contact

Owner

Molecular AutoEncoder in PyTorch

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

Lightweight mmm - Lightweight (Bayesian) Media Mix Model

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

Official page of Struct-MDC (RA-L'22 with IROS'22 option); Depth completion from Visual-SLAM using point & line features

[ ICCV 2021 Oral ] Our method can estimate camera poses and neural radiance fields jointly when the cameras are initialized at random poses in complex scenarios (outside-in scenes, even with less texture or intense noise )

Semi-supervised Domain Adaptation via Minimax Entropy

Optimizing Deeper Transformers on Small Datasets

Baseline model for "GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping" (CVPR 2020)

TensorFlow Ranking is a library for Learning-to-Rank (LTR) techniques on the TensorFlow platform

Code for weakly supervised segmentation of a single class

SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

Real-Time Seizure Detection using EEG: A Comprehensive Comparison of Recent Approaches under a Realistic Setting

Code for AutoNL on ImageNet (CVPR2020)

DeepDiffusion: Unsupervised Learning of Retrieval-adapted Representations via Diffusion-based Ranking on Latent Feature Manifold

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

An efficient and easy-to-use deep learning model compression framework

Simple and Robust Loss Design for Multi-Label Learning with Missing Labels

This project uses ViT to perform image classification tasks on DATA set CIFAR10.