Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Last update: Jan 02, 2023

Overview

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

An efficient PyTorch library for Point Cloud Completion.

Project page | Paper | Video

Chulin Xie*, Chuxin Wang*, Bo Zhang, Hao Yang, Dong Chen, and Fang Wen. (*Equal contribution)

Abstract

We proposed a novel Style-based Point Generator with Adversarial Rendering (SpareNet) for point cloud completion. Firstly, we present the channel-attentive EdgeConv to fully exploit the local structures as well as the global shape in point features. Secondly, we observe that the concatenation manner used by vanilla foldings limits its potential of generating a complex and faithful shape. Enlightened by the success of StyleGAN, we regard the shape feature as style code that modulates the normalization layers during the folding, which considerably enhances its capability. Thirdly, we realize that existing point supervisions, e.g., Chamfer Distance or Earth Mover’s Distance, cannot faithfully reﬂect the perceptual quality of the reconstructed points. To address this, we propose to project the completed points to depth maps with a differentiable renderer and apply adversarial training to advocate the perceptual realism under different viewpoints. Comprehensive experiments on ShapeNet and KITTI prove the effectiveness of our method, which achieves state-of-the-art quantitative performance while offering superior visual quality.

Installation

Create a virtual environment via conda.

conda create -n sparenet python=3.7
conda activate sparenet

Install torch and torchvision.

conda install pytorch cudatoolkit=10.1 torchvision -c pytorch

Install requirements.
```
pip install -r requirements.txt
```
Install cuda
```
sh setup_env.sh
```

Dataset

Download the processed ShapeNet dataset generated by GRNet, and the KITTI dataset.

Update the file path of the datasets in configs/base_config.py:

__C.DATASETS.shapenet.partial_points_path = "/path/to/datasets/ShapeNetCompletion/%s/partial/%s/%s/%02d.pcd"
__C.DATASETS.shapenet.complete_points_path = "/path/to/datasets/ShapeNetCompletion/%s/complete/%s/%s.pcd"
__C.DATASETS.kitti.partial_points_path = "/path/to/datasets/KITTI/cars/%s.pcd"
__C.DATASETS.kitti.bounding_box_file_path = "/path/to/datasets/KITTI/bboxes/%s.txt"

# Dataset Options: ShapeNet, ShapeNetCars, KITTI
__C.DATASET.train_dataset = "ShapeNet"
__C.DATASET.test_dataset = "ShapeNet"

Get Started

Inference Using Pretrained Model

The pretrained models:

SpareNet for ShapeNet (316 MB)
PCN for ShapeNet
GRNet for ShapeNet (307 MB)
GRNet for KITTI (307 MB)
MSN for ShapeNet (8192 points)

run

python   --gpu ${GPUS}\
         --work_dir ${WORK_DIR} \
         --model ${network} \
         --weights ${path to checkpoint} \
         --test_mode ${mode}

example

python  test.py --gpu 0 --work_dir /path/to/logfiles --model sparenet --weights /path/to/cheakpoint --test_mode default

Train

All log files in the training process, such as log message, checkpoints, etc, will be saved to the work directory.

run

python   --gpu ${GPUS}\
         --work_dir ${WORK_DIR} \
         --model ${network} \
         --weights ${path to checkpoint}

example

python  train.py --gpu 0,1,2,3 --work_dir /path/to/logfiles --model sparenet --weights /path/to/cheakpoint

Differentiable Renderer

A fully differentiable point renderer that enables end-to-end rendering from 3D point cloud to 2D depth maps. See the paper for details.

Usage of Renderer

The inputs of renderer are pcd, views and radius, and the outputs of renderer are depth_maps.

example

# `projection_mode`: a str with value "perspective" or "orthorgonal"
# `eyepos_scale`: a float that defines the distance of eyes to (0, 0, 0)
# `image_size`: an int defining the output image size
renderer = ComputeDepthMaps(projection_mode, eyepos_scale, image_size)

# `data`: a tensor with shape [batch_size, num_points, 3]
# `view_id`: the index of selected view satisfying 0 <= view_id < 8
# `radius_list`: a list of floats, defining the kernel radius to render each point
depthmaps = renderer(data, view_id, radius_list)

License

The codes and the pretrained model in this repository are under the MIT license as specified by the LICENSE file.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

BibTex

If you like our work and use the codebase or models for your research, please cite our work as follows.

@inproceedings{xie2021stylebased,
      title={Style-based Point Generator with Adversarial Rendering for Point Cloud Completion}, 
      author={Chulin Xie and Chuxin Wang and Bo Zhang and Hao Yang and Dong Chen and Fang Wen},
      booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
      year={2021},
}

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Related tags

Overview

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

Project page | Paper | Video

Abstract

Installation

Dataset

Get Started

Inference Using Pretrained Model

Train

Differentiable Renderer

Usage of Renderer

License

BibTex

Owner

Microsoft

BaseCls BaseCls 是一个基于 MegEngine 的预训练模型库，帮助大家挑选或训练出更适合自己科研或者业务的模型结构

Gym environments used in the paper: "Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors"

A generalized framework for prototyping full-stack cooperative driving automation applications under CARLA+SUMO.

A rule learning algorithm for the deduction of syndrome definitions from time series data.

Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"

An educational resource to help anyone learn deep reinforcement learning.

On-device speech-to-index engine powered by deep learning.

DL & CV-based indicator toolset for the vehicle drivers via live dash-cam footage.

Learning Lightweight Low-Light Enhancement Network using Pseudo Well-Exposed Images

Deep learning-based approach to discovering Granger causality networks in multivariate time series

A large dataset of 100k Google Satellite and matching Map images, resembling pix2pix's Google Maps dataset.

Several simple examples for popular neural network toolkits calling custom CUDA operators.

TrackTech: Real-time tracking of subjects and objects on multiple cameras

DeepLM: Large-scale Nonlinear Least Squares on Deep Learning Frameworks using Stochastic Domain Decomposition (CVPR 2021)

A curated list of awesome Deep Learning tutorials, projects and communities.

PyTorch code of "SLAPS: Self-Supervision Improves Structure Learning for Graph Neural Networks"

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Improving adversarial robustness by a coupling rejection strategy

Reproduce partial features of DeePMD-kit using PyTorch.

An expansion for RDKit to read all types of files in one line