[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Last update: Jan 05, 2023

Related tags

Overview

Planar Surface Reconstruction From Sparse Views

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey
University of Michigan
ICCV 2021 (Oral)

This repo contains code for our paper. Our model is implemented in Detectron2.

Given two RGB images with an unknown relationship, our system produces a single, coherent planar surface reconstruction of the scene in terms of 3D planes and relative camera poses.

We use a ResNet50-FPN to detect planes and predict probabilities of relative camera poses, and use a two-step optimization to generate a coherent planar reconstruction. (a) For each plane, we predict a segmentation mask, plane parameters, and an appearance feature. (b) Concurrently, we pass image features from the detection backbone through the attention layer and predict the camera transformation between views. (c) Our discrete optimization fuses the prediction of the separate heads to select the best camera pose and plane correspondence. (d) Finally, we use continuous optimization to update the camera and plane parameters.

Usage Instructions

Citation

If you find this code useful, please consider citing:

@inproceedings{jin2021planar,
      title={Planar Surface Reconstruction from Sparse Views}, 
      author={Linyi Jin and Shengyi Qian and Andrew Owens and David F. Fouhey},
      booktitle = {ICCV},
      year={2021}
}

Acknowledgment

We thank Dandan Shan, Mohamed El Banani, Nilesh Kulkarni, Richard Higgins for helpful discussions. Toyota Research Institute ("TRI") provided funds to assist the authors with their research but this article solely reflects the opinions and conclusions of its authors and not TRI or any other Toyota entity.

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Related tags

Overview

Planar Surface Reconstruction From Sparse Views

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey
University of Michigan
ICCV 2021 (Oral)

Usage Instructions

Citation

Acknowledgment

Owner

Linyi Jin

DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)

Like ThreeJS but for Python and based on wgpu

Tensorflow Implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Official implementation of VaxNeRF (Voxel-Accelearated NeRF).

SeqAttack: a framework for adversarial attacks on token classification models

Kaggle: Cell Instance Segmentation

Rethinking Transformer-based Set Prediction for Object Detection

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Neural Turing Machines (NTM) - PyTorch Implementation

Election Exit Poll Prediction and U.S.A Presidential Speech Analysis using Machine Learning

Implements an infinite sum of poisson-weighted convolutions

CAST: Character labeling in Animation using Self-supervision by Tracking

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

A Model for Natural Language Attack on Text Classification and Inference

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

prior-based-losses-for-medical-image-segmentation

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

How to train a CNN to 99% accuracy on MNIST in less than a second on a laptop

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Related tags

Overview

Planar Surface Reconstruction From Sparse Views

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey University of Michigan ICCV 2021 (Oral)

Usage Instructions

Citation

Acknowledgment

Owner

Linyi Jin

DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)

Like ThreeJS but for Python and based on wgpu

Tensorflow Implementation of SMU: SMOOTH ACTIVATION FUNCTION FOR DEEP NETWORKS USING SMOOTHING MAXIMUM TECHNIQUE

Official implementation of VaxNeRF (Voxel-Accelearated NeRF).

SeqAttack: a framework for adversarial attacks on token classification models

Kaggle: Cell Instance Segmentation

Rethinking Transformer-based Set Prediction for Object Detection

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

Neural Turing Machines (NTM) - PyTorch Implementation

Election Exit Poll Prediction and U.S.A Presidential Speech Analysis using Machine Learning

Implements an infinite sum of poisson-weighted convolutions

CAST: Character labeling in Animation using Self-supervision by Tracking

Pytorch-diffusion - A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

A Model for Natural Language Attack on Text Classification and Inference

Official implementation of "Synthetic Temporal Anomaly Guided End-to-End Video Anomaly Detection" (ICCV Workshops 2021: RSL-CV).

prior-based-losses-for-medical-image-segmentation

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

How to train a CNN to 99% accuracy on MNIST in less than a second on a laptop

[CVPR 2021] Few-shot 3D Point Cloud Semantic Segmentation

Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey
University of Michigan
ICCV 2021 (Oral)