3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Last update: Dec 26, 2022

Related tags

Deep Learning 3D-Reconstruction

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

代码：Part1

技术文档：[Markdown] [PDF]

原始图像：Original Images

点云结果：Point Cloud Results-1

效果图：

Part II 基于计算机视觉方法的点云到点云窗户识别

代码：Part2

技术文档：[Markdown] [PDF]

点云结果：Point Cloud Results-2

算法流程图：

Part III 基于ResNest的图像到点云的语义分割

代码：Part3

技术文档：[Markdown] [PDF]

语义分割结果：Semantic Segmentation Results

点云结果：Point Cloud Results-3

效果图：

参考文献

AA-RMVSNet [arXiv] [CVF] [PDF]

Wei Z, Zhu Q, Min C, et al. Aa-rmvsnet: Adaptive aggregation recurrent multi-view stereo network[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021: 6187-6196.

Cascade-MVSNet [arXiv] [CVF] [PDF]

Gu X, Fan Z, Zhu S, et al. Cascade cost volume for high-resolution multi-view stereo and stereo matching[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 2495-2504.

TransMVSNet [arXiv] [PDF]

Ding Y, Yuan W, Zhu Q, et al. TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers[J]. arXiv preprint arXiv:2111.14600, 2021.

LoFTR [arXiv] [CVF] [PDF]

Sun J, Shen Z, Wang Y, et al. LoFTR: Detector-free local feature matching with transformers[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 8922-8931.

PatchmatchNet [arXiv] [CVF] [PDF]

Wang F, Galliani S, Vogel C, et al. PatchmatchNet: Learned Multi-View Patchmatch Stereo[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 14194-14203.

ResNeSt [arXiv] [PDF]

Zhang H, Wu C, Zhang Z, et al. Resnest: Split-attention networks[J]. arXiv preprint arXiv:2004.08955, 2020.

致谢

稀疏重建部分使用Colmap完成相机参数的获取。

稠密重建部分的代码主要来源于AA-RMVSNet。

点云切割与可视化使用CloudCompare及Meshlab完成。

调用Open3D进行表面重建。

Cascade+Transformer的代码主要基于kwea123实现的pytorch-lightning版本的Cascade-MVSNetl以及LoFTR进行实现。

窗户识别算法中部分思路参考了Color Space的矩形识别算法，图像处理技术主要基于冈萨雷斯的数字图像处理（第三版）。

语义分割部分调用了PyTorch-Encoding。

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

StyleGAR TODO: add arxiv link Implementation of Inverting Generative Adversarial Renderer for Face Reconstruction TODO: for test Currently, some model

155 Oct 27, 2022

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

The Boombox: Visual Reconstruction from Acoustic Vibrations Boyuan Chen, Mia Chiquier, Hod Lipson, Carl Vondrick Columbia University Project Website |

12 Nov 30, 2022

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints Official implementation for Reducing Footskate in Human Motion Recon

38 Nov 1, 2022

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction TSDF++ is a novel multi-object TSDF formulation that can encode mult

130 Dec 29, 2022

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

MeshTransformer ✨ This is our research code of End-to-End Human Pose and Mesh Reconstruction with Transformers. MEsh TRansfOrmer is a simple yet effec

473 Dec 31, 2022

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

SinIR (Official Implementation) Requirements To install requirements: pip install -r requirements.txt We used Python 3.7.4 and f-strings which are in

47 Oct 11, 2022

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

494 Jan 6, 2023

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Neural Deformation Graphs Project Page | Paper | Video Neural Deformation Graphs for Globally-consistent Non-rigid Reconstruction Aljaž Božič, Pablo P

134 Dec 16, 2022

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

LASR Installation Build with conda conda env create -f lasr.yml conda activate lasr # install softras cd third_party/softras; python setup.py install;

157 Dec 26, 2022

Releases(7)

7(Feb 16, 2022)

White mesh generated by Neus
Source code(tar.gz)
Source code(zip)
dongbeiya_neus.ply(11.21 MB)
gym_north_neus.ply(21.28 MB)
gym_south_neus.ply(16.59 MB)
6(Feb 16, 2022)

White mesh generated by Colmap and Meshlab
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(19.11 MB)
dongbeiya.png(8.45 MB)
gym_north.ply(31.93 MB)
gym_north.png(8.73 MB)
gym_south.ply(26.97 MB)
gym_south.png(9.32 MB)
5(Dec 29, 2021)

Original images for reconstruction
Source code(tar.gz)
Source code(zip)
PIC2.zip(755.68 MB)
PIC2.z01(900.00 MB)
PIC2.z02(900.00 MB)
dby.zip(735.16 MB)
dby.z02(900.00 MB)
dby.z01(900.00 MB)
4(Dec 19, 2021)

Semantic Segmentation Results of Problem 3
Source code(tar.gz)
Source code(zip)
filtered_segmentation_result_dongbeiya.zip(661.17 MB)
filtered_segmentation_result_gym.zip(786.65 MB)
segmentation_result_dongbeiya.zip(64.31 MB)
segmentation_result_dongbeiya_block.zip(53.27 MB)
segmentation_result_gym.zip(4.72 MB)
3(Dec 19, 2021)

Point Cloud Results of Problem 3
Source code(tar.gz)
Source code(zip)
2(Dec 19, 2021)

Point Cloud Results of Problem 2
Source code(tar.gz)
Source code(zip)
gym_south_window.ply(627.30 MB)
gym_north_window.ply(808.62 MB)
dongbeiya_window.ply(1800.53 MB)
gym_window.ply(1603.31 MB)
1(Dec 19, 2021)

Point Cloud Results of Problem 1
Source code(tar.gz)
Source code(zip)
dongbeiya.ply(731.13 MB)
gym_south.ply(696.19 MB)
gym_north.ply(707.89 MB)
gym.ply(1404.08 MB)

Owner

HMT_Curo

GitHub Repository

How to Leverage Multimodal EHR Data for Better Medical Predictions?

How to Leverage Multimodal EHR Data for Better Medical Predictions? This repository contains the code of the paper: How to Leverage Multimodal EHR Dat

13 Dec 13, 2022

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

MOT-Tracking-by-Detection-Pipeline Tracking-by-Detection形式のMOT(Multi Object Trac

41 Nov 23, 2022

Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)

Normalization Matters in Weakly Supervised Object Localization (ICCV 2021) 99% of the code in this repository originates from this link. ICCV 2021 pap

10 Feb 01, 2022

Selfplay In MultiPlayer Environments

This project allows you to train AI agents on custom-built multiplayer environments, through self-play reinforcement learning.

200 Jan 08, 2023

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

A PyTorch implementation of V-Net Vnet is a PyTorch implementation of the paper V-Net: Fully Convolutional Neural Networks for Volumetric Medical Imag

606 Dec 21, 2022

Kaggle Feedback Prize - Evaluating Student Writing 15th solution

Kaggle Feedback Prize - Evaluating Student Writing 15th solution First of all, I would like to thank the excellent notebooks and discussions from http

6 Mar 24, 2022

Adaptation through prediction: multisensory active inference torque control

Adaptation through prediction: multisensory active inference torque control Submitted to IEEE Transactions on Cognitive and Developmental Systems Abst

1 Nov 07, 2022

This repo contains the source code and a benchmark for predicting user's utilities with Machine Learning techniques for Computational Persuasion

Machine Learning for Argument-Based Computational Persuasion This repo contains the source code and a benchmark for predicting user's utilities with M

4 Nov 07, 2022

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Daft-Exprt - PyTorch Implementation PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis The

47 Dec 18, 2022

Codes for "Template-free Prompt Tuning for Few-shot NER".

EntLM The source codes for EntLM. Dependencies: Cuda 10.1, python 3.6.5 To install the required packages by following commands: $ pip3 install -r requ

77 Dec 27, 2022

Fully convolutional deep neural network to remove transparent overlays from images

1.1k Jan 06, 2023

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

Non-Metric Space Library (NMSLIB) Important Notes NMSLIB is generic but fast, see the results of ANN benchmarks. A standalone implementation of our fa

2.9k Jan 04, 2023

3D-Reconstruction 基于深度学习方法的单目多视图三维重建

Related tags

Overview

基于深度学习方法的单目多视图三维重建

Part I 三维重建

Part II 基于计算机视觉方法的点云到点云窗户识别

Part III 基于ResNest的图像到点云的语义分割

参考文献

致谢

You might also like...

Implementation for Paper "Inverting Generative Adversarial Renderer for Face Reconstruction"

Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

The code for the CVPR 2021 paper Neural Deformation Graphs, a novel approach for globally-consistent deformation tracking and 3D reconstruction of non-rigid objects.

Code for "LASR: Learning Articulated Shape Reconstruction from a Monocular Video". CVPR 2021.

Releases(7)

7(Feb 16, 2022)

6(Feb 16, 2022)

5(Dec 29, 2021)

4(Dec 19, 2021)

3(Dec 19, 2021)

2(Dec 19, 2021)

1(Dec 19, 2021)

Owner

HMT_Curo

How to Leverage Multimodal EHR Data for Better Medical Predictions?

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

Normalization Matters in Weakly Supervised Object Localization (ICCV 2021)

Selfplay In MultiPlayer Environments

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Kaggle Feedback Prize - Evaluating Student Writing 15th solution

Adaptation through prediction: multisensory active inference torque control

This repo contains the source code and a benchmark for predicting user's utilities with Machine Learning techniques for Computational Persuasion

PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis

Codes for "Template-free Prompt Tuning for Few-shot NER".

Fully convolutional deep neural network to remove transparent overlays from images

Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.

A benchmark dataset for mesh multi-label-classification based on cube engravings introduced in MeshCNN

Why Are You Weird? Infusing Interpretability in Isolation Forest for Anomaly Detection

A python code to convert Keras pre-trained weights to Pytorch version

Implementation for paper "Towards the Generalization of Contrastive Self-Supervised Learning"

Identifying Stroke Indicators Using Rough Sets

Blender Python - Node-based multi-line text and image flowchart

Attention over nodes in Graph Neural Networks using PyTorch (NeurIPS 2019)

This repository implements Douzero's interface to IGCA.