[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Last update: Jan 04, 2023

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Lukas Koestler^1* Nan Yang^1,2*,† Niclas Zeller^2,3 Daniel Cremers^1,2

^*equal contribution ^†corresponding author

¹Technical University of Munich ²Artisense
³Karlsruhe University of Applied Sciences

Conference on Robot Learning (CoRL) 2021, London, UK

3DV 2021 Best Demo Award

arXiv | Video | OpenReview | Project Page

Code and Data

📣 CVA-MVSNet released! Please check cva_mvsnet/.
📣 Replica training data released! Please check replica/.
C++ code realse before Christmas. Thank you for your patience!

Abstract

In this paper, we present TANDEM a real-time monocular tracking and dense mapping framework. For pose estimation, TANDEM performs photometric bundle adjustment based on a sliding window of keyframes. To increase the robustness, we propose a novel tracking front-end that performs dense direct image alignment using depth maps rendered from a global model that is built incrementally from dense depth predictions. To predict the dense depth maps, we propose Cascade View-Aggregation MVSNet (CVA-MVSNet) that utilizes the entire active keyframe window by hierarchically constructing 3D cost volumes with adaptive view aggregation to balance the different stereo baselines between the keyframes. Finally, the predicted depth maps are fused into a consistent global map represented as a truncated signed distance function (TSDF) voxel grid. Our experimental results show that TANDEM outperforms other state-of-the-art traditional and learning-based monocular visual odometry (VO) methods in terms of camera tracking. Moreover, TANDEM shows state-of-the-art real-time 3D reconstruction performance.

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

Graph Representation Learning via Graphical Mutual Information Maximization

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]

QAT(quantize aware training) for classification with MQBench

Libraries, tools and tasks created and used at DeepMind Robotics.

PyTorch-based framework for Deep Hedging

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

Official Implementation of Neural Splines

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Magisk module to enable hidden features on Android 12 Developer Preview 1.

Modified fork of Xuebin Qin's U-2-Net Repository. Used for demonstration purposes.

Learning to Predict Gradients for Semi-Supervised Continual Learning

DANet for Tabular data classification/ regression.

An educational AI robot based on NVIDIA Jetson Nano.

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

The code release of paper Low-Light Image Enhancement with Normalizing Flow

Probabilistic Cross-Modal Embedding (PCME) CVPR 2021

From Perceptron model to Deep Neural Network from scratch in Python.

Multi-task yolov5 with detection and segmentation based on yolov5

Implementation of paper "Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal"

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

[CoRL 21'] TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo

Related tags

Overview

TANDEM: Tracking and Dense Mappingin Real-time using Deep Multi-view Stereo

Code and Data

Abstract

Poster

Owner

TUM Computer Vision Group

Graph Representation Learning via Graphical Mutual Information Maximization

DiscoNet: Learning Distilled Collaboration Graph for Multi-Agent Perception [NeurIPS 2021]

QAT(quantize aware training) for classification with MQBench

Libraries, tools and tasks created and used at DeepMind Robotics.

PyTorch-based framework for Deep Hedging

Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition

Official Implementation of Neural Splines

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows" on Object Detection and Instance Segmentation.

Magisk module to enable hidden features on Android 12 Developer Preview 1.

Modified fork of Xuebin Qin's U-2-Net Repository. Used for demonstration purposes.

Learning to Predict Gradients for Semi-Supervised Continual Learning

DANet for Tabular data classification/ regression.

An educational AI robot based on NVIDIA Jetson Nano.

Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it

The code release of paper Low-Light Image Enhancement with Normalizing Flow

Probabilistic Cross-Modal Embedding (PCME) CVPR 2021

From Perceptron model to Deep Neural Network from scratch in Python.

Multi-task yolov5 with detection and segmentation based on yolov5

Implementation of paper "Decision-based Black-box Attack Against Vision Transformers via Patch-wise Adversarial Removal"

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

TANDEM: Tracking and Dense Mapping
in Real-time using Deep Multi-view Stereo