the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Last update: Jul 27, 2022

Related tags

Deep Learning G2S

Overview

G2S

This is the official code for ICRA 2021 Paper: Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation by Hemang Chawla, Arnav Varma, Elahe Arani and Bahram Zonooz.

G2S (GPS-to-Scale) Loss is a dynamically-weighted loss that can be added to the appearance-based losses to train any monocular self-supervised depth estimation architecture to get scale-consistant and scale-aware depth estimates at inference.

Here, we provide helper GPS dataloader and the G2S loss classes for using this loss with any model.

For details, please see the Paper and Presentation.

KITTI GPS

The GPS files containing geodesic gps information of raw kitti dataset in local coordinates for training with the g2s loss can be found in the assets folder as kitti_gps_raw.zip.
Unzip the file at /path/to/KITTI/raw_data/sync to merge the GPS files in the expected directory tree structure.

Usage

You can use the G2S class in lossG2S.py within your project for scale-consistent and -aware predictions. This requires using the copresent GPS modality along with images. To load the GPS, please adopt the GPSDataloader class within dataloaderGPS.py into your images dataloader.

Cite Our Work

If you find the code useful in your research, please consider citing our paper:

@inproceedings{chawlavarma2021multimodal,
	author={H. {Chawla} and A. {Varma} and E. {Arani} and B. {Zonooz}},
	booktitle={2021 IEEE International Conference on Robotics and Automation (ICRA)},
	title={Multimodal Scale Consistency and Awareness for Monocular Self-Supervised
	Depth Estimation},
	location={Xi’an, China},
	publisher={IEEE (in press)},
	year={2021}
}

License

This project is licensed under the terms of the MIT license.

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Related tags

Overview

G2S

KITTI GPS

Usage

Cite Our Work

License

Owner

NeurAI

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021 Accepted

Self-Supervised Learning with Kernel Dependence Maximization

This repository provides the official implementation of 'Learning to ignore: rethinking attention in CNNs' accepted in BMVC 2021.

Diverse Image Generation via Self-Conditioned GANs

Decoding the Protein-ligand Interactions Using Parallel Graph Neural Networks

Deep Implicit Moving Least-Squares Functions for 3D Reconstruction

Yolo object detection - Yolo object detection with python

Relative Uncertainty Learning for Facial Expression Recognition

Lepard: Learning Partial point cloud matching in Rigid and Deformable scenes

Using Hotel Data to predict High Value And Potential VIP Guests

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Artificial intelligence technology inferring issues and logically supporting facts from raw text

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Multiple-Object Tracking with Transformer

Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation

Illuminated3D This project participates in the Nasa Space Apps Challenge 2021.

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Reporting and Visualization for Hazardous Events

EEGEyeNet is benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty

S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)