Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation

This is the official repository for our paper Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation , to appear in CVPR 2021. Code to will be released prior to the conference.

Abstract

Absolute camera pose estimation is usually addressed by sequentially solving two distinct subproblems: First a feature matching problem that seeks to establish putative 2D-3D correspondences, and then a Perspective-n-Point problem that minimizes, with respect to the camera pose, the sum of so-called Reprojection Errors (RE). We argue that generating putative 2D-3D correspondences 1) leads to an important loss of information that needs to be compensated as far as possible, within RE, through the choice of a robust loss and the tuning of its hyperparameters and 2) may lead to an RE that conveys erroneous data to the pose estimator. In this paper, we introduce the Neural Reprojection Error (NRE) as a substitute for RE. NRE allows to rethink the camera pose estimation problem by merging it with the feature learning problem, hence leveraging richer information than 2D-3D correspondences and eliminating the need for choosing a robust loss and its hyperparameters. Thus NRE can be used as training loss to learn image descriptors tailored for pose estimation. We also propose a coarse-to-fine optimization method able to very efficiently minimize a sum of NRE terms with respect to the camera pose. We experimentally demonstrate that NRE is a good substitute for RE as it significantly improves both the robustness and the accuracy of the camera pose estimate while being computationally and memory highly efficient. From a broader point of view, we believe this new way of merging deep learning and 3D geometry may be useful in other computer vision applications.

BibTex

Please consider citing our work:

@inproceedings{germain2021NRE,
  author    = {Hugo Germain and
               Vincent Lepetit and
               Guillaume Bourmaud},
  title     = {Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation},
  booktitle = {CVPR},
  year      = {2021},
  url       = {https://arxiv.org/abs/2103.07153}
}

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation

Related tags

Overview

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation

Abstract

BibTex

Owner

Hugo Germain

Taming Transformers for High-Resolution Image Synthesis

LoveDA: A Remote Sensing Land-Cover Dataset for Domain Adaptive Semantic Segmentation (NeurIPS2021 Benchmark and Dataset Track)

Chainer Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)

Simple object detection app with streamlit

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

For encoding a text longer than 512 tokens, for example 800. Set max_pos to 800 during both preprocessing and training.

Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral

Code in PyTorch for the convex combination linear IAF and the Householder Flow, J.M. Tomczak & M. Welling

Accurate identification of bacteriophages from metagenomic data using Transformer

Perform zero-order Hankel Transform for an 1D array (float or real valued).

Grammar Induction using a Template Tree Approach

NeuralCompression is a Python repository dedicated to research of neural networks that compress data

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices

Rafael Project- Classifying rockets to different types using data science algorithms.

:hot_pepper: R²SQL: "Dynamic Hybrid Relation Network for Cross-Domain Context-Dependent Semantic Parsing." (AAAI 2021)

A tutorial showing how to train, convert, and run TensorFlow Lite object detection models on Android devices, the Raspberry Pi, and more!

Ultra-lightweight human body posture key point CNN model. ModelSize:2.3MB HUAWEI P40 NCNN benchmark: 6ms/img,

This git repo contains the implementation of my ML project on Heart Disease Prediction

Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021