Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

Last update: Dec 29, 2022

Related tags

Overview

Pixel-Perfect Structure-from-Motion (ICCV 2021 Oral)

We introduce a framework that improves the accuracy of Structure-from-Motion by refining keypoints, camera poses, and 3D points using the direct alignment of deep features. It is presented in our paper:

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement
to appear at ICCV 2021
Authors: Philipp Lindenberger*, Paul-Edouard Sarlin*, Viktor Larsson, and Marc Pollefeys
Website: psarlin.com/pixsfm (videos, slides, poster)

This repository will host the code to run and evaluate our refinement. Please subscribe to this issue if you wish to be notified of the code release.

Abstract

Finding local features that are repeatable across multiple views is a cornerstone of sparse 3D reconstruction. The classical image matching paradigm detects keypoints per-image once and for all, which can yield poorly-localized features and propagate large errors to the final geometry. In this paper, we refine two key steps of structure-from-motion by a direct alignment of low-level image information from multiple views: we first adjust the initial keypoint locations prior to any geometric estimation, and subsequently refine points and camera poses as a post-processing. This refinement is robust to large detection noise and appearance changes, as it optimizes a featuremetric error based on dense features predicted by a neural network. This significantly improves the accuracy of camera poses and scene geometry for a wide range of keypoint detectors, challenging viewing conditions, and off-the-shelf deep features. Our system easily scales to large image collections, enabling pixel-perfect crowd-sourced localization at scale. Our code will be publicly available at as an add-on to the popular SfM software COLMAP.

BibTex Citation

Please consider citing our work if you use any code from this repo or ideas presented in the paper:

@inproceedings{lindenberger2021pixsfm,
  author    = {Philipp Lindenberger and
               Paul-Edouard Sarlin and
               Viktor Larsson and
               Marc Pollefeys},
  title     = {{Pixel-Perfect Structure-from-Motion with Featuremetric Refinement}},
  booktitle = {ICCV},
  year      = {2021},
}

Pixel-Perfect Structure-from-Motion with Featuremetric Refinement (ICCV 2021, Oral)

Related tags

Overview

Pixel-Perfect Structure-from-Motion (ICCV 2021 Oral)

Abstract

BibTex Citation

Owner

Computer Vision and Geometry Lab

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

Predicting lncRNA–protein interactions based on graph autoencoders and collaborative training

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

SeqTR: A Simple yet Universal Network for Visual Grounding

A Robust Non-IoU Alternative to Non-Maxima Suppression in Object Detection

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

Learning embeddings for classification, retrieval and ranking.

A project for developing transformer-based models for clinical relation extraction

A Python package for time series augmentation

Benchmark library for high-dimensional HPO of black-box models based on Weighted Lasso regression

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

Code for our CVPR 2021 paper "MetaCam+DSCE"

Implementation of experiments in the paper Clockwork Variational Autoencoders (project website) using JAX and Flax

Learning with Noisy Labels via Sparse Regularization, ICCV2021

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Implementation of accepted AAAI 2021 paper: Deep Unsupervised Image Hashing by Maximizing Bit Entropy

Repo for CVPR2021 paper "QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information"

Implementation of trRosetta and trDesign for Pytorch, made into a convenient package

A PyTorch implementation for our paper "Dual Contrastive Learning: Text Classification via Label-Aware Data Augmentation".