Rotation Robust Descriptors

Last update: Nov 15, 2022

Overview

RoRD

Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching

Project Page | Paper link

Evaluation and Datasets

MMA : Training on PhotoTourism and testing on HPatches and proposed Rotated HPatches
Pose Estimation : Training on same PhotoTourism datasets as used for MMA and testing on proposed DiverseView
Visual Place Recognition : Oxford RobotCar training sequence and testing sequence

Pretrained Models

Download models from Google Drive (73.9 MB) in the base directory.

Evaluating RoRD

You can evaluate RoRD on demo images or replace it with your custom images.

Dependencies can be installed in a conda of virtualenv by running:
1. pip install -r requirements.txt
python extractMatch.py <rgb_image1> <rgb_image2> --model_file <path to the model file RoRD>
Example:
python extractMatch.py demo/rgb/rgb1_1.jpg demo/rgb/rgb1_2.jpg --model_file models/rord.pth
This should give you output like this:

RoRD

SIFT

DiverseView Dataset

Download dataset from Google Drive (97.8 MB) in the base directory (only needed if you want to evaluate on DiverseView Dataset).

Evaluation on DiverseView Dataset

The DiverseView Dataset is a custom dataset consisting of 4 scenes with images having high-angle camera rotations and viewpoint changes.

Pose estimation on single image pair of DiverseView dataset:
1. cd demo
2. python register.py --rgb1 <path to rgb image 1> --rgb2 <path to rgb image 2> --depth1 <path to depth image 1> --depth2 <path to depth image 2> --model_rord <path to the model file RoRD>
3. Example:
  python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth
4. This should give you output like this:

RoRD matches in perspective view

RoRD matches in orthographic view

To visualize the registered point cloud, use --viz3d command:
1. python register.py --rgb1 rgb/rgb2_1.jpg --rgb2 rgb/rgb2_2.jpg --depth1 depth/depth2_1.png --depth2 depth/depth2_2.png --model_rord ../models/rord.pth --viz3d

PointCloud registration using correspondences

Pose estimation on a sequence of DiverseView dataset:
1. cd evaluation/DiverseView/
2. python evalRT.py --dataset <path to DiverseView dataset> --sequence <sequence name> --model_rord <path to RoRD model> --output_dir <name of output dir>
3. Example:
  1. python evalRT.py --dataset /path/to/preprocessed/ --sequence data1 --model_rord ../../models/rord.pth --output_dir out
4. This would generate out folder containing predicted transformations and matching results in out/vis folder, containing images like below:

RoRD

Training RoRD on PhotoTourism Images

Training using rotation homographies with initialization from D2Net weights (Download base models as mentioned in Pretrained Models).
Download branderburg_gate dataset that is used in the configs/train_scenes_small.txt from here(5.3 Gb) in phototourism folder.

Folder stucture should be:

phototourism/  
___ brandenburg_gate  
___ ___ dense  
___ ___	___ images  
___ ___	___ stereo  
___ ___	___ sparse

python trainPT_ipr.py --dataset_path <path_to_phototourism_folder> --init_model models/d2net.pth --plot

TO-DO

Provide VPR code
Provide combine training of RoRD + D2Net
Provide code for calculating error in Diverseview Dataset

Credits

Our base model is borrowed from D2-Net.

BibTex

If you use this code in your project, please cite the following paper:

@misc{rord2021,
      title={RoRD: Rotation-Robust Descriptors and Orthographic Views for Local Feature Matching}, 
      author={Udit Singh Parihar and Aniket Gujarathi and Kinal Mehta and Satyajit Tourani and Sourav Garg and Michael Milford and K. Madhava Krishna},
      year={2021},
      eprint={2103.08573},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Rotation Robust Descriptors

Related tags

Overview

RoRD

Evaluation and Datasets

Pretrained Models

Evaluating RoRD

RoRD

SIFT

DiverseView Dataset

Evaluation on DiverseView Dataset

RoRD matches in perspective view

RoRD matches in orthographic view

PointCloud registration using correspondences

RoRD

Training RoRD on PhotoTourism Images

TO-DO

Credits

BibTex

Owner

Udit Singh Parihar

neural image generation

The datasets and code of ACL 2021 paper "Aspect-Category-Opinion-Sentiment Quadruple Extraction with Implicit Aspects and Opinions".

Reducing Information Bottleneck for Weakly Supervised Semantic Segmentation (NeurIPS 2021)

Keras implementation of Normalizer-Free Networks and SGD - Adaptive Gradient Clipping

Multi-Object Tracking in Satellite Videos with Graph-Based Multi-Task Modeling

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

CS583: Deep Learning

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

A simple configurable bot for sending arXiv article alert by mail

This repository contains implementations and illustrative code to accompany DeepMind publications

A Real-World Benchmark for Reinforcement Learning based Recommender System

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

STBP is a way to train SNN with datasets by Backward propagation.

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

Codes and pretrained weights for winning submission of 2021 Brain Tumor Segmentation (BraTS) Challenge

NOD: Taking a Closer Look at Detection under Extreme Low-Light Conditions with Night Object Detection Dataset

Harmonious Textual Layout Generation over Natural Images via Deep Aesthetics Learning

[CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong Chen, Zhenyu Zhang, Yu Cheng, Ahmed Awadallah, Zhangyang Wang

Using pretrained GROVER to extract the atomic fingerprints from molecule