[BMVC 2021] Official PyTorch Implementation of Self-supervised learning of Image Scale and Orientation Estimation

Last update: Nov 10, 2022

Related tags

Overview

Self-Supervised Learning of Image Scale and Orientation Estimation (BMVC 2021)

This is the official implementation of the paper "Self-Supervised Learning of Image Scale and Orientation Estimation" by Jongmin Lee [Google Scholar], Yoonwoo Jeong [Google Scholar], and Minsh Cho [Google Scholar]. We introduce a self-supervised framework for learning patch pose. Given a rescaled/rotated pair of image patches, we feed them to the patch pose estimation networks that output scale/orientation histograms for each. We compare the output histogram vectors by the histogram alignment technique and compute the loss.

Requirements

Ubuntu 18.04
python 3.8
pytorch 1.8.1
torchvision 0.9.1
wandb 0.10.28

Environment

Clone the Git repository

git clone https://github.com/bluedream1121/SelfScaOri.git

Install dependency

Run the script to install all the dependencies. You need to provide the conda install path (e.g. ~/anaconda3) and the name for the created conda environment.

bash install.sh conda_install_path self-sca-ori

Dataset preparation

You can download the training/test dataset using the following scripts:

cd datasets
bash download.sh

If you want to regenerate the patchPose datasets, please run the following script:

cd datasets/patchpose_dataset_generation
bash generation_script.sh

Trained models

cd trained_models
bash download_ori_model.sh
bash download_sca_model.sh

Test on the patchPose and the HPatches

After download the datasets and the pre-trained models, you can evaluate the patch pose estimation results using the following scripts:

python test.py --load trained_models/_*branchori/best_model.pt  --dataset_type ppa_ppb
python test.py --load trained_models/_*branchsca/best_model.pt  --dataset_type ppa_ppb

python test.py --load trained_models/_*branchori/best_model.pt  --dataset_type hpa
python test.py --load trained_models/_*branchsca/best_model.pt  --dataset_type hpa

Training

You can train the networks for patch scale estimation and orientation estimation using the proposed histogram alignment loss as follows:

python train.py --branch ori --output_ori 36

python train.py --branch sca --output_sca 13

Citation

If you find our code or paper useful to your research work, please consider citing our work using the following bibtex:

@inproceedings{lee2021self,
    author   = {},
    title    = {},
    booktitle= {},
    year     = {2021}
}

Contact

Jongmin Lee ([email protected])

Questions can also be left as issues in the repository.

[BMVC 2021] Official PyTorch Implementation of Self-supervised learning of Image Scale and Orientation Estimation

Related tags

Overview

Self-Supervised Learning of Image Scale and Orientation Estimation (BMVC 2021)

Requirements

Environment

Clone the Git repository

Install dependency

Dataset preparation

Trained models

Test on the patchPose and the HPatches

Training

Citation

Contact

Owner

Jongmin Lee

Towards Understanding Quality Challenges of the Federated Learning: A First Look from the Lens of Robustness

An implementation of "Learning human behaviors from motion capture by adversarial imitation"

Deep Learning Head Pose Estimation using PyTorch.

Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation"

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

The repo contains the code of the ACL2020 paper `Dice Loss for Data-imbalanced NLP Tasks`

DatasetGAN: Efficient Labeled Data Factory with Minimal Human Effort

UV matrix decompostion using movielens dataset

SWA Object Detection

A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API

Official implementation of "Can You Spot the Chameleon? Adversarially Camouflaging Images from Co-Salient Object Detection" in CVPR 2022.

World Models with TensorFlow 2

🛠️ Tools for Transformers compression using Lightning ⚡

High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

TransNet V2: Shot Boundary Detection Neural Network

Based on the given clinical dataset, Predict whether the patient having Heart Disease or Not having Heart Disease

[NeurIPS 2019] Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

Image Segmentation using U-Net, U-Net with skip connections and M-Net architectures

Code associated with the paper "Towards Understanding the Data Dependency of Mixup-style Training".

Implementation of SwinTransformerV2 in TensorFlow.