Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Last update: Apr 04, 2022

Related tags

Deep Learning FSAC

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

torch >= 1.0

torchvision >= 0.2.0

Python 3

Environmental settings

This repository is developed using python 3.6.12 on Ubuntu 16.04.5 LTS. The CUDA and pytorch version is 11.2 and 1.7.1. We use one NVIDIA 3090 GPU card for training and testing.

Dataset

PASCAL VOC, Watercolor, Cityscapes, Foggycityscapes -> Please follow the instructions in [Link] to prepare the datasets.

Daytime-Sunny, Dusk-Rainy, and Night-Rainy -> Dataset preparation instruction link [Link].

Code

Faster R-CNN -> Thanks for jwyang [Link]; Fourier Domain Adaptation -> Thanks for Yanchao Yang [Link].

Our Augmentation (Mix+Replace+Extend+Disorder).

Train

To train a faster R-CNN model with vgg16 on pascal_voc:

CUDA_VISIBLE_DEVICES=$GPU_ID python trainval_net.py --dataset pascal_voc --net vgg16 --bs 1 --cuda

And you need to add augmentated data in the loadpath by creating a new dataset_name variable.

Test

To test:

python test_net.py --dataset pascal_voc --net vgg16 --modelpath your modelpath --cuda

Augmentation

Daytime-Sunny -> Dusk-Rainy

Daytime-Sunny -> Night-Rainy

Result

Results on adaptation from Cityscapes to FoggyCityscapes. ‘prsn’, ‘mcycl’, and ‘bcycl’ separately denote ‘person’, ‘motorcycle’, and ‘bicycle’ category.

Results on adaptation from Daytime-sunny to Duskrainy. Here, we directly run the released codes of the compared methods to obtain the results.

Results on Daytime-sunny → Night-rainy.

Results on the compound target domain.

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Related tags

Overview

Frequency Spectrum Augmentation Consistency for Domain Adaptive Object Detection

Main requirements

Environmental settings

Dataset

Code

Train

Test

Augmentation

Result

Owner

A framework for Quantification written in Python

Logsig-RNN: a novel network for robust and efficient skeleton-based action recognition

PyTorch implementation for our paper "Deep Facial Synthesis: A New Challenge"

Gender Classification Machine Learning Model using Sk-learn in Python with 97%+ accuracy and deployment

TakeInfoatNistforICS - Take Information in NIST NVD for ICS

STMTrack: Template-free Visual Tracking with Space-time Memory Networks

Distributing reference energies for SMIRNOFF implementations

An end-to-end project on customer segmentation

Multi-agent reinforcement learning algorithm and environment

Fast and simple implementation of RL algorithms, designed to run fully on GPU.

Data reduction pipeline for KOALA on the AAT.

DFM: A Performance Baseline for Deep Feature Matching

A PyTorch Implementation of FaceBoxes

The implementation of the algorithm in the paper "Safe Deep Semi-Supervised Learning for Unseen-Class Unlabeled Data" published in ICML 2020.

Semantic segmentation task for ADE20k & cityscapse dataset, based on several models.

nfelo: a power ranking, prediction, and betting model for the NFL

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

ManipulaTHOR, a framework that facilitates visual manipulation of objects using a robotic arm

Beyond Image to Depth: Improving Depth Prediction using Echoes (CVPR 2021)

Heterogeneous Temporal Graph Neural Network