4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Last update: Nov 09, 2022

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR)

Challenge Site

Overview

Synthetic Aperture Radar (SAR) has received more attention due to its complementary superiority on capturing significant information in the remote sensing area. However, for an Aerial View Object Classification (AVOC) task, SAR images still suffer from the long-tailed distribution of the aerial view objects. This disparity dampens the performance of classification methods, especially for the datasensitive deep learning models. In this paper, we propose a two-stage shake-shake network to tackle the long-tailed learning problem. Specifically, it decouples the learning procedure into the representation learning stage and the classification learning stage. Moreover, we apply the test time augmentation (TTA) and a post-processing approach (CAN) to improve the accuracy. In the PBVS 2022 Multi-modal Aerial View Object Classification Challenge Track 1, our method achieves 21.82% and 27.97% accuracy in the development phase and testing phase respectively, which achieves the top-tier among all the participants.

Requirements

Ubuntu (It's only tested on Ubuntu, so it may not work on Windows.)
Python >= 3.7
PyTorch >= 1.4.0
torchvision
```
pip install -r requirements.txt
```

Usage

The first stage training

python train.py --config ./configs/sar10/shake_shake.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val”, under the “dataset” field and “output_dir” under the “train” field in the file “./configs/sar10/shake_shake.yaml”。

The second stage training

python train.py --config ./configs/sar10/shake_shake_fc.yaml

You need to change the value of “dataset_dir”, “dataset_dir_val” under the “dataset” field and “output_dir”, “checkpoint” under the “train” field in the file “./configs/sar10/shake_shake_fc.yaml”。

Test

python predict_TTA.py

You need to change the value of “dataset_dir”, “checkpoint”, under the “test” field in the file “./configs/sar10/shake_shake.yaml”, then you can find the results in file “.result/results.csv”。
You can download the trained model here.

Acknowledge

The codes borrow heavily from hysts/pytorch_image_classification.

4st place solution for the PBVS 2022 Multi-modal Aerial View Object Classification Challenge - Track 1 (SAR) at PBVS2022

Related tags

Overview

A Two-Stage Shake-Shake Network for Long-tailed Recognition of SAR Aerial View Objects

Overview

Requirements

Usage

The first stage training

The second stage training

Test

Acknowledge

Owner

LinpengPan

This package proposes simplified exporting pytorch models to ONNX and TensorRT, and also gives some base interface for model inference.

This dlib-based facial login system

Experimenting with computer vision techniques to generate annotated image datasets from gameplay recordings automatically.

This package implements THOR: Transformer with Stochastic Experts.

Kindle is an easy model build package for PyTorch.

Unofficial Pytorch Lightning implementation of Contrastive Syn-to-Real Generalization (ICLR, 2021)

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT

CenterNet:Objects as Points目标检测模型在Pytorch当中的实现

Proposed n-stage Latent Dirichlet Allocation method - A Novel Approach for LDA

An educational tool to introduce AI planning concepts using mobile manipulator robots.

Learning View Priors for Single-view 3D Reconstruction (CVPR 2019)

A repository for the updated version of CoinRun used to collect MUGEN, a multimodal video-audio-text dataset.

Pytorch reimplement of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction" ACL2020. The original code is written in keras.

This is a package for LiDARTag, described in paper: LiDARTag: A Real-Time Fiducial Tag System for Point Clouds

PyTorch implementation of the paper: "Preference-Adaptive Meta-Learning for Cold-Start Recommendation", IJCAI, 2021.

This is the official code for the paper "Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision"

"NAS-Bench-301 and the Case for Surrogate Benchmarks for Neural Architecture Search".

Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels