Improving Object Detection by Label Assignment Distillation

Last update: Dec 08, 2022

Related tags

Overview

Improving Object Detection by Label Assignment Distillation

This is the official implementation of the WACV 2022 paper Improving Object Detection by Label Assignment Distillation. We provide the code for Label Assignement Distillation (LAD), training logs and several model checkpoints.

Introduction
Installation
Usage
Experiments
Citation

Introduction

This is the official repository for the paper Improving Object Detection by Label Assignment Distillation.

Soft Label Distillation concept (a)	Label Assignment Distillation concept (b)

Distillation in Object Detection is typically achived by mimicking the teacher's output directly, such soft-label distillation or feature mimicking (Fig 1.a).
We propose the concept of Label Assignment Distillation (LAD), which solves the label assignment problems from distillation perspective, thus allowing the student learns from the teacher's knowledge without direct mimicking (Fig 1.b). LAD is very general, and applied to many dynamic label assignment methods. Following figure shows a concrete example of how to adopt Probabilistic Anchor Assignment (PAA) to LAD.

Probabilistic Anchor Assignment (PAA)	Label Assignment Distillation (LAD) based on PAA

We demonstrate a number of advantages of LAD, notably that it is very simple and effective, flexible to use with most of detectors, and complementary to other distillation techniques.
Later, we introduced the Co-learning dynamic Label Assignment Distillation (CoLAD) to allow two networks to be trained mutually based on a dynamic switching criterion. We show that two networks trained with CoLAD are significantly better than if each was trained individually, given the same initialization.

Installation

Create environment:

conda create -n lad python=3.7 -y
conda activate lad

Install dependencies:

conda install pytorch=1.7.0 torchvision cudatoolkit=10.2 -c pytorch -y
pip install openmim future tensorboard sklearn timm==0.3.4
mim install mmcv-full==1.2.5
mim install mmdet==2.10.0
pip install -e ./

Usage

Train the model

#!/usr/bin/env bash
set -e
export GPUS=2
export CUDA_VISIBLE_DEVICES=0,2

CFG="configs/lad/paa_lad_r50_r101p1x_1x_coco.py"
WORKDIR="/checkpoints/lad/paa_lad_r50_r101p1x_1x_coco"

mim train mmdet $CFG --work-dir $WORKDIR \
    --gpus=$GPUS --launcher pytorch --seed 0 --deterministic

Test the model

#!/usr/bin/env bash
set -e
export GPUS=2
export CUDA_VISIBLE_DEVICES=0,2

CFG="configs/paa/paa_lad_r50_r101p1x_1x_coco.py"
CKPT="/checkpoints/lad/paa_lad_r50_r101p1x_1x_coco/epoch_12.pth"

mim test mmdet $CFG --checkpoint $CKPT --gpus $GPUS --launcher pytorch --eval bbox

Experiments

1. A Pilot Study - Preliminary Comparison.

Table 2: Compare the performance of the student PAA-R50 using Soft-Label, Label Assignment Distillation (LAD) and their combination (SoLAD) on COCO validation set.

Method	Teacher	Student	gamma	mAP	Improve	Config	Download
Baseline	None	PAA-R50 (baseline)	2	40.4	-	config	model \| log
Soft-Label-KL loss	PAA-R101	PAA-R50	0.5	41.3	+0.9	config	model \| log
LAD (ours)	PAA-R101	PAA-R50	2	41.6	+1.2	config	model \| log
SoLAD(ours)	PAA-R101	PAA-R50	0.5	42.4	+2.0	config	model \| log

2. A Pilot Study - Does LAD need a bigger teacher network?

Table 3: Compare Soft-Label and Label Assignment Distillation (LAD) on COCO validation set. Teacher and student use ResNet50 and ResNet101 backbone, respectively. 2× denotes the 2× training schedule.

Method	Teacher	Student	mAP	Improve	Config	Download
Baseline	None	PAA-R50	40.4	-	config	model \| log
Baseline (1x)	None	PAA-R101	42.6	-	config	model \| log
Baseline (2x)	None	PAA-R101	43.5	+0.9	config	model \| log
Soft-Label	PAA-R50	PAA-R101	40.4	-2.2	config	model \| log
LAD (our)	PAA-R50	PAA-R101	43.3	+0.7	config	model \| log

3. Compare with State-ot-the-art Label Assignment methods

We use the PAA-R50 3x pretrained with multi-scale on COCO as the initial teacher. The teacher was evaluated with 43:3AP on the minval set. We train our network with the COP branch, and the post-processing steps are similar to PAA.

Table 7.1 Backbone ResNet-101, train 2x schedule, Multi-scale training. Results are evaluated on the COCO testset.

Method	AP	AP50	AP75	APs	APm	APl	Download
FCOS	41.5	60.7	45.0	24.4	44.8	51.6
NoisyAnchor	41.8	61.1	44.9	23.4	44.9	52.9
FreeAnchor	43.1	62.2	46.4	24.5	46.1	54.8
SAPD	43.5	63.6	46.5	24.9	46.8	54.6
MAL	43.6	61.8	47.1	25.0	46.9	55.8
ATSS	43.6	62.1	47.4	26.1	47.0	53.6
AutoAssign	44.5	64.3	48.4	25.9	47.4	55.0
PAA	44.8	63.3	48.7	26.5	48.8	56.3
OTA	45.3	63.5	49.3	26.9	48.8	56.1
IQDet	45.1	63.4	49.3	26.7	48.5	56.6
CoLAD (ours)	`46.0`	`64.4`	`50.6`	`27.9`	`49.9`	`57.3`	config \| model \| log

4. Appendix - Ablation Study of Conditional Objectness Prediction (COP)

Table 1 - Appendix: Compare different auxiliary predictions: IoU, Implicit Object prediction (IOP), and Conditional Objectness prediction (COP), with ResNet-18 and ResNet-50 backbones. Experiments on COCO validate set.

IoU	IOP	COP	ResNet-18	ResNet-50
✔️			35.8 (config \| model \| log)	40.4 (config \| model \| log)
✔️	✔️		36.7 (config \| model \| log)	41.6 (config \| model \| log)
✔️		✔️	36.9 (config \| model \| log)	41.6 (config \| model \| log)
	✔️		36.6 (config \| model \| log)	41.1 (config \| model \| log)
		✔️	36.9 (config \| model \| log)	41.2 (config \| model \| log)

Citation

Please cite the paper in your publications if it helps your research:

@misc{nguyen2021improving,
      title={Improving Object Detection by Label Assignment Distillation}, 
      author={Chuong H. Nguyen and Thuy C. Nguyen and Tuan N. Tang and Nam L. H. Phan},
      year={2021},
      eprint={2108.10520},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

On Sep 25 2021, we found that there is a concurrent (unpublished) work from Jianfend Wang, that shares the key idea about Label Assignment Distillation. However, both of our works are independent and original. We would like to acknowledge his work and thank for his help to clarify the issue.

Improving Object Detection by Label Assignment Distillation

Related tags

Overview

Improving Object Detection by Label Assignment Distillation

Table of Contents

Introduction

Installation

Usage

Train the model

Test the model

Experiments

1. A Pilot Study - Preliminary Comparison.

2. A Pilot Study - Does LAD need a bigger teacher network?

3. Compare with State-ot-the-art Label Assignment methods

4. Appendix - Ablation Study of Conditional Objectness Prediction (COP)

Citation

Owner

Cybercore Co. Ltd

Suite of 500 procedurally-generated NLP tasks to study language model adaptability

An example project demonstrating how the Autonomous Learning Library can be used to build new reinforcement learning agents.

This repository contains the implementations related to the experiments of a set of publicly available datasets that are used in the time series forecasting research space.

JUSTICE: A Benchmark Dataset for Supreme Court’s Judgment Prediction

Warning: This project does not have any current developer. See bellow.

Python Tensorflow 2 scripts for detecting objects of any class in an image without knowing their label.

AFLFast (extends AFL with Power Schedules)

Educational API for 3D Vision using pose to control carton.

This is the paddle code for SeBoW(Self-Born wiring for neural trees), a kind of neural tree born form a large search space

Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)

Python package to add text to images, textures and different backgrounds

Code for: Gradient-based Hierarchical Clustering using Continuous Representations of Trees in Hyperbolic Space. Nicholas Monath, Manzil Zaheer, Daniel Silva, Andrew McCallum, Amr Ahmed. KDD 2019.

Music library streaming app written in Flask & VueJS

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

The official implementation of Equalization Loss for Long-Tailed Object Recognition (CVPR 2020) based on Detectron2

for taichi voxel-challange event

RID-Noise: Towards Robust Inverse Design under Noisy Environments

AAAI-22 paper: SimSR: Simple Distance-based State Representationfor Deep Reinforcement Learning