Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Last update: Jan 06, 2023

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

This repository is the official PyTorch implementation of Active Learning for Deep Object Detection via Probabilistic Modeling, ICCV 2021.

The proposed method is implemented based on the SSD pytorch.

Our approach relies on mixture density networks to estimate, in a single forward pass of a single model, both localization and classification uncertainties, and leverages them in the scoring function for active learning.

Our method performs on par with multiple model-based methods (e.g., ensembles and MC-Dropout). Therefore, our method provides the best trade-off between accuracy and computational cost.

License

To view a NVIDIA Source Code License for this work, visit https://github.com/NVlabs/AL-MDN/blob/main/LICENSE

Requirements

For setup and data preparation, please refer to the README in SSD pytorch.

Code was tested in virtual environment with Python 3+ and Pytorch 1.1.

Training

Make directory mkdir weights and cd weights.
Download the FC-reduced VGG-16 backbone weight in the weights directory, and cd ...
If necessary, change the VOC_ROOT in data/voc0712.py or COCO_ROOT in data/coco.py.
Please refer to data/config.py for configuration.
Run the training code:

# Supervised learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_supervised_learning.py

# Active learning
CUDA_VISIBLE_DEVICES=<GPU_ID> python train_ssd_gmm_active_learining.py

Evaluation

To evaluate on MS-COCO, change the COCO_ROOT_EVAL in data/coco_eval.py.
Run the evaluation code:

# Evaluation on PASCAL VOC
python eval_voc.py --trained_model <trained weight path>

# Evaluation on MS-COCO
python eval_coco.py --trained_model <trained weight path>

Visualization

Run the visualization code:

python demo.py --trained_model <trained weight path>

Citation

@InProceedings{Choi_2021_ICCV,
    author    = {Choi, Jiwoong and Elezi, Ismail and Lee, Hyuk-Jae and Farabet, Clement and Alvarez, Jose M.},
    title     = {Active Learning for Deep Object Detection via Probabilistic Modeling},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {10264-10273}
}

Official pytorch implementation of Active Learning for deep object detection via probabilistic modeling (ICCV 2021)

Related tags

Overview

Active Learning for Deep Object Detection via Probabilistic Modeling

License

Requirements

Training

Evaluation

Visualization

Citation

Owner

NVIDIA Research Projects

INSPIRED: A Transparent Dialogue Dataset for Interactive Semantic Parsing

An addon uses SMPL's poses and global translation to drive cartoon character in Blender.

4th place solution for the SIGIR 2021 challenge.

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Code for HodgeNet: Learning Spectral Geometry on Triangle Meshes, in SIGGRAPH 2021.

Code of Puregaze: Purifying gaze feature for generalizable gaze estimation, AAAI 2022.

Classification models 1D Zoo - Keras and TF.Keras

BoxInst: High-Performance Instance Segmentation with Box Annotations

[ICCV2021] IICNet: A Generic Framework for Reversible Image Conversion

Official code release for 3DV 2021 paper Human Performance Capture from Monocular Video in the Wild.

Runtime type annotations for the shape, dtype etc. of PyTorch Tensors.

Finetuning Pipeline

UV matrix decompostion using movielens dataset

Weakly Supervised Segmentation with Tensorflow. Implements instance segmentation as described in Simple Does It: Weakly Supervised Instance and Semantic Segmentation, by Khoreva et al. (CVPR 2017).

Chunkmogrify: Real image inversion via Segments

Lorien: A Unified Infrastructure for Efficient Deep Learning Workloads Delivery

PyTorch implementation of ICLR 2022 paper PiCO: Contrastive Label Disambiguation for Partial Label Learning

[NeurIPS 2021] Galerkin Transformer: a linear attention without softmax

Audio Source Separation is the process of separating a mixture into isolated sounds from individual sources

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019