Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Last update: Jun 27, 2022

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

Analyzing complex scenes with DNN is a challenging task, particularly when images contain multiple objects that partially occlude each other. Existing approaches to image analysis mostly process objects independently and do not take into account the relative occlusion of nearby objects. We propose a deep network for multi-object instance segmentation that is robust to occlusion and can be trained from bounding box supervision only.

We also introduce an Occlusion Challenge dataset generated from real-world segmented objects with accurate annotations and propose a taxonomy of occlusion scenarios that pose a particular challenge for computer vision.

NOTICE

dataset links and model will be released in a few days. Update: 18 June

Requirments

The code uses Python 3.6 and it is tested on PyTorch GPU version 1.2, with CUDA-10.0 and cuDNN-7.5.

Installation

Clone the repository with:

git clone https://github.com/XD7479/Multi-Object-Occlusion.git
cd Multi-Object-Occlusion

Install requirments:

pip install -r requirements.txt

Datasets

Download the KINS dataset here and the Occlusion Challenge dataset here.
Enter the project folder and make links for the datasets:

ln -s  kins
ln -s  occ_challenge

Download the pre-trained model here.
Make links for the pre-trained model:

ln -s  models

Check the configuration file configs.py for the dataset and backbone you're using:

dataset_eval = 'occ_challenge'      # kins, occ_challenge
nn_type = 'resnext'             # vgg, resnext

Run the evaluation code with:

python3 eval_meanIoU.py

Segmentation Demo

Citation

@misc{yuan2021robust,
      title={Robust Instance Segmentation through Reasoning about Multi-Object Occlusion}, 
      author={Xiaoding Yuan and Adam Kortylewski and Yihong Sun and Alan Yuille},
      booktitle = {Proceedings IEEE Conf. on Computer Vision and Pattern Recognition (CVPR)},
      month = jun,
      year = {2021},
      month_numeric = {6}
}

Contact

If you have any questions you can contact Xiaoding Yuan by [email protected].

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Related tags

Overview

Robust Instance Segmentation through Reasoning about Multi-Object Occlusion [CVPR 2021]

Abstract

NOTICE

Requirments

Installation

Datasets

Segmentation Demo

Citation

Contact

Owner

Irene Yuan

Python implementation of ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images, AAAI2022.

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

This project is for a Twitter bot that monitors a bird feeder in my backyard. Any detected birds are identified and posted to Twitter.

The modify PyTorch version of Siam-trackers which are speed-up by TensorRT.

Code examples and benchmarks from the paper "Understanding Entropy Coding With Asymmetric Numeral Systems (ANS): a Statistician's Perspective"

Curated list of awesome GAN applications and demo

Code for "Reconstructing 3D Human Pose by Watching Humans in the Mirror", CVPR 2021 oral

UT-Sarulab MOS prediction system using SSL models

level1-image-classification-level1-recsys-09 created by GitHub Classroom

This program generates a random 12 digit/character password (upper and lowercase) and stores it in a file along with your username and app/website.

RGBD-Net - This repository contains a pytorch lightning implementation for the 3DV 2021 RGBD-Net paper.

A simple and useful implementation of LPIPS.

tree-math: mathematical operations for JAX pytrees

Meta-learning for NLP

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

LaBERT - A length-controllable and non-autoregressive image captioning model.

Gems & Holiday Package Prediction

Benchmarking the robustness of Spatial-Temporal Models

Efficient neural networks for analog audio effect modeling

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation