FreeSOLO for unsupervised instance segmentation, CVPR 2022

Last update: Jan 02, 2023

Overview

FreeSOLO: Learning to Segment Objects without Annotations

This project hosts the code for implementing the FreeSOLO algorithm for unsupervised instance segmentation.

FreeSOLO: Learning to Segment Objects without Annotations,
Xinlong Wang, Zhiding Yu, Shalini De Mello, Jan Kautz, Anima Anandkumar, Chunhua Shen, Jose M. Alvarez
In: Proc. IEEE Conf. Computer Vision and Pattern Recognition (CVPR), 2022
arXiv preprint (arXiv 2202.12181)

Visual Results

Installation

Prerequisites

Linux or macOS with Python >= 3.6
PyTorch >= 1.5 and torchvision that matches the PyTorch installation.
scikit-image

Install PyTorch in Conda env

# create conda env
conda create -n detectron2 python=3.6
# activate the enviorment
conda activate detectron2
# install PyTorch >=1.5 with GPU
conda install pytorch torchvision -c pytorch

Build Detectron2 from Source

Follow the INSTALL.md to install Detectron2 (commit id 11528ce has been tested).

Datasets

Follow the datasets/README.md to set up the MS COCO dataset.

Pre-trained model

Download the DenseCL pre-trained model from here. Convert it to detectron2's format and put the converted model under "training_dir/pre-trained/DenseCL" directory.

python tools/convert-pretrain-to-detectron2.py {WEIGHT_FILE}.pth {WEIGHT_FILE}.pkl

Usage

Free Mask

Download the prepared free masks in json format from here. Put it under "datasets/coco/annotations" directory. Or, generate it by yourself:

bash inference_freemask.sh

Training

# train with free masks
bash train.sh

# generate pseudo labels
bash gen_pseudo_labels.sh

# self-train
bash train_pl.sh

Testing

Download the trained model from here.

bash test.sh {MODEL_PATH}

Citations

Please consider citing our paper in your publications if the project helps your research. BibTeX reference is as follow.

@article{wang2022freesolo,
  title={{FreeSOLO}: Learning to Segment Objects without Annotations},
  author={Wang, Xinlong and Yu, Zhiding and De Mello, Shalini and Kautz, Jan and Anandkumar, Anima and Shen, Chunhua and Alvarez, Jose M},
  journal={arXiv preprint arXiv:2202.12181},
  year={2022}
}

FreeSOLO for unsupervised instance segmentation, CVPR 2022

Related tags

Overview

FreeSOLO: Learning to Segment Objects without Annotations

Visual Results

Installation

Prerequisites

Install PyTorch in Conda env

Build Detectron2 from Source

Datasets

Pre-trained model

Usage

Free Mask

Training

Testing

Citations

Owner

NVIDIA Research Projects

The code for our paper Semi-Supervised Learning with Multi-Head Co-Training

f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation

An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates neural fields, predictive coding, top-down-bottom-up, and attention (consensus between columns)

A-SDF: Learning Disentangled Signed Distance Functions for Articulated Shape Representation (ICCV 2021)

Cancer metastasis detection with neural conditional random field (NCRF)

A library for augmentation of a YOLO-formated dataset

Entity-Based Knowledge Conflicts in Question Answering.

A PyTorch implementation of ViTGAN based on paper ViTGAN: Training GANs with Vision Transformers.

Codes for AAAI22 paper "Learning to Solve Travelling Salesman Problem with Hardness-Adaptive Curriculum"

git《USD-Seg:Learning Universal Shape Dictionary for Realtime Instance Segmentation》(2020) GitHub: [fig2]

Discovering Explanatory Sentences in Legal Case Decisions Using Pre-trained Language Models.

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

[ICLR'19] Trellis Networks for Sequence Modeling

This repo provides a demo for the CVPR 2021 paper "A Fourier-based Framework for Domain Generalization" on the PACS dataset.

Code for the paper "Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks"

Datasets, tools, and benchmarks for representation learning of code.

[AI6101] Introduction to AI & AI Ethics is a core course of MSAI, SCSE, NTU, Singapore

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

Code for Contrastive-Geometry Networks for Generalized 3D Pose Transfer