Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Last update: Dec 07, 2022

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

This codebase implements the loss function described in:

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth Davy Neven, Bert De Brabandere, Marc Proesmans, and Luc Van Gool Conference on Computer Vision and Pattern Recognition (CVPR), june 2019

Our network architecture is a multi-branched version of ERFNet and uses the Lovasz-hinge loss for maximizing the IoU of each instance.

License

This software is released under a creative commons license which allows for personal and research use only. For a commercial license please contact the authors. You can view a license summary here.

Getting started

This codebase showcases the proposed loss function on car instance segmentation using the Cityscapes dataset.

Prerequisites

Dependencies:

Pytorch 1.1
Python 3.6.8 (or higher)
Cityscapes + scripts (if you want to evaluate the model)

Training

Training consists out of 2 steps. We first train on 512x512 crops around each object, to avoid computation on background patches. Afterwards, we finetune on larger patches (1024x1024) to account for bigger objects and background features which are not present in the smaller crops.

To generate these crops do the following:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python utils/generate_crops.py

Afterwards start training:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python train.py

Different options can be modified in train_config.py, e.g. to visualize set display=True.

Testing

You can download a pretrained model here. Save this file in the src/pretrained_models/ or adapt the test_config.py file.

To test the model on the Cityscapes validation set run:

$ CITYSCAPES_DIR=/path/to/cityscapes/ python test.py

The pretrained model gets 56.4 AP on the car validation set.

Acknowledgement

This work was supported by Toyota, and was carried out at the TRACE Lab at KU Leuven (Toyota Research on Automated Cars in Europe - Leuven)

Instance Segmentation by Jointly Optimizing Spatial Embeddings and Clustering Bandwidth

Related tags

Overview

Instance segmentation by jointly optimizing spatial embeddings and clustering bandwidth

License

Getting started

Prerequisites

Training

Testing

Acknowledgement

Owner

IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling

StyleGAN of All Trades: Image Manipulation withOnly Pretrained StyleGAN

Source code for 2021 ICCV paper "In-the-Wild Single Camera 3D Reconstruction Through Moving Water Surfaces"

Official Implementation for "ReStyle: A Residual-Based StyleGAN Encoder via Iterative Refinement" https://arxiv.org/abs/2104.02699

Efficient electromagnetic solver based on rigorous coupled-wave analysis for 3D and 2D multi-layered structures with in-plane periodicity

Losslandscapetaxonomy - Taxonomizing local versus global structure in neural network loss landscapes

Attention-based Transformation from Latent Features to Point Clouds (AAAI 2022)

Code for the SIGIR 2022 paper "Hybrid Transformer with Multi-level Fusion for Multimodal Knowledge Graph Completion"

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

RSC-Net: 3D Human Pose, Shape and Texture from Low-Resolution Images and Videos

Visual Tracking by TridenAlign and Context Embedding

Code for EmBERT, a transformer model for embodied, language-guided visual task completion.

Exploration of some patients clinical variables.

Official PyTorch implementation for Generic Attention-model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers, a novel method to visualize any Transformer-based network. Including examples for DETR, VQA.

Conversational text Analysis using various NLP techniques

Implementation of paper "Graph Condensation for Graph Neural Networks"

【steal piano】GitHub偷情分析工具！

This project hosts the code for implementing the ISAL algorithm for object detection and image classification

使用深度学习框架提取视频硬字幕；docker容器免安装深度学习库，使用本地api接口使得界面和后端识别分离；

A state-of-the-art semi-supervised method for image recognition