git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Last update: Sep 08, 2021

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

This is the pytorch implementation of our paper "[Beta R-CNN: Looking into Pedestrian Detection from Another Perspective]", published in Neurips 2020.

Our method aiming at detecting highly occluded and highly-overlapped instances in crowded scenes especially for pedestrian detection.

Codes are prepared to release here. Due to the experiments are conducted with internal framework, we need some time to rewrite and clean the code. We will release the complete code soon.

Abstract

Recently significant progress has been made in pedestrian detection, but it remains challenging to achieve high performance in occluded and crowded scenes. It could be mostly attributed to the widely used representation of pedestrians, i.e., 2Daxis-aligned bounding box, which just describes the approximate location and size of the object. Bounding box models the object as a uniform distribution within the boundary, making pedestrians indistinguishable in occluded and crowded scenes due to much noise. To eliminate the problem, we propose a novel representation based on 2D beta distribution, named Beta Representation. It pictures a pedestrian by explicitly constructing the relationship between full-body and visible boxes,and emphasizes the center of visual mass by assigning different probability values to pixels. As a result, Beta Representation is much better for distinguishing highly-overlapped instances in crowded scenes with a new NMS strategy named BetaNMS. What’s more, to fully exploit Beta Representation, a novel pipeline Beta R-CNN equipped with BetaHead and BetaMask is proposed, leading to high detection performance in occluded and crowded scenes.

Method

The network structure and some visualization results are shown here:

Citation

@article{BetaRCNN,
  title={Beta R-CNN: Looking into Pedestrian Detection from Another Perspective},
  author={Xu, Zixuan and Li, Banghuai and Yuan, Ye and Dang, Anhong},
  journal={Advances in Neural Information Processing Systems},
  volume={33},
  year={2020}
}

Contact

If you have any questions, please do not hesitate to contact Zixuan Xu ([email protected]).

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Related tags

Overview

Beta R-CNN: Looking into Pedestrian Detection from Another Perspective

Abstract

Method

Citation

Contact

Owner

IGCN : Image-to-graph convolutional network

Download & Install mods for your favorit game with a few simple clicks

Implementation of trRosetta and trDesign for Pytorch, made into a convenient package

Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization

Differentiable architecture search for convolutional and recurrent networks

Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0

[ICLR 2021, Spotlight] Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

This is a collection of simple PyTorch implementations of neural networks and related algorithms. These implementations are documented with explanations,

face2comics by Sxela (Alex Spirin) - face2comics datasets

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

A deep learning framework for historical document image analysis

Free Book about Deep-Learning approaches for Chess (like AlphaZero, Leela Chess Zero and Stockfish NNUE)

Computer Vision application in the web

BMVC 2021 Oral: code for BI-GCN: Boundary-Aware Input-Dependent Graph Convolution for Biomedical Image Segmentation

Tensorflow implementation of DeepLabv2

Simple implementation of Mobile-Former on Pytorch

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

Baseline for the Spoofing-aware Speaker Verification Challenge 2022

Pytorch implementation of FlowNet by Dosovitskiy et al.