Deep Learning for Human Part Discovery in Images - Chainer implementation

Last update: Sep 25, 2022

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

NOTE: This is not official implementation. Original paper is Deep Learning for Human Part Discovery in Images.

We are now reproducing the experiments in the original paper. Any contribution will be welcomed!

Requirements

Python 2.7.11+
- Chainer 1.10+
- numpy 1.9+
- scipy 0.16+
- six
- matplotlib
- tqdm
- cv2 (opencv)

Preparation

Data

bash prepare.sh

This script downloads VOC 2010 dataset (http://host.robots.ox.ac.uk/pascal/VOC/voc2010/VOCtrainval_03-May-2010.tar) and the authors' original dataset (http://www2.informatik.uni-freiburg.de/~oliveira/datasets/Sitting.tar.gz).

Model

You can download pre-trained FCN model from here.

We will use weights of this model and train new model on VOC dataset.

Start training

python train.py -g 0 -b 3 -e 3000 -l on -s on

Possible options

python train.py --help

GPU memory requirement

Citation from the original paper:

Each minibatch consists of just one image. The learning rate and momentum are fixed to 1e 10 and 0.99, respectively. We train the refinement layer by layer, which takes two days per refinement layer. Thus, the overall training starting from the pre-trained VGG network took 10 days on a single GPU.

Current maximum batchsize is 3 for 12 GB memory GPU.

Also it was confirmed that MBP (Late 2016, memory 16 GiB) can run with batchsize 1.

Result

Now in prep.

Visualize Prediction

python visualize.py -f PATH_TO_IMAGE_FILE

LICENSE

MIT LICENSE.

Author

shiba24, August 2016.

Contributors

bobye

Deep Learning for Human Part Discovery in Images - Chainer implementation

Related tags

Overview

Deep Learning for Human Part Discovery in Images - Chainer implementation

Requirements

Preparation

Data

Model

Start training

Possible options

GPU memory requirement

Result

Visualize Prediction

LICENSE

Author

Contributors

Owner

Shintaro Shiba

Experiments for distributed optimization algorithms

Modeling CNN layers activity with Gaussian mixture model

Semantic Segmentation Suite in TensorFlow

Python interface for SmartRF Sniffer 2 Firmware

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

A standard framework for modelling Deep Learning Models for tabular data

PlenOctrees: NeRF-SH Training & Conversion

Instance Semantic Segmentation List

This is project is the implementation of the DeepShift: Towards Multiplication-Less Neural Networks paper

Pytorch Implementation of Continual Learning With Filter Atom Swapping (ICLR'22 Spolight) Paper

Unbalanced Feature Transport for Exemplar-based Image Translation (CVPR 2021)

[CVPR'21] Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-view Transformation

Tensor-Based Quantum Machine Learning

Reference implementation of code generation projects from Facebook AI Research. General toolkit to apply machine learning to code, from dataset creation to model training and evaluation. Comes with pretrained models.

WaveFake: A Data Set to Facilitate Audio DeepFake Detection

Ultra-Data-Efficient GAN Training: Drawing A Lottery Ticket First, Then Training It Toughly

Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

BisQue is a web-based platform designed to provide researchers with organizational and quantitative analysis tools for 5D image data. Users can extend BisQue by implementing containerized ML workflows.

Open source repository for the code accompanying the paper 'PatchNets: Patch-Based Generalizable Deep Implicit 3D Shape Representations'.