(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Last update: Aug 04, 2022

Overview

Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

This repository contains pytorch source code for AAAI2020 oral paper: Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing by Haoyu He, Jing Zhang, Qiming Zhang and Dacheng Tao.

Grapy-ML:

Getting Started:

Environment:

Pytorch = 1.1.0
torchvision
scipy
tensorboardX
numpy
opencv-python
matplotlib

Data Preparation:

You need to download the three datasets. The CIHP dataset and ATR dataset can be found in this repository and our code is heavily borrowed from it as well.

Then, the datasets should be arranged in the following folder, and images should be rearranged with the provided file structure.

/data/dataset/

Testing:

The pretrain models and some trained models are provided here for testing and training.

Model Name	Description	Derived from
deeplab_v3plus_v3.pth	The Deeplab v3+'s pretrain weights
CIHP_pretrain.pth	The reproduced Deeplab v3+ model trained on CIHP dataset	deeplab_v3plus_v3.pth
CIHP_trained.pth	GPM model trained on CIHP dataset	CIHP_pretrain.pth
deeplab_multi-dataset.pth	The reproduced multi-task learning Deeplab v3+ model trained on CIHP, PASCAL-Person-Part and ATR dataset	deeplab_v3plus_v3.pth
GPM-ML_multi-dataset.pth	Grapy-ML model trained on CIHP, PASCAL-Person-Part and ATR dataset	deeplab_multi-dataset.pth
GPM-ML_finetune_PASCAL.pth	Grapy-ML model finetuned on PASCAL-Person-Part dataset	GPM-ML_multi-dataset.pth

To test, run the following two scripts:

bash eval_gpm.sh
bash eval_gpm_ml.sh

Training:

GPM:

During training, you first need to get the Deeplab pretrain model(e.g. CIHP_dlab.pth) on each dataset. Such act aims to provide a trustworthy initial raw result for the GSA operation in GPM.

bash train_dlab.sh

The imageNet pretrain model is provided in the following table, and you should swith the dataset name and target classes to the dataset you want in the script. (CIHP: 20 classes, PASCAL: 7 classes and ATR: 18 classes)

In the next step, you should utilize the Deeplab pretrain model to further train the GPM model.

bash train_gpm.sh

It is recommended to follow the training settings in our paper to reproduce the results.

GPM-ML:

Firstly, you can conduct the deeplab pretrain process by the following script:

bash train_dlab_ml.sh

The multi-dataset Deeplab V3+ is transformed as a simple multi-task task.

Then, you can train the GPM-ML model with the training set from all three datasets by:

bash train_gpm_ml_all.sh

After this phase, the first two levels of the GPM-ML model would be more robust and generalized.

Finally, you can try to finetune on each dataset by the unified pretrain model.

bash train_gpm_ml_pascal.sh

Citation:

@inproceedings{he2020grapy,
title={Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing},
author={He, Haoyu and Zhang, Jing and Zhang, Qiming and Tao, Dacheng},
booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
year={2020}
}

Maintainer:

[email protected]

(AAAI2020)Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Related tags

Overview

Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing

Grapy-ML:

Getting Started:

Environment:

Data Preparation:

Testing:

Training:

GPM:

GPM-ML:

Citation:

Maintainer:

Owner

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)

This is an example implementation of the paper "Cross Domain Robot Imitation with Invariant Representation".

[MICCAI'20] AlignShift: Bridging the Gap of Imaging Thickness in 3D Anisotropic Volumes

This is the code of NeurIPS'21 paper "Towards Enabling Meta-Learning from Target Models".

Towards Open-World Feature Extrapolation: An Inductive Graph Learning Approach

CCCL: Contrastive Cascade Graph Learning.

[CVPR2021 Oral] FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation.

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021)

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

📖 Deep Attentional Guided Image Filtering

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

Selecting Parallel In-domain Sentences for Neural Machine Translation Using Monolingual Texts

Reimplement of SimSwap training code

constructing maps of intellectual influence from publication data

Vehicle speed detection with python

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

Mortgage-loan-prediction - Show how to perform advanced Analytics and Machine Learning in Python using a full complement of PyData utilities

The official implementation of Equalization Loss v1 & v2 (CVPR 2020, 2021) based on MMDetection.