Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Last update: Dec 30, 2022

Related tags

Overview

Neuron Merging: Compensating for Pruned Neurons

Pytorch implementation of Neuron Merging: Compensating for Pruned Neurons, accepted at 34th Conference on Neural Information Processing Systems (NeurIPS 2020).

Requirements

To install requirements:

conda env create -f ./environment.yml

Python environment & main libraries:

python 3.8
pytorch 1.5.0
scikit-learn 0.22.1
torchvision 0.6.0

LeNet-300-100

To test LeNet-300-100 model on FashionMNIST, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script:

model type: original | prune | merge
pruning criterion : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

For example, to test the model after pruning 50% of the neurons with $l_1$-norm criterion, run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t prune -c l1-norm -r 0.5

To test the model after merging , run:

bash scripts/LeNet_300_100_FashionMNIST.sh -t merge -c l1-norm -r 0.5

VGG-16

To test VGG-16 model on CIFAR-10, run:

bash scripts/VGG16_CIFAR10.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

As a pretrained model on CIFAR-100 is not included, you must train it first. To train VGG-16 on CIFAR-100, run:

bash scripts/VGG16_CIFAR100_train.sh

All the hyperparameters are as described in the supplementary material.

After training, to test VGG-16 model on CIFAR-100, run:

bash scripts/VGG16_CIFAR100.sh -t [model type] -c [criterion]

You can use two arguments for this script

model type: original | prune | merge
pruning criterion: l1-norm | l2-norm | l2-GM

ResNet

To test ResNet-56 model on CIFAR-10, run:

bash scripts/ResNet56_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

To test WideResNet-40-4 model on CIFAR-10, run:

bash scripts/WideResNet_40_4_CIFAR10.sh -t [model type] -c [criterion] -r [pruning ratio]

You can use three arguments for this script

model type: original | prune | merge
pruning method : l1-norm | l2-norm | l2-GM
pruning ratio : 0.0 ~ 1.0

Results

Our model achieves the following performance on (without fine-tuning) :

Image classification of LeNet-300-100 on FashionMNIST

Baseline Accuracy : 89.80%

Pruning Ratio	Prune ($l_1$-norm)	Merge
50%	88.40%	88.69%
60%	85.17%	86.92%
70%	71.26%	82.75%
80%	66.76	80.02%

Image classification of VGG-16 on CIFAR-10

Baseline Accuracy : 93.70%

Criterion	Prune	Merge
$l_1$-norm	88.70%	93.16%
$l_2$-norm	89.14%	93.16%
$l_2$-GM	87.85%	93.10%

Citation

@inproceedings{kim2020merging,
  title     = {Neuron Merging: Compensating for Pruned Neurons},
  author    = {Kim, Woojeong and Kim, Suhyun and Park, Mincheol and Jeon, Geonseok},
  booktitle = {Advances in Neural Information Processing Systems 33},
  year      = {2020}
}

Neuron Merging: Compensating for Pruned Neurons (NeurIPS 2020)

Related tags

Overview

Neuron Merging: Compensating for Pruned Neurons

Requirements

LeNet-300-100

VGG-16

ResNet

Results

Image classification of LeNet-300-100 on FashionMNIST

Image classification of VGG-16 on CIFAR-10

Citation

Owner

Woojeong Kim

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

A new benchmark for Icon Question Answering (IconQA) and a large-scale icon dataset Icon645.

Official Code for AdvRush: Searching for Adversarially Robust Neural Architectures (ICCV '21)

The first machine learning framework that encourages learning ML concepts instead of memorizing class functions.

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

A high-performance anchor-free YOLO. Exceeding yolov3~v5 with ONNX, TensorRT, NCNN, and Openvino supported.

Attempt at implementation of a simple GAN using Keras

Simulate genealogical trees and genomic sequence data using population genetic models

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"

Implementation of "Efficient Regional Memory Network for Video Object Segmentation" (Xie et al., CVPR 2021).

LONG-TERM SERIES FORECASTING WITH QUERYSELECTOR – EFFICIENT MODEL OF SPARSEATTENTION

Spatial Attentive Single-Image Deraining with a High Quality Real Rain Dataset (CVPR'19)

It is an open dataset for object detection in remote sensing images.

Using Machine Learning to Test Causal Hypotheses in Conjoint Analysis

Python wrappers to the C++ library SymEngine, a fast C++ symbolic manipulation library.

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Code for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming

ppo_pytorch_cpp - an implementation of the proximal policy optimization algorithm for the C++ API of Pytorch

Explainability of the Implications of Supervised and Unsupervised Face Image Quality Estimations Through Activation Map Variation Analyses in Face Recognition Models

Bagua is a flexible and performant distributed training algorithm development framework.