BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Last update: Dec 02, 2022

Overview

BitPack

BitPack is a practical tool that can efficiently save quantized neural network models with mixed bitwidth.

Installation

PyTorch version >= 1.4.0
Python version >= 3.5
To install Bitpack simply run:

git clone https://github.com/Zhen-Dong/BitPack.git
cd BitPack

Usage

We can use BitPack pack.py to save integer checkpoints with various bitwidth, and use BitPack unpack.py to load the packed checkpoint, as shown in the demo.
To pack integer values that are saved in floating point format, add --force-pack-fp in the command.
To directly save packed checkpoint in PyTorch, please use save_quantized_state_dict() and load_quantized_state_dict() in pytorch_interface.py. If you don't want to operate jointly on state_dict, then codes inside the for loop of those two functions can be applied on every quantized tensor (ultra low-precision integer tensors) in various quantization frameworks.

Quick Start

BitPack is handy to use on various quantization frameworks. Here we show a demo that applying BitPack to save mixed-precision model generated by HAWQ.

export CUDA_VISIBLE_DEVICES=0
python pack.py --input-int-file quantized_checkpoint.pth.tar --force-pack-fp
python unpack.py --input-packed-file packed_quantized_checkpoint.pth.tar --original-int-file quantized_checkpoint.pth.tar

To get a better sense of how BitPack works, we provide a simple test that compares the original tensor, the packed tensor, and the unpacked tensor in details.

cd bitpack
python bitpack_utils.py

Results of BitPack on ResNet50

Original Precision	Quantization	Original Size(MB)	Packed Size(MB)	Compression Ratio
Floating Point	Mixed-Precision(4bit/8bit)	102	13.8	7.4x
8-bit	Mixed-Precision(2bit/8bit)	26	7.9	3.3x

Special Notes

unpack.py can be used for checking correctness. It loads and unpacks the packed model, and then compares it with the original model.

License

BitPack is released under the MIT license.

BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.

Related tags

Overview

BitPack

Installation

Usage

Quick Start

Results of BitPack on ResNet50

Special Notes

License

Owner

Zhen Dong

Continuous Security Group Rule Change Detection & Response at scale

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

Code for KDD'20 "An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph"

[ICCV'21] Pri3D: Can 3D Priors Help 2D Representation Learning?

An open source object detection toolbox based on PyTorch

Denoising Diffusion Probabilistic Models

Implement slightly different caffe-segnet in tensorflow

pip install python-office

Codes for NeurIPS 2021 paper "Adversarial Neuron Pruning Purifies Backdoored Deep Models"

[CVPR 2021] MiVOS - Scribble to Mask module

High-Resolution Image Synthesis with Latent Diffusion Models

Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

Mosaic of Object-centric Images as Scene-centric Images (MosaicOS) for long-tailed object detection and instance segmentation.

Generating Fractals on Starknet with Cairo

VR Viewport Pose Model for Quantifying and Exploiting Frame Correlations

Pytorch codes for Feature Transfer Learning for Face Recognition with Under-Represented Data

Accelerating BERT Inference for Sequence Labeling via Early-Exit

Code for the ICML 2021 paper "Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation", Haoxiang Wang, Han Zhao, Bo Li.

NOD: Taking a Closer Look at Detection under Extreme Low-Light Conditions with Night Object Detection Dataset

AI grand challenge 2020 Repo (Speech Recognition Track)