Data Free Quantization

13 papers with code • 2 benchmarks • 1 datasets

Data Free Quantization is a technique to achieve a highly accurate quantized model without accessing any training data.

Source: Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

Benchmarks

Add a Result

These leaderboards are used to track progress in Data Free Quantization

Trend	Dataset	Best Model	Paper	Code	Compare
	CIFAR10	ResNet-20 CIFAR-10			See all
	CIFAR-100	ResNet-20 CIFAR-100			See all

Libraries

Use these libraries to find Data Free Quantization models and implementations

iamkanghyunchoi/ait

3 papers

Datasets

CIFAR-100

Most implemented papers

Most implemented Social Latest No code

Data-Free Quantization Through Weight Equalization and Bias Correction

jakc4103/DFQ • • ICCV 2019

This improves quantization accuracy performance, and can be applied to many common computer vision architectures with a straight forward API call.

Paper
Code

ZeroQ: A Novel Zero Shot Quantization Framework

amirgholami/ZeroQ • • CVPR 2020

Importantly, ZeroQ has a very low computational overhead, and it can finish the entire quantization process in less than 30s (0. 5\% of one epoch training time of ResNet50 on ImageNet).

Paper
Code

Generative Low-bitwidth Data Free Quantization

xushoukai/GDFQ • • ECCV 2020

More critically, our method achieves much higher accuracy on 4-bit quantization than the existing data free quantization method.

Paper
Code

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples

iamkanghyunchoi/qimera • • NeurIPS 2021

We find that this is often insufficient to capture the distribution of the original data, especially around the decision boundaries.

Paper
Code

Diverse Sample Generation: Pushing the Limit of Generative Data-free Quantization

htqin/dsg • • 1 Sep 2021

We first give a theoretical analysis that the diversity of synthetic samples is crucial for the data-free quantization, while in existing approaches, the synthetic data completely constrained by BN statistics experimentally exhibit severe homogenization at distribution and sample levels.

Paper
Code

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation

clevercool/SQuant • • ICLR 2022

This paper proposes an on-the-fly DFQ framework with sub-second quantization time, called SQuant, which can quantize networks on inference-only devices with low computation and memory requirements.

Paper
Code

Patch Similarity Aware Data-Free Quantization for Vision Transformers

zkkli/psaq-vit • • 4 Mar 2022

The above insights guide us to design a relative value metric to optimize the Gaussian noise to approximate the real images, which are then utilized to calibrate the quantization parameters.

Paper
Code