AdaQAT: Adaptive Bit-Width Quantization-Aware Training

Gernigon, Cédric; Filip, Silviu-Ioan; Sentieys, Olivier; Coggiola, Clément; Bruno, Mickael

Computer Science > Machine Learning

arXiv:2404.16876 (cs)

[Submitted on 22 Apr 2024]

Title:AdaQAT: Adaptive Bit-Width Quantization-Aware Training

Authors:Cédric Gernigon (TARAN), Silviu-Ioan Filip (TARAN), Olivier Sentieys (TARAN), Clément Coggiola (CNES), Mickael Bruno (CNES)

View PDF HTML (experimental)

Abstract:Large-scale deep neural networks (DNNs) have achieved remarkable success in many application scenarios. However, high computational complexity and energy costs of modern DNNs make their deployment on edge devices challenging. Model quantization is a common approach to deal with deployment constraints, but searching for optimized bit-widths can be challenging. In this work, we present Adaptive Bit-Width Quantization Aware Training (AdaQAT), a learning-based method that automatically optimizes weight and activation signal bit-widths during training for more efficient DNN inference. We use relaxed real-valued bit-widths that are updated using a gradient descent rule, but are otherwise discretized for all quantization operations. The result is a simple and flexible QAT approach for mixed-precision uniform quantization problems. Compared to other methods that are generally designed to be run on a pretrained network, AdaQAT works well in both training from scratch and fine-tuning scenarios.Initial results on the CIFAR-10 and ImageNet datasets using ResNet20 and ResNet18 models, respectively, indicate that our method is competitive with other state-of-the-art mixed-precision quantization approaches.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.16876 [cs.LG]
	(or arXiv:2404.16876v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2404.16876

Submission history

From: Cedric Gernigon [view email] [via CCSD proxy]
[v1] Mon, 22 Apr 2024 09:23:56 UTC (41 KB)

Computer Science > Machine Learning

Title:AdaQAT: Adaptive Bit-Width Quantization-Aware Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AdaQAT: Adaptive Bit-Width Quantization-Aware Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators