Training Multi-Bit Quantized and Binarized Networks with a Learnable Symmetric Quantizer

Autor:	Phuoc Pham, Jacob A. Abraham, Jaeyong Chung
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Learnable quantizer quantization binarization model compression machine learning neuromorphic computing Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 9, Pp 47194-47203 (2021)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2021.3067889
Popis:	Quantizing weights and activations of deep neural networks is essential for deploying them in resource-constrained devices, or cloud platforms for at-scale services. While binarization is a special case of quantization, this extreme case often leads to several training difficulties, and necessitates specialized models and training methods. As a result, recent quantization methods do not provide binarization, thus losing the most resource-efficient option, and quantized and binarized networks have been distinct research areas. We examine binarization difficulties in a quantization framework and find that all we need to enable the binary training are a symmetric quantizer, good initialization, and careful hyperparameter selection. These techniques also lead to substantial improvements in multi-bit quantization. We demonstrate our unified quantization framework, denoted as UniQ, on the ImageNet dataset with various architectures such as ResNet-18,-34 and MobileNetV2. For multi-bit quantization, UniQ outperforms existing methods to achieve the state-of-the-art accuracy. In binarization, the achieved accuracy is comparable to existing state-of-the-art methods even without modifying the original architectures.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/db79439d6bb74b6dbe299621c56e0b0e Zobrazit plný text záznamu View record in DOAJ