AI hardware acceleration with analog memory: Microarchitectures for low energy at high speed

Autor:	Scott C. Lewis, Hung-Yang Chang, Hsinyu Tsai, Geoffrey W. Burr, Nathan C. P. Farinha, An Chen, Pritish Narayanan, Stefano Ambrogio, Kohji Hosokawa, Charles Mackin
Rok vydání:	2019
Předmět:	010302 applied physics General Computer Science Computer science business.industry 02 engineering and technology Converters 01 natural sciences CMOS 020204 information systems 0103 physical sciences 0202 electrical engineering electronic engineering information engineering Hardware acceleration Node (circuits) Crossbar switch business Throughput (business) Energy (signal processing) Computer hardware Efficient energy use
Zdroj:	IBM Journal of Research and Development. 63:8:1-8:14
ISSN:	0018-8646
Popis:	In this article, we present innovative microarchitectural designs for multilayer deep neural networks (DNNs) implemented in crossbar arrays of analog memories. Data is transferred in a fully parallel manner between arrays without explicit analog-to-digital converters. Design ideas including source follower-based readout, array segmentation, and transmit-by-duration are adopted to improve the circuit efficiency. The execution energy and throughput, for both DNN training and inference, are analyzed quantitatively using circuit simulations of a full CMOS design in the 90-nm technology node. We find that our current design could achieve up to 12–14 TOPs/s/W energy efficiency for training, while a projected scaled design could achieve up to 250 TOPs/s/W. Key challenges in realizing analog AI systems are discussed.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::ccc7aa63daa2ead0f7dc98f13afc0fee https://doi.org/10.1147/jrd.2019.2934050 Zobrazit plný text záznamu