Fast and robust analog in-memory deep neural network training.

Autor: Rasch MJ; IBM Research, TJ Watson Research Center, Yorktown Heights, NY, USA. malte.rasch@gmail.com.; Sony AI, Zürich, Switzerland. malte.rasch@gmail.com., Carta F; IBM Research, TJ Watson Research Center, Yorktown Heights, NY, USA., Fagbohungbe O; IBM Research, TJ Watson Research Center, Yorktown Heights, NY, USA., Gokmen T; IBM Research, TJ Watson Research Center, Yorktown Heights, NY, USA. tgokmen@us.ibm.com.
Jazyk: angličtina
Zdroj: Nature communications [Nat Commun] 2024 Aug 20; Vol. 15 (1), pp. 7133. Date of Electronic Publication: 2024 Aug 20.
DOI: 10.1038/s41467-024-51221-z
Abstrakt: Analog in-memory computing is a promising future technology for efficiently accelerating deep learning networks. While using in-memory computing to accelerate the inference phase has been studied extensively, accelerating the training phase has received less attention, despite its arguably much larger compute demand to accelerate. While some analog in-memory training algorithms have been suggested, they either invoke significant amount of auxiliary digital compute-accumulating the gradient in digital floating point precision, limiting the potential speed-up-or suffer from the need for near perfectly programming reference conductance values to establish an algorithmic zero point. Here, we propose two improved algorithms for in-memory training, that retain the same fast runtime complexity while resolving the requirement of a precise zero point. We further investigate the limits of the algorithms in terms of conductance noise, symmetry, retention, and endurance which narrow down possible device material choices adequate for fast and robust in-memory deep neural network training.
(© 2024. The Author(s).)
Databáze: MEDLINE