FATE
Autor: | Jeff Jun Zhang, Siddharth Garg |
---|---|
Rok vydání: | 2018 |
Předmět: |
FOS: Computer and information sciences
010302 applied physics Computer Science - Machine Learning Computer science Computer Science - Neural and Evolutionary Computing Machine Learning (stat.ML) 02 engineering and technology 01 natural sciences Machine Learning (cs.LG) 020202 computer hardware & architecture Complement (complexity) Power (physics) Computer engineering Statistics - Machine Learning Logic gate Hardware Architecture (cs.AR) 0103 physical sciences 0202 electrical engineering electronic engineering information engineering Neural and Evolutionary Computing (cs.NE) Computer Science - Hardware Architecture Representation (mathematics) Energy (signal processing) Efficient energy use |
Zdroj: | ICCAD |
DOI: | 10.1145/3240765.3240809 |
Popis: | Deep neural networks (DNN) are increasingly being accelerated on application-specific hardware such as the Google TPU designed especially for deep learning. Timing speculation is a promising approach to further increase the energy efficiency of DNN accelerators. Architectural exploration for timing speculation requires detailed gate-level timing simulations that can be time-consuming for large DNNs that execute millions of multiply-and-accumulate (MAC) operations. In this paper we propose FATE, a new methodology for fast and accurate timing simulations of DNN accelerators like the Google TPU. FATE proposes two novel ideas: (i) DelayNet, a DNN based timing model for MAC units; and (ii) a statistical sampling methodology that reduces the number of MAC operations for which timing simulations are performed. We show that FATE results in between 8 times-58 times speed-up in timing simulations, while introducing less than 2% error in classification accuracy estimates. We demonstrate the use of FATE by comparing to conventional DNN accelerator that uses 2's complement (2C) arithmetic with an alternative implementation that uses signed magnitude representations (SMR). We show that that the SMR implementation provides 18% more energy savings for the same classification accuracy than 2C, a result that might be of independent interest. To appear at IEEE/ACM International Conference On Computer Aided Design 2018 |
Databáze: | OpenAIRE |
Externí odkaz: |