A Dual-Split 6T SRAM-Based Computing-in-Memory Unit-Macro With Fully Parallel Product-Sum Operation for Binarized DNN Edge Processors
Autor: | Meng-Fan Chang, Qiang Li, Xiaoyu Sun, Xin Si, Jiafang Li, Jia-Jing Chen, Hiroyuki Yamauchi, Rui Liu, Shimeng Yu, Win-San Khwa |
---|---|
Rok vydání: | 2019 |
Předmět: |
Offset (computer science)
Artificial neural network Computer science business.industry 020208 electrical & electronic engineering Transistor Binary number 02 engineering and technology law.invention XNOR gate Hardware and Architecture law 0202 electrical engineering electronic engineering information engineering Static random-access memory Electrical and Electronic Engineering business Computer hardware Access time Efficient energy use |
Zdroj: | IEEE Transactions on Circuits and Systems I: Regular Papers. 66:4172-4185 |
ISSN: | 1558-0806 1549-8328 |
DOI: | 10.1109/tcsi.2019.2928043 |
Popis: | Computing-in-memory (CIM) is a promising approach to reduce the latency and improve the energy efficiency of deep neural network (DNN) artificial intelligence (AI) edge processors. However, SRAM-based CIM (SRAM-CIM) faces practical challenges in terms of area overhead, performance, energy efficiency, and yield against variations in data patterns and transistor performance. This paper employed a circuit-system co-design methodology to develop a SRAM-CIM unit-macro for a binary-based fully connected neural network (FCNN) layer of the DNN AI edge processors. The proposed SRAM-CIM unit-macro supports two binarized neural network models: an XNOR neural network (XNORNN) and a modified binary neural network (MBNN). To achieve compact area, fast access time, robust operations, and high energy-efficiency, our proposed SRAM-CIM uses a split-wordline compact-rule 6T SRAM and circuit techniques, including a dynamic input-aware reference generation (DIARG) scheme, an algorithm-dependent asymmetric control (ADAC) scheme, a write disturb-free (WDF) scheme, and a common-mode-insensitive small offset voltage-mode sensing amplifier (CMI-VSA). A fabricated 65-nm 4-Kb SRAM-CIM unit-macro achieved 2.4- and 2.3-ns product-sum access times for a FCNN layer using XNORNN and MBNN, respectively. The measured maximum energy efficiency reached 30.49 TOPS/W for XNORNN and 55.8 TOPS/W for the MBNN modes. |
Databáze: | OpenAIRE |
Externí odkaz: |