Always-On Speech Recognition Using TrueNorth, a Reconfigurable, Neurosynaptic Processor
Autor: | Bryan L. Jackson, Andrew S. Cassidy, John V. Arthur, Myron D. Flickner, Jack Sampson, R Davis, Dharmendra S. Modha, Alexander Andreopoulos, Wei-Yu Tsai, Michael DeBole, Vijaykrishnan Narayanan |
---|---|
Rok vydání: | 2017 |
Předmět: |
010302 applied physics
Computer science Speech recognition Feature extraction Reconfigurability 02 engineering and technology Network topology computer.software_genre 01 natural sciences TrueNorth 020202 computer hardware & architecture Theoretical Computer Science Computational Theory and Mathematics Neuromorphic engineering Hardware and Architecture 0103 physical sciences 0202 electrical engineering electronic engineering information engineering Mel-frequency cepstrum Audio signal processing computer Software |
Zdroj: | IEEE Transactions on Computers. 66:996-1007 |
ISSN: | 0018-9340 |
Popis: | Deep neural networks (DNN) have been shown to be very effective at solving challenging problems in several areas of computing, including vision, speech, and natural language processing. However, traditional platforms for implementing these DNNs are often very power hungry, which has lead to significant efforts in the development of configurable platforms capable of implementing these DNNs efficiently. One of these platforms, the IBM TrueNorth processor, has demonstrated very low operating power in performing visual computing and neural network classification tasks in real-time. The neuron computation, synaptic memory, and communication fabrics are all configurable, so that a wide range of network types and topologies can be mapped to TrueNorth. This reconfigurability translates into the capability to support a wide range of low-power functions in addition to feed-forward DNN classifiers, including for example, the audio processing functions presented here.In this work, we propose an end-to-end audio processing pipeline that is implemented entirely on a TrueNorth processor and designed to specifically leverage the highly-parallel, low-precision computing primitives TrueNorth offers. As part of this pipeline, we develop an audio feature extractor (LATTE) designed for implementation on TrueNorth, and explore the tradeoffs among several design variants in terms of accuracy, power, and performance. We customize the energy-efficient deep neuromorphic networks structures that our design utilizes as the classifier and show how classifier parameters can trade between power and accuracy. In addition to enabling a wide range of diverse functions, the reconfigurability of TrueNorth enables re-training and re-programming the system to satisfy varying energy, speed, area, and accuracy requirements. The resulting system's end-to-end power consumption can be as low as $14.43\text{mW}$ , which would give up to 100 hours of continuous usage with button cell batteries (CR3023 $1.5\; \text{Whr}$ ) or 450 hours with cellphone batteries (iPhone 6s $6.55\; \text{Whr}$ ). |
Databáze: | OpenAIRE |
Externí odkaz: |