Digital Multiplier-Less Spiking Neural Network Architecture of Reinforcement Learning in a Context-Dependent Task
Autor: | Yulia Sandamirskaya, Hajar Asgari, Raphaela Kreiser, Babak Mazloom-Nezhad Maybodi |
---|---|
Přispěvatelé: | University of Zurich, Asgari, Hajar |
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: |
Spiking neural network
Artificial neural network Computer science business.industry 2208 Electrical and Electronic Engineering 020208 electrical & electronic engineering Autonomous agent Supervised learning 02 engineering and technology 020202 computer hardware & architecture Neuromorphic engineering Asynchronous communication 0202 electrical engineering electronic engineering information engineering Reinforcement learning 570 Life sciences biology Artificial intelligence Electrical and Electronic Engineering Field-programmable gate array business 10194 Institute of Neuroinformatics |
Popis: | Neuromorphic engineers develop event-based spiking neural networks (SNNs) in hardware. These SNNs closer resemble the dynamics of biological neurons than conventional artificial neural networks and achieve higher efficiency thanks to the event-based, asynchronous nature of the processing. Learning in the hardware SNNs is a more challenging task, however. The conventional supervised learning methods cannot be directly applied to SNNs due to the non-differentiable event-based nature of their activation. For this reason, learning in SNNs is currently an active research topic. Reinforcement learning (RL) is a particularly promising learning method for neuromorphic implementation, especially in the field of autonomous agents’ control. An SNN realization of a bio-inspired RL model is in the focus of this work. In particular, in this article, we propose a new digital multiplier-less hardware implementation of an SNN with RL capability. We show how the proposed network can learn stimulus-response associations in a context-dependent task. The task is inspired by biological experiments that study RL in animals. The architecture is described using the standard digital design flow and uses power- and space-efficient cores. The proposed hardware SNN model is compared both to data from animal experiments and to a computational model. We perform a comparison to the behavioral experiments using a robot, to show the learning capability in hardware in a closed sensory-motor loop. |
Databáze: | OpenAIRE |
Externí odkaz: |