Digital Multiplier-Less Spiking Neural Network Architecture of Reinforcement Learning in a Context-Dependent Task

Autor:	Yulia Sandamirskaya, Hajar Asgari, Raphaela Kreiser, Babak Mazloom-Nezhad Maybodi
Přispěvatelé:	University of Zurich, Asgari, Hajar
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	Spiking neural network Artificial neural network Computer science business.industry 2208 Electrical and Electronic Engineering 020208 electrical & electronic engineering Autonomous agent Supervised learning 02 engineering and technology 020202 computer hardware & architecture Neuromorphic engineering Asynchronous communication 0202 electrical engineering electronic engineering information engineering Reinforcement learning 570 Life sciences biology Artificial intelligence Electrical and Electronic Engineering Field-programmable gate array business 10194 Institute of Neuroinformatics
Popis:	Neuromorphic engineers develop event-based spiking neural networks (SNNs) in hardware. These SNNs closer resemble the dynamics of biological neurons than conventional artificial neural networks and achieve higher efficiency thanks to the event-based, asynchronous nature of the processing. Learning in the hardware SNNs is a more challenging task, however. The conventional supervised learning methods cannot be directly applied to SNNs due to the non-differentiable event-based nature of their activation. For this reason, learning in SNNs is currently an active research topic. Reinforcement learning (RL) is a particularly promising learning method for neuromorphic implementation, especially in the field of autonomous agents’ control. An SNN realization of a bio-inspired RL model is in the focus of this work. In particular, in this article, we propose a new digital multiplier-less hardware implementation of an SNN with RL capability. We show how the proposed network can learn stimulus-response associations in a context-dependent task. The task is inspired by biological experiments that study RL in animals. The architecture is described using the standard digital design flow and uses power- and space-efficient cores. The proposed hardware SNN model is compared both to data from animal experiments and to a computational model. We perform a comparison to the behavioral experiments using a robot, to show the learning capability in hardware in a closed sensory-motor loop.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ee5b17413009ab0f0372334f3996ac73 https://www.zora.uzh.ch/id/eprint/200376/ Zobrazit plný text záznamu