Embodied Synaptic Plasticity With Online Reinforcement Learning.
Autor: | Kaiser J; FZI Research Center for Information Technology, Karlsruhe, Germany., Hoff M; FZI Research Center for Information Technology, Karlsruhe, Germany.; Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria., Konle A; FZI Research Center for Information Technology, Karlsruhe, Germany., Vasquez Tieck JC; FZI Research Center for Information Technology, Karlsruhe, Germany., Kappel D; Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria.; Bernstein Center for Computational Neuroscience, III Physikalisches Institut-Biophysik, Georg-August Universität, Göttingen, Germany.; Technische Universität Dresden, Chair of Highly Parallel VLSI Systems and Neuromorphic Circuits, Dresden, Germany., Reichard D; FZI Research Center for Information Technology, Karlsruhe, Germany., Subramoney A; Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria., Legenstein R; Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria., Roennau A; FZI Research Center for Information Technology, Karlsruhe, Germany., Maass W; Institute for Theoretical Computer Science, Graz University of Technology, Graz, Austria., Dillmann R; FZI Research Center for Information Technology, Karlsruhe, Germany. |
---|---|
Jazyk: | angličtina |
Zdroj: | Frontiers in neurorobotics [Front Neurorobot] 2019 Oct 03; Vol. 13, pp. 81. Date of Electronic Publication: 2019 Oct 03 (Print Publication: 2019). |
DOI: | 10.3389/fnbot.2019.00081 |
Abstrakt: | The endeavor to understand the brain involves multiple collaborating research fields. Classically, synaptic plasticity rules derived by theoretical neuroscientists are evaluated in isolation on pattern classification tasks. This contrasts with the biological brain which purpose is to control a body in closed-loop. This paper contributes to bringing the fields of computational neuroscience and robotics closer together by integrating open-source software components from these two fields. The resulting framework allows to evaluate the validity of biologically-plausibe plasticity models in closed-loop robotics environments. We demonstrate this framework to evaluate Synaptic Plasticity with Online REinforcement learning (SPORE), a reward-learning rule based on synaptic sampling, on two visuomotor tasks: reaching and lane following. We show that SPORE is capable of learning to perform policies within the course of simulated hours for both tasks. Provisional parameter explorations indicate that the learning rate and the temperature driving the stochastic processes that govern synaptic learning dynamics need to be regulated for performance improvements to be retained. We conclude by discussing the recent deep reinforcement learning techniques which would be beneficial to increase the functionality of SPORE on visuomotor tasks. (Copyright © 2019 Kaiser, Hoff, Konle, Vasquez Tieck, Kappel, Reichard, Subramoney, Legenstein, Roennau, Maass and Dillmann.) |
Databáze: | MEDLINE |
Externí odkaz: |