A Deep Reinforcement Learning Algorithm Based on Tetanic Stimulation and Amnesic Mechanisms for Continuous Control of Multi-DOF Manipulator

Autor:	Yangyang Hou, Huajie Hong, Dasheng Xu, Zhe Zeng, Yaping Chen, Zhaoyang Liu
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	multi-DOF manipulator tetanic stimulation amnesia mechanism deep reinforcement learning Materials of engineering and construction. Mechanics of materials TA401-492 Production of electric energy or power. Powerplants. Central stations TK1001-1841
Zdroj:	Actuators, Vol 10, Iss 10, p 254 (2021)
Druh dokumentu:	article
ISSN:	2076-0825
DOI:	10.3390/act10100254
Popis:	Deep Reinforcement Learning (DRL) has been an active research area in view of its capability in solving large-scale control problems. Until presently, many algorithms have been developed, such as Deep Deterministic Policy Gradient (DDPG), Twin-Delayed Deep Deterministic Policy Gradient (TD3), and so on. However, the converging achievement of DRL often requires extensive collected data sets and training episodes, which is data inefficient and computing resource consuming. Motivated by the above problem, in this paper, we propose a Twin-Delayed Deep Deterministic Policy Gradient algorithm with a Rebirth Mechanism, Tetanic Stimulation and Amnesic Mechanisms (ATRTD3), for continuous control of a multi-DOF manipulator. In the training process of the proposed algorithm, the weighting parameters of the neural network are learned using Tetanic stimulation and Amnesia mechanism. The main contribution of this paper is that we show a biomimetic view to speed up the converging process by biochemical reactions generated by neurons in the biological brain during memory and forgetting. The effectiveness of the proposed algorithm is validated by a simulation example including the comparisons with previously developed DRL algorithms. The results indicate that our approach shows performance improvement in terms of convergence speed and precision.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/fbbffaa4ad1a48e8b1417fa49837791d Zobrazit plný text záznamu View record in DOAJ