Autor: |
Alessandro Tasora, Simone Benatti, Dario Mangoni |
Rok vydání: |
2019 |
Předmět: |
|
Zdroj: |
Multibody Dynamics 2019 ISBN: 9783030231316 |
DOI: |
10.1007/978-3-030-23132-3_47 |
Popis: |
In this paper we use the Proximal Policy Optimization (PPO) deep reinforcement learning algorithm to train a Neural Network to control a four-legged robot in simulation. Reinforcement learning in general can learn complex behavior policies from simple state-reward tuples datasets and PPO in particular has proved its effectiveness in solving complex tasks with continuous states and actions. Moreover, since it is model-free, it is general and can adapt to changes in the environment or in the robot itself. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|