Autor: |
Tomohiro Yoshida, Yukinori Kakazu, Yuko Ishiwaka, Hiroshi Yokoi |
Rok vydání: |
2004 |
Předmět: |
|
Zdroj: |
IEEE International Conference on Systems, Man and Cybernetics. |
DOI: |
10.1109/icsmc.2002.1173252 |
Popis: |
Our approach is to achieve the cooperative behavior of autonomous decentralized agents constructed with Q-Learning, which is a type of reinforcement learning. The piano mover's problem is employed. We propose the multi agent architecture that has an external agent and internal agents. Internal agents are heterogeneous and they can communicate with each other. The movement of the external agent depends on the composition of the actions of internal agents. According to learning its own shape by internal agents, it is expected that the agents avoid obstacles. We simulate our method on a two-dimensional continuous world. The results show the effect of our method. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|