Impact of Heterogeneity and Risk Aversion on Task Allocation in Multi-Agent Teams
Autor: | Alparslan Emrah Bayrak, Haochen Wu, Jonathon M. Smereka, Amin Ghadami, Bogdan I. Epureanu |
---|---|
Rok vydání: | 2021 |
Předmět: |
Control and Optimization
Computer science Risk aversion Mechanical Engineering Distributed computing Biomedical Engineering Markov process Computer Science Applications Task (project management) Human-Computer Interaction symbols.namesake Artificial Intelligence Control and Systems Engineering Task analysis Benchmark (computing) symbols Reinforcement learning Resource management Computer Vision and Pattern Recognition Markov decision process |
Zdroj: | IEEE Robotics and Automation Letters. 6:7065-7072 |
ISSN: | 2377-3774 |
DOI: | 10.1109/lra.2021.3097259 |
Popis: | Cooperative multi-agent decision-making is a ubiquitous problem with many real-world applications. In many practical applications, it is desirable to design a multi-agent team with a heterogeneous composition where the agents can have different capabilities and levels of risk tolerance to address diverse requirements. While heterogeneity in multi-agent teams offers benefits, new challenges arise including how to find optimal heterogeneous team compositions and how to dynamically distribute tasks among agents in complex operations. In this work, we develop an artificial intelligence framework for multi-agent heterogeneous teams to dynamically learn task distributions among agents through reinforcement learning. The framework extends Decentralized Partially Observable Markov Decision Processes (Dec-POMDP) to be compatible to model various types of heterogeneity. We demonstrate our approach with a benchmark problem on a disaster relief scenario. The effect of heterogeneity and risk aversion in agent capabilities and decision-making strategies on the performance of multi-agent teams in uncertain environments is analyzed. Results show that a well-designed heterogeneous team outperforms its homogeneous counterpart and possesses higher adaptivity in uncertain environments. |
Databáze: | OpenAIRE |
Externí odkaz: |