Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Bloembergen, Daniel"'
We present RoM-Q 1, a new Q-learning-like algorithm for finding policies robust to attacks in multi-agent systems (MAS). We consider a novel type of attack, where a team of adversaries, aware of the optimal multi-agent Q-value function, performs a wo
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=narcis______::243f3c3ffeffb0366c67cb6635db34ac
https://ir.cwi.nl/pub/30869
https://ir.cwi.nl/pub/30869
We introduce and study a multiplayer version of the classical Ultimatum Game in which a group of N Proposers jointly offers a division of resources to a group of M Responders. In general, the proposal is rejected if the (average) proposed offer is lo
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=narcis______::022ab640ea29b913171cb1deed29b124
https://ir.cwi.nl/pub/28687
https://ir.cwi.nl/pub/28687
Negotiation is a complex problem, in which the variety of settings and opponents that may be encountered prohibits the use of a single predefined negotiation strategy. Hence the agent should be able to learn such a strategy autonomously. To this end
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=narcis______::ca406b86d9b3510783235ff1d6fb9940
https://ir.cwi.nl/pub/28690
https://ir.cwi.nl/pub/28690
Models of emotion, particularly those based on the Ortony, Clore, and Collins (OCC) account of emotions, have been used as part of agents' decision making processes to explore their effects on cooperation within social dilemmas [7, 19, 22]. We analys
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::b5497c15f0068806447872c6dd7dd7c7
https://ir.cwi.nl/pub/30043
https://ir.cwi.nl/pub/30043
Autor:
Serrano, Jonathan, Morales, Eduardo, Hernandez-Leal, Pablo, Bloembergen, Daniel, Kaisers, Michael
Agents acting in real-world scenarios often have constraints such as finite budgets or daily job performance targets. While repeated (episodic) tasks can be solved with existing RL algorithms, methods need to be extended if the repetition depends on
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=narcis______::724adc7a9ed29be62a0a4755f96a9b08
https://ir.cwi.nl/pub/28329
https://ir.cwi.nl/pub/28329