Výsledky vyhledávání - "Christoffersen, Phillip"

Dissertation/ Thesis

Mitigating Social Dilemmas in Multi-Agent Reinforcement Learning with Formal Contracting

Autor: Christoffersen, Phillip Johannes Kerr

As society deploys more and more sophisticated artificial intelligence (AI) agents, it will be increasingly necessary for such agents, while pursuing their own objectives, to coexist in common environments in the physical or digital worlds. This may

Externí odkaz: https://hdl.handle.net/1721.1/153795

Zobrazit plný text záznamu

Report

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Reinforcement learning from human feedback (RLHF) is a technique for training AI systems to align with human goals. RLHF has emerged as the central method used to finetune state-of-the-art large language models (LLMs). Despite this popularity, there

Externí odkaz: http://arxiv.org/abs/2307.15217

Zobrazit plný text záznamu

Report

Learning Symbolic Representations for Reinforcement Learning of Non-Markovian Behavior

Autor: Christoffersen, Phillip J. K., Li, Andrew C., Icarte, Rodrigo Toro, McIlraith, Sheila A.

Many real-world reinforcement learning (RL) problems necessitate learning complex, temporally extended behavior that may only receive reward signal when the behavior is completed. If the reward-worthy behavior is known, it can be specified in terms o

Externí odkaz: http://arxiv.org/abs/2301.02952

Zobrazit plný text záznamu

Report

Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL

Autor: Haupt, Andreas A., Christoffersen, Phillip J. K., Damani, Mehul, Hadfield-Menell, Dylan

Multi-agent Reinforcement Learning (MARL) is a powerful tool for training autonomous agents acting independently in a common environment. However, it can lead to sub-optimal behavior when individual incentives and group incentives diverge. Humans are

Externí odkaz: http://arxiv.org/abs/2208.10469

Zobrazit plný text záznamu

Report

The act of remembering: a study in partially observable reinforcement learning

Autor: Icarte, Rodrigo Toro, Valenzano, Richard, Klassen, Toryn Q., Christoffersen, Phillip, Farahmand, Amir-massoud, McIlraith, Sheila A.

Reinforcement Learning (RL) agents typically learn memoryless policies---policies that only consider the last observation when selecting actions. Learning memoryless policies is efficient and optimal in fully observable environments. However, some fo

Externí odkaz: http://arxiv.org/abs/2010.01753

Zobrazit plný text záznamu

Akademický článek

Formal contracts mitigate social dilemmas in multi-agent reinforcement learning.

Autor: Haupt, Andreas, Christoffersen, Phillip, Damani, Mehul, Hadfield-Menell, Dylan

Publikováno v: Autonomous Agents & Multi-Agent Systems; Dec2024, Vol. 38 Issue 2, p1-38, 38p

Zobrazit plný text záznamu

Get It in Writing: Formal Contracts Mitigate Social Dilemmas in Multi-Agent RL

Autor: Christoffersen, Phillip J. K., Haupt, Andreas A., Hadfield-Menell, Dylan

Multi-agent reinforcement learning (MARL) is a powerful tool for training automated systems acting independently in a common environment. However, it can lead to sub-optimal behavior when individual incentives and group incentives diverge. Humans are

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0f76d7046a4c3181bb52437897948148
http://arxiv.org/abs/2208.10469

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání