Výsledky vyhledávání

Report

Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems

Autor: Holder, Joshua, Jaques, Natasha, Mesbahi, Mehran

Assignment problems are a classic combinatorial optimization problem in which a group of agents must be assigned to a group of tasks such that maximum utility is achieved while satisfying assignment constraints. Given the utility of each agent comple

Externí odkaz: http://arxiv.org/abs/2412.15573

Zobrazit plný text záznamu

Report

Learning to Cooperate with Humans using Generative Agents

Autor: Liang, Yancheng, Chen, Daphne, Gupta, Abhishek, Du, Simon S., Jaques, Natasha

Training agents that can coordinate zero-shot with humans is a key mission in multi-agent reinforcement learning (MARL). Current algorithms focus on training simulated human partner policies which are then used to train a Cooperator agent. The simula

Externí odkaz: http://arxiv.org/abs/2411.13934

Zobrazit plný text záznamu

Report

InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma

Autor: Hou, Xiaoxuan, Yuan, Jiayi, Leibo, Joel Z., Jaques, Natasha

InvestESG is a novel multi-agent reinforcement learning (MARL) benchmark designed to study the impact of Environmental, Social, and Governance (ESG) disclosure mandates on corporate climate investments. Supported by both PyTorch and JAX implementatio

Externí odkaz: http://arxiv.org/abs/2411.09856

Zobrazit plný text záznamu

Report

PadChest-GR: A Bilingual Chest X-ray Dataset for Grounded Radiology Report Generation

Autor: Castro, Daniel C., Bustos, Aurelia, Bannur, Shruthi, Hyland, Stephanie L., Bouzid, Kenza, Wetscherek, Maria Teodora, Sánchez-Valverde, Maria Dolores, Jaques-Pérez, Lara, Pérez-Rodríguez, Lourdes, Takeda, Kenji, Salinas, José María, Alvarez-Valle, Javier, Herrero, Joaquín Galant, Pertusa, Antonio

Radiology report generation (RRG) aims to create free-text radiology reports from clinical imaging. Grounded radiology report generation (GRRG) extends RRG by including the localisation of individual findings on the image. Currently, there are no man

Externí odkaz: http://arxiv.org/abs/2411.05085

Zobrazit plný text záznamu

Report

Infer Human's Intentions Before Following Natural Language Instructions

Autor: Wan, Yanming, Wu, Yue, Wang, Yiping, Mao, Jiayuan, Jaques, Natasha

For AI agents to be helpful to humans, they should be able to follow natural language instructions to complete everyday cooperative tasks in human environments. However, real human instructions inherently possess ambiguity, because the human speakers

Externí odkaz: http://arxiv.org/abs/2409.18073

Zobrazit plný text záznamu

Report

Personalizing Reinforcement Learning from Human Feedback with Variational Preference Learning

Autor: Poddar, Sriyash, Wan, Yanming, Ivison, Hamish, Gupta, Abhishek, Jaques, Natasha

Reinforcement Learning from Human Feedback (RLHF) is a powerful paradigm for aligning foundation models to human values and preferences. However, current RLHF techniques cannot account for the naturally occurring differences in individual human prefe

Externí odkaz: http://arxiv.org/abs/2408.10075

Zobrazit plný text záznamu

Report

Achieving Human Level Competitive Robot Table Tennis

Achieving human-level speed and performance on real world tasks is a north star for the robotics research community. This work takes a step towards that goal and presents the first learned robot agent that reaches amateur human-level performance in c

Externí odkaz: http://arxiv.org/abs/2408.03906

Zobrazit plný text záznamu

Report

Exploring the Plausibility of Hate and Counter Speech Detectors with Explainable AI

Autor: Böck, Adrian Jaques, Slijepčević, Djordje, Zeppelzauer, Matthias

In this paper we investigate the explainability of transformer models and their plausibility for hate speech and counter speech detection. We compare representatives of four different explainability approaches, i.e., gradient-based, perturbation-base

Externí odkaz: http://arxiv.org/abs/2407.20274

Zobrazit plný text záznamu

Report

A comprehensive and easy-to-use multi-domain multi-task medical imaging meta-dataset (MedIMeta)

Autor: Woerner, Stefano, Jaques, Arthur, Baumgartner, Christian F.

While the field of medical image analysis has undergone a transformative shift with the integration of machine learning techniques, the main challenge of these techniques is often the scarcity of large, diverse, and well-annotated datasets. Medical i

Externí odkaz: http://arxiv.org/abs/2404.16000

Zobrazit plný text záznamu

Report

Moral Foundations of Large Language Models

Autor: Abdulhai, Marwa, Serapio-Garcia, Gregory, Crepy, Clément, Valter, Daria, Canny, John, Jaques, Natasha

Moral foundations theory (MFT) is a psychological assessment tool that decomposes human moral reasoning into five factors, including care/harm, liberty/oppression, and sanctity/degradation (Graham et al., 2009). People vary in the weight they place o

Externí odkaz: http://arxiv.org/abs/2310.15337

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání