Výsledky vyhledávání

Report

MobileSafetyBench: Evaluating Safety of Autonomous Agents in Mobile Device Control

Autor: Lee, Juyong, Hahm, Dongyoon, Choi, June Suk, Knox, W. Bradley, Lee, Kimin

Autonomous agents powered by large language models (LLMs) show promising potential in assistive tasks across various domains, including mobile device control. As these agents interact directly with personal information and device settings, ensuring t

Externí odkaz: http://arxiv.org/abs/2410.17520

Zobrazit plný text záznamu

Report

Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions

Autor: Zhang, Michael J. Q., Knox, W. Bradley, Choi, Eunsol

Large language models (LLMs) must often respond to highly ambiguous user requests. In such cases, the LLM's best response may be to ask a clarifying question to elicit more information. We observe existing LLMs often respond by presupposing a single

Externí odkaz: http://arxiv.org/abs/2410.13788

Zobrazit plný text záznamu

Kniha

Women and Scottish Society, 1700-2000. [elektronicky zdroj]

Autor: Knox, W. W. J.

Externí odkaz: Kolekce e-knih KNAV Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on requests.

Report

Contrastive Preference Learning: Learning from Human Feedback without RL

Autor: Hejna, Joey, Rafailov, Rafael, Sikchi, Harshit, Finn, Chelsea, Niekum, Scott, Knox, W. Bradley, Sadigh, Dorsa

Reinforcement Learning from Human Feedback (RLHF) has emerged as a popular paradigm for aligning models with human intent. Typically RLHF algorithms operate in two phases: first, use human preferences to learn a reward function and second, align the

Externí odkaz: http://arxiv.org/abs/2310.13639

Zobrazit plný text záznamu

Report

Learning Optimal Advantage from Preferences and Mistaking it for Reward

Autor: Knox, W. Bradley, Hatgis-Kessell, Stephane, Adalgeirsson, Sigurdur Orn, Booth, Serena, Dragan, Anca, Stone, Peter, Niekum, Scott

We consider algorithms for learning reward functions from human preferences over pairs of trajectory segments, as used in reinforcement learning from human feedback (RLHF). Most recent work assumes that human preferences are generated based only upon

Externí odkaz: http://arxiv.org/abs/2310.02456

Zobrazit plný text záznamu

Kniha

Jimmy Reid : a clyde-built man / W. W. J. Knox and A. McKinlay. [elektronicky zdroj]

Autor: Knox, W. W. J., author

Externí odkaz: Kolekce e-knih KNAV

Kniha

Jimmy Reid : A Clyde-Built Man. Elektronicky zdroj

Autor: Knox, W. W. J.

Externí odkaz: Kolekce e-knih KNAV Kolekce e-knih KNAV Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on request.

Report

Models of human preference for learning reward functions

Autor: Knox, W. Bradley, Hatgis-Kessell, Stephane, Booth, Serena, Niekum, Scott, Stone, Peter, Allievi, Alessandro

The utility of reinforcement learning is limited by the alignment of reward functions with the interests of human stakeholders. One promising method for alignment is to learn the reward function from human-generated preferences between pairs of traje

Externí odkaz: http://arxiv.org/abs/2206.02231

Zobrazit plný text záznamu

Report

Reward (Mis)design for Autonomous Driving

Autor: Knox, W. Bradley, Allievi, Alessandro, Banzhaf, Holger, Schmitt, Felix, Stone, Peter

This article considers the problem of diagnosing certain common errors in reward design. Its insights are also applicable to the design of cost functions and performance metrics more generally. To diagnose common errors, we develop 8 simple sanity ch

Externí odkaz: http://arxiv.org/abs/2104.13906

Zobrazit plný text záznamu

Report

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Autor: Cui, Yuchen, Zhang, Qiping, Allievi, Alessandro, Stone, Peter, Niekum, Scott, Knox, W. Bradley

Reactions such as gestures, facial expressions, and vocalizations are an abundant, naturally occurring channel of information that humans provide during interactions. A robot or other agent could leverage an understanding of such implicit human feedb

Externí odkaz: http://arxiv.org/abs/2009.13649

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání