Výsledky vyhledávání - "Schulhoff, Sander"

Report

GPT Deciphering Fedspeak: Quantifying Dissent Among Hawks and Doves

Autor: Peskoff, Denis, Visokay, Adam, Schulhoff, Sander, Wachspress, Benjamin, Blinder, Alan, Stewart, Brandon M.

Markets and policymakers around the world hang on the consequential monetary policy decisions made by the Federal Open Market Committee (FOMC). Publicly available textual documentation of their meetings provides insight into members' attitudes about

Externí odkaz: http://arxiv.org/abs/2407.19110

Zobrazit plný text záznamu

Report

Gymnasium: A Standard Interface for Reinforcement Learning Environments

Autor: Towers, Mark, Kwiatkowski, Ariel, Terry, Jordan, Balis, John U., De Cola, Gianluca, Deleu, Tristan, Goulão, Manuel, Kallinteris, Andreas, Krimmel, Markus, KG, Arjun, Perez-Vicente, Rodrigo, Pierré, Andrea, Schulhoff, Sander, Tai, Jun Jet, Tan, Hannah, Younis, Omar G.

Gymnasium is an open-source library providing an API for reinforcement learning environments. Its main contribution is a central abstraction for wide interoperability between benchmark environments and training algorithms. Gymnasium comes with variou

Externí odkaz: http://arxiv.org/abs/2407.17032

Zobrazit plný text záznamu

Report

The Prompt Report: A Systematic Survey of Prompting Techniques

Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prom

Externí odkaz: http://arxiv.org/abs/2406.06608

Zobrazit plný text záznamu

Report

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

Autor: Milani, Stephanie, Kanervisto, Anssi, Ramanauskas, Karolis, Schulhoff, Sander, Houghton, Brandon, Shah, Rohin

The MineRL BASALT competition has served to catalyze advances in learning from human feedback through four hard-to-specify tasks in Minecraft, such as create and photograph a waterfall. Given the completion of two years of BASALT competitions, we off

Externí odkaz: http://arxiv.org/abs/2312.02405

Zobrazit plný text záznamu

Report

Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition

Autor: Schulhoff, Sander, Pinto, Jeremy, Khan, Anaum, Bouchard, Louis-François, Si, Chenglei, Anati, Svetlina, Tagliabue, Valen, Kost, Anson Liu, Carnahan, Christopher, Boyd-Graber, Jordan

Large Language Models (LLMs) are deployed in interactive contexts with direct user engagement, such as chatbots and writing assistants. These deployments are vulnerable to prompt injection and jailbreaking (collectively, prompt hacking), in which mod

Externí odkaz: http://arxiv.org/abs/2311.16119

Zobrazit plný text záznamu

Report

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms t

Externí odkaz: http://arxiv.org/abs/2303.13512

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání