Zobrazeno 1 - 6
of 6
pro vyhledávání: '"Schulhoff, Sander"'
Autor:
Peskoff, Denis, Visokay, Adam, Schulhoff, Sander, Wachspress, Benjamin, Blinder, Alan, Stewart, Brandon M.
Markets and policymakers around the world hang on the consequential monetary policy decisions made by the Federal Open Market Committee (FOMC). Publicly available textual documentation of their meetings provides insight into members' attitudes about
Externí odkaz:
http://arxiv.org/abs/2407.19110
Autor:
Towers, Mark, Kwiatkowski, Ariel, Terry, Jordan, Balis, John U., De Cola, Gianluca, Deleu, Tristan, Goulão, Manuel, Kallinteris, Andreas, Krimmel, Markus, KG, Arjun, Perez-Vicente, Rodrigo, Pierré, Andrea, Schulhoff, Sander, Tai, Jun Jet, Tan, Hannah, Younis, Omar G.
Gymnasium is an open-source library providing an API for reinforcement learning environments. Its main contribution is a central abstraction for wide interoperability between benchmark environments and training algorithms. Gymnasium comes with variou
Externí odkaz:
http://arxiv.org/abs/2407.17032
Autor:
Schulhoff, Sander, Ilie, Michael, Balepur, Nishant, Kahadze, Konstantine, Liu, Amanda, Si, Chenglei, Li, Yinheng, Gupta, Aayush, Han, HyoJung, Schulhoff, Sevien, Dulepet, Pranav Sandeep, Vidyadhara, Saurav, Ki, Dayeon, Agrawal, Sweta, Pham, Chau, Kroiz, Gerson, Li, Feileen, Tao, Hudson, Srivastava, Ashay, Da Costa, Hevander, Gupta, Saloni, Rogers, Megan L., Goncearenco, Inna, Sarli, Giuseppe, Galynker, Igor, Peskoff, Denis, Carpuat, Marine, White, Jules, Anadkat, Shyamal, Hoyle, Alexander, Resnik, Philip
Generative Artificial Intelligence (GenAI) systems are being increasingly deployed across all parts of industry and research settings. Developers and end users interact with these systems through the use of prompting or prompt engineering. While prom
Externí odkaz:
http://arxiv.org/abs/2406.06608
Autor:
Milani, Stephanie, Kanervisto, Anssi, Ramanauskas, Karolis, Schulhoff, Sander, Houghton, Brandon, Shah, Rohin
The MineRL BASALT competition has served to catalyze advances in learning from human feedback through four hard-to-specify tasks in Minecraft, such as create and photograph a waterfall. Given the completion of two years of BASALT competitions, we off
Externí odkaz:
http://arxiv.org/abs/2312.02405
Autor:
Schulhoff, Sander, Pinto, Jeremy, Khan, Anaum, Bouchard, Louis-François, Si, Chenglei, Anati, Svetlina, Tagliabue, Valen, Kost, Anson Liu, Carnahan, Christopher, Boyd-Graber, Jordan
Large Language Models (LLMs) are deployed in interactive contexts with direct user engagement, such as chatbots and writing assistants. These deployments are vulnerable to prompt injection and jailbreaking (collectively, prompt hacking), in which mod
Externí odkaz:
http://arxiv.org/abs/2311.16119
Autor:
Milani, Stephanie, Kanervisto, Anssi, Ramanauskas, Karolis, Schulhoff, Sander, Houghton, Brandon, Mohanty, Sharada, Galbraith, Byron, Chen, Ke, Song, Yan, Zhou, Tianze, Yu, Bingquan, Liu, He, Guan, Kai, Hu, Yujing, Lv, Tangjie, Malato, Federico, Leopold, Florian, Raut, Amogh, Hautamäki, Ville, Melnik, Andrew, Ishida, Shu, Henriques, João F., Klassert, Robert, Laurito, Walter, Novoseller, Ellen, Goecks, Vinicius G., Waytowich, Nicholas, Watkins, David, Miller, Josh, Shah, Rohin
To facilitate research in the direction of fine-tuning foundation models from human feedback, we held the MineRL BASALT Competition on Fine-Tuning from Human Feedback at NeurIPS 2022. The BASALT challenge asks teams to compete to develop algorithms t
Externí odkaz:
http://arxiv.org/abs/2303.13512