Zobrazeno 1 - 10
of 488
pro vyhledávání: '"Park, Ryan"'
Autor:
Hamilton, Christopher W., McEwen, Alfred S., Keszthelyi, Laszlo, Carter, Lynn M., Davies, Ashley G., de Kleer, Katherine, Jessup, Kandis Lea, Jia, Xianzhe, Keane, James T., Mandt, Kathleen, Nimmo, Francis, Paranicas, Chris, Park, Ryan S., Perry, Jason E., Pommier, Anne, Radebaugh, Jani, Sutton, Sarah S., Vorburger, Audrey, Wurz, Peter, Borlina, Cauê, Haapala, Amanda F., DellaGiustina, Daniella N., Denevi, Brett W., Hörst, Sarah M., Kempf, Sascha, Khurana, Krishan K., Likar, Justin J., Masters, Adam, Mousis, Olivier, Polit, Anjani T., Bhushan, Aditya, Bland, Michael, Matsuyama, Isamu, Spencer, John
Jupiter's moon Io is a highly compelling target for future exploration that offers critical insight into tidal dissipation processes and the geology of high heat flux worlds, including primitive planetary bodies, such as the early Earth, that are sha
Externí odkaz:
http://arxiv.org/abs/2408.08334
The transformation of time between the surface of the Earth, the solar system barycenter, and the surface of the Moon involves relativistic corrections. For solar system Barycentric Dynamical Time (TDB), we also require that there be no rate differen
Externí odkaz:
http://arxiv.org/abs/2406.16147
Autor:
Rafailov, Rafael, Chittepu, Yaswanth, Park, Ryan, Sikchi, Harshit, Hejna, Joey, Knox, Bradley, Finn, Chelsea, Niekum, Scott
Reinforcement Learning from Human Feedback (RLHF) has been crucial to the recent success of Large Language Models (LLMs), however, it is often a complex and brittle process. In the classical RLHF framework, a reward model is first trained to represen
Externí odkaz:
http://arxiv.org/abs/2406.02900
Reinforcement Learning From Human Feedback (RLHF) has been critical to the success of the latest generation of generative AI models. In response to the complex nature of the classical RLHF pipeline, direct alignment algorithms such as Direct Preferen
Externí odkaz:
http://arxiv.org/abs/2404.12358
We present a mathematical framework for modeling two-player noncooperative games in which one player (the defender) is uncertain of the costs of the game and the second player's (the attacker's) intention but can preemptively allocate information-gat
Externí odkaz:
http://arxiv.org/abs/2404.00733
Reinforcement Learning from Human Feedback (RLHF) has been a crucial component in the recent success of Large Language Models. However, RLHF is know to exploit biases in human preferences, such as verbosity. A well-formatted and eloquent answer is of
Externí odkaz:
http://arxiv.org/abs/2403.19159
Autor:
Cao, Hao, Bloxham, Jeremy, Park, Ryan S., Militzer, Burkhard, Yadav, Rakesh K., Kulowski, Laura, Stevenson, David J., Bolton, Scott J.
Publikováno v:
ApJ 959 78 (2023)
Jupiter's atmosphere-interior is a coupled fluid dynamical system strongly influenced by the rapid background rotation. While the visible atmosphere features east-west zonal winds on the order of 100 m/s (Tollefson et al. 2017), zonal flows in the dy
Externí odkaz:
http://arxiv.org/abs/2311.11494
Molecular language modeling is an effective approach to generating novel chemical structures. However, these models do not \emph{a priori} encode certain preferences a chemist may desire. We investigate the use of fine-tuning using Direct Preference
Externí odkaz:
http://arxiv.org/abs/2310.12304
Autor:
Gramigna, Edoardo, Manghi, Riccardo Lasagni, Zannoni, Marco, Tortora, Paolo, Park, Ryan S., Tommei, Giacomo, Maistre, Sébastien Le, Michel, Patrick, Castellini, Francesco, Kueppers, Michael
Publikováno v:
Planetary and Space Science, 2024, 105906
Hera represents the European Space Agency's inaugural planetary defense space mission and plays a pivotal role in the Asteroid Impact and Deflection Assessment international collaboration with NASA DART mission that performed the first asteroid defle
Externí odkaz:
http://arxiv.org/abs/2310.11883
Plasma-neutral interactions, including reactive kinetics, are often either studied in 0D using ODE based descriptions, or in multi-dimensional fluid or particle based plasma codes. The latter case involves a complex assembly of procedures that are no
Externí odkaz:
http://arxiv.org/abs/2310.07913