Zobrazeno 1 - 10
of 2 388
pro vyhledávání: '"A W, Bradley"'
Large language models (LLMs) must often respond to highly ambiguous user requests. In such cases, the LLM's best response may be to ask a clarifying question to elicit more information. We observe existing LLMs often respond by presupposing a single
Externí odkaz:
http://arxiv.org/abs/2410.13788
Autor:
Hejna, Joey, Rafailov, Rafael, Sikchi, Harshit, Finn, Chelsea, Niekum, Scott, Knox, W. Bradley, Sadigh, Dorsa
Reinforcement Learning from Human Feedback (RLHF) has emerged as a popular paradigm for aligning models with human intent. Typically RLHF algorithms operate in two phases: first, use human preferences to learn a reward function and second, align the
Externí odkaz:
http://arxiv.org/abs/2310.13639
Autor:
Knox, W. Bradley, Hatgis-Kessell, Stephane, Adalgeirsson, Sigurdur Orn, Booth, Serena, Dragan, Anca, Stone, Peter, Niekum, Scott
We consider algorithms for learning reward functions from human preferences over pairs of trajectory segments, as used in reinforcement learning from human feedback (RLHF). Most recent work assumes that human preferences are generated based only upon
Externí odkaz:
http://arxiv.org/abs/2310.02456
Autor:
McKibben, W. Bradley1 (AUTHOR) wmckibb@ju.edu, Lenz, A. Stephen2 (AUTHOR), Alvero, Arianna1 (AUTHOR)
Publikováno v:
Counseling Outcome Research & Evaluation. Oct2024, p1-15. 15p. 1 Illustration.
Publikováno v:
International Journal of Sports Physiology & Performance; Sep2024, Vol. 19 Issue 9, p897-904, 8p
Autor:
Wendel, W. Bradley, author
Publikováno v:
Methodology in Private Law Theory : Between New Private Law and Rechtsdogmatik, 2024.
Externí odkaz:
https://doi.org/10.1093/oso/9780198885306.003.0011
Autor:
Knox, W. Bradley, Hatgis-Kessell, Stephane, Booth, Serena, Niekum, Scott, Stone, Peter, Allievi, Alessandro
The utility of reinforcement learning is limited by the alignment of reward functions with the interests of human stakeholders. One promising method for alignment is to learn the reward function from human-generated preferences between pairs of traje
Externí odkaz:
http://arxiv.org/abs/2206.02231
Autor:
Jacob N. Dowe, B.S., A.T.C., Matthew W. Bradley, M.P.H., Lance E. LeClere, M.D., M.C., U.S.N.R., Jonathan F. Dickens, M.D., M.C., U.S.A.R.
Publikováno v:
Arthroscopy Techniques, Vol 13, Iss 6, Pp 102972- (2024)
Understanding the anatomical structure of a patient’s shoulder joint is essential in surgical decision-making, especially regarding glenohumeral bone loss. The use of various imaging techniques, such as magnetic resonance imaging (MRI) and computed
Externí odkaz:
https://doaj.org/article/6892188808b244d99a41af2940364150
Autor:
Al-Shawwa, Abdul, Ost, Kalum, Anderson, David, Cho, Newton, Evaniew, Nathan, Jacobs, W. Bradley, Martin, Allan R., Gaekwad, Ranjeet, Tripathy, Saswati, Bouchard, Jacques, Casha, Steve, Cho, Roger, duPlessis, Stephen, Lewkonia, Peter, Nicholls, Fred, Salo, Paul T., Soroceanu, Alex, Swamy, Ganesh, Thomas, Kenneth C., Yang, Michael M.H., Cohen-Adad, Julien, Cadotte, David W.
Publikováno v:
In The Spine Journal September 2024 24(9):1605-1614
Autor:
Althagafi, Alwalaa, Dea, Nicolas, Evaniew, Nathan, Rampersaud, Raja Y., Jacobs, W. Bradley, Paquet, Jérome, Wilson, Jefferson R., Hall, Hamilton, Bailey, Christopher S., Weber, Michael H., Nataraj, Andrew, Attabib, Najmedden, Cadotte, David W., Phan, Philippe, Christie, Sean D., Fisher, Charles G., Manson, Neil, Thomas, Kenneth, McIntosh, Greg, Charest-Morin, Raphaële
Publikováno v:
In The Spine Journal September 2024 24(9):1595-1604