Zobrazeno 1 - 10
of 179
pro vyhledávání: '"Lau, Elaine"'
Autor:
Kumar, Priyanshu, Lau, Elaine, Vijayakumar, Saranya, Trinh, Tu, Team, Scale Red, Chang, Elaine, Robinson, Vaughn, Hendryx, Sean, Zhou, Shuyan, Fredrikson, Matt, Yue, Summer, Wang, Zifan
For safety reasons, large language models (LLMs) are trained to refuse harmful user instructions, such as assisting dangerous activities. We study an open question in this work: does the desired safety refusal, typically enforced in chat contexts, ge
Externí odkaz:
http://arxiv.org/abs/2410.13886
Generative Flow Networks (GFlowNets; GFNs) are a family of reward/energy-based generative methods for combinatorial objects, capable of generating diverse and high-utility samples. However, biasing GFNs towards producing high-utility samples is non-t
Externí odkaz:
http://arxiv.org/abs/2402.05234
Deep learning is emerging as an effective tool in drug discovery, with potential applications in both predictive and generative models. Generative Flow Networks (GFlowNets/GFNs) are a recently introduced method recognized for the ability to generate
Externí odkaz:
http://arxiv.org/abs/2310.19685
Reinforcement Learning (RL) algorithms aim to learn an optimal policy by iteratively sampling actions to learn how to maximize the total expected return, $R(x)$. GFlowNets are a special class of algorithms designed to generate diverse candidates, $x$
Externí odkaz:
http://arxiv.org/abs/2307.07674
Autor:
Obadinma, Stephen, Khattak, Faiza Khan, Wang, Shirley, Sidhom, Tania, Lau, Elaine, Robertson, Sean, Niu, Jingcheng, Au, Winnie, Munim, Alif, Bhaskar, Karthik Raja K., Wei, Bencheng, Ren, Iris, Muhammad, Waqar, Li, Erin, Ishola, Bukola, Wang, Michael, Tanner, Griffin, Shiah, Yu-Jia, Zhang, Sean X., Apponsah, Kwesi P., Patel, Kanishk, Narain, Jaswinder, Pandya, Deval, Zhu, Xiaodan, Rudzicz, Frank, Dolatabadi, Elham
Building Agent Assistants that can help improve customer service support requires inputs from industry users and their customers, as well as knowledge about state-of-the-art Natural Language Processing (NLP) technology. We combine expertise from acad
Externí odkaz:
http://arxiv.org/abs/2302.03222
Autor:
Kondrup, Flemming, Jiralerspong, Thomas, Lau, Elaine, de Lara, Nathan, Shkrob, Jacob, Tran, My Duc, Precup, Doina, Basu, Sumana
Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be benefi
Externí odkaz:
http://arxiv.org/abs/2210.02552
Reasoning about the future -- understanding how decisions in the present time affect outcomes in the future -- is one of the central challenges for reinforcement learning (RL), especially in highly-stochastic or partially observable environments. Whi
Externí odkaz:
http://arxiv.org/abs/2108.02096
Publikováno v:
In Teaching and Teacher Education March 2023 123
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
In Studies in Educational Evaluation September 2022 74