Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Hou, Betty Li"'
Large Language Models (LLMs) are able to provide assistance on a wide range of information-seeking tasks. However, model outputs may be misleading, whether unintentionally or in cases of intentional deception. We investigate the ability of LLMs to be
Externí odkaz:
http://arxiv.org/abs/2407.11789
Autor:
Rein, David, Hou, Betty Li, Stickland, Asa Cooper, Petty, Jackson, Pang, Richard Yuanzhe, Dirani, Julien, Michael, Julian, Bowman, Samuel R.
We present GPQA, a challenging dataset of 448 multiple-choice questions written by domain experts in biology, physics, and chemistry. We ensure that the questions are high-quality and extremely difficult: experts who have or are pursuing PhDs in the
Externí odkaz:
http://arxiv.org/abs/2311.12022
Autor:
Hou, Betty Li, Green, Brian Patrick
AI alignment considers how we can encode AI systems in a way that is compatible with human values. The normative side of this problem asks what moral values or principles, if any, we should encode in AI. To this end, we present a framework to conside
Externí odkaz:
http://arxiv.org/abs/2301.03740