Výsledky vyhledávání - "Hou, Betty Li"

Report

Large Language Models as Misleading Assistants in Conversation

Autor: Hou, Betty Li, Shi, Kejian, Phang, Jason, Aung, James, Adler, Steven, Campbell, Rosie

Large Language Models (LLMs) are able to provide assistance on a wide range of information-seeking tasks. However, model outputs may be misleading, whether unintentionally or in cases of intentional deception. We investigate the ability of LLMs to be

Externí odkaz: http://arxiv.org/abs/2407.11789

Zobrazit plný text záznamu

Report

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Autor: Rein, David, Hou, Betty Li, Stickland, Asa Cooper, Petty, Jackson, Pang, Richard Yuanzhe, Dirani, Julien, Michael, Julian, Bowman, Samuel R.

We present GPQA, a challenging dataset of 448 multiple-choice questions written by domain experts in biology, physics, and chemistry. We ensure that the questions are high-quality and extremely difficult: experts who have or are pursuing PhDs in the

Externí odkaz: http://arxiv.org/abs/2311.12022

Zobrazit plný text záznamu

Report

A Multi-Level Framework for the AI Alignment Problem

Autor: Hou, Betty Li, Green, Brian Patrick

AI alignment considers how we can encode AI systems in a way that is compatible with human values. The normative side of this problem asks what moral values or principles, if any, we should encode in AI. To this end, we present a framework to conside

Externí odkaz: http://arxiv.org/abs/2301.03740

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání