Výsledky vyhledávání - "ZIEGLER, DANIEL M."

Report

Adversarial Training for High-Stakes Reliability

Autor: Ziegler, Daniel M., Nix, Seraphina, Chan, Lawrence, Bauman, Tim, Schmidt-Nielsen, Peter, Lin, Tao, Scherlis, Adam, Nabeshima, Noa, Weinstein-Raun, Ben, de Haas, Daniel, Shlegeris, Buck, Thomas, Nate

In the future, powerful AI systems may be deployed in high-stakes settings, where a single failure could be catastrophic. One technique for improving AI safety in high-stakes settings is adversarial training, which uses an adversary to generate examp

Externí odkaz: http://arxiv.org/abs/2205.01663

Zobrazit plný text záznamu

Report

Recursively Summarizing Books with Human Feedback

Autor: Wu, Jeff, Ouyang, Long, Ziegler, Daniel M., Stiennon, Nisan, Lowe, Ryan, Leike, Jan, Christiano, Paul

A major challenge for scaling machine learning is training models to perform tasks that are very difficult or time-consuming for humans to evaluate. We present progress on this problem on the task of abstractive summarization of entire fiction novels

Externí odkaz: http://arxiv.org/abs/2109.10862

Zobrazit plný text záznamu

Report

Scaling Laws for Autoregressive Generative Modeling

Autor: Henighan, Tom, Kaplan, Jared, Katz, Mor, Chen, Mark, Hesse, Christopher, Jackson, Jacob, Jun, Heewoo, Brown, Tom B., Dhariwal, Prafulla, Gray, Scott, Hallacy, Chris, Mann, Benjamin, Radford, Alec, Ramesh, Aditya, Ryder, Nick, Ziegler, Daniel M., Schulman, John, Amodei, Dario, McCandlish, Sam

We identify empirical scaling laws for the cross-entropy loss in four domains: generative image modeling, video modeling, multimodal image$\leftrightarrow$text models, and mathematical problem solving. In all cases autoregressive Transformers smoothl

Externí odkaz: http://arxiv.org/abs/2010.14701

Zobrazit plný text záznamu

Report

Learning to summarize from human feedback

Autor: Stiennon, Nisan, Ouyang, Long, Wu, Jeff, Ziegler, Daniel M., Lowe, Ryan, Voss, Chelsea, Radford, Alec, Amodei, Dario, Christiano, Paul

As language models become more powerful, training and evaluation are increasingly bottlenecked by the data and metrics used for a particular task. For example, summarization models are often trained to predict human reference summaries and evaluated

Externí odkaz: http://arxiv.org/abs/2009.01325

Zobrazit plný text záznamu

Report

Language Models are Few-Shot Learners

Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-speci

Externí odkaz: http://arxiv.org/abs/2005.14165

Zobrazit plný text záznamu

Report

Fine-Tuning Language Models from Human Preferences

Autor: Ziegler, Daniel M., Stiennon, Nisan, Wu, Jeffrey, Brown, Tom B., Radford, Alec, Amodei, Dario, Christiano, Paul, Irving, Geoffrey

Reward learning enables the application of reinforcement learning (RL) to tasks where reward is defined by human judgment, building a model of reward by asking humans questions. Most work on reward learning has used simulated environments, but comple

Externí odkaz: http://arxiv.org/abs/1909.08593

Zobrazit plný text záznamu