Zobrazeno 1 - 10
of 30
pro vyhledávání: '"hÉigeartaigh, Seán Ó"'
Autor:
Anwar, Usman, Saparov, Abulhair, Rando, Javier, Paleka, Daniel, Turpin, Miles, Hase, Peter, Lubana, Ekdeep Singh, Jenner, Erik, Casper, Stephen, Sourbut, Oliver, Edelman, Benjamin L., Zhang, Zhaowei, Günther, Mario, Korinek, Anton, Hernandez-Orallo, Jose, Hammond, Lewis, Bigelow, Eric, Pan, Alexander, Langosco, Lauro, Korbak, Tomasz, Zhang, Heidi, Zhong, Ruiqi, hÉigeartaigh, Seán Ó, Recchia, Gabriel, Corsi, Giulio, Chan, Alan, Anderljung, Markus, Edwards, Lilian, Petrov, Aleksandar, de Witt, Christian Schroeder, Motwan, Sumeet Ramesh, Bengio, Yoshua, Chen, Danqi, Torr, Philip H. S., Albanie, Samuel, Maharaj, Tegan, Foerster, Jakob, Tramer, Florian, He, He, Kasirzadeh, Atoosa, Choi, Yejin, Krueger, David
This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are organized into three different categories: scientific understanding of LLMs, development and deployment methods
Externí odkaz:
http://arxiv.org/abs/2404.09932
AI alignment work is important from both a commercial and a safety lens. With this paper, we aim to help actors who support alignment efforts to make these efforts as effective as possible, and to avoid potential adverse effects. We begin by suggesti
Externí odkaz:
http://arxiv.org/abs/2312.08039
Autor:
Zhou, Lexin, Moreno-Casares, Pablo A., Martínez-Plumed, Fernando, Burden, John, Burnell, Ryan, Cheke, Lucy, Ferri, Cèsar, Marcoci, Alexandru, Mehrbakhsh, Behzad, Moros-Daval, Yael, hÉigeartaigh, Seán Ó, Rutar, Danaja, Schellaert, Wout, Voudouris, Konstantinos, Hernández-Orallo, José
We introduce the fundamental ideas and challenges of Predictable AI, a nascent research area that explores the ways in which we can anticipate key indicators of present and future AI ecosystems. We argue that achieving predictability is crucial for f
Externí odkaz:
http://arxiv.org/abs/2310.06167
Concerns around future dangers from advanced AI often centre on systems hypothesised to have intrinsic characteristics such as agent-like behaviour, strategic awareness, and long-range planning. We label this cluster of characteristics as "Property X
Externí odkaz:
http://arxiv.org/abs/2310.05876
Autor:
Seger, Elizabeth, Dreksler, Noemi, Moulange, Richard, Dardaman, Emily, Schuett, Jonas, Wei, K., Winter, Christoph, Arnold, Mackenzie, hÉigeartaigh, Seán Ó, Korinek, Anton, Anderljung, Markus, Bucknall, Ben, Chan, Alan, Stafford, Eoghan, Koessler, Leonie, Ovadya, Aviv, Garfinkel, Ben, Bluemke, Emma, Aird, Michael, Levermore, Patrick, Hazell, Julian, Gupta, Abhishek
Recent decisions by leading AI labs to either open-source their models or to restrict access to their models has sparked debate about whether, and how, increasingly capable AI models should be shared. Open-sourcing in AI typically refers to making mo
Externí odkaz:
http://arxiv.org/abs/2311.09227
Autor:
Trager, Robert, Harack, Ben, Reuel, Anka, Carnegie, Allison, Heim, Lennart, Ho, Lewis, Kreps, Sarah, Lall, Ranjit, Larter, Owen, hÉigeartaigh, Seán Ó, Staffell, Simon, Villalobos, José Jaime
This report describes trade-offs in the design of international governance arrangements for civilian artificial intelligence (AI) and presents one approach in detail. This approach represents the extension of a standards, licensing, and liability reg
Externí odkaz:
http://arxiv.org/abs/2308.15514
Autor:
Brundage, Miles, Avin, Shahar, Wang, Jasmine, Belfield, Haydn, Krueger, Gretchen, Hadfield, Gillian, Khlaaf, Heidy, Yang, Jingying, Toner, Helen, Fong, Ruth, Maharaj, Tegan, Koh, Pang Wei, Hooker, Sara, Leung, Jade, Trask, Andrew, Bluemke, Emma, Lebensold, Jonathan, O'Keefe, Cullen, Koren, Mark, Ryffel, Théo, Rubinovitz, JB, Besiroglu, Tamay, Carugati, Federica, Clark, Jack, Eckersley, Peter, de Haas, Sarah, Johnson, Maritza, Laurie, Ben, Ingerman, Alex, Krawczuk, Igor, Askell, Amanda, Cammarota, Rosario, Lohn, Andrew, Krueger, David, Stix, Charlotte, Henderson, Peter, Graham, Logan, Prunkl, Carina, Martin, Bianca, Seger, Elizabeth, Zilberman, Noa, hÉigeartaigh, Seán Ó, Kroeger, Frens, Sastry, Girish, Kagan, Rebecca, Weller, Adrian, Tse, Brian, Barnes, Elizabeth, Dafoe, Allan, Scharre, Paul, Herbert-Voss, Ariel, Rasser, Martijn, Sodhani, Shagun, Flynn, Carrick, Gilbert, Thomas Krendl, Dyer, Lisa, Khan, Saif, Bengio, Yoshua, Anderljung, Markus
With the recent wave of progress in artificial intelligence (AI) has come a growing awareness of the large-scale impacts of AI systems, and recognition that existing regulations and norms in industry and academia are insufficient to ensure responsibl
Externí odkaz:
http://arxiv.org/abs/2004.07213
Autor:
Martínez-Plumed, Fernando, Avin, Shahar, Brundage, Miles, Dafoe, Allan, hÉigeartaigh, Sean Ó, Hernández-Orallo, José
We reframe the analysis of progress in AI by incorporating into an overall framework both the task performance of a system, and the time and resource costs incurred in the development and deployment of the system. These costs include: data, expert kn
Externí odkaz:
http://arxiv.org/abs/1806.00610
Autor:
Brundage, Miles, Avin, Shahar, Clark, Jack, Toner, Helen, Eckersley, Peter, Garfinkel, Ben, Dafoe, Allan, Scharre, Paul, Zeitzoff, Thomas, Filar, Bobby, Anderson, Hyrum, Roff, Heather, Allen, Gregory C., Steinhardt, Jacob, Flynn, Carrick, hÉigeartaigh, Seán Ó, Beard, Simon, Belfield, Haydn, Farquhar, Sebastian, Lyle, Clare, Crootof, Rebecca, Evans, Owain, Page, Michael, Bryson, Joanna, Yampolskiy, Roman, Amodei, Dario
This report surveys the landscape of potential security threats from malicious uses of AI, and proposes ways to better forecast, prevent, and mitigate these threats. After analyzing the ways in which AI may influence the threat landscape in the digit
Externí odkaz:
http://arxiv.org/abs/1802.07228
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.