Zobrazeno 1 - 10
of 1 276
pro vyhledávání: '"Sellitto, P"'
Autor:
Voria, Gianmario, Sellitto, Giulia, Ferrara, Carmine, Abate, Francesco, De Lucia, Andrea, Ferrucci, Filomena, Catolino, Gemma, Palomba, Fabio
Machine learning's widespread adoption in decision-making processes raises concerns about fairness, particularly regarding the treatment of sensitive features and potential discrimination against minorities. The software engineering community has res
Externí odkaz:
http://arxiv.org/abs/2408.16683
Autor:
Hubinger, Evan, Denison, Carson, Mu, Jesse, Lambert, Mike, Tong, Meg, MacDiarmid, Monte, Lanham, Tamera, Ziegler, Daniel M., Maxwell, Tim, Cheng, Newton, Jermyn, Adam, Askell, Amanda, Radhakrishnan, Ansh, Anil, Cem, Duvenaud, David, Ganguli, Deep, Barez, Fazl, Clark, Jack, Ndousse, Kamal, Sachan, Kshitij, Sellitto, Michael, Sharma, Mrinank, DasSarma, Nova, Grosse, Roger, Kravec, Shauna, Bai, Yuntao, Witten, Zachary, Favaro, Marina, Brauner, Jan, Karnofsky, Holden, Christiano, Paul, Bowman, Samuel R., Graham, Logan, Kaplan, Jared, Mindermann, Sören, Greenblatt, Ryan, Shlegeris, Buck, Schiefer, Nicholas, Perez, Ethan
Humans are capable of strategically deceptive behavior: behaving helpfully in most situations, but then behaving very differently in order to pursue alternative objectives when given the opportunity. If an AI system learned such a deceptive strategy,
Externí odkaz:
http://arxiv.org/abs/2401.05566
Autor:
Gerardo Salvato, Claudio Bertolotti, Manuela Sellitto, Teresa Fazia, Damiano Crivelli, Gabriele De Maio, Francesca Giulia Magnani, Alessandra Leo, Tatiana Bianconi, Maria Chiara Cortesi, Michele Spinelli, Gabriella Bottini
Publikováno v:
Scientific Reports, Vol 14, Iss 1, Pp 1-9 (2024)
Summary Postural balance requires the interplay between several physiological signals. Indirect evidence suggests that the perception of signals arising from the autonomic nervous system might play a role (e.g. cardiac awareness). Here, we tested thi
Externí odkaz:
https://doaj.org/article/6dd5259e8d9f49afba881a68cc31a13a
Autor:
Shoker, Sarah, Reddie, Andrew, Barrington, Sarah, Booth, Ruby, Brundage, Miles, Chahal, Husanjot, Depp, Michael, Drexel, Bill, Gupta, Ritwik, Favaro, Marina, Hecla, Jake, Hickey, Alan, Konaev, Margarita, Kumar, Kirthi, Lambert, Nathan, Lohn, Andrew, O'Keefe, Cullen, Rajani, Nazneen, Sellitto, Michael, Trager, Robert, Walker, Leah, Wehsener, Alexa, Young, Jessica
Foundation models could eventually introduce several pathways for undermining state security: accidents, inadvertent escalation, unintentional conflict, the proliferation of weapons, and the interference with human diplomacy are just a few on a long
Externí odkaz:
http://arxiv.org/abs/2308.00862
Direct and indirect regulation of β-glucocerebrosidase by the transcription factors USF2 and ONECUT2
Autor:
Kathi Ging, Lukas Frick, Johannes Schlachetzki, Andrea Armani, Yanping Zhu, Pierre-André Gilormini, Ashutosh Dhingra, Desirée Böck, Ana Marques, Matthew Deen, Xi Chen, Tetiana Serdiuk, Chiara Trevisan, Stefano Sellitto, Claudio Pisano, Christopher K. Glass, Peter Heutink, Jiang-An Yin, David J. Vocadlo, Adriano Aguzzi
Publikováno v:
npj Parkinson's Disease, Vol 10, Iss 1, Pp 1-18 (2024)
Abstract Mutations in GBA1 encoding the lysosomal enzyme β-glucocerebrosidase (GCase) are among the most prevalent genetic susceptibility factors for Parkinson’s disease (PD), with 10–30% of carriers developing the disease. To identify genetic m
Externí odkaz:
https://doaj.org/article/4c0ae1d7d594437b8f6bf9199fd5b7f6
Autor:
Daniele Gennuso, Angela Baldelli, Loredana Gigli, Ilaria Ruotolo, Giovanni Galeoto, Daniela Gaburri, Giovanni Sellitto
Publikováno v:
BMC Cancer, Vol 24, Iss 1, Pp 1-23 (2024)
Abstract Background Patients with cancer (PwC) who undergo specific treatments reported greater fatigue and reduced functional capacity as predominant outcomes, compromising their QoL during and following the treatment. Prehabilitation intervention,
Externí odkaz:
https://doaj.org/article/62fae39039eb42fea617b2d314328134
Autor:
Ganguli, Deep, Askell, Amanda, Schiefer, Nicholas, Liao, Thomas I., Lukošiūtė, Kamilė, Chen, Anna, Goldie, Anna, Mirhoseini, Azalia, Olsson, Catherine, Hernandez, Danny, Drain, Dawn, Li, Dustin, Tran-Johnson, Eli, Perez, Ethan, Kernion, Jackson, Kerr, Jamie, Mueller, Jared, Landau, Joshua, Ndousse, Kamal, Nguyen, Karina, Lovitt, Liane, Sellitto, Michael, Elhage, Nelson, Mercado, Noemi, DasSarma, Nova, Rausch, Oliver, Lasenby, Robert, Larson, Robin, Ringer, Sam, Kundu, Sandipan, Kadavath, Saurav, Johnston, Scott, Kravec, Shauna, Showk, Sheer El, Lanham, Tamera, Telleen-Lawton, Timothy, Henighan, Tom, Hume, Tristan, Bai, Yuntao, Hatfield-Dodds, Zac, Mann, Ben, Amodei, Dario, Joseph, Nicholas, McCandlish, Sam, Brown, Tom, Olah, Christopher, Clark, Jack, Bowman, Samuel R., Kaplan, Jared
We test the hypothesis that language models trained with reinforcement learning from human feedback (RLHF) have the capability to "morally self-correct" -- to avoid producing harmful outputs -- if instructed to do so. We find strong evidence in suppo
Externí odkaz:
http://arxiv.org/abs/2302.07459
Autor:
Perez, Ethan, Ringer, Sam, Lukošiūtė, Kamilė, Nguyen, Karina, Chen, Edwin, Heiner, Scott, Pettit, Craig, Olsson, Catherine, Kundu, Sandipan, Kadavath, Saurav, Jones, Andy, Chen, Anna, Mann, Ben, Israel, Brian, Seethor, Bryan, McKinnon, Cameron, Olah, Christopher, Yan, Da, Amodei, Daniela, Amodei, Dario, Drain, Dawn, Li, Dustin, Tran-Johnson, Eli, Khundadze, Guro, Kernion, Jackson, Landis, James, Kerr, Jamie, Mueller, Jared, Hyun, Jeeyoon, Landau, Joshua, Ndousse, Kamal, Goldberg, Landon, Lovitt, Liane, Lucas, Martin, Sellitto, Michael, Zhang, Miranda, Kingsland, Neerav, Elhage, Nelson, Joseph, Nicholas, Mercado, Noemí, DasSarma, Nova, Rausch, Oliver, Larson, Robin, McCandlish, Sam, Johnston, Scott, Kravec, Shauna, Showk, Sheer El, Lanham, Tamera, Telleen-Lawton, Timothy, Brown, Tom, Henighan, Tom, Hume, Tristan, Bai, Yuntao, Hatfield-Dodds, Zac, Clark, Jack, Bowman, Samuel R., Askell, Amanda, Grosse, Roger, Hernandez, Danny, Ganguli, Deep, Hubinger, Evan, Schiefer, Nicholas, Kaplan, Jared
As language models (LMs) scale, they develop many novel behaviors, good and bad, exacerbating the need to evaluate how they behave. Prior work creates evaluations with crowdwork (which is time-consuming and expensive) or existing data sources (which
Externí odkaz:
http://arxiv.org/abs/2212.09251
Autor:
Bai, Yuntao, Kadavath, Saurav, Kundu, Sandipan, Askell, Amanda, Kernion, Jackson, Jones, Andy, Chen, Anna, Goldie, Anna, Mirhoseini, Azalia, McKinnon, Cameron, Chen, Carol, Olsson, Catherine, Olah, Christopher, Hernandez, Danny, Drain, Dawn, Ganguli, Deep, Li, Dustin, Tran-Johnson, Eli, Perez, Ethan, Kerr, Jamie, Mueller, Jared, Ladish, Jeffrey, Landau, Joshua, Ndousse, Kamal, Lukosuite, Kamile, Lovitt, Liane, Sellitto, Michael, Elhage, Nelson, Schiefer, Nicholas, Mercado, Noemi, DasSarma, Nova, Lasenby, Robert, Larson, Robin, Ringer, Sam, Johnston, Scott, Kravec, Shauna, Showk, Sheer El, Fort, Stanislav, Lanham, Tamera, Telleen-Lawton, Timothy, Conerly, Tom, Henighan, Tom, Hume, Tristan, Bowman, Samuel R., Hatfield-Dodds, Zac, Mann, Ben, Amodei, Dario, Joseph, Nicholas, McCandlish, Sam, Brown, Tom, Kaplan, Jared
As AI systems become more capable, we would like to enlist their help to supervise other AIs. We experiment with methods for training a harmless AI assistant through self-improvement, without any human labels identifying harmful outputs. The only hum
Externí odkaz:
http://arxiv.org/abs/2212.08073
Autor:
Vivian Lee, Nisha Vashi, Flora Roudbarani, Paula Tablon Modica, Ava Pouyandeh, Teresa Sellitto, Alaa Ibrahim, Stephanie H. Ameis, Alex Elkader, Kylie M. Gray, Connor M. Kerns, Meng-Chuan Lai, Johanna Lake, Kendra Thomson, Jonathan A. Weiss
Publikováno v:
BMC Health Services Research, Vol 24, Iss 1, Pp 1-11 (2024)
Abstract Background Autistic children often experience socioemotional difficulties relating to emotion regulation and mental health problems. Supports for autistic children involve the use of adapted interventions that target emotion regulation and s
Externí odkaz:
https://doaj.org/article/564eb4bee5ab419e9056850e7f626105