Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Pan, Alexa Y."'
Autor:
Pacchiardi, Lorenzo, Chan, Alex J., Mindermann, Sören, Moscovitz, Ilan, Pan, Alexa Y., Gal, Yarin, Evans, Owain, Brauner, Jan
Large language models (LLMs) can "lie", which we define as outputting false statements despite "knowing" the truth in a demonstrable sense. LLMs might "lie", for example, when instructed to output misinformation. Here, we develop a simple lie detecto
Externí odkaz:
http://arxiv.org/abs/2309.15840