Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Ghilardi, Davide"'
Autor:
Doumbouya, Moussa Koulako Bala, Nandi, Ananjan, Poesia, Gabriel, Ghilardi, Davide, Goldie, Anna, Bianchi, Federico, Jurafsky, Dan, Manning, Christopher D.
The safety of Large Language Models (LLMs) remains a critical concern due to a lack of adequate benchmarks for systematically evaluating their ability to resist generating harmful content. Previous efforts towards automated red teaming involve static
Externí odkaz:
http://arxiv.org/abs/2408.04811
Publikováno v:
In Applied Mathematical Modelling May 2024 129:191-206
Publikováno v:
In Applied Mathematical Modelling January 2020 77 Part 2:1881-1893