Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Iranmanesh, Reihaneh"'
Autor:
Gibbs, Tom, Kosak-Hine, Ethan, Ingebretsen, George, Zhang, Jason, Broomfield, Julius, Pieri, Sara, Iranmanesh, Reihaneh, Rabbany, Reihaneh, Pelrine, Kellin
Large language models (LLMs) are improving at an exceptional rate. However, these models are still susceptible to jailbreak attacks, which are becoming increasingly dangerous as models become increasingly powerful. In this work, we introduce a datase
Externí odkaz:
http://arxiv.org/abs/2409.00137