Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Flores, Juan Arturo Nolazco"'
Autor:
Bethany, Emet, Bethany, Mazal, Flores, Juan Arturo Nolazco, Jha, Sumit Kumar, Najafirad, Peyman
Recent advancements in AI safety have led to increased efforts in training and red-teaming large language models (LLMs) to mitigate unsafe content generation. However, these safety mechanisms may not be comprehensive, leaving potential vulnerabilitie
Externí odkaz:
http://arxiv.org/abs/2409.11445