Výsledky vyhledávání

Report

Faster-GCG: Efficient Discrete Optimization Jailbreak Attacks against Aligned Large Language Models

Autor: Li, Xiao, Li, Zhuhong, Li, Qiongxiu, Lee, Bingze, Cui, Jinghao, Hu, Xiaolin

Aligned Large Language Models (LLMs) have demonstrated remarkable performance across various tasks. However, LLMs remain susceptible to jailbreak adversarial attacks, where adversaries manipulate prompts to elicit malicious responses that aligned LLM

Externí odkaz: http://arxiv.org/abs/2410.15362

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání