Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Blum, Carter Wood"'
Autor:
Gupta, Ashim, Blum, Carter Wood, Choji, Temma, Fei, Yingjie, Shah, Shalin, Vempala, Alakananda, Srikumar, Vivek
Can language models transform inputs to protect text classifiers against adversarial attacks? In this work, we present ATINTER, a model that intercepts and learns to rewrite adversarial inputs to make them non-adversarial for a downstream text classi
Externí odkaz:
http://arxiv.org/abs/2305.16444