Russian Texts Detoxification with Levenshtein Editing

Autor: Gusev, Ilya
Rok vydání: 2022
Předmět:
Druh dokumentu: Working Paper
Popis: Text detoxification is a style transfer task of creating neutral versions of toxic texts. In this paper, we use the concept of text editing to build a two-step tagging-based detoxification model using a parallel corpus of Russian texts. With this model, we achieved the best style transfer accuracy among all models in the RUSSE Detox shared task, surpassing larger sequence-to-sequence models.
Comment: Accepted to Dialogue 2022
Databáze: arXiv