Automatic Correction of Arabic Dyslexic Text

Autor: Maha M. Alamri, William J. Teahan
Jazyk: angličtina
Rok vydání: 2019
Předmět:
Zdroj: Computers, Vol 8, Iss 1, p 19 (2019)
Druh dokumentu: article
ISSN: 2073-431X
DOI: 10.3390/computers8010019
Popis: This paper proposes an automatic correction system that detects and corrects dyslexic errors in Arabic text. The system uses a language model based on the Prediction by Partial Matching (PPM) text compression scheme that generates possible alternatives for each misspelled word. Furthermore, the generated candidate list is based on edit operations (insertion, deletion, substitution and transposition), and the correct alternative for each misspelled word is chosen on the basis of the compression codelength of the trigram. The system is compared with widely-used Arabic word processing software and the Farasa tool. The system provided good results compared with the other tools, with a recall of 43%, precision 89%, F1 58% and accuracy 81%.
Databáze: Directory of Open Access Journals