Automated Spelling Correction for Dutch Internet Users with Intellectual Disabilities

Autor: Sevens, Leen, Vanallemeersch, Tom, Schuurman, Ineke, Vandeghinste, Vincent, Van Eynde, Frank
Přispěvatelé: Schuurman, Ineke, Vandeghinste, Vincent, Saggion, Horacio
Jazyk: angličtina
Rok vydání: 2016
Předmět:
Popis: We present the first version of an automated spelling correction system for Dutch Internet users with Intellectual Disabilities (ID). The normalization of ill-formed messages is an important preprocessing step before any conventional Natural Language Processing (NLP) process can be applied. As such, we describe the effects of automated correction of Dutch ID text within the larger framework of a Text-to-Pictograph translation system. The present study consists of two main parts. First, we thoroughly analyze email messages that have been written by users with cognitive disabilities in order to gain insights on how to develop solutions that are specifically tailored to their needs. We then present a new, generally applicable approach toward context-sensitive spelling correction, based on character-level fuzzy matching techniques. The resulting system shows significant improvements, although further research is still needed. No ISSN ispartof: pages:11-19 ispartof: Proceedings of 1st Workshop on Improving Social Inclusion using NLP: Tools and Resources pages:11-19 ispartof: Workshop on Improving Social Inclusion using NLP: Tools and Resources (ISI-NLP) location:Portorož, Slovenia date:23 May - 23 May 2016 status: published
Databáze: OpenAIRE