Malayalam Spell Checker Using N-Gram Method
Autor: | C. Sunitha, P. H. Hema |
---|---|
Rok vydání: | 2015 |
Předmět: |
Programming language
business.industry Computer science Spell Construct (python library) computer.software_genre Lexicon language.human_language Task (project management) n-gram ComputingMethodologies_DOCUMENTANDTEXTPROCESSING Malayalam language Edit distance Artificial intelligence business computer Word (computer architecture) Natural language processing |
Zdroj: | Advances in Intelligent Systems and Computing ISBN: 9788132227328 |
Popis: | Spell checker is a software tool which can detect incorrectly spelled words in a text document. Developing spell checker for a morphologically rich language like Malayalam is really tedious task. This paper mainly discusses about the construction of spell checker for Malayalam language. Since in Malayalam, many words can be derived from root word, it will be impossible to include all the words in a lexicon. So a hybrid method of different techniques can improve the performance of a spell checker. The method explained is an n-gram based approach and which will be inexpensive to construct without deep linguistic knowledge. Along with n gram, a Minimum edit distance algorithm is also added to detect the errors due to addition, deletion, or interchange of letters in a word. This will improve the efficiency of the spell checker. This approach will be useful when less linguistic resources are available for Malayalam language. And also the performance analysis is done with an existing method of spell checking in Malayalam language. |
Databáze: | OpenAIRE |
Externí odkaz: |