Lexicon Splitting In Lexical Disambiguation For Malay Morphological Analysis And Stemming

Autor: Mohd Yunus Sharum, Zaitul Azma Zainon Hamzah, Nasir Sulaiman, Masrah Azrifah Azmi Murad, Muhamad Taufik Abdullah
Rok vydání: 2013
Předmět:
Zdroj: Journal of Next Generation Information Technology. 4:9-15
ISSN: 2233-9388
2092-8637
DOI: 10.4156/jnit.vol4.issue5.2
Popis: Lexical ambiguity is one of the problems faced by morphological analyser and stemmer. It is caused by ambiguous word form like homonym, which could direct the tools to produce incorrect output. Thus a method that can resolve ambiguity may improve the performance of such tools. Malay word affixation differentiates between monosyllable and multisyllable word. A disambiguation method is proposed for tools that use lexicon for analysis and stemming, by splitting the lexicon into monosyllable and multisyllable words. We found that this feature could help to resolve ambiguity involving monosyllable words, improve language’s exception handling and improve storage lookup.This would be useful for Malay morphological analysis and stemming as this method does not require document-level context analysis of the analysed word.
Databáze: OpenAIRE