Lexicon Splitting In Lexical Disambiguation For Malay Morphological Analysis And Stemming
Autor: | Mohd Yunus Sharum, Zaitul Azma Zainon Hamzah, Nasir Sulaiman, Masrah Azrifah Azmi Murad, Muhamad Taufik Abdullah |
---|---|
Rok vydání: | 2013 |
Předmět: |
General Computer Science
business.industry Computer science media_common.quotation_subject Speech recognition Exception handling Ambiguity Lexicon computer.software_genre language.human_language Homonym Feature (linguistics) Context analysis language Artificial intelligence business computer Word (computer architecture) Natural language processing Malay media_common |
Zdroj: | Journal of Next Generation Information Technology. 4:9-15 |
ISSN: | 2233-9388 2092-8637 |
DOI: | 10.4156/jnit.vol4.issue5.2 |
Popis: | Lexical ambiguity is one of the problems faced by morphological analyser and stemmer. It is caused by ambiguous word form like homonym, which could direct the tools to produce incorrect output. Thus a method that can resolve ambiguity may improve the performance of such tools. Malay word affixation differentiates between monosyllable and multisyllable word. A disambiguation method is proposed for tools that use lexicon for analysis and stemming, by splitting the lexicon into monosyllable and multisyllable words. We found that this feature could help to resolve ambiguity involving monosyllable words, improve language’s exception handling and improve storage lookup.This would be useful for Malay morphological analysis and stemming as this method does not require document-level context analysis of the analysed word. |
Databáze: | OpenAIRE |
Externí odkaz: |