On State-of-the-art of POS Tagger, ‘Sandhi’ Splitter, ‘Alankaar’ Finder and ‘Samaas’ Finder for Indo-Aryan and Dravidian Languages
Autor: | Hema Gaikwad, R Jatinderkumar |
---|---|
Rok vydání: | 2021 |
Předmět: |
General Computer Science
Indo aryan business.industry Computer science Dravidian languages 020206 networking & telecommunications Rule-based system 02 engineering and technology computer.software_genre Part of speech Sandhi Splitter 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing State (computer science) Artificial intelligence business Hidden Markov model computer Natural language processing |
Zdroj: | Web of Science |
ISSN: | 2156-5570 2158-107X |
DOI: | 10.14569/ijacsa.2021.0120455 |
Popis: | Computational Linguistic refers to the development of the computer systems that deal with human languages. In this paper, different Computational Linguistic Techniques such as Parts of Speech (POS) tagger, ‘Sandhi’ Splitter, ‘Alankaar’ Finder and ‘Samaas’ Finder were considered. After a thorough literature review, it was found that fifteen techniques were used for POS tagging, nine techniques were used for ‘Sandhi’ splitting, one work is done for ‘Alankaar’ finder and absolutely no techniques are available for ‘Samaas’ finder for the Indo-Aryan as well as Dravidian languages. Analysis shows that Rule Based Approach (RBA) and Hidden Markov Model (HMM) are frequently used for POS tagging, RBA is most frequently used for ‘Sandhi’ Splitter, the general Human Intelligence (HI) is used for ‘Alankaar’ Finder and no ‘Samaas’ finder technique is available for any Indian language. |
Databáze: | OpenAIRE |
Externí odkaz: |