On State-of-the-art of POS Tagger, ‘Sandhi’ Splitter, ‘Alankaar’ Finder and ‘Samaas’ Finder for Indo-Aryan and Dravidian Languages

Autor:	Hema Gaikwad, R Jatinderkumar
Rok vydání:	2021
Předmět:	General Computer Science Indo aryan business.industry Computer science Dravidian languages 020206 networking & telecommunications Rule-based system 02 engineering and technology computer.software_genre Part of speech Sandhi Splitter 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing State (computer science) Artificial intelligence business Hidden Markov model computer Natural language processing
Zdroj:	Web of Science
ISSN:	2156-5570 2158-107X
DOI:	10.14569/ijacsa.2021.0120455
Popis:	Computational Linguistic refers to the development of the computer systems that deal with human languages. In this paper, different Computational Linguistic Techniques such as Parts of Speech (POS) tagger, ‘Sandhi’ Splitter, ‘Alankaar’ Finder and ‘Samaas’ Finder were considered. After a thorough literature review, it was found that fifteen techniques were used for POS tagging, nine techniques were used for ‘Sandhi’ splitting, one work is done for ‘Alankaar’ finder and absolutely no techniques are available for ‘Samaas’ finder for the Indo-Aryan as well as Dravidian languages. Analysis shows that Rule Based Approach (RBA) and Hidden Markov Model (HMM) are frequently used for POS tagging, RBA is most frequently used for ‘Sandhi’ Splitter, the general Human Intelligence (HI) is used for ‘Alankaar’ Finder and no ‘Samaas’ finder technique is available for any Indian language.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::38aa32be944755f852bf70b1cff4af01 https://doi.org/10.14569/ijacsa.2021.0120455 Zobrazit plný text záznamu