An analysis of sentence level text classification for the Kannada language
Autor: | Murthy K Srikanta, R Jayashree |
---|---|
Rok vydání: | 2011 |
Předmět: |
Computer science
business.industry Word processing computer.software_genre Automatic summarization Task (project management) Support vector machine Naive Bayes classifier ComputingMethodologies_PATTERNRECOGNITION Bag-of-words model Question answering Artificial intelligence business computer Natural language processing Sentence |
Zdroj: | SoCPaR |
DOI: | 10.1109/socpar.2011.6089130 |
Popis: | With the rapid growth of internet, huge amount of data is available online. The ability to draw useful information from this digital data is quite challenging. The task of exploring and extracting information from native languages available on line is very much a useful task. The work presented here focuses on sentence level classification in the Kannada language. The most popular approaches in text categorization like Naive Bayesian and Bag of Words (BOW) approaches are used in this work. It is evident that Bag of Words approach performs significantly better than Naive Bayesian approach. The objective of the work is to find how sentence level classification works for Kannada Language, as it can be extended further to sentiment classification, Question Answering, Text Summarization and also for customer reviews in Kannada Blogs, because most user's comments, queries, opinions etc are expressed using sentences, hence this sentence level Text Classification becomes a special task of Text Classification problem. The work though focuses on very basic approaches presently, can later be extended to other methods like SVM, KNN etc. |
Databáze: | OpenAIRE |
Externí odkaz: |