A Heuristic Approach for Text Classification with Ontology: A Review
Autor: | Preeti Rathee, Kamalika Saha, Sanjay Kumar Malik, Rijul Singh Malik |
---|---|
Rok vydání: | 2021 |
Předmět: |
Stop words
Heuristic business.industry Computer science Feature extraction Intelligent decision support system Feature selection Ontology (information science) computer.software_genre Classifier (linguistics) Selection (linguistics) Artificial intelligence business computer Natural language processing |
Zdroj: | 2021 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS). |
DOI: | 10.1109/icccis51004.2021.9397061 |
Popis: | In day to day life, mass information is readily available. So, the motive and main concern of automatic text classifiers is to segregate the relevant documents and irrelevant documents. The automated text classifier organizes the clustered data automatically. In this paper, a method for automatically classifying text has been implemented, by pre-processing the data in hand and selecting the necessary features and then classifying the data with ontology. Here, two techniques for pre-processing have been used that are stop words removal and stemming. In this paper, when the words are stemmed, it’s necessary to remove the noise from the data, which has been implemented here using feature selection technique known as univariate selection and using tools named as PyDotPlus and Graphviz, followed by ontology-based classification of documents which can be implemented in future. |
Databáze: | OpenAIRE |
Externí odkaz: |