Performance Evaluation of n-Grams Ratio Features in Solving Multi-Classes Classification Problems
Autor: | Irvanizam, Ahmad Zuhri Ramadhan, Nur Ratna Sari, Taufik Fuadi Abidin, Razief Perucha Fauzie Afidh |
---|---|
Rok vydání: | 2018 |
Předmět: |
021110 strategic
defence & security studies 010504 meteorology & atmospheric sciences Computer science business.industry 0211 other engineering and technologies 02 engineering and technology Machine learning computer.software_genre 01 natural sciences people.cause_of_death Flooding (computer networking) Support vector machine ComputingMethodologies_PATTERNRECOGNITION Binary classification Aviation accident Artificial intelligence Natural disaster business people computer 0105 earth and related environmental sciences |
Zdroj: | 2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE). |
DOI: | 10.1109/iciteed.2018.8534773 |
Popis: | We present experimental results that compare k-Nearest Neighbors (k-NN) and Support Vector Machines (SVM) algorithms to classifythe natural disasters multi-classes problem when n-grams ratio is used as the numerical features and compare three SVM approaches to classify the transportation accidents multi-classes problem when the same n-grams ratio is used as the features. In the former problem, we would like to investigate which of the two prominent algorithms have a better accuracy, while in thelatter problem, we would like to compare which of the three well-known SVM approaches for solving multi-classes problem performs best.In the natural disasters problem, the class labels are earthquakes, volcanic eruptions, flooding, landslides, and others, while in the transportation accidents problem, the categories are traffic collisions, maritime accidents, aviation accidents, and others. n-grams dictionaries of each category are used as the references in creating numerical features of the news articles. The results show that for the natural disasters problem, k-NN performs better than SVM and for the transportation accidents problem, DAGSVMoutperforms the other two SVM binary classification approaches. |
Databáze: | OpenAIRE |
Externí odkaz: |