Comparison of attribute selection techniques and algorithms in classifying bad behaviors of vocational education students
Autor: | Anongnart Srivihok, Sukontip Wongpun |
---|---|
Rok vydání: | 2008 |
Předmět: |
Computer science
business.industry Bayesian network Feature selection Pattern recognition Machine learning computer.software_genre Multiclass classification Naive Bayes classifier Statistical classification C4.5 algorithm Classification rule One-class classification Artificial intelligence business Algorithm computer |
Zdroj: | 2008 2nd IEEE International Conference on Digital Ecosystems and Technologies. |
DOI: | 10.1109/dest.2008.4635213 |
Popis: | This study presents the comparison of attribute selection techniques which used for classifying the bad behaviors of vocational education students. There are two classification methods: hybrid classification and single classification. Hybrid classification includes two steps, step one is attribute selection by search method using genetic search and results are compared by three evaluators: 1) Correlation-based Feature Selection (CFS) 2) Consistency-based Subset Evaluation and 3) Wrapper Subset Evaluation. Step two is the classification of data set by using selected attributed from step one and four classification algorithms. Next, Simple classification used classification algorithms only without attribute selection. The four classification algorithms that used in this experiment for comparing in two methods are : 1) Naive Bayes classifier 2) Baysian Belief Network 3) C4.5 algorithm and 4) RIPPER algorithm. The measurements of classification efficiency had been obtained by using the k-fold cross validation technique. From the experiment, it was found that hybrid classification technique using genetic search and CFS evaluator with C4.5 algorithm, gives the highest accuracy rate at 82.52%. However, results from F-measure evaluation showed that C4.5 algorithm did not fit for all data types. The hybrid classification technique using genetic search and wrapper subset with Baysian belief network can give a better precision value which can be seen in the F-measure, and it gives the accuracy rate at 82.42%. |
Databáze: | OpenAIRE |
Externí odkaz: |