Feature Selection Based on Wrapper and Information Gain

Autor:	Abdulhameed N. Hameed, Mohammad A. Thanoon, Mohammad J.M Zedan
Rok vydání:	2019
Předmět:	Naive Bayes classifier Computer science Classifier (linguistics) Decision tree Process (computing) Feature selection Selection method Data mining High dimensionality Information gain computer.software_genre computer
Zdroj:	2019 1st AL-Noor International Conference for Science and Technology (NICST).
DOI:	10.1109/nicst49484.2019.9043805
Popis:	The accuracy of the classification process always suffers from the high dimensionality problem due to the independent, irrelevant, redundant and not useful attributes of the dataset. In this research, feature selection techniques (wrapper selection method, and information gain method) are obtained to handle the mentioned problem by removing those features and reducing the dataset dimensions. The techniques include wrapper selection method and information gain method. This research predicates on the diabetes dataset in WEKA application, which contains checking seven models enforce wrapper selection method as an attribute evaluator, forwarding direction, backward, and bi-directional best-first search method and Naive Bayes technique as a classifier method, checking eight models applying information gain method as attribute evaluator, as well as the ranker as a search method. Additional to demonstrate the decision tree and classification figures for the best-obtained models in each one technique. The results proved the ability of wrapper and information gain to choose a minimum number of features in order to classify the data with an accuracy of more than 76% in this work.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::143dbde5616a5c7d34bb4af7fb6eaddd https://doi.org/10.1109/nicst49484.2019.9043805 Zobrazit plný text záznamu