Semi-wrapper feature subset selector for feed-forward neural networks: Applications to binary and multi-class classification problems

Autor:	José C. Riquelme, Antonio J. Tallón-Ballesteros, Roberto Ruiz
Rok vydání:	2019
Předmět:	0209 industrial biotechnology Fitness function Artificial neural network business.industry Computer science Cognitive Neuroscience Pattern recognition Feature selection 02 engineering and technology Perceptron Computer Science Applications Multiclass classification Naive Bayes classifier ComputingMethodologies_PATTERNRECOGNITION 020901 industrial engineering & automation Knowledge extraction Ranking Artificial Intelligence 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business Classifier (UML)
Zdroj:	Neurocomputing. 353:28-44
ISSN:	0925-2312
DOI:	10.1016/j.neucom.2018.05.133
Popis:	This paper explores widely the data preparation stage within the process of knowledge discovery and data mining via feature subset selection in the context of two very well-known neural models: radial basis function neural networks and multi-layer perceptron. It is known the best performance of wrapper attribute selection methods based on the evaluation measure provided by a classifier, although the temporal complexity of learning neural networks practically precludes the use of wrapper techniques, especially in complex databases with high dimensionality and a large number of labels. In this paper, we propose the use of the Naive Bayes classifier as a fitness function within a semi-wrapper feature selection approach. The Naive Bayes classifier is a good fast approach to a neural network and utilising it as a measure of goodness in a backward search on a ranking provides a specific attribute selection method for neural networks in complex data. The test-bed consists of 34 binary and multi-class classification problems and 7 feature selectors. Of these, there are 6 data sets with upwards of 5 classes. According to the reported accuracy results that have been supported by non-parametric statistical tests in different scenarios, our method has been shown to be very suitable for both kinds of neural networks. Moreover, the reduced feature-space is around 20% of the full attribute space. The speedup with the aforementioned semi-wrapper is very outstanding and its value fluctuates, on average, from about 1.5 with radial basis function neural networks to around 30 with multi-layer perceptron.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::87d4ca084fdda5e93bac9f513f4e02a8 https://doi.org/10.1016/j.neucom.2018.05.133 Zobrazit plný text záznamu Full Text from ScienceDirect