Semi-wrapper feature subset selector for feed-forward neural networks: Applications to binary and multi-class classification problems
Autor: | José C. Riquelme, Antonio J. Tallón-Ballesteros, Roberto Ruiz |
---|---|
Rok vydání: | 2019 |
Předmět: |
0209 industrial biotechnology
Fitness function Artificial neural network business.industry Computer science Cognitive Neuroscience Pattern recognition Feature selection 02 engineering and technology Perceptron Computer Science Applications Multiclass classification Naive Bayes classifier ComputingMethodologies_PATTERNRECOGNITION 020901 industrial engineering & automation Knowledge extraction Ranking Artificial Intelligence 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business Classifier (UML) |
Zdroj: | Neurocomputing. 353:28-44 |
ISSN: | 0925-2312 |
Popis: | This paper explores widely the data preparation stage within the process of knowledge discovery and data mining via feature subset selection in the context of two very well-known neural models: radial basis function neural networks and multi-layer perceptron. It is known the best performance of wrapper attribute selection methods based on the evaluation measure provided by a classifier, although the temporal complexity of learning neural networks practically precludes the use of wrapper techniques, especially in complex databases with high dimensionality and a large number of labels. In this paper, we propose the use of the Naive Bayes classifier as a fitness function within a semi-wrapper feature selection approach. The Naive Bayes classifier is a good fast approach to a neural network and utilising it as a measure of goodness in a backward search on a ranking provides a specific attribute selection method for neural networks in complex data. The test-bed consists of 34 binary and multi-class classification problems and 7 feature selectors. Of these, there are 6 data sets with upwards of 5 classes. According to the reported accuracy results that have been supported by non-parametric statistical tests in different scenarios, our method has been shown to be very suitable for both kinds of neural networks. Moreover, the reduced feature-space is around 20% of the full attribute space. The speedup with the aforementioned semi-wrapper is very outstanding and its value fluctuates, on average, from about 1.5 with radial basis function neural networks to around 30 with multi-layer perceptron. |
Databáze: | OpenAIRE |
Externí odkaz: |