New prior knowledge based extensions for stable feature selection
Autor: | Afef Ben Brahim, Mohamed Limam |
---|---|
Rok vydání: | 2014 |
Předmět: |
Clustering high-dimensional data
business.industry Dimensionality reduction Stability (learning theory) Feature selection Machine learning computer.software_genre Support vector machine Feature (computer vision) Minimum redundancy feature selection Data mining Artificial intelligence business computer Curse of dimensionality Mathematics |
Zdroj: | SoCPaR |
DOI: | 10.1109/socpar.2014.7008024 |
Popis: | In many data sets, there are only hundreds or fewer samples but thousands of features. The relatively small number of samples in high dimensional data results in modest classification performance and feature selection instability. In order to deal with the curse of dimensionality, we propose to investigate research on the effect of integrating background knowledge about some dimensions known to be more relevant, as a means of directing the feature selection process. We propose extensions of three feature selection techniques, two filters and a wrapper, by incorporating prior knowledge in the search procedure of the best features. We study the effect of these extensions on the classification performance and the stability of the feature selection. We experimentally test and compare our proposed approaches with their original versions, which do not integrate prior knowledge, over three high-dimensional datasets. The results show that our proposed techniques outperform other methods in terms of stability of feature selection but also in classification performance in most cases. |
Databáze: | OpenAIRE |
Externí odkaz: |