Feature selection by optimizing a lower bound of conditional mutual information
Autor: | Yong Fan, Hanyang Peng |
---|---|
Rok vydání: | 2017 |
Předmět: |
Information Systems and Management
Feature selection 02 engineering and technology Information theory computer.software_genre Article Theoretical Computer Science Artificial Intelligence Robustness (computer science) Component (UML) 0202 electrical engineering electronic engineering information engineering Minimum redundancy feature selection Mathematics business.industry Conditional mutual information 05 social sciences 050301 education Pattern recognition Computer Science Applications Control and Systems Engineering Feature (computer vision) Metric (mathematics) 020201 artificial intelligence & image processing Data mining Artificial intelligence business 0503 education computer Software |
Zdroj: | Information Sciences. :652-667 |
ISSN: | 0020-0255 |
Popis: | A unified framework is proposed to select features by optimizing computationally feasible approximations of high-dimensional conditional mutual information (CMI) between features and their associated class label under different assumptions. Under this unified framework, state-of-the-art information theory based feature selection algorithms are rederived, and a new algorithm is proposed to select features by optimizing a lower bound of the CMI with a weaker assumption than those adopted by existing methods. The new feature selection method integrates a plug-in component to distinguish redundant features from irrelevant ones for improving the feature selection robustness. Furthermore, a novel metric is proposed to evaluate feature selection methods based on simulated data. The proposed method has been compared with state-of-the-art feature selection methods based on the new evaluation metric and classification performance of classifiers built upon the selected features. The experiment results have demonstrated that the proposed method could achieve promising performance in a variety of feature selection problems. |
Databáze: | OpenAIRE |
Externí odkaz: |