An Ensemble Classification Method for High-Dimensional Data Using Neighborhood Rough Set
Autor: | Jing Zhang, Jia-quan Li, Guang Lu, Chuanwen Li |
---|---|
Rok vydání: | 2021 |
Předmět: |
Clustering high-dimensional data
Multidisciplinary Article Subject General Computer Science Ensemble forecasting Computer science business.industry Intersection (set theory) Pattern recognition Feature selection QA75.5-76.95 ComputingMethodologies_PATTERNRECOGNITION Feature (computer vision) Electronic computers. Computer science Classifier (linguistics) Artificial intelligence Rough set Cluster analysis business |
Zdroj: | Complexity, Vol 2021 (2021) |
ISSN: | 1099-0526 1076-2787 |
DOI: | 10.1155/2021/8358921 |
Popis: | Mining useful knowledge from high-dimensional data is a hot research topic. Efficient and effective sample classification and feature selection are challenging tasks due to high dimensionality and small sample size of microarray data. Feature selection is necessary in the process of constructing the model to reduce time and space consumption. Therefore, a feature selection model based on prior knowledge and rough set is proposed. Pathway knowledge is used to select feature subsets, and rough set based on intersection neighborhood is then used to select important feature in each subset, since it can select features without redundancy and deals with numerical features directly. In order to improve the diversity among base classifiers and the efficiency of classification, it is necessary to select part of base classifiers. Classifiers are grouped into several clusters by k-means clustering using the proposed combination distance of Kappa-based diversity and accuracy. The base classifier with the best classification performance in each cluster will be selected to generate the final ensemble model. Experimental results on three Arabidopsis thaliana stress response datasets showed that the proposed method achieved better classification performance than existing ensemble models. |
Databáze: | OpenAIRE |
Externí odkaz: |