Variable Weighting in PCA-Guided k-Means and its Connection with Information Summarization
Autor: | Akira Notsu, Hidetomo Ichihashi, Katsuhiro Honda |
---|---|
Rok vydání: | 2011 |
Předmět: |
Computer science
business.industry Computer Science::Information Retrieval k-means clustering Pattern recognition computer.software_genre Automatic summarization Weighting Connection (mathematics) Human-Computer Interaction Variable (computer science) Artificial Intelligence Principal component analysis Computer Vision and Pattern Recognition Artificial intelligence Data mining business computer |
Zdroj: | Journal of Advanced Computational Intelligence and Intelligent Informatics. 15:83-89 |
ISSN: | 1883-8014 1343-0130 |
DOI: | 10.20965/jaciii.2011.p0083 |
Popis: | In the present paper, a variable selection model in k-Means is proposed, in which a variable weighting mechanism is introduced to PCA-guided k-Means. Variable weights are estimated in a manner similar to FCM clustering, while the membership indicator is derived using a PCA-guided method, in which the principal component scores are calculated by considering the variable weights. The variable weights emphasize the variables that have meaningful cluster information in the calculation of the membership indicators, and the absolute responsibility of each variable is revealed by soft transition to possibilistic values. It is also shown that the variable weights are derived in a manner similar to variable selection for PCA, with the goal being information summarization. The characteristics of the proposed method are demonstrated in an application to document clustering. |
Databáze: | OpenAIRE |
Externí odkaz: |