Adaptive Dimensionality Adjustment for Online 'Principal Component Analysis'
Autor: | Wolfram Schenck, Nico Migenda, Ralf Möller |
---|---|
Rok vydání: | 2019 |
Předmět: |
Data stream
050101 languages & linguistics Computer science business.industry Dimensionality reduction 05 social sciences Big data Contrast (statistics) 02 engineering and technology computer.software_genre Set (abstract data type) Principal component analysis 0202 electrical engineering electronic engineering information engineering sort 020201 artificial intelligence & image processing 0501 psychology and cognitive sciences Data mining business computer Curse of dimensionality |
Zdroj: | Intelligent Data Engineering and Automated Learning – IDEAL 2019 ISBN: 9783030336066 IDEAL (1) |
Popis: | Many applications in the Industrial Internet of Things and Industry 4.0 rely on large amounts of data which are continuously generated. The exponential growth in available data and the resulting storage requirements are often underestimated bottlenecks. Therefore, efficient dimensionality reduction gets more attention and becomes more relevant. One of the most widely used techniques for dimensionality reduction is “Principal Component Analysis” (PCA). A novel algorithm to determine the optimal number of meaningful principal components on a data stream is proposed. The basic idea of the proposed algorithm is to optimize the dimensionality adjustment process by taking advantage of several “natural” PCA features. In contrast to the commonly used approach to start with a maximal set of principal components and apply some sort of stopping rule, the proposed algorithm starts with a minimal set of principal components and uses a linear regression model in the natural logarithmic scale to approximate the remaining components. An experimental study is presented to demonstrate the successful application of the algorithm to noisy synthetic and real world data sets. |
Databáze: | OpenAIRE |
Externí odkaz: |