High-Performance Data Stream Mining by Means of Embedding Hidden Markov Model into Reproducing Kernel Hibert Spaces

Autor: Galyna Kriukova, Mykola Glybovets
Přispěvatelé: National University of Kyiv-Mohyla Academy
Rok vydání: 2018
Předmět:
Zdroj: 2018 IEEE Second International Conference on Data Stream Mining & Processing (DSMP).
DOI: 10.1109/dsmp.2018.8478571
Popis: Hidden Markov models (HMMs) are a well-known probabilistic graphical model for time series of discrete, partially observable stochastic processes. We consider method to extend the application of HMMs to non-Gaussian continuous distributions by embedding the belief about the state into a reproducing kernel Hilbert space (RKHS). Corresponding regularization techniques are proposed to reduce tendency to overfitting and computational complexity of algorithm, specifically, Nystrom subsampling for feature and kernel matrices and general ¨ regularization family. This method may be applied to various statistical inference and learning problems, including classification, clustering, prediction, identification, and as an online algorithm is may be used for dynamic data mining and data stream mining. We investigate, both theoretically and empirically, regularization and approximation bounds. Furthermore, we discuss applications of the method to real-world problems, comparing the approach to several state-of-the-art algorithms.
Databáze: OpenAIRE