Basis-Based Speaker Adaptation Using Partitioned HMM Mean Parameters of Training Speaker Models

Autor:	Yongwon Jeong
Rok vydání:	2015
Předmět:	Computer science Maximum likelihood Adaptation (eye) 01 natural sciences Theoretical Computer Science 030507 speech-language pathology & audiology 03 medical and health sciences Dimension (vector space) 0103 physical sciences Maximum a posteriori estimation Hidden Markov model 010301 acoustics Basis (linear algebra) business.industry Pattern recognition Sample mean and sample covariance Hardware and Architecture Control and Systems Engineering Modeling and Simulation Signal Processing Principal component analysis Pattern recognition (psychology) Artificial intelligence 0305 other medical science business Information Systems
Zdroj:	Journal of Signal Processing Systems. 82:303-310
ISSN:	1939-8115 1939-8018
Popis:	This paper presents the basis-based speaker adaptation method that includes approaches using principal component analysis (PCA) and two-dimensional PCA (2DPCA). The proposed method partitions the hidden Markov model (HMM) mean vectors of training models into subvectors of smaller dimension. Consequently, the sample covariance matrix computed using the partitioned HMM mean vectors has various dimensions according to the dimension of the subvectors. From the eigen-decomposition of the sample covariance matrix, basis vectors are constructed. Thus, the dimension of basis vectors varies according to the dimension of the sample covariance matrix, and the proposed method includes PCA and 2DPCA-based approaches. We present the adaptation equation in both the maximum likelihood (ML) and maximum a posteriori (MAP) frameworks. We perform continuous speech recognition experiments using the Wall Street Journal (WSJ) corpus. The results show that the model with basis vectors whose dimensions are between those of PCA and 2DPCA-based approaches shows good overall performance. The proposed approach in the MAP framework shows additional performance improvement over the ML counterpart when the number of adaptation parameters is large but the amount of available adaptation data is small. Furthermore, the performance of the approach in the MAP framework approach is less sensitive to the choice of model order than the ML counterpart.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::248492491d8eb1130d3006443c6ba455 https://doi.org/10.1007/s11265-015-0996-2 Zobrazit plný text záznamu Full text from SpringerLink