Applying SPHINX-II to the DARPA Wall Street Journal CSR Task

Autor:	Alleva, F, Hon, H, Huang, M, Rosenfeld, Ronald, Weide, R
Rok vydání:	2023
Předmět:	FOS: Psychology 80602 Computer-Human Interaction
DOI:	10.1184/r1/21709916
Popis:	This paper reports recent efforts to apply the speaker-independent SPHINX-II system to the DARPA Wall Street Journal continuous speech recognition task. In SPHINX-II, we incorporated additional dynamic and speaker-normalized features, replaced discrete models with sex-dependent semi-continuous hidden Markov models, augmented within-word triphones with between-word triphones, and extended generalized triphone models to shared-distribution models. The configuration of SPHINX-II being used for this task includes sex-dependent, semi-continuous, shared-distribution hidden Markov models and left context dependent between-word triphones. In applying our technology to this task we addressed issues that were not previously of concern owing to the (relatively) small size of the Resource Management task.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6eea97a2ebe38a75a8a04aef931d12d1 Zobrazit plný text záznamu