Maximum relative margin estimation of HMMS based on N-best string models for continuous speech recognition

Autor:	L. Rigazio, Chaojun Liu, Hui Jiang
Rok vydání:	2005
Předmět:	Estimation Computer science business.industry Speech recognition String (computer science) Word error rate Pattern recognition Task (project management) Margin (machine learning) Test set Margin classifier Artificial intelligence Hidden Markov model business
Zdroj:	IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..
DOI:	10.1109/asru.2005.1566540
Popis:	Based on the principle of large margin classifier, recently we proposed two novel training methods, namely large margin estimation (LME) [8] and maximum relative margin estimation (MRME) [9] for speech recognition. In LME or MRME, HMM parameters are estimated to maximize the minimum margin among all training utterances. However their original formulation is limited to isolated-word ASR tasks. In this paper, we propose a new training method based on N-best string models to extend the original MRME framework to continuous speech recognition. We also study a new definition of relative margin which is more theoretically sound than the one used in [9]. Experimental results in a connected digit recognition task clearly show that the string-level MRME is very effective in terms of reducing recognition error rates by up to 57% over our best MCE-trained models. A string error rate as low as 0.84% has been achieved on the standard TIDIGITS test set, which is the best result that has ever been reported in this task
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::b1adfba86b1cd05d43bb5fcaeec942a5 https://doi.org/10.1109/asru.2005.1566540 Zobrazit plný text záznamu