Pre-processing speech for speech recognition

Autor:	Antoni Abella, Thomas Kemp, Raquel Tato
Rok vydání:	2010
Předmět:	Acoustics and Ultrasonics Arts and Humanities (miscellaneous) Computer Science::Sound Computer science Speech recognition Acoustic model Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing) Speech processing Linear predictive coding Signal
Zdroj:	The Journal of the Acoustical Society of America. 128:964
ISSN:	0001-4966
DOI:	10.1121/1.3481751
Popis:	A method for pre-processing speech, in particular for recognizing speech, including receiving a speech signal, separating a spectrum of said speech signal into a number of predetermined frequency sub-bands, analyzing said speech signal within each of said frequency sub-bands, generating respective band-dependent acoustic feature data for each of said respective frequency sub-bands, deriving band-dependent likelihoods for occurrences of speech elements or within said speech signal based on said band-dependent acoustic feature data, analyzing said speech signal within said spectrum, generating full-band acoustic feature data, which are at least in part representative for said speech signal with respect to said spectrum, deriving a full-band likelihood for occurrences of speech elements or of sequences thereof within said speech signal based on said full-band acoustic feature data, deriving an overall likelihood for occurrences of speech elements within said speech signal based on said band-dependent likelihoods and said full-band likelihood.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::136160474ef09b0c2ea08abbb9587aa8 https://doi.org/10.1121/1.3481751 Zobrazit plný text záznamu