Audio signal classification with temporal envelopes

Autor:	Biing-Hwang Juang, M. Umair Bin Altaf
Rok vydání:	2011
Předmět:	Audio signal Computer science business.industry Speech recognition Speech coding Bandwidth extension Spectral density Pattern recognition Audio signal flow computer.software_genre Signal Mel-frequency cepstrum Artificial intelligence Sound quality Audio signal processing business computer
Zdroj:	ICASSP
DOI:	10.1109/icassp.2011.5946442
Popis:	The conventional approach to audio processing, based on the short-time power spectrum model, is not adequate when it comes to general audio signals. We propose an approach, justified by studies from psycho-acoustics and neuroimaging, which uses the magnitude and frequency envelope of the audio signal in the from of AM-FM modulations to build an ARMA model which is then fed to a GMM to classify into various audio classes. We show that it makes explicit certain aspects of the signal which are overlooked when processing is limited to the spectral domain.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::4ca41c0925f06549d5a915fee37d1290 https://doi.org/10.1109/icassp.2011.5946442 Zobrazit plný text záznamu