Audio emotion recognition by perceptual features

Autor:	Bilge Gunsel, Cenk Sezgin, Canberk Hacioglu
Rok vydání:	2012
Předmět:	Learning vector quantization business.industry Computer science Speech recognition Feature vector media_common.quotation_subject ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Vector quantization Pattern recognition computer.software_genre Mixture model Support vector machine Discriminative model Perception Artificial intelligence Audio signal processing business computer media_common
Zdroj:	SIU
DOI:	10.1109/siu.2012.6204799
Popis:	A 9-D perceptual feature set has been used for audio emotion recognition. Performance tests have been performed on well known EMO-DB and VAM databases and the results are reported for different classifiers. Support Vector Machines, Gaussian Mixture Models and Learning Vector Quantization have been used in classification. Audio emotion recognition performance achieved by the perceptual visual features are compared to openEar and GerDa which are cited as state of the art audio emotion recognition systems. It is shown that the 9-D perceptual feature vectors are highly discriminative in continuous emotional space. It is concluded that the learning Vector Quantization increases the performance for natural records, while the Support Vector Machines provide the highest recognition rate for the acted records.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::7694daae385dc41f0936d47a469be8c4 https://doi.org/10.1109/siu.2012.6204799 Zobrazit plný text záznamu