Audio emotion recognition by perceptual features
Autor: | Bilge Gunsel, Cenk Sezgin, Canberk Hacioglu |
---|---|
Rok vydání: | 2012 |
Předmět: |
Learning vector quantization
business.industry Computer science Speech recognition Feature vector media_common.quotation_subject ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Vector quantization Pattern recognition computer.software_genre Mixture model Support vector machine Discriminative model Perception Artificial intelligence Audio signal processing business computer media_common |
Zdroj: | SIU |
DOI: | 10.1109/siu.2012.6204799 |
Popis: | A 9-D perceptual feature set has been used for audio emotion recognition. Performance tests have been performed on well known EMO-DB and VAM databases and the results are reported for different classifiers. Support Vector Machines, Gaussian Mixture Models and Learning Vector Quantization have been used in classification. Audio emotion recognition performance achieved by the perceptual visual features are compared to openEar and GerDa which are cited as state of the art audio emotion recognition systems. It is shown that the 9-D perceptual feature vectors are highly discriminative in continuous emotional space. It is concluded that the learning Vector Quantization increases the performance for natural records, while the Support Vector Machines provide the highest recognition rate for the acted records. |
Databáze: | OpenAIRE |
Externí odkaz: |