On the influence of quantization on the identifiability of emotions from voice coding parameters
Autor: | Roch Lefebvre, Patrick Robitaille, Samuel Trempe, Philippe Gournay |
---|---|
Rok vydání: | 2016 |
Předmět: |
Voice activity detection
Mean squared error Computer science Speech recognition Quantization (signal processing) Speech coding 02 engineering and technology Linear predictive coding 01 natural sciences 0103 physical sciences 0202 electrical engineering electronic engineering information engineering Voice Identifiability 020201 artificial intelligence & image processing 010301 acoustics Harmonic Vector Excitation Coding Coding (social sciences) |
Zdroj: | ICASSP |
DOI: | 10.1109/icassp.2016.7472819 |
Popis: | Although emotions play a major role in voice communication, the quality of their reproduction by low bit rate voice coders has never been investigated so far. This paper shows that the emotional state of a speaker can be identified automatically, with reasonable precision and accuracy, using conventional voice coding parameters (pitch, voicing, energy and LPC coefficients). It also shows that the performance of this identification degrades when these parameters are quantized, especially at lower rates (1200 bits/s). This suggests that quantization of speech parameters could be improved by targeting the faithful reproduction of important higher-level voice communication attributes such as emotions, rather than simply optimizing objective measures such as the signal-to-noise ratio, mean squared error and spectral distortion. |
Databáze: | OpenAIRE |
Externí odkaz: |