Psychoacoustically motivated objective speech quality evaluation procedures, PSQM, and improvements
Autor: | John G. Bereends, Andries P. Hekstra |
---|---|
Rok vydání: | 1999 |
Předmět: |
Dynamic time warping
Acoustics and Ultrasonics Computer science Speech recognition media_common.quotation_subject ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION PSQM Arts and Humanities (miscellaneous) Perception Codec Quality (business) Sone Set (psychology) Representation (mathematics) media_common |
Zdroj: | The Journal of the Acoustical Society of America. 105:975-975 |
ISSN: | 0001-4966 |
DOI: | 10.1121/1.425330 |
Popis: | PSQM (Perceptual Speech Quality Measure), measuring speech quality objectively, has been standardized by ITU‐T as recommendation P.861. PSQM characterizes the perception of the (degraded) output speech signal of the system in comparison to the (ideal) input speech. A perceptual model is used that maps input and output signals onto psychophysical representations using psychophysical equivalents of frequency (Bark) and intensity (compressed Sone). The quality of the device under test is determined with a simple cognitive mapping from the differences in the psychophysical representation to the perceived speech quality in terms of Mean Opinion Scores (MOS). Within an ITU benchmark testing a limited set of unknown codec distortions the PSQM showed high correlations (around 0.97) between subjectively perceived and objectively measured speech quality. When applying PSQM to a wide variety of real world distortions two major limitations show up: First, dynamic time warping effects, as they will be found, e.g., in ... |
Databáze: | OpenAIRE |
Externí odkaz: |