A Quantitative Assessment of Group Delay Methods for Identifying Glottal Closures in Voiced Speech
Autor: | Jon Gudnason, Mike Brookes, Patrick A. Naylor |
---|---|
Rok vydání: | 2006 |
Předmět: |
Reduction (complexity)
Acoustics and Ultrasonics Computational complexity theory Speech recognition Electrical and Electronic Engineering Linear predictive coding Residual GeneralLiterature_REFERENCE(e.g. dictionaries encyclopedias glossaries) Measure (mathematics) Electroglottograph Standard deviation Group delay and phase delay Mathematics |
Zdroj: | IEEE Transactions on Audio, Speech and Language Processing. 14:456-466 |
ISSN: | 1558-7916 |
DOI: | 10.1109/tsa.2005.857810 |
Popis: | Measures based on the group delay of the LPC residual have been used by a number of authors to identify the time instants of glottal closure in voiced speech. In this paper, we discuss the theoretical properties of three such measures and we also present a new measure having useful properties. We give a quantitative assessment of each measure's ability to detect glottal closure instants evaluated using a speech database that includes a direct measurement of glottal activity from a Laryngograph/EGG signal. We find that when using a fixed-length analysis window, the best measures can detect the instant of glottal closure in 97% of larynx cycles with a standard deviation of 0.6 ms and that in 9% of these cycles an additional excitation instant is found that normally corresponds to glottal opening. We show that some improvement in detection rate may be obtained if the analysis window length is adapted to the speech pitch. If the measures are applied to the preemphasized speech instead of to the LPC residual, we find that the timing accuracy worsens but the detection rate improves slightly. We assess the computational cost of evaluating the measures and we present new recursive algorithms that give a substantial reduction in computation in all cases. |
Databáze: | OpenAIRE |
Externí odkaz: |