Formant analysis in dysphonic patients and automatic Arabic digit speech recognition

Autor: Farahat Mohamed, Malki Khalid H, Mesallam Tamer A, Muhammad Ghulam, Alsulaiman Mansour, Bukhari Manal
Jazyk: angličtina
Rok vydání: 2011
Předmět:
Zdroj: BioMedical Engineering OnLine, Vol 10, Iss 1, p 41 (2011)
Druh dokumentu: article
ISSN: 1475-925X
DOI: 10.1186/1475-925X-10-41
Popis: Abstract Background and objective There has been a growing interest in objective assessment of speech in dysphonic patients for the classification of the type and severity of voice pathologies using automatic speech recognition (ASR). The aim of this work was to study the accuracy of the conventional ASR system (with Mel frequency cepstral coefficients (MFCCs) based front end and hidden Markov model (HMM) based back end) in recognizing the speech characteristics of people with pathological voice. Materials and methods The speech samples of 62 dysphonic patients with six different types of voice disorders and 50 normal subjects were analyzed. The Arabic spoken digits were taken as an input. The distribution of the first four formants of the vowel /a/ was extracted to examine deviation of the formants from normal. Results There was 100% recognition accuracy obtained for Arabic digits spoken by normal speakers. However, there was a significant loss of accuracy in the classifications while spoken by voice disordered subjects. Moreover, no significant improvement in ASR performance was achieved after assessing a subset of the individuals with disordered voices who underwent treatment. Conclusion The results of this study revealed that the current ASR technique is not a reliable tool in recognizing the speech of dysphonic patients.
Databáze: Directory of Open Access Journals