Phase-Based Feature Representations for Improving Recognition of Dysarthric Speech
Autor: | Stuart Cunningham, Siddharth Sehgal, Phil D. Green |
---|---|
Rok vydání: | 2018 |
Předmět: |
Computer science
Speech recognition 02 engineering and technology 030507 speech-language pathology & audiology 03 medical and health sciences Dysarthria Cepstrum Motor speech 0202 electrical engineering electronic engineering information engineering medicine Feature (machine learning) 020201 artificial intelligence & image processing Mel-frequency cepstrum medicine.symptom 0305 other medical science Hidden Markov model Vocal tract Group delay and phase delay |
Zdroj: | SLT |
Popis: | Dysarthria is a neurological speech impairment, which usually results in the loss of motor speech control due to muscular atrophy and incoordination of the articulators. As a result the speech becomes less intelligible and difficult to model by machine learning algorithms due to inconsistencies in the acoustic signal and data sparseness. This paper presents phase-based feature representations for dysarthric speech that are exploited in the group delay spectrum. Such representations are found to be better suited to characterising the resonances of the vocal tract, exhibit better phone discrimination capabilities in dysarthric signals and consequently improve ASR performance. All the experiments were conducted using the UASPEECH corpus and significant ASR gains are reported using phase-based cepstral features in comparison to the standard MFCCs irrespective of the severity of the condition. |
Databáze: | OpenAIRE |
Externí odkaz: |