Consideration of Lombard effect for speechreading

Autor: Tsuhan Chen, Fu Jie Huang
Rok vydání: 2002
Předmět:
Zdroj: MMSP
DOI: 10.1109/mmsp.2001.962800
Popis: We propose a method for integrating audio and visual information to enhance speech recognition in adverse environments. We train the audio hidden Markov model and the visual hidden Markov model separately, and then use a Viterbi algorithm to decode both channels in parallel. The decoding process is asynchronous between the two channels to capture the asynchronous nature of audio and visual speech. We test the proposed method using speech corrupted by various types of noise and speech with the Lombard effect.
Databáze: OpenAIRE