Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions

Autor: Ba Quyen Dam, Van Tuan Mai, Quoc Bao Nguyen, Van Hai Do, Quang Trung Le
Rok vydání: 2018
Předmět:
Zdroj: SoICT
DOI: 10.1145/3287921.3287938
Popis: In this paper, we first present our effort to collect a 500-hour corpus for Vietnamese read speech. After that, various techniques such as data augmentation, recurrent neural network language model rescoring, language model adaptation, bottleneck feature, system combination are applied to build the speech recognition system. Our final system achieves a low word error rate at 6.9% on the noisy test set.
Databáze: OpenAIRE