Development of a Vietnamese Large Vocabulary Continuous Speech Recognition System under Noisy Conditions
Autor: | Ba Quyen Dam, Van Tuan Mai, Quoc Bao Nguyen, Van Hai Do, Quang Trung Le |
---|---|
Rok vydání: | 2018 |
Předmět: |
Vocabulary
Computer science Speech recognition media_common.quotation_subject Vietnamese Word error rate Speech corpus 02 engineering and technology language.human_language 030507 speech-language pathology & audiology 03 medical and health sciences Recurrent neural network Test set 0202 electrical engineering electronic engineering information engineering Feature (machine learning) language 020201 artificial intelligence & image processing Language model 0305 other medical science media_common |
Zdroj: | SoICT |
DOI: | 10.1145/3287921.3287938 |
Popis: | In this paper, we first present our effort to collect a 500-hour corpus for Vietnamese read speech. After that, various techniques such as data augmentation, recurrent neural network language model rescoring, language model adaptation, bottleneck feature, system combination are applied to build the speech recognition system. Our final system achieves a low word error rate at 6.9% on the noisy test set. |
Databáze: | OpenAIRE |
Externí odkaz: |