End-to-end Korean Digits Speech Recognition

Autor: Jong-Hyuk Roh, Sangrae Cho, Youngsam Kim, Kwantae Cho
Rok vydání: 2019
Předmět:
Zdroj: ICTC
Popis: The traditional speech recognition model consisting of an acoustic model and a language model is mainly used. Recently, an end-to-end speech recognition model consisting of a single integrated neural network model is being studied. This model has the advantage that it does not require a lot of training and it is easy to understand the structure of the model. In this paper, we designed the end-to-end model for Korean digit speech recognition and showed the performance results. We tried the digit speech recognition model in two forms: word model and character model.
Databáze: OpenAIRE