End-to-end Korean Digits Speech Recognition
Autor: | Jong-Hyuk Roh, Sangrae Cho, Youngsam Kim, Kwantae Cho |
---|---|
Rok vydání: | 2019 |
Předmět: |
Structure (mathematical logic)
End-to-end principle Artificial neural network Computer science Character (computing) Speech recognition 0202 electrical engineering electronic engineering information engineering Acoustic model 020206 networking & telecommunications 020201 artificial intelligence & image processing 02 engineering and technology Language model Numerical digit |
Zdroj: | ICTC |
Popis: | The traditional speech recognition model consisting of an acoustic model and a language model is mainly used. Recently, an end-to-end speech recognition model consisting of a single integrated neural network model is being studied. This model has the advantage that it does not require a lot of training and it is easy to understand the structure of the model. In this paper, we designed the end-to-end model for Korean digit speech recognition and showed the performance results. We tried the digit speech recognition model in two forms: word model and character model. |
Databáze: | OpenAIRE |
Externí odkaz: |