End-to-end Korean Digits Speech Recognition

Autor:	Jong-Hyuk Roh, Sangrae Cho, Youngsam Kim, Kwantae Cho
Rok vydání:	2019
Předmět:	Structure (mathematical logic) End-to-end principle Artificial neural network Computer science Character (computing) Speech recognition 0202 electrical engineering electronic engineering information engineering Acoustic model 020206 networking & telecommunications 020201 artificial intelligence & image processing 02 engineering and technology Language model Numerical digit
Zdroj:	ICTC
Popis:	The traditional speech recognition model consisting of an acoustic model and a language model is mainly used. Recently, an end-to-end speech recognition model consisting of a single integrated neural network model is being studied. This model has the advantage that it does not require a lot of training and it is easy to understand the structure of the model. In this paper, we designed the end-to-end model for Korean digit speech recognition and showed the performance results. We tried the digit speech recognition model in two forms: word model and character model.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::9f392e246ec7b051be4d8dfd6ea1f3b9 https://doi.org/10.1109/ictc46691.2019.8939697 Zobrazit plný text záznamu