Phoneme Duration Prediction for Kazakh Language
Autor: | Sergey V. Rybin, Yuri Matveev, Arman Kaliyev |
---|---|
Rok vydání: | 2018 |
Předmět: |
Artificial neural network
Computer science Speech recognition Detector 020207 software engineering Speech synthesis 02 engineering and technology Kazakh computer.software_genre language.human_language Set (abstract data type) 030507 speech-language pathology & audiology 03 medical and health sciences ComputingMethodologies_PATTERNRECOGNITION 0202 electrical engineering electronic engineering information engineering language Duration (project management) 0305 other medical science Classifier (UML) computer Test data |
Zdroj: | Speech and Computer ISBN: 9783319995786 SPECOM |
DOI: | 10.1007/978-3-319-99579-3_29 |
Popis: | Our research team set the goal of creating a modern speech synthesis system for the Kazakh language. One of the most important components of such system is the phoneme duration prediction. In this article, we present our work on the creation of such a classifier. We managed to develop a detector based on deep neural network, using for this purpose a minimum number of input linguistic and phonetic parameters. Based on the learning results, the proposed detector predicts the duration of phonemes on test data with a deviation of 20–25 ms on average. |
Databáze: | OpenAIRE |
Externí odkaz: |