Diphone preparation for Bangla text to speech synthesis
Autor: | Md. Akter Hussain, Muhammad Masud Rashid, M. Shahidur Rahman |
---|---|
Rok vydání: | 2009 |
Předmět: |
Computer science
business.industry Speech recognition Concatenation Speech synthesis computer.software_genre Diphone language.human_language Bengali MBROLA language Selection (linguistics) Text normalization Artificial intelligence business computer Natural language processing Digital signal processing |
Zdroj: | 2009 12th International Conference on Computers and Information Technology. |
DOI: | 10.1109/iccit.2009.5407135 |
Popis: | This paper presents methodologies involved in diphone preparation for Bangla text to speech synthesis. A concatenation based synthesis system comprises basically two modules- one is natural language processing and other is digital signal processing (DSP). Natural language processing implies converting text to its pronounceable text, called text normalization and the diphone selection method based on the normalized text is called Graphene to Phoneme (G2P) conversion. We developed a speech synthesizer for Bangla using diphone based concatenative approach. Diphone preparation, labeling and selection techniques are described in this paper. |
Databáze: | OpenAIRE |
Externí odkaz: |