Výsledky vyhledávání

Akademický článek

Punctuation-generation-inspired linguistic features for Mandarin prosody generation

Autor: Chen-Yu Chiang, Yu-Ping Hung, Han-Yun Yeh, I-Bin Liao, Chen-Ming Pan

Publikováno v: EURASIP Journal on Audio, Speech, and Music Processing, Vol 2019, Iss 1, Pp 1-22 (2019)

Abstract This paper proposes two novel linguistic features extracted from text input for prosody generation in a Mandarin text-to-speech system. The first feature is the punctuation confidence (PC), which measures the likelihood that a major punctuat

Externí odkaz: https://doaj.org/article/a82870608c964bcc945312fd7108fbae

Zobrazit plný text záznamu

Dissertation/ Thesis

Implementation of A Network News Reader By Using Speech I/O Interface

Autor: I-Bin Liao, 廖宜斌

87
In this thesis, two main topics are intensively studied. One is the implementation of a network news reader by using speech I/O interface. It is a real-time news reader by accessing the news server through internet. Using voice command, the u

Externí odkaz: http://ndltd.ncl.edu.tw/handle/67137002337016162028

Zobrazit plný text záznamu

Punctuation-generation-inspired linguistic features for Mandarin prosody generation

Autor: Han-Yun Yeh, I-Bin Liao, Chen-Yu Chiang, Chen-Ming Pan, Yu-Ping Hung

Publikováno v: EURASIP Journal on Audio, Speech, and Music Processing, Vol 2019, Iss 1, Pp 1-22 (2019)

This paper proposes two novel linguistic features extracted from text input for prosody generation in a Mandarin text-to-speech system. The first feature is the punctuation confidence (PC), which measures the likelihood that a major punctuation mark

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::caacef3b2e41ab0e32122dc3f14b0c5e
http://link.springer.com/article/10.1186/s13636-019-0147-y

Zobrazit plný text záznamu

Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS

Autor: Chen-Yu Chiang, I-Bin Liao, Yih-Ru Wang, Sin-Horng Chen

Publikováno v: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24:2046-2058

In this paper, a structural maximum a posteriori (SMAP) speaker adaptation approach to adjusting the speaking rate (SR)-dependent hierarchical prosodic model (SR-HPM) of an existing SR-controlled Mandarin text-to-speech system to a new speaker's data

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5cc8ce2d8f48c04f2c010d88256bec66
https://doi.org/10.1109/taslp.2016.2598307

Zobrazit plný text záznamu

Punctuation Generation Inspired Linguistic Features for Mandarin Prosody Generation

Autor: Yu-Ping Hung, Chen-Yu Chiang, Han-Yun Yeh, I-Bin Liao, Chen-Ming Pan

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5e24e6bbbc7643e45ee0dedc5606b9e7

Zobrazit plný text záznamu

Structural maximum a posteriori speaker adaptation of speaking rate-dependent hierarchical prosodic model for Mandarin TTS

Autor: Chen-Yu Chiang, I-Bin Liao, Sin-Horng Chen

Publikováno v: ICASSP

In this paper, a structural maximum a posterior speaker adaptation method to adjust the existing speaking rate (SR) dependent hierarchical prosodic model (SR-HPM) to a new speaker's data for realizing a new voice of any given SR is discussed. The ada

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::7c59a5c94a6a3a1da37753c9ffdccbf4
https://doi.org/10.1109/icassp.2016.7472754

Zobrazit plný text záznamu

Hierarchical prosody modeling of English speech and its application to TTS

Autor: Chen-Yu Chiang, Yih-Ru Wang, Chin-Kuan Kuo, Chung-Yao Tsai, I-Bin Liao, Sin-Horng Chen

Publikováno v: O-COCOSDA

In this paper, a hierarchical prosody modeling approach for English speech is proposed. It is an extended version of the HPM approach proposed previously for Mandarin speech. It first designs a syllable-based, statistical prosodic model to describe v

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::01bead20800d7f5f086cc775f8c23008
https://doi.org/10.1109/icsda.2014.7051427

Zobrazit plný text záznamu

An investigation on linguistic features for Mandarin prosody generation

Autor: Chen-Yu Chiang, Chen-Ming Pan, I-Bin Liao, Yu-Ping Hung, Han-Yun Yeh

Publikováno v: O-COCOSDA

This paper seeks to investigate the usability of two fully-automatic machine-extracted linguistic features from an unlimited text input, in a prosody generation of Mandarin text-to-speech system (MTTS). One is the base-phrase chunk feature, labeled b

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8f16e110d94e3d8572957493a4ab228d
https://doi.org/10.1109/icsda.2014.7051426

Zobrazit plný text záznamu

Speaker adaptation of speaking rate-dependent hierarchical prosodic model for Mandarin TTS

Autor: Sin-Horng Chen, Yih-Ru Wang, Po-Chun Wang, Chen-Yu Chiang, I-Bin Liao

Publikováno v: ISCSLP

In this paper, a speaker adaptation method to adapt an existing speaking rate-dependent hierarchical prosodic model (SR-HPM) of an SR-controlled Mandarin TTS system to new speaker's data for realizing a new voice is proposed. Two main problems are ad

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::3561ffe041c551cbfaa7dc694e7044f0
https://doi.org/10.1109/iscslp.2014.6936616

Zobrazit plný text záznamu

A hybrid approach to singing pitch extraction based on trend estimation and hidden Markov models

Autor: I-Bin Liao, Wei-Lun Chang, Jyh-Shing Roger Jang, Ming-Ju Wu, Tzu-Chun Yeh

Publikováno v: ICASSP

In this paper, we propose a hybrid method for singing pitch extraction from polyphonic audio music. We have observed several kinds of pitch errors made by a previously proposed algorithm based on trend estimation. We also noticed that other pitch tra

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::4a3554fecd1023e874a79d996036ff14
https://doi.org/10.1109/icassp.2012.6287915

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání