Zobrazeno 1 - 10
of 19
pro vyhledávání: '"I-Bin Liao"'
Publikováno v:
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2019, Iss 1, Pp 1-22 (2019)
Abstract This paper proposes two novel linguistic features extracted from text input for prosody generation in a Mandarin text-to-speech system. The first feature is the punctuation confidence (PC), which measures the likelihood that a major punctuat
Externí odkaz:
https://doaj.org/article/a82870608c964bcc945312fd7108fbae
Autor:
I-Bin Liao, 廖宜斌
87
In this thesis, two main topics are intensively studied. One is the implementation of a network news reader by using speech I/O interface. It is a real-time news reader by accessing the news server through internet. Using voice command, the u
In this thesis, two main topics are intensively studied. One is the implementation of a network news reader by using speech I/O interface. It is a real-time news reader by accessing the news server through internet. Using voice command, the u
Externí odkaz:
http://ndltd.ncl.edu.tw/handle/67137002337016162028
Publikováno v:
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2019, Iss 1, Pp 1-22 (2019)
This paper proposes two novel linguistic features extracted from text input for prosody generation in a Mandarin text-to-speech system. The first feature is the punctuation confidence (PC), which measures the likelihood that a major punctuation mark
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 24:2046-2058
In this paper, a structural maximum a posteriori (SMAP) speaker adaptation approach to adjusting the speaking rate (SR)-dependent hierarchical prosodic model (SR-HPM) of an existing SR-controlled Mandarin text-to-speech system to a new speaker's data
This paper proposes two novel linguistic features extracted from text input for prosody generation in a Mandarin text-to-speech system. The first feature is the punctuation confidence (PC), which measures the likelihood that a major punctuation mark
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5e24e6bbbc7643e45ee0dedc5606b9e7
Publikováno v:
ICASSP
In this paper, a structural maximum a posterior speaker adaptation method to adjust the existing speaking rate (SR) dependent hierarchical prosodic model (SR-HPM) to a new speaker's data for realizing a new voice of any given SR is discussed. The ada
Publikováno v:
O-COCOSDA
In this paper, a hierarchical prosody modeling approach for English speech is proposed. It is an extended version of the HPM approach proposed previously for Mandarin speech. It first designs a syllable-based, statistical prosodic model to describe v
Publikováno v:
O-COCOSDA
This paper seeks to investigate the usability of two fully-automatic machine-extracted linguistic features from an unlimited text input, in a prosody generation of Mandarin text-to-speech system (MTTS). One is the base-phrase chunk feature, labeled b
Publikováno v:
ISCSLP
In this paper, a speaker adaptation method to adapt an existing speaking rate-dependent hierarchical prosodic model (SR-HPM) of an SR-controlled Mandarin TTS system to new speaker's data for realizing a new voice is proposed. Two main problems are ad
Publikováno v:
ICASSP
In this paper, we propose a hybrid method for singing pitch extraction from polyphonic audio music. We have observed several kinds of pitch errors made by a previously proposed algorithm based on trend estimation. We also noticed that other pitch tra