Zobrazeno 1 - 10
of 11
pro vyhledávání: '"Takaaki Saeki"'
Publikováno v:
IEEE Access, Vol 11, Pp 144831-144843 (2023)
Restoring high-quality speech from degraded historical recordings is crucial for the preservation of cultural and endangered linguistic resources. A key challenge in this task is the scarcity of paired training data that replicate the original acoust
Externí odkaz:
https://doaj.org/article/08b8e27933e7435cafcad313c4bfe927
Pause insertion, also known as phrase break prediction and phrasing, is an essential part of TTS systems because proper pauses with natural duration significantly enhance the rhythm and intelligibility of synthetic speech. However, conventional phras
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2cfcd44e1a66851b87c8700e9d7c7213
Publikováno v:
IEICE Transactions on Information and Systems. :1002-1016
Autor:
Takaaki Saeki, Heiga Zen, Zhehuai Chen, Nobuyuki Morioka, Gary Wang, Yu Zhang, Ankur Bapna, Andrew Rosenberg, Bhuvana Ramabhadran
This paper proposes Virtuoso, a massively multilingual speech-text joint semi-supervised learning framework for text-to-speech synthesis (TTS) models. Existing multilingual TTS typically supports tens of languages, which are a small fraction of the t
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::52500f6059f16130fbb660682cd9fdfb
We present a self-supervised speech restoration method without paired speech corpora. Because the previous general speech restoration method uses artificial paired data created by applying various distortions to high-quality speech corpora, it cannot
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9caec395491cfd205b85323b6ff968f2
While human evaluation is the most reliable metric for evaluating speech generation systems, it is generally costly and time-consuming. Previous studies on automatic speech quality assessment address the problem by predicting human evaluation scores
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7967b940812b5bdf7db054acfcf93aa1
This letter presents an incremental text-to-speech (TTS) method that performs synthesis in small linguistic units while maintaining the naturalness of output speech. Incremental TTS is generally subject to a trade-off between latency and synthetic sp
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a2993b1c6236e3e82fde500a12c568f0
http://arxiv.org/abs/2012.12612
http://arxiv.org/abs/2012.12612
Publikováno v:
ICASSP
In this paper, we propose computationally efficient and high-quality methods for statistical voice conversion (VC) with direct waveform modification based on spectral differentials. The conventional method with a minimum-phase filter achieves high-qu
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::161ec8443ecc29e3751c5b4d033f60c0
http://arxiv.org/abs/2002.06778
http://arxiv.org/abs/2002.06778
Publikováno v:
IEICE Transactions on Communications. (8):2126-2134
Cooperative relaying (CR) is a promising technique to provide spatial diversity by combining multiple signals from source and relay stations. In the present paper, the impact and use of the asymmetric property in bi-directional CR under asymmetric tr
Publikováno v:
Micro Total Analysis Systems 2002 ISBN: 9789401039536
We have developed the microchips for gene manipulation: DNA extraction and purification microchips based on the alkaline-SDS lysis method, and a microfluidic DNA- transfection device via electroporation. These chips allow contamination free, high thr
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::14a4e9273eb8bcaa9118df15f5680e99
https://doi.org/10.1007/978-94-010-0504-3_74
https://doi.org/10.1007/978-94-010-0504-3_74