Zobrazeno 1 - 10
of 22
pro vyhledávání: '"Mohammed Salah Al-Radhi"'
Publikováno v:
Applied Sciences, Vol 11, Iss 16, p 7489 (2021)
Voice conversion (VC) transforms the speaking style of a source speaker to the speaking style of a target speaker by keeping linguistic information unchanged. Traditional VC techniques rely on parallel recordings of multiple speakers uttering the sam
Externí odkaz:
https://doaj.org/article/0931fd748ab94e599540bb8bf560e1a8
Publikováno v:
Applied Sciences, Vol 9, Iss 12, p 2460 (2019)
Recent studies in text-to-speech synthesis have shown the benefit of using a continuous pitch estimate; one that interpolates fundamental frequency (F0) even when voicing is not present. However, continuous F0 is still sensitive to additive noise in
Externí odkaz:
https://doaj.org/article/d270da1b6eba4e2cacbea4ea292f8dc5
Publikováno v:
Multimedia Tools and Applications. 82:15635-15649
This paper presents an investigation of speaker adaptation using a continuous vocoder for parametric text-to-speech (TTS) synthesis. In purposes that demand low computational complexity, conventional vocoder-based statistical parametric speech synthe
Publikováno v:
Infocommunications journal. 14:55-62
Speech synthesis has the aim of generating humanlike speech from text. Nowadays, with end-to-end systems, highly natural synthesized speech can be achieved if a large enough dataset is available from the target speaker. However, often it would be nec
Publikováno v:
1st Workshop on Intelligent Infocommunication Networks, Systems and Services.
Publikováno v:
Journal of Petroleum Research and Studies. 10:1-18
Degassing station breakdowns can be dangerous to the operator health and the environment. Programmable logic controllers (PLCs) are key modules of manufacturing control systems that are applied in the complex oil and gas units to reduce manpower and
Publikováno v:
Multimedia Tools and Applications. 80:1969-1994
This article focuses on developing a system for high-quality synthesized and converted speech by addressing three fundamental principles. Although the noise-like component in the state-of-the-art parametric vocoders (for example, STRAIGHT) is often n
Publikováno v:
IEICE Transactions on Information and Systems
In this article, we propose a method called “continuous noise masking (cNM)” that allows eliminating residual buzziness in a continuous vocoder, i.e. of which all parameters are continuous and offers a simple and flexible speech analysis and synt
Neural network-based Text-to-Speech has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron2, FastSpeech, FastPitch) usually generate Mel-spectrogram from text and then synthesize speech using vocoder (e.g., Wa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6e407814e93704c8d44d456dd039e8dd
Publikováno v:
SpeD
This paper shows recent Silent Speech Interface (SSI) progress that translates tongue motions into audible speech. In our previous work and also in the current study, the prediction of fundamental frequency (F0) from Ultra-Sound Tongue Images (UTI) w