Výsledky vyhledávání - "ZAINKÓ, CSABA"

Report

Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0

Autor: Al-Radhi, Mohammed Salah, Csapó, Tamás Gábor, Zainkó, Csaba, Németh, Géza

Neural network-based Text-to-Speech has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron2, FastSpeech, FastPitch) usually generate Mel-spectrogram from text and then synthesize speech using vocoder (e.g., Wa

Externí odkaz: http://arxiv.org/abs/2208.07122

Zobrazit plný text záznamu

Report

Improving Self-Supervised Learning-based MOS Prediction Networks

Autor: Gyires-Tóth, Bálint, Zainkó, Csaba

MOS (Mean Opinion Score) is a subjective method used for the evaluation of a system's quality. Telecommunications (for voice and video), and speech synthesis systems (for generated speech) are a few of the many applications of the method. While MOS t

Externí odkaz: http://arxiv.org/abs/2204.11030

Zobrazit plný text záznamu

Report

Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging

Autor: Zainkó, Csaba, Tóth, László, Shandiz, Amin Honarmandi, Gosztolya, Gábor, Markó, Alexandra, Németh, Géza, Csapó, Tamás Gábor

For articulatory-to-acoustic mapping, typically only limited parallel training data is available, making it impossible to apply fully end-to-end solutions like Tacotron2. In this paper, we experimented with transfer learning and adaptation of a Tacot

Externí odkaz: http://arxiv.org/abs/2107.12051

Zobrazit plný text záznamu

Report

Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis

Autor: Al-Radhi, Mohammed Salah, Csapó, Tamás Gábor, Zainkó, Csaba, Németh, Géza

To date, various speech technology systems have adopted the vocoder approach, a method for synthesizing speech waveform that shows a major role in the performance of statistical parametric speech synthesis. WaveNet one of the best models that nearly

Externí odkaz: http://arxiv.org/abs/2106.06863

Zobrazit plný text záznamu

Report

Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis

Autor: Csapó, Tamás Gábor, Zainkó, Csaba, Tóth, László, Gosztolya, Gábor, Markó, Alexandra

For articulatory-to-acoustic mapping using deep neural networks, typically spectral and excitation parameters of vocoders have been used as the training targets. However, vocoding often results in buzzy and muffled final speech quality. Therefore, in

Externí odkaz: http://arxiv.org/abs/2008.03152

Zobrazit plný text záznamu

Akademický článek

Speech-centric Multimodal Interaction for Easy-to-access Online Services – A Personal Life Assistant for the Elderly

Autor: Teixeira, António, Hämäläinen, Annika, Avelar, Jairo, Almeida, Nuno, Németh, Géza, Fegyó, Tibor, Zainkó, Csaba, Csapó, Tamás, Tóth, Bálint, Oliveira, André, Dias, Miguel Sales

Publikováno v: In Procedia Computer Science 2014 27:389-397

Zobrazit plný text záznamu

Akademický článek

Multilingual statistical text analysis, Zipf's law and Hungarian speech generation

Autor: Németh, Géza, Zainkó, Csaba

Publikováno v: Acta Linguistica Hungarica (Since 2017 Acta Linguistica Academica). 49(3-4):385-405

Externí odkaz: https://www.ceeol.com/search/article-detail?id=626072

Zobrazit plný text záznamu

Kísérletek a WavNet módszer alkalmazására magyar beszédszintézishez

Autor: Zainkó, Csaba, Gyires-Tóth, Bálint, Németh, Géza, Olaszy, Gábor

The WaveNet architecture is suitable to generate high quality speech, it was demonstrated for English by Google DeepMind. In this paper we de- scribe our experiments of using WaveNet for Hungarian speech generation. We investigated the effects of dif

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::abd40739bf9cad1321f4bc0f60d53ca0

Zobrazit plný text záznamu

Kniha

Special Speech Synthesis for Social Network Websites.

Autor: Zainkó, Csaba, Csapó, Tamás Gábor, Németh, Géza

Publikováno v: Text, Speech & Dialogue (9783642157592); 2010, p455-463, 9p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání