Zobrazeno 1 - 10
of 256
pro vyhledávání: '"Tamas, Gabor"'
The aim of the study is to investigate the complex mechanisms of speech perception and ultimately decode the electrical changes in the brain accruing while listening to speech. We attempt to decode heard speech from intracranial electroencephalograph
Externí odkaz:
http://arxiv.org/abs/2402.16996
Publikováno v:
Proceedings of Interspeech 2023
Previous initial research has already been carried out to propose speech-based BCI using brain signals (e.g. non-invasive EEG and invasive sEEG / ECoG), but there is a lack of combined methods that investigate non-invasive brain, articulation, and sp
Externí odkaz:
http://arxiv.org/abs/2306.05374
Autor:
Pogliano, Francesco, Larsen, Ann-Cecilie, Garrote, Frank Leonel Bello, Bjørøen, Marianne Møller, Eriksen, Tomas Kvalheim, Gjestvang, Dorthea, Görgen, Andreas, Guttormsen, Magne, Li, Kevin Ching Wei, Markova, Maria, Matthews, Eric Francis, Paulsen, Wanja, Pedersen, Line Gaard, Siem, Sunniva, Storebakken, Tellef, Tornyi, Tamas Gabor, Vevik, Julian Ersland
Publikováno v:
Phys. Rev. C 106, 015804, Published 25 July 2022
Nuclei in the $^{135}$I region have been identified as being a possible bottleneck for the \textit{i} process. Here we present an indirect measurement for the Maxwellian-averaged cross section of $^{126}\text{Sb}(n,\gamma)$. The nuclear level density
Externí odkaz:
http://arxiv.org/abs/2208.10397
Neural network-based Text-to-Speech has significantly improved the quality of synthesized speech. Prominent methods (e.g., Tacotron2, FastSpeech, FastPitch) usually generate Mel-spectrogram from text and then synthesize speech using vocoder (e.g., Wa
Externí odkaz:
http://arxiv.org/abs/2208.07122
Traditional vocoder-based statistical parametric speech synthesis can be advantageous in applications that require low computational complexity. Recent neural vocoders, which can produce high naturalness, still cannot fulfill the requirement of being
Externí odkaz:
http://arxiv.org/abs/2108.01154
Autor:
Zainkó, Csaba, Tóth, László, Shandiz, Amin Honarmandi, Gosztolya, Gábor, Markó, Alexandra, Németh, Géza, Csapó, Tamás Gábor
For articulatory-to-acoustic mapping, typically only limited parallel training data is available, making it impossible to apply fully end-to-end solutions like Tacotron2. In this paper, we experimented with transfer learning and adaptation of a Tacot
Externí odkaz:
http://arxiv.org/abs/2107.12051
Autor:
Csapó, Tamás Gábor
In this paper, we present our first experiments in text-to-articulation prediction, using ultrasound tongue image targets. We extend a traditional (vocoder-based) DNN-TTS framework with predicting PCA-compressed ultrasound images, of which the contin
Externí odkaz:
http://arxiv.org/abs/2107.05550
Articulatory information has been shown to be effective in improving the performance of HMM-based and DNN-based text-to-speech synthesis. Speech synthesis research focuses traditionally on text-to-speech conversion, when the input is text or an estim
Externí odkaz:
http://arxiv.org/abs/2107.02003
Autor:
Szegedi, Viktor, Tiszlavicz, Ádám, Furdan, Szabina, Douida, Abdennour, Bakos, Emoke, Barzo, Pal, Tamas, Gabor, Szucs, Attila, Lamsa, Karri
Publikováno v:
In Journal of Biotechnology 20 June 2024 389:1-12
Vocoders received renewed attention as main components in statistical parametric text-to-speech (TTS) synthesis and speech transformation systems. Even though there are vocoding techniques give almost accepted synthesized speech, their high computati
Externí odkaz:
http://arxiv.org/abs/2106.10481