Zobrazeno 1 - 10
of 1 957
pro vyhledávání: '"Markopoulos P"'
Autor:
Mitsios, Michael, Vamvoukakis, Georgios, Maniati, Georgia, Ellinas, Nikolaos, Dimitriou, Georgios, Markopoulos, Konstantinos, Kakoulidis, Panos, Vioni, Alexandra, Christidou, Myrsini, Oh, Junkwang, Jho, Gunu, Hwang, Inchul, Vardaxoglou, Georgios, Chalamandaris, Aimilios, Tsiakoulis, Pirros, Raptis, Spyros
Emotion detection in textual data has received growing interest in recent years, as it is pivotal for developing empathetic human-computer interaction systems. This paper introduces a method for categorizing emotions from text, which acknowledges and
Externí odkaz:
http://arxiv.org/abs/2404.01805
Autor:
Christophorou, Christophoros, Ioannou, Iacovos, Vassiliou, Vasos, Christofi, Loizos, Vardakas, John S, Seder, Erin E, Chiasserini, Carla Fabiana, Iordache, Marius, Issaid, Chaouki Ben, Markopoulos, Ioannis, Franzese, Giulio, Järvet, Tanel, Verikoukis, Christos
In the upcoming 6G era, mobile networks must deal with more challenging applications (e.g., holographic telepresence and immersive communication) and meet far more stringent application requirements stemming along the edge-cloud continuum. These new
Externí odkaz:
http://arxiv.org/abs/2403.05277
Autor:
Markopoulos, Konstantinos, Maniati, Georgia, Vamvoukakis, Georgios, Ellinas, Nikolaos, Vardaxoglou, Georgios, Kakoulidis, Panos, Oh, Junkwang, Jho, Gunu, Hwang, Inchul, Chalamandaris, Aimilios, Tsiakoulis, Pirros, Raptis, Spyros
The gender of any voice user interface is a key element of its perceived identity. Recently, there has been increasing interest in interfaces where the gender is ambiguous rather than clearly identifying as female or male. This work addresses the tas
Externí odkaz:
http://arxiv.org/abs/2211.00375
Autor:
Ellinas, Nikolaos, Vamvoukakis, Georgios, Markopoulos, Konstantinos, Maniati, Georgia, Kakoulidis, Panos, Sung, June Sig, Hwang, Inchul, Raptis, Spyros, Chalamandaris, Aimilios, Tsiakoulis, Pirros
This paper presents a method for end-to-end cross-lingual text-to-speech (TTS) which aims to preserve the target language's pronunciation regardless of the original speaker's language. The model used is based on a non-attentive Tacotron architecture,
Externí odkaz:
http://arxiv.org/abs/2210.17264
Autor:
Le, Duc, Markopoulos, Panos P.
Singular-Value Decomposition (SVD) is a ubiquitous data analysis method in engineering, science, and statistics. Singular-value estimation, in particular, is of critical importance in an array of engineering applications, such as channel estimation i
Externí odkaz:
http://arxiv.org/abs/2210.12097
Autor:
Hyder, Rakib, Shao, Ken, Hou, Boyu, Markopoulos, Panos, Prater-Bennette, Ashley, Asif, M. Salman
Publikováno v:
ECCV 2022
Incremental Task learning (ITL) is a category of continual learning that seeks to train a single network for multiple tasks (one after another), where training data for each task is only available during the training of that task. Neural networks ten
Externí odkaz:
http://arxiv.org/abs/2207.09074
Autor:
Nikitaras, Karolos, Vamvoukakis, Georgios, Ellinas, Nikolaos, Klapsas, Konstantinos, Markopoulos, Konstantinos, Raptis, Spyros, Sung, June Sig, Jho, Gunu, Chalamandaris, Aimilios, Tsiakoulis, Pirros
A text-to-speech (TTS) model typically factorizes speech attributes such as content, speaker and prosody into disentangled representations.Recent works aim to additionally model the acoustic conditions explicitly, in order to disentangle the primary
Externí odkaz:
http://arxiv.org/abs/2204.05070
Autor:
Kakoulidis, Panos, Ellinas, Nikolaos, Vamvoukakis, Georgios, Markopoulos, Konstantinos, Sung, June Sig, Jho, Gunu, Tsiakoulis, Pirros, Chalamandaris, Aimilios
Existing singing voice synthesis models (SVS) are usually trained on singing data and depend on either error-prone time-alignment and duration features or explicit music score information. In this paper, we propose Karaoker, a multispeaker Tacotron-b
Externí odkaz:
http://arxiv.org/abs/2204.04127
Autor:
Klapsas, Konstantinos, Ellinas, Nikolaos, Nikitaras, Karolos, Vamvoukakis, Georgios, Kakoulidis, Panos, Markopoulos, Konstantinos, Raptis, Spyros, Sung, June Sig, Jho, Gunu, Chalamandaris, Aimilios, Tsiakoulis, Pirros
Voice cloning is a difficult task which requires robust and informative features incorporated in a high quality TTS system in order to effectively copy an unseen speaker's voice. In our work, we utilize features learned in a self-supervised framework
Externí odkaz:
http://arxiv.org/abs/2204.03421
Autor:
Christidou, Myrsini, Vioni, Alexandra, Ellinas, Nikolaos, Vamvoukakis, Georgios, Markopoulos, Konstantinos, Kakoulidis, Panos, Sung, June Sig, Park, Hyoungmin, Chalamandaris, Aimilios, Tsiakoulis, Pirros
This paper presents a method for phoneme-level prosody control of F0 and duration on a multispeaker text-to-speech setup, which is based on prosodic clustering. An autoregressive attention-based model is used, incorporating multispeaker architecture
Externí odkaz:
http://arxiv.org/abs/2111.10168