Zobrazeno 1 - 10
of 331
pro vyhledávání: '"Kouzelis, A."'
Recent advances in Diffusion Models (DMs) have led to significant progress in visual synthesis and editing tasks, establishing them as a strong competitor to Generative Adversarial Networks (GANs). However, the latent space of DMs is not as well unde
Externí odkaz:
http://arxiv.org/abs/2408.16845
Autor:
Andreas Panagopoulos, MD, PhD, Konstantina Solou, MD, MSc, Marios Nicolaides, MSc, Ioannis K. Triantafyllopoulos, MD, PhD, Antonis Kouzelis, MD, PhD, Zinon T. Kokkalis, MD, PhD
Publikováno v:
JSES Reviews, Reports, and Techniques, Vol 4, Iss 4, Pp 676-683 (2024)
Background: Unstable “extralateral” fractures of the distal clavicle (lateral to the coracoclavicular ligaments) are not distinguished in the Neer classification system and are commonly included with Neer IIb fractures. In the literature, there i
Externí odkaz:
https://doaj.org/article/6943e42bdb2142c082ee9b651d6173d8
In recent years, datasets of paired audio and captions have enabled remarkable success in automatically generating descriptions for audio clips, namely Automated Audio Captioning (AAC). However, it is labor-intensive and time-consuming to collect a s
Externí odkaz:
http://arxiv.org/abs/2309.12242
Autor:
Plitsis, Manos, Kouzelis, Theodoros, Paraskevopoulos, Georgios, Katsouros, Vassilis, Panagakis, Yannis
In this work, we investigate the personalization of text-to-music diffusion models in a few-shot setting. Motivated by recent advances in the computer vision domain, we are the first to explore the combination of pre-trained text-to-audio diffusers w
Externí odkaz:
http://arxiv.org/abs/2309.11140
The study of speech disorders can benefit greatly from time-aligned data. However, audio-text mismatches in disfluent speech cause rapid performance degradation for modern speech aligners, hindering the use of automatic approaches. In this work, we p
Externí odkaz:
http://arxiv.org/abs/2306.00996
Automated audio captioning is multi-modal translation task that aim to generate textual descriptions for a given audio clip. In this paper we propose a full Transformer architecture that utilizes Patchout as proposed in [1], significantly reducing th
Externí odkaz:
http://arxiv.org/abs/2304.02916
Autor:
Paraskevopoulos, Georgios, Kouzelis, Theodoros, Rouvalis, Georgios, Katsamanis, Athanasios, Katsouros, Vassilis, Potamianos, Alexandros
Modern speech recognition systems exhibits rapid performance degradation under domain shift. This issue is especially prevalent in data-scarce settings, such as low-resource languages, where diversity of training data is limited. In this work we prop
Externí odkaz:
http://arxiv.org/abs/2301.00304
Autor:
Panagopoulos, Andreas, Solou, Konstantina, Nicolaides, Marios, Triantafyllopoulos, Ioannis K., Kouzelis, Antonis, Kokkalis, Zinon T.
Publikováno v:
In JSES Reviews, Reports, and Techniques November 2024 4(4):676-683
Publikováno v:
Applied Sciences, Vol 14, Iss 15, p 6405 (2024)
Extended reality offers unique ways to create mediated spaces that enhance and help popularize experiences across several domains, including entertainment, creativity, and culture. There are still issues that hinder the widespread adoption of the med
Externí odkaz:
https://doaj.org/article/7f63200bc13742e3b299ee1de7ae8f4d
Publikováno v:
Phys. Rev. A 101, 043847 (2020)
We study the preparation of coherent quantum states in a two-photon micromaser for applications in quantum metrology. While this setting can be in principle realized in a host of physical systems, we consider atoms interacting with the field of a cav
Externí odkaz:
http://arxiv.org/abs/1906.03933