Zobrazeno 1 - 10
of 1 817
pro vyhledávání: '"Trueba, P."'
Autor:
Martín-Cortinas, Álvaro, Sáez-Trigueros, Daniel, Vallés-Pérez, Iván, Tura-Vecino, Biel, Biliński, Piotr, Lajszczak, Mateusz, Beringer, Grzegorz, Barra-Chicote, Roberto, Lorenzo-Trueba, Jaime
Large Language Models (LLMs) are one of the most promising technologies for the next era of speech generation systems, due to their scalability and in-context learning capabilities. Nevertheless, they suffer from multiple stability issues at inferenc
Externí odkaz:
http://arxiv.org/abs/2402.03407
Phonetic information and linguistic knowledge are an essential component of a Text-to-speech (TTS) front-end. Given a language, a lexicon can be collected offline and Grapheme-to-Phoneme (G2P) relationships are usually modeled in order to predict the
Externí odkaz:
http://arxiv.org/abs/2307.16709
Autor:
Zhang, Guangyan, Merritt, Thomas, Ribeiro, Manuel Sam, Tura-Vecino, Biel, Yanagisawa, Kayoko, Pokora, Kamil, Ezzerg, Abdelhamid, Cygert, Sebastian, Abbas, Ammar, Bilinski, Piotr, Barra-Chicote, Roberto, Korzekwa, Daniel, Lorenzo-Trueba, Jaime
Neural text-to-speech systems are often optimized on L1/L2 losses, which make strong assumptions about the distributions of the target data space. Aiming to improve those assumptions, Normalizing Flows and Diffusion Probabilistic Models were recently
Externí odkaz:
http://arxiv.org/abs/2307.16679
The Grapheme-to-Phoneme (G2P) task aims to convert orthographic input into a discrete phonetic representation. G2P conversion is beneficial to various speech processing applications, such as text-to-speech and speech recognition. However, these tend
Externí odkaz:
http://arxiv.org/abs/2307.16643
$ $With recent advances in CNNs, exceptional improvements have been made in semantic segmentation of high resolution images in terms of accuracy and latency. However, challenges still remain in detecting objects in crowded scenes, large scale variati
Externí odkaz:
http://arxiv.org/abs/2212.08613
Publikováno v:
Humanities & Social Sciences Communications, Vol 11, Iss 1, Pp 1-20 (2024)
Abstract The academic literature on personal experiences of climate-induced wellbeing erosion (often conceptualised as ‘non-economic losses and damages’) is still limited. This represents a serious climate policy gap that hinders support for marg
Externí odkaz:
https://doaj.org/article/7bcff4f3c9ce4a729559ddaeae9c9ba0
Autor:
Arrayás, Manuel, Bettsworth, Francis, Haley, Richard, Schanen, Roch, Trueba, José Luis, Uriarte, Carlos, Zavyalov, Vladislav, Zmeev, Dmitry
We present the working prototype of a levitation system designed for investigation of flows in cryogenic helium fluids. The current device allows the levitation of a superconducting sphere and has several provisions made for allowing precise control
Externí odkaz:
http://arxiv.org/abs/2210.16615
Autor:
Adrian Trueba Espinosa, Jessica Sanchez -Arrazola, Jair Cervantes, Farid Garcia-Lamont, José Sergio Ruiz Castilla
Publikováno v:
ELCVIA Electronic Letters on Computer Vision and Image Analysis, Vol 23, Iss 1 (2024)
In this paper we propose the classification of radiological patterns with the presence of tuberculosis in X-ray images, it was observed that two to six patterns (consolidation, fibrosis, opacity, opacity, pleural, nodules and cavitations) are present
Externí odkaz:
https://doaj.org/article/12ebd0d6617f41cbb669c95ddd719001
Autor:
Edgar Trueba Paz y Puente
Publikováno v:
Iuris Tantum, Vol 38, Iss 39 (2024)
En los últimos años se han desarrollado diversos cuestionamientos al modelo capitalista. Uno de los más recurrentes se centra en el incremento de la desigualdad y la necesidad de redistribución de la riqueza. En este ensayo se cuestionan algunos
Externí odkaz:
https://doaj.org/article/a931c8a390684629a00fd78e2ad80aa5
The availability of data in expressive styles across languages is limited, and recording sessions are costly and time consuming. To overcome these issues, we demonstrate how to build low-resource, neural text-to-speech (TTS) voices with only 1 hour o
Externí odkaz:
http://arxiv.org/abs/2207.14607