Výsledky vyhledávání - "Germán Bordel"

Akademický článek

A Bilingual Basque–Spanish Dataset of Parliamentary Sessions for the Development and Evaluation of Speech Technology

Autor: Amparo Varona, Mikel Penagarikano, Germán Bordel, Luis Javier Rodriguez-Fuentes

Publikováno v: Applied Sciences, Vol 14, Iss 5, p 1951 (2024)

The development of speech technology requires large amounts of data to estimate the underlying models. Even when relying on large multilingual pre-trained models, some amount of task-specific data on the target language is needed to fine-tune those m

Externí odkaz: https://doaj.org/article/b728311dc6f649c7b17ba4a86578c55f

Zobrazit plný text záznamu

Akademický článek

An Overview of the IberSpeech-RTVE 2022 Challenges on Speech Technologies

Autor: Eduardo Lleida, Luis Javier Rodriguez-Fuentes, Javier Tejedor, Alfonso Ortega, Antonio Miguel, Virginia Bazán, Carmen Pérez, Alberto de Prada, Mikel Penagarikano, Amparo Varona, Germán Bordel, Doroteo Torre-Toledano, Aitor Álvarez, Haritz Arzelus

Publikováno v: Applied Sciences, Vol 13, Iss 15, p 8577 (2023)

Evaluation campaigns provide a common framework with which the progress of speech technologies can be effectively measured. The aim of this paper is to present a detailed overview of the IberSpeech-RTVE 2022 Challenges, which were organized as part o

Externí odkaz: https://doaj.org/article/75d032e326114ebca601ea06a1a3cd04

Zobrazit plný text záznamu

Akademický článek

Semisupervised Speech Data Extraction from Basque Parliament Sessions and Validation on Fully Bilingual Basque–Spanish ASR

Autor: Mikel Penagarikano, Amparo Varona, Germán Bordel, Luis Javier Rodriguez-Fuentes

Publikováno v: Applied Sciences, Vol 13, Iss 14, p 8492 (2023)

In this paper, a semisupervised speech data extraction method is presented and applied to create a new dataset designed for the development of fully bilingual Automatic Speech Recognition (ASR) systems for Basque and Spanish. The dataset is drawn fro

Externí odkaz: https://doaj.org/article/e8c099db1d79434f9d1b16b7d1d9094f

Zobrazit plný text záznamu

GTTS Systems for the Albayzin 2022 Speech and Text Alignment Challenge

Autor: Germán Bordel, Luis Javier Rodriguez-Fuentes, Mikel Peñagarikano, Amparo Varona

Publikováno v: IberSPEECH 2022.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::f0e76b30ca610e8f1bf270e34dd3f16b
https://doi.org/10.21437/iberspeech.2022-58

Zobrazit plný text záznamu

GTTS-EHU Systems for the Albayzin 2018 Search on Speech Evaluation

Autor: Mikel Peñagarikano, Luis Javier Rodríguez-Fuentes, Amparo Varona, Germán Bordel

Publikováno v: IberSPEECH

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::a08dabcb38a5cc9abec726998e7796dc
https://doi.org/10.21437/iberspeech.2018-52

Zobrazit plný text záznamu

Probabilistic Kernels for Improved Text-to-Speech Alignment in Long Audio Tracks

Autor: Luis Javier Rodríguez-Fuentes, Amparo Varona, Aitor Alvarez, Germán Bordel, Mikel Peñagarikano

Publikováno v: IEEE Signal Processing Letters. 23:126-129

The synchronization of text transcripts with audio tracks is typically solved by forced alignment at the phonetic level. However, when dealing with either very long audio tracks or acoustically inaccurate text transcripts, more complex methods are ne

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::7614f61e3150d1640e30d09defda8729
https://doi.org/10.1109/lsp.2015.2505140

Zobrazit plný text záznamu

KALAKA-3: a database for the assessment of spoken language recognition technology on YouTube audios

Autor: Germán Bordel, Mireia Diez, Luis Javier Rodríguez-Fuentes, Amparo Varona, Mikel Peñagarikano

Publikováno v: Language Resources and Evaluation. 50:221-243

KALAKA-3 is a speech database specifically designed for the development and evaluation of Spoken Language Recognition (SLR) systems. The database provides TV broadcast speech for training, and audio data extracted from YouTube videos for tuning and t

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::6e61e26a1d6c8668638227a8c1658f7e
https://doi.org/10.1007/s10579-015-9324-5

Zobrazit plný text záznamu

On the Projection of PLLRs for Unbounded Feature Distributions in Spoken Language Recognition

Autor: Germán Bordel, Luis Javier Rodríguez-Fuentes, Amparo Varona, Mireia Diez, Mikel Peñagarikano

Publikováno v: IEEE Signal Processing Letters. 21:1073-1077

The so called Phone Log-Likelihood Ratio (PLLR) features have been recently introduced as a novel and effective way of retrieving acoustic-phonetic information in spoken language and speaker recognition systems. In this letter, an in-depth insight in

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::0af21db193038fc82a9277c6381e24b1
https://doi.org/10.1109/lsp.2014.2324819

Zobrazit plný text záznamu

On the Complementarity of Phone Posterior Probabilities for Improved Speaker Recognition

Autor: Mireia Diez, Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Germán Bordel, Amparo Varona

Publikováno v: IEEE Signal Processing Letters. 21:649-652

In this letter, we apply Phone Log-Likelihood Ratio (PLLR) features to the task of speaker recognition. PLLRs, which are computed on the phone posterior probabilities provided by phone decoders, convey acoustic-phonetic information in a sequence of f

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5e894be61ef5bbbd7948f69d2e03161b
https://doi.org/10.1109/lsp.2014.2312213

Zobrazit plný text záznamu

Improved Modeling of Cross-Decoder Phone Co-Occurrences in SVM-Based Phonotactic Language Recognition

Autor: Mikel Peñagarikano, Germán Bordel, Luis Javier Rodríguez-Fuentes, Amparo Varona

Publikováno v: IEEE Transactions on Audio, Speech, and Language Processing. 19:2348-2363

Most common approaches to phonotactic language recognition deal with several independent phone decodings. These decodings are processed and scored in a fully uncoupled way, their time alignment (and the information that may be extracted from it) bein

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::10b2b4d56ae20165e06e0722167d8715
https://doi.org/10.1109/tasl.2011.2134088

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání