Výsledky vyhledávání

Report

Data Augmentation for Low-Resource Quechua ASR Improvement

Autor: Zevallos, Rodolfo, Bel, Nuria, Cámbara, Guillermo, Farrús, Mireia, Luque, Jordi

Automatic Speech Recognition (ASR) is a key element in new services that helps users to interact with an automated system. Deep learning methods have made it possible to deploy systems with word error rates below 5% for ASR of English. However, the u

Externí odkaz: http://arxiv.org/abs/2207.06872

Zobrazit plný text záznamu

Report

Voice Quality and Pitch Features in Transformer-Based Speech Recognition

Autor: Cámbara, Guillermo, Luque, Jordi, Farrús, Mireia

Jitter and shimmer measurements have shown to be carriers of voice quality and prosodic information which enhance the performance of tasks like speaker recognition, diarization or automatic speech recognition (ASR). However, such features have been s

Externí odkaz: http://arxiv.org/abs/2112.11391

Zobrazit plný text záznamu

Akademický článek

SCALING UP β-FRUCTOSIDASE PRODUCTION BY PICHIA PASTORIS BfrA4X ON A COST-EFFECTIVE CANE MOLASSES MEDIUM

Autor: Julio Cesar Ortega Cambara, Vivian León Fernández, Duniesky Martínez García, Alina Sobrino Legón, Daisy Dopico Ramírez, Enrique Rosendo Pérez Cruz

Publikováno v: Revista Centro Azúcar, Vol 51, Iss 2, Pp e1065(02/05/2024)-e1065(02/05/2024) (2024)

Introduction: Molasses is an agro-industrial by-product that could be used as inexpensive carbon source to reduce the culture media cost for enzyme-producing microorganisms. Objective: To evaluate four combinations of carbon and nitrogen sources for

Externí odkaz: https://doaj.org/article/eca61346048b4023a4ca677881448ee6

Zobrazit plný text záznamu

Report

Influence of ASR and Language Model on Alzheimer's Disease Detection

Autor: Codina-Filbà, Joan, Cámbara, Guillermo, Luque, Jordi, Farrús, Mireia

Alzheimer's Disease is the most common form of dementia. Automatic detection from speech could help to identify symptoms at early stages, so that preventive actions can be carried out. This research is a contribution to the ADReSSo Challenge, we anal

Externí odkaz: http://arxiv.org/abs/2110.15704

Zobrazit plný text záznamu

Report

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System

Autor: Cámbara, Guillermo, Peiró-Lilja, Alex, Farrús, Mireia, Luque, Jordi

Nowadays, research in speech technologies has gotten a lot out thanks to recently created public domain corpora that contain thousands of recording hours. These large amounts of data are very helpful for training the new complex models based on deep

Externí odkaz: http://arxiv.org/abs/2105.05041

Zobrazit plný text záznamu

Report

Speech Enhancement for Wake-Up-Word detection in Voice Assistants

Autor: Bonet, David, Cámbara, Guillermo, López, Fernando, Gómez, Pablo, Segura, Carlos, Luque, Jordi

Keyword spotting and in particular Wake-Up-Word (WUW) detection is a very important task for voice assistants. A very common issue of voice assistants is that they get easily activated by background noise like music, TV or background speech that acci

Externí odkaz: http://arxiv.org/abs/2101.12732

Zobrazit plný text záznamu

Report

BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge

Autor: Kocour, Martin, Cámbara, Guillermo, Luque, Jordi, Bonet, David, Farrús, Mireia, Karafiát, Martin, Veselý, Karel, Ĉernocký, Jan ''Honza''

This paper describes joint effort of BUT and Telef\'onica Research on development of Automatic Speech Recognition systems for Albayzin 2020 Challenge. We compare approaches based on either hybrid or end-to-end models. In hybrid modelling, we explore

Externí odkaz: http://arxiv.org/abs/2101.12729

Zobrazit plný text záznamu

Report

Convolutional Speech Recognition with Pitch and Voice Quality Features

Autor: Cámbara, Guillermo, Luque, Jordi, Farrús, Mireia

The effects of adding pitch and voice quality features such as jitter and shimmer to a state-of-the-art CNN model for Automatic Speech Recognition are studied in this work. Pitch features have been previously used for improving classical HMM and DNN

Externí odkaz: http://arxiv.org/abs/2009.01309

Zobrazit plný text záznamu

Report

Detection of speech events and speaker characteristics through photo-plethysmographic signal neural processing

Autor: Cámbara, Guillermo, Luque, Jordi, Farrús, Mireia

The use of photoplethysmogram signal (PPG) for heart and sleep monitoring is commonly found nowadays in smartphones and wrist wearables. Besides common usages, it has been proposed and reported that person information can be extracted from PPG for ot

Externí odkaz: http://arxiv.org/abs/1911.04808

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání