Výsledky vyhledávání - "Nigmatulina, Iuliia"

Report

TokenVerse: Unifying Speech and NLP Tasks via Transducer-based ASR

Autor: Kumar, Shashi, Madikeri, Srikanth, Zuluaga-Gomez, Juan, Nigmatulina, Iuliia, Villatoro-Tello, Esaú, Burdisso, Sergio, Motlicek, Petr, Pandia, Karthik, Ganapathiraju, Aravind

In traditional conversational intelligence from speech, a cascaded pipeline is used, involving tasks such as voice activity detection, diarization, transcription, and subsequent processing with different NLP models for tasks like semantic endpointing

Externí odkaz: http://arxiv.org/abs/2407.04444

Zobrazit plný text záznamu

Report

XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models

Autor: Kumar, Shashi, Madikeri, Srikanth, Zuluaga-Gomez, Juan, Villatoro-Tello, Esaú, Nigmatulina, Iuliia, Motlicek, Petr, E, Manjunath K, Ganapathiraju, Aravind

Self-supervised pretrained models exhibit competitive performance in automatic speech recognition on finetuning, even with limited in-domain supervised data for training. However, popular pretrained models are not suitable for streaming ASR because t

Externí odkaz: http://arxiv.org/abs/2407.04439

Zobrazit plný text záznamu

Report

Implementing contextual biasing in GPU decoder for online ASR

Autor: Nigmatulina, Iuliia, Madikeri, Srikanth, Villatoro-Tello, Esaú, Motliček, Petr, Zuluaga-Gomez, Juan, Pandia, Karthik, Ganapathiraju, Aravind

GPU decoding significantly accelerates the output of ASR predictions. While GPUs are already being used for online ASR decoding, post-processing and rescoring on GPUs have not been properly investigated yet. Rescoring with available contextual inform

Externí odkaz: http://arxiv.org/abs/2306.15685

Zobrazit plný text záznamu

Report

Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding

Autor: Zuluaga-Gomez, Juan, Nigmatulina, Iuliia, Prasad, Amrutha, Motlicek, Petr, Khalil, Driss, Madikeri, Srikanth, Tart, Allan, Szoke, Igor, Lenders, Vincent, Rigault, Mickael, Choukri, Khalid

Voice communication between air traffic controllers (ATCos) and pilots is critical for ensuring safe and efficient air traffic control (ATC). This task requires high levels of awareness from ATCos and can be tedious and error-prone. Recent attempts h

Externí odkaz: http://arxiv.org/abs/2305.01155

Zobrazit plný text záznamu

Report

A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers

Autor: Zuluaga-Gomez, Juan, Prasad, Amrutha, Nigmatulina, Iuliia, Motlicek, Petr, Kleinert, Matthias

In this paper we propose a novel virtual simulation-pilot engine for speeding up air traffic controller (ATCo) training by integrating different state-of-the-art artificial intelligence (AI) based tools. The virtual simulation-pilot engine receives s

Externí odkaz: http://arxiv.org/abs/2304.07842

Zobrazit plný text záznamu

Report

Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks

Autor: Villatoro-Tello, Esaú, Madikeri, Srikanth, Zuluaga-Gomez, Juan, Sharma, Bidisha, Sarfjoo, Seyyed Saeed, Nigmatulina, Iuliia, Motlicek, Petr, Ivanov, Alexei V., Ganapathiraju, Aravind

Publikováno v: ICASSP 2023

In this paper, we perform an exhaustive evaluation of different representations to address the intent classification problem in a Spoken Language Understanding (SLU) setup. We benchmark three types of systems to perform the SLU intent detection task:

Externí odkaz: http://arxiv.org/abs/2212.08489

Zobrazit plný text záznamu

Report

Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator

Autor: Prasad, Amrutha, Zuluaga-Gomez, Juan, Motlicek, Petr, Sarfjoo, Saeed, Nigmatulina, Iuliia, Vesely, Karel

This paper describes a simple yet efficient repetition-based modular system for speeding up air-traffic controllers (ATCos) training. E.g., a human pilot is still required in EUROCONTROL's ESCAPE lite simulator (see https://www.eurocontrol.int/simula

Externí odkaz: http://arxiv.org/abs/2212.07164

Zobrazit plný text záznamu

Report

ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

Autor: Zuluaga-Gomez, Juan, Veselý, Karel, Szöke, Igor, Blatt, Alexander, Motlicek, Petr, Kocour, Martin, Rigault, Mickael, Choukri, Khalid, Prasad, Amrutha, Sarfjoo, Seyyed Saeed, Nigmatulina, Iuliia, Cevenini, Claudia, Kolčárek, Pavel, Tart, Allan, Černocký, Jan, Klakow, Dietrich

Personal assistants, automatic speech recognizers and dialogue understanding systems are becoming more critical in our interconnected digital world. A clear example is air traffic control (ATC) communications. ATC aims at guiding aircraft and control

Externí odkaz: http://arxiv.org/abs/2211.04054

Zobrazit plný text záznamu

Report

How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications

Autor: Zuluaga-Gomez, Juan, Prasad, Amrutha, Nigmatulina, Iuliia, Sarfjoo, Saeed, Motlicek, Petr, Kleinert, Matthias, Helmke, Hartmut, Ohneiser, Oliver, Zhan, Qingran

Recent work on self-supervised pre-training focus on leveraging large-scale unlabeled speech data to build robust end-to-end (E2E) acoustic models (AM) that can be later fine-tuned on downstream tasks e.g., automatic speech recognition (ASR). Yet, fe

Externí odkaz: http://arxiv.org/abs/2203.16822

Zobrazit plný text záznamu

Report

A two-step approach to leverage contextual data: speech recognition in air-traffic communications

Autor: Nigmatulina, Iuliia, Zuluaga-Gomez, Juan, Prasad, Amrutha, Sarfjoo, Seyyed Saeed, Motlicek, Petr

Publikováno v: ICASSP 2022

Automatic Speech Recognition (ASR), as the assistance of speech communication between pilots and air-traffic controllers, can significantly reduce the complexity of the task and increase the reliability of transmitted information. ASR application can

Externí odkaz: http://arxiv.org/abs/2202.03725

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání