Zobrazeno 1 - 10
of 33
pro vyhledávání: '"Nigmatulina, Iuliia"'
Autor:
Kumar, Shashi, Madikeri, Srikanth, Zuluaga-Gomez, Juan, Nigmatulina, Iuliia, Villatoro-Tello, Esaú, Burdisso, Sergio, Motlicek, Petr, Pandia, Karthik, Ganapathiraju, Aravind
In traditional conversational intelligence from speech, a cascaded pipeline is used, involving tasks such as voice activity detection, diarization, transcription, and subsequent processing with different NLP models for tasks like semantic endpointing
Externí odkaz:
http://arxiv.org/abs/2407.04444
Autor:
Kumar, Shashi, Madikeri, Srikanth, Zuluaga-Gomez, Juan, Villatoro-Tello, Esaú, Nigmatulina, Iuliia, Motlicek, Petr, E, Manjunath K, Ganapathiraju, Aravind
Self-supervised pretrained models exhibit competitive performance in automatic speech recognition on finetuning, even with limited in-domain supervised data for training. However, popular pretrained models are not suitable for streaming ASR because t
Externí odkaz:
http://arxiv.org/abs/2407.04439
Autor:
Nigmatulina, Iuliia, Madikeri, Srikanth, Villatoro-Tello, Esaú, Motliček, Petr, Zuluaga-Gomez, Juan, Pandia, Karthik, Ganapathiraju, Aravind
GPU decoding significantly accelerates the output of ASR predictions. While GPUs are already being used for online ASR decoding, post-processing and rescoring on GPUs have not been properly investigated yet. Rescoring with available contextual inform
Externí odkaz:
http://arxiv.org/abs/2306.15685
Autor:
Zuluaga-Gomez, Juan, Nigmatulina, Iuliia, Prasad, Amrutha, Motlicek, Petr, Khalil, Driss, Madikeri, Srikanth, Tart, Allan, Szoke, Igor, Lenders, Vincent, Rigault, Mickael, Choukri, Khalid
Voice communication between air traffic controllers (ATCos) and pilots is critical for ensuring safe and efficient air traffic control (ATC). This task requires high levels of awareness from ATCos and can be tedious and error-prone. Recent attempts h
Externí odkaz:
http://arxiv.org/abs/2305.01155
Autor:
Zuluaga-Gomez, Juan, Prasad, Amrutha, Nigmatulina, Iuliia, Motlicek, Petr, Kleinert, Matthias
In this paper we propose a novel virtual simulation-pilot engine for speeding up air traffic controller (ATCo) training by integrating different state-of-the-art artificial intelligence (AI) based tools. The virtual simulation-pilot engine receives s
Externí odkaz:
http://arxiv.org/abs/2304.07842
Autor:
Villatoro-Tello, Esaú, Madikeri, Srikanth, Zuluaga-Gomez, Juan, Sharma, Bidisha, Sarfjoo, Seyyed Saeed, Nigmatulina, Iuliia, Motlicek, Petr, Ivanov, Alexei V., Ganapathiraju, Aravind
Publikováno v:
ICASSP 2023
In this paper, we perform an exhaustive evaluation of different representations to address the intent classification problem in a Spoken Language Understanding (SLU) setup. We benchmark three types of systems to perform the SLU intent detection task:
Externí odkaz:
http://arxiv.org/abs/2212.08489
Autor:
Prasad, Amrutha, Zuluaga-Gomez, Juan, Motlicek, Petr, Sarfjoo, Saeed, Nigmatulina, Iuliia, Vesely, Karel
This paper describes a simple yet efficient repetition-based modular system for speeding up air-traffic controllers (ATCos) training. E.g., a human pilot is still required in EUROCONTROL's ESCAPE lite simulator (see https://www.eurocontrol.int/simula
Externí odkaz:
http://arxiv.org/abs/2212.07164
Autor:
Zuluaga-Gomez, Juan, Veselý, Karel, Szöke, Igor, Blatt, Alexander, Motlicek, Petr, Kocour, Martin, Rigault, Mickael, Choukri, Khalid, Prasad, Amrutha, Sarfjoo, Seyyed Saeed, Nigmatulina, Iuliia, Cevenini, Claudia, Kolčárek, Pavel, Tart, Allan, Černocký, Jan, Klakow, Dietrich
Personal assistants, automatic speech recognizers and dialogue understanding systems are becoming more critical in our interconnected digital world. A clear example is air traffic control (ATC) communications. ATC aims at guiding aircraft and control
Externí odkaz:
http://arxiv.org/abs/2211.04054
Autor:
Zuluaga-Gomez, Juan, Prasad, Amrutha, Nigmatulina, Iuliia, Sarfjoo, Saeed, Motlicek, Petr, Kleinert, Matthias, Helmke, Hartmut, Ohneiser, Oliver, Zhan, Qingran
Recent work on self-supervised pre-training focus on leveraging large-scale unlabeled speech data to build robust end-to-end (E2E) acoustic models (AM) that can be later fine-tuned on downstream tasks e.g., automatic speech recognition (ASR). Yet, fe
Externí odkaz:
http://arxiv.org/abs/2203.16822
Autor:
Nigmatulina, Iuliia, Zuluaga-Gomez, Juan, Prasad, Amrutha, Sarfjoo, Seyyed Saeed, Motlicek, Petr
Publikováno v:
ICASSP 2022
Automatic Speech Recognition (ASR), as the assistance of speech communication between pilots and air-traffic controllers, can significantly reduce the complexity of the task and increase the reliability of transmitted information. ASR application can
Externí odkaz:
http://arxiv.org/abs/2202.03725