Výsledky vyhledávání

Report

Personalizing Keyword Spotting with Speaker Information

Autor: Labrador, Beltrán, Zhu, Pai, Zhao, Guanlong, Scarpati, Angelo Scorza, Wang, Quan, Lozano-Diez, Alicia, Park, Alex, Moreno, Ignacio López

Keyword spotting systems often struggle to generalize to a diverse population with various accents and age groups. To address this challenge, we propose a novel approach that integrates speaker information into keyword spotting using Feature-wise Lin

Externí odkaz: http://arxiv.org/abs/2311.03419

Zobrazit plný text záznamu

Report

Locale Encoding For Scalable Multilingual Keyword Spotting Models

Autor: Zhu, Pai, Park, Hyun Jin, Park, Alex, Scarpati, Angelo Scorza, Moreno, Ignacio Lopez

A Multilingual Keyword Spotting (KWS) system detects spokenkeywords over multiple locales. Conventional monolingual KWSapproaches do not scale well to multilingual scenarios because ofhigh development/maintenance costs and lack of resource sharing.To

Externí odkaz: http://arxiv.org/abs/2302.12961

Zobrazit plný text záznamu

Report

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

Autor: Panchapagesan, Sankaran, Narayanan, Arun, Shabestary, Turaj Zakizadeh, Shao, Shuai, Howard, Nathan, Park, Alex, Walker, James, Gruenstein, Alexander

Acoustic Echo Cancellation (AEC) is essential for accurate recognition of queries spoken to a smart speaker that is playing out audio. Previous work has shown that a neural AEC model operating on log-mel spectral features (denoted "logmel" hereafter)

Externí odkaz: http://arxiv.org/abs/2205.03481

Zobrazit plný text záznamu

Report

Production federated keyword spotting via distillation, filtering, and joint federated-centralized training

Autor: Hard, Andrew, Partridge, Kurt, Chen, Neng, Augenstein, Sean, Shah, Aishanee, Park, Hyun Jin, Park, Alex, Ng, Sara, Nguyen, Jessica, Moreno, Ignacio Lopez, Mathews, Rajiv, Beaufays, Françoise

We trained a keyword spotting model using federated learning on real user devices and observed significant improvements when the model was deployed for inference on phones. To compensate for data domains that are missing from on-device training cache

Externí odkaz: http://arxiv.org/abs/2204.06322

Zobrazit plný text záznamu

Report

A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation

Autor: O'Malley, Tom, Narayanan, Arun, Wang, Quan, Park, Alex, Walker, James, Howard, Nathan

We present a frontend for improving robustness of automatic speech recognition (ASR), that jointly implements three modules within a single model: acoustic echo cancellation, speech enhancement, and speech separation. This is achieved by using a cont

Externí odkaz: http://arxiv.org/abs/2111.09935

Zobrazit plný text záznamu

Report

A Neural Acoustic Echo Canceller Optimized Using An Automatic Speech Recognizer And Large Scale Synthetic Data

Autor: Howard, Nathan, Park, Alex, Shabestary, Turaj Zakizadeh, Gruenstein, Alexander, Prabhavalkar, Rohit

We consider the problem of recognizing speech utterances spoken to a device which is generating a known sound waveform; for example, recognizing queries issued to a digital assistant which is generating responses to previous user inputs. Previous wor

Externí odkaz: http://arxiv.org/abs/2106.00856

Zobrazit plný text záznamu

Dissertation/ Thesis

Characterization of a novel class of anti-HCV agents targeting protein-protein interactions

Autor: Park, Alex

Le virus de l’hépatite C (VHC) est un agent causateur de maladies du foie important responsable d’une pandémie affectant près de 180 millions d’individus mondialement. L’absence de symptômes dans les premières années d’infection entra

Externí odkaz: http://hdl.handle.net/1866/18893

Zobrazit plný text záznamu

Akademický článek

To prescribe or not to prescribe? A review of the Prescribing Competencies Framework for dentistry

Autor: Teoh, Leanne, Park, Alex, Moses, Geraldine, McCullough, Michael, Page, Amy

Publikováno v: In Journal of Dentistry August 2023

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Dissertation/ Thesis

Unsupervised pattern discovery in speech : applications to word acquisition and speaker segmentation

Autor: Park, Alex S. (Alex Seungryong), 1979

Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2007.
Includes bibliographical references (p. 167-176).
We present a novel approach to speech processing based on the pri

Externí odkaz: http://hdl.handle.net/1721.1/38684

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání