Zobrazeno 1 - 10
of 140
pro vyhledávání: '"Park, Alex"'
Autor:
Labrador, Beltrán, Zhu, Pai, Zhao, Guanlong, Scarpati, Angelo Scorza, Wang, Quan, Lozano-Diez, Alicia, Park, Alex, Moreno, Ignacio López
Keyword spotting systems often struggle to generalize to a diverse population with various accents and age groups. To address this challenge, we propose a novel approach that integrates speaker information into keyword spotting using Feature-wise Lin
Externí odkaz:
http://arxiv.org/abs/2311.03419
A Multilingual Keyword Spotting (KWS) system detects spokenkeywords over multiple locales. Conventional monolingual KWSapproaches do not scale well to multilingual scenarios because ofhigh development/maintenance costs and lack of resource sharing.To
Externí odkaz:
http://arxiv.org/abs/2302.12961
Autor:
Panchapagesan, Sankaran, Narayanan, Arun, Shabestary, Turaj Zakizadeh, Shao, Shuai, Howard, Nathan, Park, Alex, Walker, James, Gruenstein, Alexander
Acoustic Echo Cancellation (AEC) is essential for accurate recognition of queries spoken to a smart speaker that is playing out audio. Previous work has shown that a neural AEC model operating on log-mel spectral features (denoted "logmel" hereafter)
Externí odkaz:
http://arxiv.org/abs/2205.03481
Autor:
Hard, Andrew, Partridge, Kurt, Chen, Neng, Augenstein, Sean, Shah, Aishanee, Park, Hyun Jin, Park, Alex, Ng, Sara, Nguyen, Jessica, Moreno, Ignacio Lopez, Mathews, Rajiv, Beaufays, Françoise
We trained a keyword spotting model using federated learning on real user devices and observed significant improvements when the model was deployed for inference on phones. To compensate for data domains that are missing from on-device training cache
Externí odkaz:
http://arxiv.org/abs/2204.06322
We present a frontend for improving robustness of automatic speech recognition (ASR), that jointly implements three modules within a single model: acoustic echo cancellation, speech enhancement, and speech separation. This is achieved by using a cont
Externí odkaz:
http://arxiv.org/abs/2111.09935
Autor:
Howard, Nathan, Park, Alex, Shabestary, Turaj Zakizadeh, Gruenstein, Alexander, Prabhavalkar, Rohit
We consider the problem of recognizing speech utterances spoken to a device which is generating a known sound waveform; for example, recognizing queries issued to a digital assistant which is generating responses to previous user inputs. Previous wor
Externí odkaz:
http://arxiv.org/abs/2106.00856
Autor:
Park, Alex
Le virus de l’hépatite C (VHC) est un agent causateur de maladies du foie important responsable d’une pandémie affectant près de 180 millions d’individus mondialement. L’absence de symptômes dans les premières années d’infection entra
Externí odkaz:
http://hdl.handle.net/1866/18893
Publikováno v:
In Journal of Dentistry August 2023
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Unsupervised pattern discovery in speech : applications to word acquisition and speaker segmentation
Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2007.
Includes bibliographical references (p. 167-176).
We present a novel approach to speech processing based on the pri
Includes bibliographical references (p. 167-176).
We present a novel approach to speech processing based on the pri
Externí odkaz:
http://hdl.handle.net/1721.1/38684