Zobrazeno 1 - 10
of 476
pro vyhledávání: '"Hautamaki, A."'
Autor:
Singh, Vishwanath Pratap, Malato, Federico, Hautamaki, Ville, Sahidullah, Md., Kinnunen, Tomi
Publikováno v:
Interspeech 2024
While automatic speech recognition (ASR) greatly benefits from data augmentation, the augmentation recipes themselves tend to be heuristic. In this paper, we address one of the heuristic approach associated with balancing the right amount of augmente
Externí odkaz:
http://arxiv.org/abs/2406.09999
Autor:
Malato, Federico, Hautamaki, Ville
Imitation learning enables autonomous agents to learn from human examples, without the need for a reward signal. Still, if the provided dataset does not encapsulate the task correctly, or when the task is too complex to be modeled, such agents fail t
Externí odkaz:
http://arxiv.org/abs/2406.04913
Behavioral cloning uses a dataset of demonstrations to learn a policy. To overcome computationally expensive training procedures and address the policy adaptation problem, we propose to use latent spaces of pre-trained foundation models to index a de
Externí odkaz:
http://arxiv.org/abs/2401.16398
Behavioural cloning uses a dataset of demonstrations to learn a behavioural policy. To overcome various learning and policy adaptation problems, we propose to use latent space to index a demonstration dataset, instantly access similar relevant experi
Externí odkaz:
http://arxiv.org/abs/2306.09082
Speech enhancement aims to improve the perceptual quality of the speech signal by suppression of the background noise. However, excessive suppression may lead to speech distortion and speaker information loss, which degrades the performance of speake
Externí odkaz:
http://arxiv.org/abs/2110.00940
VoxCeleb datasets are widely used in speaker recognition studies. Our work serves two purposes. First, we provide speaker age labels and (an alternative) annotation of speaker gender. Second, we demonstrate the use of this metadata by constructing ag
Externí odkaz:
http://arxiv.org/abs/2109.13510
Autor:
Long T. Nguyen, Nicolas C. Macaluso, Noah R. Rakestraw, Dylan R. Carman, Brianna L.M. Pizzano, Raymond C. Hautamaki, Santosh R. Rananaware, Isabel E. Roberts, Piyush K. Jain
Publikováno v:
Cell Reports, Vol 43, Iss 2, Pp 113777- (2024)
Summary: There is a broad diversity among Cas12a endonucleases that possess nucleic acid detection and gene-editing capabilities, but few are studied extensively. Here, we present an exhaustive investigation of 23 Cas12a orthologs, with a focus on th
Externí odkaz:
https://doaj.org/article/fa7c464d82e7437d8bba5ed5f812778d
Autor:
Nguyen, Long T., Macaluso, Nicolas C., Rakestraw, Noah R., Carman, Dylan R., Pizzano, Brianna L.M., Hautamaki, Raymond C., Rananaware, Santosh R., Roberts, Isabel E., Jain, Piyush K.
Publikováno v:
In Cell Reports 27 February 2024 43(2)
In recent years, transformer models have achieved great success in natural language processing (NLP) tasks. Most of the current state-of-the-art NLP results are achieved by using monolingual transformer models, where the model is pre-trained using a
Externí odkaz:
http://arxiv.org/abs/2006.07698
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.