Výsledky vyhledávání - "Ronan Collobert"

Akademický článek

Large scale application of neural network based semantic role labeling for automated relation extraction from biomedical texts.

Autor: Thorsten Barnickel, Jason Weston, Ronan Collobert, Hans-Werner Mewes, Volker Stümpflen

Publikováno v: PLoS ONE, Vol 4, Iss 7, p e6393 (2009)

To reduce the increasing amount of time spent on literature search in the life sciences, several methods for automated knowledge extraction have been developed. Co-occurrence based approaches can deal with large text corpora like MEDLINE in an accept

Externí odkaz: https://doaj.org/article/e00ebc70e9c548a597ec3f637edeff70

Zobrazit plný text záznamu

Word Order Does Not Matter For Speech Recognition

Autor: Vineel Pratap, Qiantong Xu, Tatiana Likhomanenko, Gabriel Synnaeve, Ronan Collobert

In this paper, we study training of automatic speech recognition system in a weakly supervised setting where the order of words in transcript labels of the audio training data is not known. We train a word-level acoustic model which aggregates the di

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::33f3599023480c64f7a523b9cb6b2442
http://arxiv.org/abs/2110.05994

Zobrazit plný text záznamu

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training

Autor: Vineel Pratap, Michael Auli, Ann B. Lee, Tatiana Likhomanenko, Gabriel Synnaeve, Alexei Baevski, Ronan Collobert, Anuroop Sriram, Qiantong Xu, Jacob Kahn, Wei-Ning Hsu

Publikováno v: Interspeech 2021.

Self-supervised learning of speech representations has been a very active research area but most work is focused on a single domain such as read audio books for which there exist large quantities of labeled and unlabeled data. In this paper, we explo

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8c73d193fc72ccadb64227828a5d482e
https://doi.org/10.21437/interspeech.2021-236

Zobrazit plný text záznamu

Self-Training and Pre-Training are Complementary for Speech Recognition

Autor: Michael Auli, Alexis Conneau, Tatiana Likhomanenko, Qiantong Xu, Paden Tomasello, Gabriel Synnaeve, Alexei Baevski, Ronan Collobert

Publikováno v: ICASSP

Self-training and unsupervised pre-training have emerged as effective approaches to improve speech recognition systems using unlabeled data. However, it is not clear whether they learn similar patterns or if they can be effectively combined. In this

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2cbaeb347dc71d642bd291b9b8e4712c
https://doi.org/10.1109/icassp39728.2021.9414641

Zobrazit plný text záznamu

Joint Masked CPC And CTC Training For ASR

Autor: Gabriel Synnaeve, Tatiana Likhomanenko, Ronan Collobert, Chaitanya Talnikar

Publikováno v: ICASSP

Self-supervised learning (SSL) has shown promise in learning representations of audio that are useful for automatic speech recognition (ASR). But, training SSL models like wav2vec~2.0 requires a two-stage pipeline. In this paper we demonstrate a sing

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b0adc9e831ee345d8e3c698df1e510d1
https://doi.org/10.1109/icassp39728.2021.9414227

Zobrazit plný text záznamu

End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition

Autor: Mathew Magimai-Doss, Ronan Collobert, Dimitri Palaz

Publikováno v: Speech Communication. 108:15-32

In hidden Markov model (HMM) based automatic speech recognition (ASR) system, modeling the statistical relationship between the acoustic speech signal and the HMM states that represent linguistically motivated subword units such as phonemes is a cruc

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::67fab63284f545a52b91e049a7911e95
https://doi.org/10.1016/j.specom.2019.01.004

Zobrazit plný text záznamu

Iterative Pseudo-Labeling for Speech Recognition

Autor: Jacob Kahn, Gabriel Synnaeve, Tatiana Likhomanenko, Qiantong Xu, Ronan Collobert, Awni Hannun

Publikováno v: INTERSPEECH

Pseudo-labeling has recently shown promise in end-to-end automatic speech recognition (ASR). We study Iterative Pseudo-Labeling (IPL), a semi-supervised algorithm which efficiently performs multiple iterations of pseudo-labeling on unlabeled data as

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d4cd022845698e762c1f646ecdc04328
https://doi.org/10.21437/interspeech.2020-1800

Zobrazit plný text záznamu

MLS: A Large-Scale Multilingual Dataset for Speech Research

Autor: Ronan Collobert, Vineel Pratap, Gabriel Synnaeve, Qiantong Xu, Anuroop Sriram

Publikováno v: INTERSPEECH

This paper introduces Multilingual LibriSpeech (MLS) dataset, a large multilingual corpus suitable for speech research. The dataset is derived from read audiobooks from LibriVox and consists of 8 languages, including about 44.5K hours of English and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ebaf45fb0100a82cbf867b91e4edc751
https://doi.org/10.21437/interspeech.2020-2826

Zobrazit plný text záznamu

SlimIPL: Language-Model-Free Iterative Pseudo-Labeling

Autor: Ronan Collobert, Gabriel Synnaeve, Tatiana Likhomanenko, Jacob Kahn, Qiantong Xu

Recent results in end-to-end automatic speech recognition have demonstrated the efficacy of pseudo-labeling for semi-supervised models trained both with Connectionist Temporal Classification (CTC) and Sequence-to-Sequence (seq2seq) losses. Iterative

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e81380b3c44b677c1551436c9a32d51b
http://arxiv.org/abs/2010.11524

Zobrazit plný text záznamu

Massively Multilingual ASR: 50 Languages, 1 Model, 1 Billion Parameters

Autor: Ronan Collobert, Gabriel Synnaeve, Paden Tomasello, Vitaliy Liptchinsky, Awni Hannun, Vineel Pratap, Anuroop Sriram

Publikováno v: INTERSPEECH

We study training a single acoustic model for multiple languages with the aim of improving automatic speech recognition (ASR) performance on low-resource languages, and over-all simplifying deployment of ASR systems that support diverse languages. We

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ef82d1005bea91eee550ee2ee3cd9637
http://arxiv.org/abs/2007.03001

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání