Zobrazeno 1 - 10
of 16
pro vyhledávání: '"Salima Mdhaffar"'
This paper presents a study on the use of federated learning to train an ASR model based on a wav2vec 2.0 model pre-trained by self supervision. Carried out on the well-known TED-LIUM 3 dataset, our experiments show that such a model can obtain, with
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1e328d3a34dbbb67e0451b9427824bfe
Le benchmark MEDIA revisité : données, outils et évaluation dans un contexte d’apprentissage profond
Autor:
Gaëlle Laperrière, Valentin Pelloin, Antoine Caubrière, Salima Mdhaffar, Nathalie Camelin, Sahar Ghannay, Bassam Jabaian, Yannick Estève
Publikováno v:
XXXIVe Journées d'Études sur la Parole -- JEP 2022.
Publikováno v:
JEP 2022
JEP 2022, Jun 2022, île de Noirmoutier, France
JEP 2022, Jun 2022, île de Noirmoutier, France
National audience; Plusieurs services intégrés dans notre vie quotidienne utilisent la reconnaissance automatique de la parole (Apple-Siri, Amazon-Alexa...). Ces services s'appuient sur des modèles entraînés sur une grande quantité de données
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::488074dda5e21679943deabde2e9e10f
http://hdl.handle.net/20.500.12210/80092
http://hdl.handle.net/20.500.12210/80092
Publikováno v:
XXXIVe Journées d'Études sur la Parole -- JEP 2022.
Publikováno v:
ICASSP 2022
ICASSP 2022, 2022, Singapour, Singapore
ICASSP 2022, 2022, Singapour, Singapore
This paper investigates methods to effectively retrieve speaker information from the personalized speaker adapted neural network acoustic models (AMs) in automatic speech recognition (ASR). This problem is especially important in the context of feder
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0d307046ec338ddc49eb994ffd835b82
https://hal.science/hal-03539742v2/document
https://hal.science/hal-03539742v2/document
Autor:
Hang Le, Sina Alisamir, Marco Dinarelli, Fabien Ringeval, Solène Evain, Ha Nguyen, Marcely Zanon Boito, Salima Mdhaffar, Ziyi Tong, Natalia Tomashenko, Titouan Parcollet, Alexandre Allauzen, Yannick Estève, Benjamin Lecouteux, François Portet, Solange Rossato, Didier Schwab, Laurent Besacier
Publikováno v:
HAL
L'apprentissage autosupervisé a apporté des améliorations remarquables dans de nombreux domaines tels que la vision par ordinateur ou le traitement de la langue et de la parole, en exploitant de grandes quantités de données non étiquetées. Dan
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5b45625171543c36fa01b1db28207288
https://hal.archives-ouvertes.fr/hal-03706952
https://hal.archives-ouvertes.fr/hal-03706952
Publikováno v:
IEEE ICASSP 2022
IEEE ICASSP 2022, 2022, Singapour, Singapore
IEEE ICASSP 2022, 2022, Singapour, Singapore
International audience; The widespread of powerful personal devices capable of collecting voice of their users has opened the opportunity to build speaker adapted speech recognition system (ASR) or to participate to collaborative learning of ASR. In
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6daf7cfd935e69d12da1144313a84aa4
https://hal.science/hal-03539741/document
https://hal.science/hal-03539741/document
Publikováno v:
Speech and Computer 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27–30, 2021, Proceedings
SPECOM 2021-23rd International Conference on Speech and Computer
SPECOM 2021-23rd International Conference on Speech and Computer, Sep 2021, St Petersburg, Russia. pp.426-436, ⟨10.1007/978-3-030-87802-3_39⟩
Speech and Computer ISBN: 9783030878016
SPECOM
SPECOM 2021-23rd International Conference on Speech and Computer
SPECOM 2021-23rd International Conference on Speech and Computer, Sep 2021, St Petersburg, Russia. pp.426-436, ⟨10.1007/978-3-030-87802-3_39⟩
Speech and Computer ISBN: 9783030878016
SPECOM
International audience; This paper investigates different approaches in order to improve the performance of a speech recognition system for a given speaker by using no more than 5 min of speech from this speaker, and without exchanging data from othe
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::89618d1122a008c604d91793198385d6
https://hal.science/hal-03369206/document
https://hal.science/hal-03369206/document
Autor:
Gaëlle Laperrière, Salima Mdhaffar, Sahar Ghannay, Bassam Jabaian, Antoine Caubrière, Yannick Estève
Publikováno v:
SPECOM 2021 23rd International Conference on Speech and Computer
SPECOM 2021 23rd International Conference on Speech and Computer, Sep 2021, Saint Petersburg, Russia
Speech and Computer ISBN: 9783030878016
SPECOM
SPECOM 2021 23rd International Conference on Speech and Computer, Sep 2021, Saint Petersburg, Russia
Speech and Computer ISBN: 9783030878016
SPECOM
Spoken language understanding (SLU) topic has seen a lot of progress these last three years, with the emergence of end-to-end neural approaches. Spoken language understanding refers to natural language processing tasks related to semantic extraction
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::aafa9337fffa958fa34908d411b1629a
https://hal.science/hal-03372494/document
https://hal.science/hal-03372494/document
Autor:
Laurent Besacier, Marcely Zanon Boito, Solène Evain, Ziyi Tong, Solange Rossato, Yannick Estève, Titouan Parcollet, Marco Dinarelli, Natalia A. Tomashenko, Benjamin Lecouteux, Hang Le, Sina Alisamir, François Portet, Ha Nguyen, Didier Schwab, Salima Mdhaffar, Alexandre Allauzen, Fabien Ringeval
Publikováno v:
INTERSPEECH 2021
INTERSPEECH 2021: Conference of the International Speech Communication Association
INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic
HAL
INTERSPEECH 2021: Conference of the International Speech Communication Association
INTERSPEECH 2021: Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic
HAL
Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored for image and natural language processing. Recent works also investigated SSL from speech. They were notably successful to improve performance on downstream tasks
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::613b79ecb8bb6d24c24c38e95fe1281e
http://arxiv.org/abs/2104.11462
http://arxiv.org/abs/2104.11462