Zobrazeno 1 - 10
of 40
pro vyhledávání: '"Viet-Bac Le"'
Autor:
Bianca Vieru, Abdel Messaoudi, Jean-Luc Gauvain, J. L. Gauvain, Julien Despres, Viet Bac Le, Lori Lamel, Waad Ben Kheder
Publikováno v:
International Conference on Multimedia Modeling
International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece
MultiMedia Modeling ISBN: 9783030057152
MMM (2)
International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece
MultiMedia Modeling ISBN: 9783030057152
MMM (2)
International audience; Much information in multimedia data related to terrorist activity can be extracted from the audio content. Our work in ongoing projects aims to provide a complete description of the audio portion of multimedia documents. The i
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2cbfbb9bc4f5422b483eda0beb2fb8c5
https://hal.archives-ouvertes.fr/hal-02415176
https://hal.archives-ouvertes.fr/hal-02415176
Publikováno v:
Multimedia Tools and Applications
Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377-1396. ⟨10.1007/s11042-014-1940-3⟩
Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377-1396. ⟨10.1007/s11042-014-1940-3⟩
The final publication is available at https://link.springer.com/article/10.1007/s11042-014-1940-3; International audience; It is possible to use lexical information extracted from speech transcripts for speaker identification (SID), either on its own
Publikováno v:
INTERSPEECH
Publikováno v:
Odyssey 2016
Odyssey 2016, Jun 2016, Bilbao, Spain
Odyssey
Odyssey 2016, Jun 2016, Bilbao, Spain
Odyssey
International audience
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4f0f15a3c70d989dc60721c24f780fad
https://hal.archives-ouvertes.fr/hal-01744188
https://hal.archives-ouvertes.fr/hal-01744188
Autor:
Jean-Luc Gauvain, Abdel Messaoudi, Thiago Fraga-Silva, Viet Bac Le, Lori Lamel, Antoine Laurent
Publikováno v:
ASRU
ASRU 2015
ASRU 2015, Dec 2015, Scottsdale, United States
ASRU 2015
ASRU 2015, Dec 2015, Scottsdale, United States
This paper extends recent research on training data selection for speech transcription and keyword spotting system development. Selection techniques were explored in the context of the IARPA-Babel Active Learning (AL) task for 6 languages. Different
Autor:
Jean-Luc Gauvain, Viet Bac Le, Lori Lamel, Antoine Laurent, Abdelkhalek Messaoudi, Thiago Fraga-Silva
Publikováno v:
INTERSPEECH
Interspeech 2015
Interspeech 2015, Sep 2015, Dresden, Germany
Interspeech 2015
Interspeech 2015, Sep 2015, Dresden, Germany
This paper presents first results in using active learning (AL) for training data selection in the context of the IARPABabel program. Given an initial training data set, we aim to automatically select additional data (from an untranscribed pool data
Autor:
Viet Bac Le, Lori Lamel, Cécile Woehrling, Anindya Roy, Julien Despres, Jean-Luc Gauvain, William Hartmann, Abdelkhalek Messaoudi
Publikováno v:
INTERSPEECH
This paper presents recent progress in developing speech-totext (STT) and keyword spotting (KWS) systems for the 2014 IARPA-Babel evaluation. Systems have been developed for the limited language pack condition for four of the five development languag
Publikováno v:
INTERSPEECH
Annual Conference of the International Speech Communication Association
Annual Conference of the International Speech Communication Association, ISCA, Sep 2014, Singapore, Singapore
Annual Conference of the International Speech Communication Association
Annual Conference of the International Speech Communication Association, ISCA, Sep 2014, Singapore, Singapore
International audience; For languages with limited training resources, out-of-vocabulary (OOV) words are a significant problem, both fortranscription and keyword spotting. This paper investigates theuse of subword lexical units for keyword spotting.
Publikováno v:
IEEE Signal Processing Letters
IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040-1044. ⟨10.1109/LSP.2014.2323432⟩
IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040-1044. ⟨10.1109/LSP.2014.2323432⟩
International audience; Most speaker recognition systems rely on short-term acoustic cepstral features for extracting the speaker-relevant information from the signal. But phonetic discriminant features, extracted by a bottleneck multi-layer perceptr
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9b6034d4d51091077cd9937201f7988b
https://hal.archives-ouvertes.fr/hal-01690336
https://hal.archives-ouvertes.fr/hal-01690336
Publikováno v:
International Journal of Multimedia Information Retrieval
International Journal of Multimedia Information Retrieval, Springer, 2014, 3 (3), pp.161-175. ⟨10.1007/s13735-014-0055-y⟩
International Journal of Multimedia Information Retrieval, Springer, 2014, 3 (3), pp.161-175. ⟨10.1007/s13735-014-0055-y⟩
The final publication is available at https://link.springer.com/article/10.1007/s13735-014-0055-y; International audience; This work introduces a unified framework for mono-, cross-and multi-modal person recognition in multimedia data. Dubbed Person
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::87aeecccf19eec8f4ed6f27093de96ad
https://hal.archives-ouvertes.fr/hal-01690350/document
https://hal.archives-ouvertes.fr/hal-01690350/document