Výsledky vyhledávání

Challenges in Audio Processing of Terrorist-Related Data

Autor: Bianca Vieru, Abdel Messaoudi, Jean-Luc Gauvain, J. L. Gauvain, Julien Despres, Viet Bac Le, Lori Lamel, Waad Ben Kheder

Publikováno v: International Conference on Multimedia Modeling
International Conference on Multimedia Modeling, Springer, Jan 2019, Thessaloniki, Greece
MultiMedia Modeling ISBN: 9783030057152
MMM (2)

International audience; Much information in multimedia data related to terrorist activity can be extracted from the audio content. Our work in ongoing projects aims to provide a complete description of the audio portion of multimedia documents. The i

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2cbfbb9bc4f5422b483eda0beb2fb8c5
https://hal.archives-ouvertes.fr/hal-02415176

Zobrazit plný text záznamu

Lexical speaker identification in TV shows

Autor: Anindya Roy, Claude Barras, Viet Bac Le, Jean-Luc Gauvain, William Hartmann, Hervé Bredin

Publikováno v: Multimedia Tools and Applications
Multimedia Tools and Applications, Springer Verlag, 2015, 74 (4), pp.1377-1396. ⟨10.1007/s11042-014-1940-3⟩

The final publication is available at https://link.springer.com/article/10.1007/s11042-014-1940-3; International audience; It is possible to use lexical information extracted from speech transcripts for speaker identification (SID), either on its own

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c04531b35677c95eb43184f93862fb0f
https://doi.org/10.1007/s11042-014-1940-3

Zobrazit plný text záznamu

A Divide-and-Conquer Approach for Language Identification Based on Recurrent Neural Networks

Autor: Jean-Luc Gauvain, Abdelkhalek Messaoudi, Viet Bac Le, Gregory Gelly

Publikováno v: INTERSPEECH

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::d85626560b4857e3a023dc995c55f8ed
https://doi.org/10.21437/interspeech.2016-180

Zobrazit plný text záznamu

Language Recognition for Dialects and Closely Related Languages

Autor: Viet Bac Le, Lori Lamel, Abdel Messaoudi, Antoine Laurent, Jean-Luc Gauvain, Gregory Gelly

Publikováno v: Odyssey 2016
Odyssey 2016, Jun 2016, Bilbao, Spain
Odyssey

International audience

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4f0f15a3c70d989dc60721c24f780fad
https://hal.archives-ouvertes.fr/hal-01744188

Zobrazit plný text záznamu

Improving data selection for low-resource STT and KWS

Autor: Jean-Luc Gauvain, Abdel Messaoudi, Thiago Fraga-Silva, Viet Bac Le, Lori Lamel, Antoine Laurent

Publikováno v: ASRU
ASRU 2015
ASRU 2015, Dec 2015, Scottsdale, United States

This paper extends recent research on training data selection for speech transcription and keyword spotting system development. Selection techniques were explored in the context of the IARPA-Babel Active Learning (AL) task for 6 languages. Different

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::52a78cd829ddc984bc56ebc8d1d49476
https://doi.org/10.1109/asru.2015.7404788

Zobrazit plný text záznamu

Active learning based data selection for limited resource STT and KWS

Autor: Jean-Luc Gauvain, Viet Bac Le, Lori Lamel, Antoine Laurent, Abdelkhalek Messaoudi, Thiago Fraga-Silva

Publikováno v: INTERSPEECH
Interspeech 2015
Interspeech 2015, Sep 2015, Dresden, Germany

This paper presents first results in using active learning (AL) for training data selection in the context of the IARPABabel program. Given an initial training data set, we aim to automatically select additional data (from an untranscribed pool data

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9d3972d881e8bc49697e1f60e5f9ca8f
https://doi.org/10.21437/interspeech.2015-636

Zobrazit plný text záznamu

Developing STT and KWS systems using limited language resources

Autor: Viet Bac Le, Lori Lamel, Cécile Woehrling, Anindya Roy, Julien Despres, Jean-Luc Gauvain, William Hartmann, Abdelkhalek Messaoudi

Publikováno v: INTERSPEECH

This paper presents recent progress in developing speech-totext (STT) and keyword spotting (KWS) systems for the 2014 IARPA-Babel evaluation. Systems have been developed for the limited language pack condition for four of the five development languag

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8fec6efd76961329839ac93e027bbe78
https://doi.org/10.21437/interspeech.2014-527

Zobrazit plný text záznamu

Comparing decoding strategies for subword-based keyword spotting in low-resourced languages

Autor: Viet Bac Le, Lori Lamel, Jean-Luc Gauvain, Abdelkhalek Messaoudi, William Hartmann

Publikováno v: INTERSPEECH
Annual Conference of the International Speech Communication Association
Annual Conference of the International Speech Communication Association, ISCA, Sep 2014, Singapore, Singapore

International audience; For languages with limited training resources, out-of-vocabulary (OOV) words are a signiﬁcant problem, both fortranscription and keyword spotting. This paper investigates theuse of subword lexical units for keyword spotting.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d3652ee15719193b901d2db15c348daf
https://doi.org/10.21437/interspeech.2014-528

Zobrazit plný text záznamu

Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification

Autor: Cong-Thanh Do, Viet Bac Le, Claude Barras, Achintya Kumar Sarkar

Publikováno v: IEEE Signal Processing Letters
IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2014, 21 (9), pp.1040-1044. ⟨10.1109/LSP.2014.2323432⟩

International audience; Most speaker recognition systems rely on short-term acoustic cepstral features for extracting the speaker-relevant information from the signal. But phonetic discriminant features, extracted by a bottleneck multi-layer perceptr

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9b6034d4d51091077cd9937201f7988b
https://hal.archives-ouvertes.fr/hal-01690336

Zobrazit plný text záznamu

Person instance graphs for mono-, cross- and multi-modal person recognition in multimedia data: application to speaker identification in TV broadcast

Autor: Viet Bac Le, Anindya Roy, Hervé Bredin, Claude Barras

Publikováno v: International Journal of Multimedia Information Retrieval
International Journal of Multimedia Information Retrieval, Springer, 2014, 3 (3), pp.161-175. ⟨10.1007/s13735-014-0055-y⟩

The final publication is available at https://link.springer.com/article/10.1007/s13735-014-0055-y; International audience; This work introduces a unified framework for mono-, cross-and multi-modal person recognition in multimedia data. Dubbed Person

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::87aeecccf19eec8f4ed6f27093de96ad
https://hal.archives-ouvertes.fr/hal-01690350/document

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání