Zobrazeno 1 - 10
of 72
pro vyhledávání: '"Xavier Anguera"'
Publikováno v:
SLaTE
Publikováno v:
IEEE/ACM Transactions on Audio, Speech and Language Processing
IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, 23 (12), pp.2286-2297. ⟨10.1109/TASLP.2015.2479043⟩
IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, 23 (12), pp.2286-2297. ⟨10.1109/TASLP.2015.2479043⟩
International audience; Speaker diarization has become a key process within other speech processing systems which take advantage of single-speaker speech signals. Furthermore, finding recurrent speakers among a set of audio recordings, known as cross
Autor:
Julien Karadayi, Xuan Nga Cao, Mathieu Bernard, Juan Benjumea, Ewan Dunbar, Emmanuel Dupoux, Xavier Anguera, Laurent Besacier
Publikováno v:
IEEE Automatic Speech Recognition and Understanding (ASRU)
IEEE Automatic Speech Recognition and Understanding (ASRU), Dec 2017, Okinawa, Japan
ASRU 2017
ASRU 2017, Dec 2017, Okinawa, Japan
HAL
ASRU
IEEE Automatic Speech Recognition and Understanding (ASRU), Dec 2017, Okinawa, Japan
ASRU 2017
ASRU 2017, Dec 2017, Okinawa, Japan
HAL
ASRU
We describe a new challenge aimed at discovering subword and word units from raw speech. This challenge is the followup to the Zero Resource Speech Challenge 2015. It aims at constructing systems that generalize across languages and adapt to new spea
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a664ae3878f2a8630847dc5b3863227d
Autor:
Oriol Vinyals, Gerald Friedland, Adam Janin, Luke Gottlieb, Marijn Huijbregts, Mary Tai Knox, David Imseng, Xavier Anguera Miro
Publikováno v:
IEEE Transactions on Audio, Speech, and Language Processing, 20, 2, pp. 371-381
IEEE Transactions on Audio Speech and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing, 20, 371-381
IEEE Transactions on Audio Speech and Language Processing
IEEE Transactions on Audio, Speech, and Language Processing, 20, 371-381
The speaker diarization system developed at the International Computer Science Institute (ICSI) has played a prominent role in the speaker diarization community, and many researchers in the rich transcription community have adopted methods and techni
Autor:
Oriol Vinyals, Corinne Fredouille, Gerald Friedland, Xavier Anguera Miro, Nicholas Evans, Simon Bozonnet
Publikováno v:
IEEE transactions on acoustics, speech, and signal processing
IEEE transactions on acoustics, speech, and signal processing, Institute of Electrical and Electronics Engineers (IEEE), 2010, pp.1
IEEE transactions on acoustics, speech, and signal processing, Institute of Electrical and Electronics Engineers (IEEE), 2010, pp.1
International audience; Speaker diarization is the task of determining "who spoke when?" in an audio or video recording that contains an unknown amount of speech and also an unknown number of speakers. Initially, it was proposed as a research topic r
Publikováno v:
IEEE Transactions on Computers. 56:1212-1224
Human-machine interaction in meetings requires the localization and identification of the speakers interacting with the system as well as the recognition of the words spoken. A seminal step toward this goal is the field of rich transcription research
Publikováno v:
IEEE Transactions on Audio, Speech and Language Processing. 15:2011-2022
When performing speaker diarization on recordings from meetings, multiple microphones of different qualities are usually available and distributed around the meeting room. Although several approaches have been proposed in recent years to take advanta
Autor:
Aren Jansen, Thomas Schatz, Xavier Anguera, Maarten Versteegh, Emmanuel Dupoux, Xuan Nga Cao, Roland Thiolliere
Publikováno v:
INTERSPEECH
The Interspeech 2015 Zero Resource Speech Challenge aims at discovering subword and word units from raw speech. The challenge provides the first unified and open source suite of evaluation metrics and data sets to compare and analyse the results of u
Publikováno v:
Scopus-Elsevier
INTERSPEECH
INTERSPEECH
Customer center call data is typically collected by organizations and corporations in order to improve customer experience through the analysis of such call data. In this paper, we report our findings when analysing more than 26 thousand calls to the
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b2ebdaee34086a382033d4d974d54805
Publikováno v:
2015 23rd European Signal Processing Conference (EUSIPCO)
2015 23rd European Signal Processing Conference (EUSIPCO), Aug 2015, Nice, France. pp.2087-2091
EUSIPCO
2015 23rd European Signal Processing Conference (EUSIPCO), Aug 2015, Nice, France. pp.2087-2091
EUSIPCO
The recently proposed speaker diarization technique based on binary keys provides a very fast alternative to state-of-the-art systems. However, this speed up has the cost of a little increase in Diarization Error Rate (DER). This paper proposes a ser
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::637a653a112597d1a811a7ba4a346e74
https://hal.archives-ouvertes.fr/hal-02102796
https://hal.archives-ouvertes.fr/hal-02102796