Zobrazeno 1 - 10
of 39
pro vyhledávání: '"Mireia Diez"'
Autor:
Martin Kocour, Jahnavi Umesh, Martin Karafiat, Ján Švec, Fernando López, Jordi Luque, Karel Beneš, Mireia Diez, Igor Szoke, Karel Veselý, Lukáš Burget, Jan Černocký
Publikováno v:
IberSPEECH 2022.
End-to-end diarization presents an attractive alternative to standard cascaded diarization systems because a single system can handle all aspects of the task at once. Many flavors of end-to-end models have been proposed but all of them require (so fa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f9909ef392675c87d49e66b250088a02
http://arxiv.org/abs/2211.06750
http://arxiv.org/abs/2211.06750
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28:355-368
In our previous work, we introduced our Bayesian Hidden Markov Model with eigenvoice priors, which has been recently recognized as the state-of-the-art model for Speaker Diarization. In this article we present a more complete analysis of the Diarizat
Autor:
Oldřich Plchot, Mireia Diez, Pavel Matějka, Anna Silnova, Ondřej Glembek, Lukas Burget, Johan Rohdin
Publikováno v:
Computer Speech & Language. 59:22-35
Recently several end-to-end speaker verification systems based on deep neural networks (DNNs) have been proposed. These systems have been proven to be competitive for text-dependent tasks as well as for text-independent tasks with short utterances. H
The recently proposed VBx diarization method uses a Bayesian hidden Markov model to find speaker clusters in a sequence of x-vectors. In this work we perform an extensive comparison of performance of the VBx diarization with other approaches in the l
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::eb987793af3df18b9c97c7fc2eaf971b
http://arxiv.org/abs/2012.14952
http://arxiv.org/abs/2012.14952
Autor:
Lukas Burget, Johan Rohdin, Ondrej Glembek, Federico Landini, Mireia Diez, Anna Silnova, Pavel Matejka
Publikováno v:
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
ICASSP
ICASSP
This paper describes the system developed by the BUT team for the fourth track of the VoxCeleb Speaker Recognition Challenge, focusing on diarization on the VoxConverse dataset. The system consists of signal pre-processing, voice activity detection,
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5703904c600ac98f68a352ebe57fd6ce
http://arxiv.org/abs/2010.11718
http://arxiv.org/abs/2010.11718
Optimizing Bayesian Hmm Based X-Vector Clustering for the Second Dihard Speech Diarization Challenge
Publikováno v:
ICASSP
This paper presents an analysis of our diarization system winning the second DIHARD speech diarization challenge, track 1. This system is based on clustering x-vector speaker embeddings extracted every 0.25s from short segments of the input recording
Autor:
Lukas Burget, Ondrej Novotny, Johan Rohdin, Oldrich Plchot, Hossein Zeinali, Shuai Wang, Federico Landini, Katerina Zmolikova, Pavel Matejka, Ladislav Mosner, Anna Silnova, Mireia Diez
Publikováno v:
Web of Science
ICASSP
ICASSP
This paper describes the winning systems developed by the BUT team for the four tracks of the Second DIHARD Speech Diarization Challenge. For tracks 1 and 2 the systems were mainly based on performing agglomerative hierarchical clustering (AHC) of x-
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0af6d16427ec61340f4a914616b285a5
http://arxiv.org/abs/2002.11356
http://arxiv.org/abs/2002.11356
Autor:
Lukas Burget, Mohamed Dahmane, Gilles Boulianne, Pierre-Luc St-Charles, Oldrich Plchot, Josef Slavícek, Cedric Noiseux, Jahangir Alam, Pavel Matejka, Ondrej Novotný, Marc Lalonde, Themos Stafylakis, Ondrej Glembek, Hossein Zeinali, Johan Rohdin, Mireia Diez Sánchez, Petr Mizera, Anna Silnova, Alicia Lozano-Diez, Shuai Wang, Joao M. Monteiro, Ladislav Mosner
Publikováno v:
Biblos-e Archivo. Repositorio Institucional de la UAM
Universidad Camilo José Cela (UCJC)
Odyssey 2020 The Speaker and Language Recognition Workshop
Odyssey
Universidad Camilo José Cela (UCJC)
Odyssey 2020 The Speaker and Language Recognition Workshop
Odyssey
We present a condensed description and analysis of the joint submission of ABC team for NIST SRE 2019, by BUT, CRIM, Phonexia, Omilia and UAM. We concentrate on challenges that arose during development and we analyze the results obtained on the evalu
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8ab733e54bbc9257b44dee70b4c082f9
http://hdl.handle.net/10486/703093
http://hdl.handle.net/10486/703093
Publikováno v:
INTERSPEECH