Zobrazeno 1 - 10
of 209
pro vyhledávání: '"Speaker Segmentation"'
Autor:
Beatriz Martínez-González, José M. Pardo, José A. Vallejo-Pinto, Rubén San-Segundo, Javier Ferreiros
Publikováno v:
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2021, Iss 1, Pp 1-24 (2021)
Abstract There has been little work in the literature on the speaker diarization of meetings with multiple distance microphones since the publications in 2012 related to the last National Institute of Standards (NIST) Rich Transcription Evaluation Ca
Externí odkaz:
https://doaj.org/article/d4b1850459994af3a076120942454592
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Restricted Boltzmann Machine Vectors for Speaker Clustering and Tracking Tasks in TV Broadcast Shows
Publikováno v:
Applied Sciences, Vol 9, Iss 13, p 2761 (2019)
Restricted Boltzmann Machines (RBMs) have shown success in both the front-end and backend of speaker verification systems. In this paper, we propose applying RBMs to the front-end for the tasks of speaker clustering and speaker tracking in TV broadca
Externí odkaz:
https://doaj.org/article/df33a2a4d89248db982c4cb2f461c69c
Publikováno v:
ICASSP
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
An essential part of any diarization system is the task of speaker segmentation which is important for many applications including speaker indexing and automatic speech recognition (ASR) in multi-speaker environments. Segmentation of overlapping spee
Autor:
Antoine Laurent, Hervé Bredin
Publikováno v:
Interspeech 2021
Interspeech 2021, Aug 2021, Brno, Czech Republic
Interspeech 2021, Aug 2021, Brno, Czech Republic
Speaker segmentation consists in partitioning a conversation between one or more speakers into speaker turns. Usually addressed as the late combination of three sub-tasks (voice activity detection, speaker change detection, and overlapped speech dete
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2effb03a1a9f8d98483795fa647237ca
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
WASPAA
WASPAA
Speaker segmentation is an essential part of any diarization system. Applications of diarization include tasks such as speaker indexing, improving automatic speech recognition (ASR) performance and making single speaker-based algorithms available for
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::03dcafbdc14326a1508cc0a596ea9c13
http://hdl.handle.net/10044/1/72344
http://hdl.handle.net/10044/1/72344
Restricted Boltzmann Machine Vectors for Speaker Clustering and Tracking Tasks in TV Broadcast Shows
Publikováno v:
Applied Sciences, Vol 9, Iss 13, p 2761 (2019)
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Applied Sciences
Volume 9
Issue 13
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Applied Sciences
Volume 9
Issue 13
Restricted Boltzmann Machines (RBMs) have shown success in both the front-end and backend of speaker verification systems. In this paper, we propose applying RBMs to the front-end for the tasks of speaker clustering and speaker tracking in TV broadca
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
India Massana, Miquel Àngel, Rodríguez Fonollosa, José Adrián, Hernando Pericás, Francisco Javier
Publikováno v:
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Recercat. Dipósit de la Recerca de Catalunya
instname
Universitat Politècnica de Catalunya (UPC)
Recercat. Dipósit de la Recerca de Catalunya
instname
This paper presents a new speaker change detection system based on Long Short-Term Memory (LSTM) neural networks using acoustic data and linguistic content. Language modelling is combined with two different Joint Factor Analysis (JFA) acoustic approa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::a1419b1250c6c9c4cf80d9e670247a78
http://hdl.handle.net/2117/112988
http://hdl.handle.net/2117/112988