Výsledky vyhledávání - "Speaker Segmentation"

Akademický článek

Analysis of transition cost and model parameters in speaker diarization for meetings

Autor: Beatriz Martínez-González, José M. Pardo, José A. Vallejo-Pinto, Rubén San-Segundo, Javier Ferreiros

Publikováno v: EURASIP Journal on Audio, Speech, and Music Processing, Vol 2021, Iss 1, Pp 1-24 (2021)

Abstract There has been little work in the literature on the speaker diarization of meetings with multiple distance microphones since the publications in 2012 related to the last National Institute of Standards (NIST) Rich Transcription Evaluation Ca

Externí odkaz: https://doaj.org/article/d4b1850459994af3a076120942454592

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Restricted Boltzmann Machine Vectors for Speaker Clustering and Tracking Tasks in TV Broadcast Shows

Autor: Umair Khan, Pooyan Safari, Javier Hernando

Publikováno v: Applied Sciences, Vol 9, Iss 13, p 2761 (2019)

Restricted Boltzmann Machines (RBMs) have shown success in both the front-end and backend of speaker verification systems. In this paper, we propose applying RBMs to the front-end for the tasks of speaker clustering and speaker tracking in TV broadca

Externí odkaz: https://doaj.org/article/df33a2a4d89248db982c4cb2f461c69c

Zobrazit plný text záznamu

Multichannel Overlapping Speaker Segmentation Using Multiple Hypothesis Tracking Of Acoustic And Spatial Features

Autor: Patrick A. Naylor, Aidan O. T. Hogg, Christine Evers

Publikováno v: ICASSP
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

An essential part of any diarization system is the task of speaker segmentation which is important for many applications including speaker indexing and automatic speech recognition (ASR) in multi-speaker environments. Segmentation of overlapping spee

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7a397d8ff4f605c677188b429fa31d66
https://doi.org/10.1109/icassp39728.2021.9414130

Zobrazit plný text záznamu

End-to-end speaker segmentation for overlap-aware resegmentation

Autor: Antoine Laurent, Hervé Bredin

Publikováno v: Interspeech 2021
Interspeech 2021, Aug 2021, Brno, Czech Republic

Speaker segmentation consists in partitioning a conversation between one or more speakers into speaker turns. Usually addressed as the late combination of three sub-tasks (voice activity detection, speaker change detection, and overlapped speech dete

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2effb03a1a9f8d98483795fa647237ca

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Multiple hypothesis tracking for overlapping speaker segmentation

Autor: Aidan O. T. Hogg, Christine Evers, Patrick A. Naylor

Publikováno v: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)
WASPAA

Speaker segmentation is an essential part of any diarization system. Applications of diarization include tasks such as speaker indexing, improving automatic speech recognition (ASR) performance and making single speaker-based algorithms available for

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::03dcafbdc14326a1508cc0a596ea9c13
http://hdl.handle.net/10044/1/72344

Zobrazit plný text záznamu

Restricted Boltzmann Machine Vectors for Speaker Clustering and Tracking Tasks in TV Broadcast Shows

Autor: Pooyan Safari, Umair Khan, Javier Hernando

Publikováno v: Applied Sciences, Vol 9, Iss 13, p 2761 (2019)
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Applied Sciences
Volume 9
Issue 13

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d0677cf54d6bcb5c449493ba61bfd572
https://doi.org/10.3390/app9132761

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

LSTM neural network-based speaker segmentation using acoustic and language modelling

Autor: India Massana, Miquel Àngel, Rodríguez Fonollosa, José Adrián, Hernando Pericás, Francisco Javier

Publikováno v: UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Recercat. Dipósit de la Recerca de Catalunya
instname

This paper presents a new speaker change detection system based on Long Short-Term Memory (LSTM) neural networks using acoustic data and linguistic content. Language modelling is combined with two different Joint Factor Analysis (JFA) acoustic approa

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::a1419b1250c6c9c4cf80d9e670247a78
http://hdl.handle.net/2117/112988

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání