Applying EEND Diarization to Telephone Recordings from a Call Center

Autor: Marie Kunešová, Luděk Müller, Zbyněk Zajíc
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Speech and Computer ISBN: 9783030878016
SPECOM
Popis: In this paper, we focus on the issue of speaker diarization of data from a real call center. We have previously proposed a specialized solution to the problem, which employed additional knowledge about the identities of the phone operators (in our case, the language counselors from the Language Consulting Center), thus improving performance over the baseline. But a recent end-to-end diarization method, EEND, has since proven very successful on other data and was shown to surpass the previous state of the art in the field. Thus, we chose to compare this new method with our own previous approach. Using an existing implementation of the EEND method (adapted using a small amount of the target data from the Language Consulting Center), we successfully surpass the performance of our previous approach (17.42% vs. 19.39% DER), without the need for any additional information about speaker identities. The majority of the remaining diarization error of the EEND system is due to incorrect decisions between speech and silence, rather than speaker confusion. For comparison, we also show the results of a more standard diarization approach, represented by the method used in the Kaldi toolkit.
Databáze: OpenAIRE