Výsledky vyhledávání

Report

Speaker-IPL: Unsupervised Learning of Speaker Characteristics with i-Vector based Pseudo-Labels

Autor: Aldeneh, Zakaria, Higuchi, Takuya, Jung, Jee-weon, Chen, Li-Wei, Shum, Stephen, Abdelaziz, Ahmed Hussen, Watanabe, Shinji, Likhomanenko, Tatiana, Theobald, Barry-John

Iterative self-training, or iterative pseudo-labeling (IPL)--using an improved model from the current iteration to provide pseudo-labels for the next iteration--has proven to be a powerful approach to enhance the quality of speaker representations. R

Externí odkaz: http://arxiv.org/abs/2409.10791

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Report

Effective field theory for radiative corrections to charged-current processes I: Vector coupling

Autor: Cirigliano, Vincenzo, Dekens, Wouter, Mereghetti, Emanuele, Tomalak, Oleksandr

Publikováno v: Phys. Rev. D 108 (2023) 053003

We study radiative corrections to low-energy charged-current processes involving nucleons, such as neutron beta decay and (anti)neutrino-nucleon scattering within a top-down effective-field-theory approach. We first match the Standard Model to the lo

Externí odkaz: http://arxiv.org/abs/2306.03138

Zobrazit plný text záznamu

Akademický článek

Comparisons of extreme learning machine and backpropagation-based i-vector approach for speaker identification.

Autor: Al-KALTAKCHI, Musab T. S.¹ musab.tahseen@gmail.com, AL-NIMA, Raid R. O.², ABDULLAH, Mohammed A. M.³

Publikováno v: Turkish Journal of Electrical Engineering & Computer Sciences. 2020, Vol. 28 Issue 3, p1236-1245. 10p.

Zobrazit plný text záznamu

Report

An Environmental Feature Representation in I-vector Space for Room Verification and Metadata Estimation

Autor: Caulley, Desmond

This paper investigates the application of environmental feature representations for room verification tasks and acoustic meta-data estimation. Audio recordings contain both speaker and non-speaker information. We refer to the non-speaker-related inf

Externí odkaz: http://arxiv.org/abs/2203.04880

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Report

I-vector Based Within Speaker Voice Quality Identification on connected speech

Autor: Feng, Chuyao, van Leer, Eva, Curtis, Mackenzie Lee, Anderson, David V.

Voice disorders affect a large portion of the population, especially heavy voice users such as teachers or call-center workers. Most voice disorders can be treated effectively with behavioral voice therapy, which teaches patients to replace problemat

Externí odkaz: http://arxiv.org/abs/2102.07307

Zobrazit plný text záznamu

Report

Target-speaker Voice Activity Detection with Improved I-Vector Estimation for Unknown Number of Speaker

Autor: He, Maokui, Raj, Desh, Huang, Zili, Du, Jun, Chen, Zhuo, Watanabe, Shinji

Target-speaker voice activity detection (TS-VAD) has recently shown promising results for speaker diarization on highly overlapped speech. However, the original model requires a fixed (and known) number of speakers, which limits its application to re

Externí odkaz: http://arxiv.org/abs/2108.03342

Zobrazit plný text záznamu

Report

Estimating Uniqueness of I-Vector Representation of Human Voice

Autor: Tandogan, Erkam Sinan, Sencar, Husrev Taha

We study the individuality of the human voice with respect to a widely used feature representation of speech utterances, namely, the i-vector model. As a first step toward this goal, we compare and contrast uniqueness measures proposed for different

Externí odkaz: http://arxiv.org/abs/2008.11985

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání