Zobrazeno 1 - 10
of 20 439
pro vyhledávání: '"I vector"'
Autor:
Aldeneh, Zakaria, Higuchi, Takuya, Jung, Jee-weon, Chen, Li-Wei, Shum, Stephen, Abdelaziz, Ahmed Hussen, Watanabe, Shinji, Likhomanenko, Tatiana, Theobald, Barry-John
Iterative self-training, or iterative pseudo-labeling (IPL)--using an improved model from the current iteration to provide pseudo-labels for the next iteration--has proven to be a powerful approach to enhance the quality of speaker representations. R
Externí odkaz:
http://arxiv.org/abs/2409.10791
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
Phys. Rev. D 108 (2023) 053003
We study radiative corrections to low-energy charged-current processes involving nucleons, such as neutron beta decay and (anti)neutrino-nucleon scattering within a top-down effective-field-theory approach. We first match the Standard Model to the lo
Externí odkaz:
http://arxiv.org/abs/2306.03138
Autor:
Al-KALTAKCHI, Musab T. S.1 musab.tahseen@gmail.com, AL-NIMA, Raid R. O.2, ABDULLAH, Mohammed A. M.3
Publikováno v:
Turkish Journal of Electrical Engineering & Computer Sciences. 2020, Vol. 28 Issue 3, p1236-1245. 10p.
Autor:
Caulley, Desmond
This paper investigates the application of environmental feature representations for room verification tasks and acoustic meta-data estimation. Audio recordings contain both speaker and non-speaker information. We refer to the non-speaker-related inf
Externí odkaz:
http://arxiv.org/abs/2203.04880
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Voice disorders affect a large portion of the population, especially heavy voice users such as teachers or call-center workers. Most voice disorders can be treated effectively with behavioral voice therapy, which teaches patients to replace problemat
Externí odkaz:
http://arxiv.org/abs/2102.07307
Target-speaker voice activity detection (TS-VAD) has recently shown promising results for speaker diarization on highly overlapped speech. However, the original model requires a fixed (and known) number of speakers, which limits its application to re
Externí odkaz:
http://arxiv.org/abs/2108.03342
We study the individuality of the human voice with respect to a widely used feature representation of speech utterances, namely, the i-vector model. As a first step toward this goal, we compare and contrast uniqueness measures proposed for different
Externí odkaz:
http://arxiv.org/abs/2008.11985
Publikováno v:
In Expert Systems With Applications 30 December 2022 210