Zobrazeno 1 - 10
of 1 198
pro vyhledávání: '"Speaker diarization"'
A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams
Publikováno v:
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2024, Iss 1, Pp 1-16 (2024)
Abstract This manuscript deals with the task of real-time speaker diarization (SD) for stream-wise data processing. Therefore, in contrast to most of the existing papers, it considers not only the accuracy but also the computational demands of indivi
Externí odkaz:
https://doaj.org/article/8b5037e2570e41bebdde2413729b9ef1
Publikováno v:
Advances in Simulation, Vol 9, Iss 1, Pp 1-13 (2024)
Abstract Background Debriefings are central to effective learning in simulation-based medical education. However, educators often face challenges when conducting debriefings, which are further compounded by the lack of empirically derived knowledge o
Externí odkaz:
https://doaj.org/article/ecd603ced7e44d2499ebcf789f77f685
Autor:
Michael Nigro, Sridhar Krishnan
Publikováno v:
Machine Learning with Applications, Vol 18, Iss , Pp 100593- (2024)
Audio scene analysis involves a variety of tasks to obtain information from an audio environment. Audio source counting is one such task that has implications to many other aspects of audio analysis, yet it is relatively unexplored. This work present
Externí odkaz:
https://doaj.org/article/ab4d25579a8345c9ae0c15f10fbd76bb
Publikováno v:
IEEE Access, Vol 12, Pp 134702-134713 (2024)
Using automated models to analyze classroom discourse is a valuable tool for educators to improve their teaching methods. In this paper, we focus on exploring alternatives to ensure the generalizability of models for identifying teaching practices ac
Externí odkaz:
https://doaj.org/article/5a20c25632ce41e39a93de6639a595e3
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
Aerospace, Vol 11, Iss 7, p 599 (2024)
This study addresses the challenges that high-noise environments and complex multi-speaker scenarios present in civil aviation radio communications. A novel radiotelephone communications speaker diffraction network is developed specifically for these
Externí odkaz:
https://doaj.org/article/54a0d0a222764dc09f2ef51c49f34c26
Publikováno v:
PeerJ Computer Science, Vol 10, p e1973 (2024)
This research presents the development of a cutting-edge real-time multilingual speech recognition and speaker diarization system that leverages OpenAI’s Whisper model. The system specifically addresses the challenges of automatic speech recognitio
Externí odkaz:
https://doaj.org/article/9d5d05f38a6f43e0884eb1289b532e1e
Autor:
Sean Shensheng Xu, Xiaoquan Ke, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok, Jason Gu, Jian Zhang, Wei Tao, Chunqi Chang
Publikováno v:
Frontiers in Neuroscience, Vol 17 (2024)
IntroductionSpeaker diarization is an essential preprocessing step for diagnosing cognitive impairments from speech-based Montreal cognitive assessments (MoCA).MethodsThis paper proposes three enhancements to the conventional speaker diarization meth
Externí odkaz:
https://doaj.org/article/89297ecfd0b64ef393a6c2384fba0a71
Publikováno v:
Sensors, Vol 24, Iss 13, p 4229 (2024)
Speaker diarization consists of answering the question of “who spoke when” in audio recordings. In meeting scenarios, the task of labeling audio with the corresponding speaker identities can be further assisted by the exploitation of spatial feat
Externí odkaz:
https://doaj.org/article/ff14739e8a074d0dba4886db4816059b
Publikováno v:
Information, Vol 15, Iss 4, p 217 (2024)
Current methods for assessing individual well-being in team collaboration at the workplace often rely on manually collected surveys. This limits continuous real-world data collection and proactive measures to improve team member workplace satisfactio
Externí odkaz:
https://doaj.org/article/60d4f4371bef40d281858dc2a7aa08d6