Výsledky vyhledávání - "Speaker diarization"

Akademický článek

A lightweight approach to real-time speaker diarization: from audio toward audio-visual data streams

Autor: Frantisek Kynych, Petr Cerva, Jindrich Zdansky, Torbjørn Svendsen, Giampiero Salvi

Publikováno v: EURASIP Journal on Audio, Speech, and Music Processing, Vol 2024, Iss 1, Pp 1-16 (2024)

Abstract This manuscript deals with the task of real-time speaker diarization (SD) for stream-wise data processing. Therefore, in contrast to most of the existing papers, it considers not only the accuracy but also the computational demands of indivi

Externí odkaz: https://doaj.org/article/8b5037e2570e41bebdde2413729b9ef1

Zobrazit plný text záznamu

Akademický článek

Speech recognition technology for assessing team debriefing communication and interaction patterns: An algorithmic toolkit for healthcare simulation educators

Autor: Robin Brutschi, Rui Wang, Michaela Kolbe, Kerrin Weiss, Quentin Lohmeyer, Mirko Meboldt

Publikováno v: Advances in Simulation, Vol 9, Iss 1, Pp 1-13 (2024)

Abstract Background Debriefings are central to effective learning in simulation-based medical education. However, educators often face challenges when conducting debriefings, which are further compounded by the lack of empirically derived knowledge o

Externí odkaz: https://doaj.org/article/ecd603ced7e44d2499ebcf789f77f685

Zobrazit plný text záznamu

Akademický článek

Trends in audio scene source counting and analysis

Autor: Michael Nigro, Sridhar Krishnan

Publikováno v: Machine Learning with Applications, Vol 18, Iss , Pp 100593- (2024)

Audio scene analysis involves a variety of tasks to obtain information from an audio environment. Audio source counting is one such task that has implications to many other aspects of audio analysis, yet it is relatively unexplored. This work present

Externí odkaz: https://doaj.org/article/ab4d25579a8345c9ae0c15f10fbd76bb

Zobrazit plný text záznamu

Akademický článek

Exploring AI Techniques for Generalizable Teaching Practice Identification

Autor: Federico Pardo Garcia, Oscar Canovas, Felix J. Garcia Clemente

Publikováno v: IEEE Access, Vol 12, Pp 134702-134713 (2024)

Using automated models to analyze classroom discourse is a valuable tool for educators to improve their teaching methods. In this paper, we focus on exploring alternatives to ensure the generalizability of models for identifying teaching practices ac

Externí odkaz: https://doaj.org/article/5a20c25632ce41e39a93de6639a595e3

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

ATC-SD Net: Radiotelephone Communications Speaker Diarization Network

Autor: Weijun Pan, Yidi Wang, Yumei Zhang, Boyuan Han

Publikováno v: Aerospace, Vol 11, Iss 7, p 599 (2024)

This study addresses the challenges that high-noise environments and complex multi-speaker scenarios present in civil aviation radio communications. A novel radiotelephone communications speaker diffraction network is developed specifically for these

Externí odkaz: https://doaj.org/article/54a0d0a222764dc09f2ef51c49f34c26

Zobrazit plný text záznamu

Akademický článek

Real-time multilingual speech recognition and speaker diarization system based on Whisper segmentation

Autor: Ke-Ming Lyu, Ren-yuan Lyu, Hsien-Tsung Chang

Publikováno v: PeerJ Computer Science, Vol 10, p e1973 (2024)

This research presents the development of a cutting-edge real-time multilingual speech recognition and speaker diarization system that leverages OpenAI’s Whisper model. The system specifically addresses the challenges of automatic speech recognitio

Externí odkaz: https://doaj.org/article/9d5d05f38a6f43e0884eb1289b532e1e

Zobrazit plný text záznamu

Akademický článek

Speaker-turn aware diarization for speech-based cognitive assessments

Autor: Sean Shensheng Xu, Xiaoquan Ke, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok, Jason Gu, Jian Zhang, Wei Tao, Chunqi Chang

Publikováno v: Frontiers in Neuroscience, Vol 17 (2024)

IntroductionSpeaker diarization is an essential preprocessing step for diagnosing cognitive impairments from speech-based Montreal cognitive assessments (MoCA).MethodsThis paper proposes three enhancements to the conventional speaker diarization meth

Externí odkaz: https://doaj.org/article/89297ecfd0b64ef393a6c2384fba0a71

Zobrazit plný text záznamu

Akademický článek

Multisensory Fusion for Unsupervised Spatiotemporal Speaker Diarization

Autor: Paris Xylogiannis, Nikolaos Vryzas, Lazaros Vrysis, Charalampos Dimoulas

Publikováno v: Sensors, Vol 24, Iss 13, p 4229 (2024)

Speaker diarization consists of answering the question of “who spoke when” in audio recordings. In meeting scenarios, the task of labeling audio with the corresponding speaker identities can be further assisted by the exploitation of spatial feat

Externí odkaz: https://doaj.org/article/ff14739e8a074d0dba4886db4816059b

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Predicting Individual Well-Being in Teamwork Contexts Based on Speech Features

Autor: Tobias Zeulner, Gerhard Johann Hagerer, Moritz Müller, Ignacio Vazquez, Peter A. Gloor

Publikováno v: Information, Vol 15, Iss 4, p 217 (2024)

Current methods for assessing individual well-being in team collaboration at the workplace often rely on manually collected surveys. This limits continuous real-world data collection and proactive measures to improve team member workplace satisfactio

Externí odkaz: https://doaj.org/article/60d4f4371bef40d281858dc2a7aa08d6

Zobrazit plný text záznamu

Plný text ve formátu HTML

Vyhledávací nástroje:

Upřesnit hledání