Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Anderer, Katharina"'
Publikováno v:
Proceedings of Interspeech 2024
This paper presents a benchmark dataset for aligning lecture videos with corresponding slides and introduces a novel multimodal algorithm leveraging features from speech, text, and images. It achieves an average accuracy of 0.82 in comparison to SIFT
Externí odkaz:
http://arxiv.org/abs/2409.16765
Publikováno v:
Big Data & Cognitive Computing; Jan2024, Vol. 8 Issue 1, p2, 20p