Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding

Autor: Arda Senocak, Junsik Kim, Tae-Hyun Oh, Dingzeyu Li, In So Kweon
Rok vydání: 2023
Zdroj: 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).
DOI: 10.1109/wacv56688.2023.00227
Databáze: OpenAIRE