Cascade transformers with dynamic attention for video question answering

Autor: Jiang, Yimin, Yan, Tingfei, Yao, Mingze, Wang, Huibing, Liu, Wenzhe
Zdroj: In Computer Vision and Image Understanding May 2024 242
Databáze: ScienceDirect