Cascade transformers with dynamic attention for video question answering
Autor: | Jiang, Yimin, Yan, Tingfei, Yao, Mingze, Wang, Huibing, Liu, Wenzhe |
---|---|
Zdroj: | In Computer Vision and Image Understanding May 2024 242 |
Databáze: | ScienceDirect |
Externí odkaz: |