Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation

Autor: Chuang Lin, Yi Jiang, Jianfei Cai, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan
Rok vydání: 2022
Zdroj: Lecture Notes in Computer Science ISBN: 9783031200588
DOI: 10.1007/978-3-031-20059-5_22
Databáze: OpenAIRE