Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation
Autor: | Chuang Lin, Yi Jiang, Jianfei Cai, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan |
---|---|
Rok vydání: | 2022 |
Zdroj: | Lecture Notes in Computer Science ISBN: 9783031200588 |
DOI: | 10.1007/978-3-031-20059-5_22 |
Databáze: | OpenAIRE |
Externí odkaz: |