Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Hong, Richnag"'
Navigating unseen environments based on natural language instructions remains difficult for egocentric agents in Vision-and-Language Navigation (VLN). While recent advancements have yielded promising outcomes, they primarily rely on RGB images for en
Externí odkaz:
http://arxiv.org/abs/2412.06465