Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Kurt, Yunus Bilge"'
In recent years, transformer-based architectures become the de facto standard for sequence modeling in deep learning frameworks. Inspired by the successful examples, we propose a causal visual-inertial fusion transformer (VIFT) for pose estimation in
Externí odkaz:
http://arxiv.org/abs/2409.08769