Zobrazeno 1 - 10
of 119
pro vyhledávání: '"Paudel, Danda"'
The advancement of dense visual simultaneous localization and mapping (SLAM) has been greatly facilitated by the emergence of neural implicit representations. Neural implicit encoding SLAM, a typical example of which is NICE-SLAM, has recently demons
Externí odkaz:
http://arxiv.org/abs/2410.03812
Recent progress in large language models and access to large-scale robotic datasets has sparked a paradigm shift in robotics models transforming them into generalists able to adapt to various tasks, scenes, and robot modalities. A large step for the
Externí odkaz:
http://arxiv.org/abs/2409.15250
Current methods to learn controllers for autonomous vehicles (AVs) focus on behavioural cloning. Being trained only on exact historic data, the resulting agents often generalize poorly to novel scenarios. Simulators provide the opportunity to go beyo
Externí odkaz:
http://arxiv.org/abs/2409.07965
World models are increasingly pivotal in interpreting and simulating the rules and actions of complex environments. Genie, a recent model, excels at learning from visually diverse environments but relies on costly human-collected data. We observe tha
Externí odkaz:
http://arxiv.org/abs/2409.06445
CLIP is a powerful and widely used tool for understanding images in the context of natural language descriptions to perform nuanced tasks. However, it does not offer application-specific fine-grained and structured understanding, due to its generic n
Externí odkaz:
http://arxiv.org/abs/2409.01690
Generalist vision models aim for one and the same architecture for a variety of vision tasks. While such shared architecture may seem attractive, generalist models tend to be outperformed by their bespoken counterparts, especially in the case of pano
Externí odkaz:
http://arxiv.org/abs/2408.16504
Autor:
Ma, Qi, Li, Yue, Ren, Bin, Sebe, Nicu, Konukoglu, Ender, Gevers, Theo, Van Gool, Luc, Paudel, Danda Pani
3D Gaussian Splatting (3DGS) has become the de facto method of 3D representation in many vision tasks. This calls for the 3D understanding directly in this representation space. To facilitate the research in this direction, we first build a large-sca
Externí odkaz:
http://arxiv.org/abs/2408.10906
Autor:
Pan, Jiancheng, Liu, Yanxing, Fu, Yuqian, Ma, Muyuan, Li, Jiaohao, Paudel, Danda Pani, Van Gool, Luc, Huang, Xiaomeng
Object detection, particularly open-vocabulary object detection, plays a crucial role in Earth sciences, such as environmental monitoring, natural disaster assessment, and land-use planning. However, existing open-vocabulary detectors, primarily trai
Externí odkaz:
http://arxiv.org/abs/2408.09110
Autor:
Ren, Bin, Zamfir, Eduard, Li, Yawei, Wu, Zongwei, Paudel, Danda Pani, Timofte, Radu, Sebe, Nicu, Van Gool, Luc
With the emergence of mobile devices, there is a growing demand for an efficient model to restore any degraded image for better perceptual quality. However, existing models often require specific learning modules tailored for each degradation, result
Externí odkaz:
http://arxiv.org/abs/2407.13372
Personalized 3D avatars require an animatable representation of digital humans. Doing so instantly from monocular videos offers scalability to broad class of users and wide-scale applications. In this paper, we present a fast, simple, yet effective m
Externí odkaz:
http://arxiv.org/abs/2407.11174