Výsledky vyhledávání - "Paudel, Danda"

Report

EvenNICER-SLAM: Event-based Neural Implicit Encoding SLAM

Autor: Chen, Shi, Paudel, Danda Pani, Van Gool, Luc

The advancement of dense visual simultaneous localization and mapping (SLAM) has been greatly facilitated by the emergence of neural implicit representations. Neural implicit encoding SLAM, a typical example of which is NICE-SLAM, has recently demons

Externí odkaz: http://arxiv.org/abs/2410.03812

Zobrazit plný text záznamu

Report

ReVLA: Reverting Visual Domain Limitation of Robotic Foundation Models

Autor: Dey, Sombit, Zaech, Jan-Nico, Nikolov, Nikolay, Van Gool, Luc, Paudel, Danda Pani

Recent progress in large language models and access to large-scale robotic datasets has sparked a paradigm shift in robotics models transforming them into generalists able to adapt to various tasks, scenes, and robot modalities. A large step for the

Externí odkaz: http://arxiv.org/abs/2409.15250

Zobrazit plný text záznamu

Report

Autonomous Vehicle Controllers From End-to-End Differentiable Simulation

Autor: Nachkov, Asen, Paudel, Danda Pani, Van Gool, Luc

Current methods to learn controllers for autonomous vehicles (AVs) focus on behavioural cloning. Being trained only on exact historic data, the resulting agents often generalize poorly to novel scenarios. Simulators provide the opportunity to go beyo

Externí odkaz: http://arxiv.org/abs/2409.07965

Zobrazit plný text záznamu

Report

Learning Generative Interactive Environments By Trained Agent Exploration

Autor: Kazemi, Naser, Savov, Nedko, Paudel, Danda, Van Gool, Luc

World models are increasingly pivotal in interpreting and simulating the rules and actions of complex environments. Genie, a recent model, excels at learning from visually diverse environments but relies on costly human-collected data. We observe tha

Externí odkaz: http://arxiv.org/abs/2409.06445

Zobrazit plný text záznamu

Report

Taming CLIP for Fine-grained and Structured Visual Understanding of Museum Exhibits

Autor: Balauca, Ada-Astrid, Paudel, Danda Pani, Toutanova, Kristina, Van Gool, Luc

CLIP is a powerful and widely used tool for understanding images in the context of natural language descriptions to perform nuanced tasks. However, it does not offer application-specific fine-grained and structured understanding, due to its generic n

Externí odkaz: http://arxiv.org/abs/2409.01690

Zobrazit plný text záznamu

Report

A Simple and Generalist Approach for Panoptic Segmentation

Autor: Prisadnikov, Nedyalko, Van Gansbeke, Wouter, Paudel, Danda Pani, Van Gool, Luc

Generalist vision models aim for one and the same architecture for a variety of vision tasks. While such shared architecture may seem attractive, generalist models tend to be outperformed by their bespoken counterparts, especially in the case of pano

Externí odkaz: http://arxiv.org/abs/2408.16504

Zobrazit plný text záznamu

Report

ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining

Autor: Ma, Qi, Li, Yue, Ren, Bin, Sebe, Nicu, Konukoglu, Ender, Gevers, Theo, Van Gool, Luc, Paudel, Danda Pani

3D Gaussian Splatting (3DGS) has become the de facto method of 3D representation in many vision tasks. This calls for the 3D understanding directly in this representation space. To facilitate the research in this direction, we first build a large-sca

Externí odkaz: http://arxiv.org/abs/2408.10906

Zobrazit plný text záznamu

Report

Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community

Autor: Pan, Jiancheng, Liu, Yanxing, Fu, Yuqian, Ma, Muyuan, Li, Jiaohao, Paudel, Danda Pani, Van Gool, Luc, Huang, Xiaomeng

Object detection, particularly open-vocabulary object detection, plays a crucial role in Earth sciences, such as environmental monitoring, natural disaster assessment, and land-use planning. However, existing open-vocabulary detectors, primarily trai

Externí odkaz: http://arxiv.org/abs/2408.09110

Zobrazit plný text záznamu

Report

Any Image Restoration with Efficient Automatic Degradation Adaptation

Autor: Ren, Bin, Zamfir, Eduard, Li, Yawei, Wu, Zongwei, Paudel, Danda Pani, Timofte, Radu, Sebe, Nicu, Van Gool, Luc

With the emergence of mobile devices, there is a growing demand for an efficient model to restore any degraded image for better perceptual quality. However, existing models often require specific learning modules tailored for each degradation, result

Externí odkaz: http://arxiv.org/abs/2407.13372

Zobrazit plný text záznamu

Report

iHuman: Instant Animatable Digital Humans From Monocular Videos

Autor: Paudel, Pramish, Khanal, Anubhav, Chhatkuli, Ajad, Paudel, Danda Pani, Tandukar, Jyoti

Personalized 3D avatars require an animatable representation of digital humans. Doing so instantly from monocular videos offers scalability to broad class of users and wide-scale applications. In this paper, we present a fast, simple, yet effective m

Externí odkaz: http://arxiv.org/abs/2407.11174

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání