Zobrazeno 1 - 10
of 826
pro vyhledávání: '"Cremers, Daniel"'
Autor:
Xia, Yan, Ding, Ran, Qin, Ziyuan, Zhan, Guanqi, Zhou, Kaichen, Yang, Long, Dong, Hao, Cremers, Daniel
Recent advances in predicting 6D grasp poses from a single depth image have led to promising performance in robotic grasping. However, previous grasping models face challenges in cluttered environments where nearby objects impact the target object's
Externí odkaz:
http://arxiv.org/abs/2407.06168
Autor:
Zhang, Gengyuan, Fok, Mang Ling Ada, Xia, Yan, Tang, Yansong, Cremers, Daniel, Torr, Philip, Tresp, Volker, Gu, Jindong
Video understanding is a pivotal task in the digital era, yet the dynamic and multievent nature of videos makes them labor-intensive and computationally demanding to process. Thus, localizing a specific event given a semantic query has gained importa
Externí odkaz:
http://arxiv.org/abs/2406.10079
Recent advancements in generative models have highlighted the crucial role of image tokenization in the efficient synthesis of high-resolution images. Tokenization, which transforms images into latent representations, reduces computational demands co
Externí odkaz:
http://arxiv.org/abs/2406.07550
Initialization-free bundle adjustment (BA) remains largely uncharted. While Levenberg-Marquardt algorithm is the golden method to solve the BA problem, it generally relies on a good initialization. In contrast, the under-explored Variable Projection
Externí odkaz:
http://arxiv.org/abs/2405.05079
Resource-constrained hardware, such as edge devices or cell phones, often rely on cloud servers to provide the required computational resources for inference in deep vision models. However, transferring image and video data from an edge or mobile dev
Externí odkaz:
http://arxiv.org/abs/2404.12330
Autor:
Ehm, Viktoria, Gao, Maolin, Roetzer, Paul, Eisenberger, Marvin, Cremers, Daniel, Bernard, Florian
Finding correspondences between 3D shapes is an important and long-standing problem in computer vision, graphics and beyond. A prominent challenge are partial-to-partial shape matching settings, which occur when the shapes to match are only observed
Externí odkaz:
http://arxiv.org/abs/2404.12209
A major barrier towards the practical deployment of large language models (LLMs) is their lack of reliability. Three situations where this is particularly apparent are correctness, hallucinations when given unanswerable questions, and safety. In all
Externí odkaz:
http://arxiv.org/abs/2404.10960
Inferring scene geometry from images via Structure from Motion is a long-standing and fundamental problem in computer vision. While classical approaches and, more recently, depth map predictions only focus on the visible parts of a scene, the task of
Externí odkaz:
http://arxiv.org/abs/2404.07933
The Laplace-Beltrami operator (LBO) emerges from studying manifolds equipped with a Riemannian metric. It is often called the Swiss army knife of geometry processing as it allows to capture intrinsic shape information and gives rise to heat diffusion
Externí odkaz:
http://arxiv.org/abs/2404.03999
Hierarchy is a natural representation of semantic taxonomies, including the ones routinely used in image segmentation. Indeed, recent work on semantic segmentation reports improved accuracy from supervised training leveraging hierarchical label struc
Externí odkaz:
http://arxiv.org/abs/2404.03778