Výsledky vyhledávání - "Cremers, Daniel"

Report

TARGO: Benchmarking Target-driven Object Grasping under Occlusions

Autor: Xia, Yan, Ding, Ran, Qin, Ziyuan, Zhan, Guanqi, Zhou, Kaichen, Yang, Long, Dong, Hao, Cremers, Daniel

Recent advances in predicting 6D grasp poses from a single depth image have led to promising performance in robotic grasping. However, previous grasping models face challenges in cluttered environments where nearby objects impact the target object's

Externí odkaz: http://arxiv.org/abs/2407.06168

Zobrazit plný text záznamu

Report

Localizing Events in Videos with Multimodal Queries

Autor: Zhang, Gengyuan, Fok, Mang Ling Ada, Xia, Yan, Tang, Yansong, Cremers, Daniel, Torr, Philip, Tresp, Volker, Gu, Jindong

Video understanding is a pivotal task in the digital era, yet the dynamic and multievent nature of videos makes them labor-intensive and computationally demanding to process. Thus, localizing a specific event given a semantic query has gained importa

Externí odkaz: http://arxiv.org/abs/2406.10079

Zobrazit plný text záznamu

Report

An Image is Worth 32 Tokens for Reconstruction and Generation

Autor: Yu, Qihang, Weber, Mark, Deng, Xueqing, Shen, Xiaohui, Cremers, Daniel, Chen, Liang-Chieh

Recent advancements in generative models have highlighted the crucial role of image tokenization in the efficient synthesis of high-resolution images. Tokenization, which transforms images into latent representations, reduces computational demands co

Externí odkaz: http://arxiv.org/abs/2406.07550

Zobrazit plný text záznamu

Report

Power Variable Projection for Initialization-Free Large-Scale Bundle Adjustment

Autor: Weber, Simon, Hong, Je Hyeong, Cremers, Daniel

Initialization-free bundle adjustment (BA) remains largely uncharted. While Levenberg-Marquardt algorithm is the golden method to solve the BA problem, it generally relies on a good initialization. In contrast, the under-explored Variable Projection

Externí odkaz: http://arxiv.org/abs/2405.05079

Zobrazit plný text záznamu

Report

A Perspective on Deep Vision Performance with Standard Image and Video Codecs

Autor: Reich, Christoph, Hahn, Oliver, Cremers, Daniel, Roth, Stefan, Debnath, Biplob

Resource-constrained hardware, such as edge devices or cell phones, often rely on cloud servers to provide the required computational resources for inference in deep vision models. However, transferring image and video data from an edge or mobile dev

Externí odkaz: http://arxiv.org/abs/2404.12330

Zobrazit plný text záznamu

Report

Partial-to-Partial Shape Matching with Geometric Consistency

Autor: Ehm, Viktoria, Gao, Maolin, Roetzer, Paul, Eisenberger, Marvin, Cremers, Daniel, Bernard, Florian

Finding correspondences between 3D shapes is an important and long-standing problem in computer vision, graphics and beyond. A prominent challenge are partial-to-partial shape matching settings, which occur when the shapes to match are only observed

Externí odkaz: http://arxiv.org/abs/2404.12209

Zobrazit plný text záznamu

Report

Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations

Autor: Tomani, Christian, Chaudhuri, Kamalika, Evtimov, Ivan, Cremers, Daniel, Ibrahim, Mark

A major barrier towards the practical deployment of large language models (LLMs) is their lack of reliability. Three situations where this is particularly apparent are correctness, hallucinations when given unanswerable questions, and safety. In all

Externí odkaz: http://arxiv.org/abs/2404.10960

Zobrazit plný text záznamu

Report

Boosting Self-Supervision for Single-View Scene Completion via Knowledge Distillation

Autor: Han, Keonhee, Muhle, Dominik, Wimbauer, Felix, Cremers, Daniel

Inferring scene geometry from images via Structure from Motion is a long-standing and fundamental problem in computer vision. While classical approaches and, more recently, depth map predictions only focus on the visible parts of a scene, the task of

Externí odkaz: http://arxiv.org/abs/2404.07933

Zobrazit plný text záznamu

Report

Finsler-Laplace-Beltrami Operators with Application to Shape Analysis

Autor: Weber, Simon, Dagès, Thomas, Gao, Maolin, Cremers, Daniel

The Laplace-Beltrami operator (LBO) emerges from studying manifolds equipped with a Riemannian metric. It is often called the Swiss army knife of geometry processing as it allows to capture intrinsic shape information and gives rise to heat diffusion

Externí odkaz: http://arxiv.org/abs/2404.03999

Zobrazit plný text záznamu

Report

Flattening the Parent Bias: Hierarchical Semantic Segmentation in the Poincar\'e Ball

Autor: Weber, Simon, Zöngür, Barış, Araslanov, Nikita, Cremers, Daniel

Hierarchy is a natural representation of semantic taxonomies, including the ones routinely used in image segmentation. Indeed, recent work on semantic segmentation reports improved accuracy from supervised training leveraging hierarchical label struc

Externí odkaz: http://arxiv.org/abs/2404.03778

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání