Zobrazeno 1 - 10
of 479 428
pro vyhledávání: '"Camera P"'
Contrastive Language-Image Pre-Training (CLIP) model excels in traditional person re-identification (ReID) tasks due to its inherent advantage in generating textual descriptions for pedestrian images. However, applying CLIP directly to intra-camera s
Externí odkaz:
http://arxiv.org/abs/2409.19563
Autor:
Lu, Jingpei, Liang, Zekai, Xie, Tristin, Ritcher, Florian, Lin, Shan, Liu, Sainan, Yip, Michael C.
Camera-to-robot calibration is crucial for vision-based robot control and requires effort to make it accurate. Recent advancements in markerless pose estimation methods have eliminated the need for time-consuming physical setups for camera-to-robot c
Externí odkaz:
http://arxiv.org/abs/2409.10441
Bird's Eye View (BEV) map prediction is essential for downstream autonomous driving tasks like trajectory prediction. In the past, this was accomplished through the use of a sophisticated sensor configuration that captured a surround view from multip
Externí odkaz:
http://arxiv.org/abs/2409.02676
Accurate calibration of camera intrinsic parameters is crucial to various computer vision-based applications in the fields of intelligent systems, autonomous vehicles, etc. However, existing calibration schemes are incompetent for finding general tre
Externí odkaz:
http://arxiv.org/abs/2409.01171
Currently, there are no learning-free or neural techniques for real-time recalibration of infrared multi-camera systems. In this paper, we address the challenge of real-time, highly-accurate calibration of multi-camera infrared systems, a critical ta
Externí odkaz:
http://arxiv.org/abs/2410.14505
Vision-centric autonomous driving has demonstrated excellent performance with economical sensors. As the fundamental step, 3D perception aims to infer 3D information from 2D images based on 3D-2D projection. This makes driving perception models susce
Externí odkaz:
http://arxiv.org/abs/2410.13864
Multi-camera systems are indispensable in movies, TV shows, and other media. Selecting the appropriate camera at every timestamp has a decisive impact on production quality and audience preferences. Learning-based view recommendation frameworks can a
Externí odkaz:
http://arxiv.org/abs/2410.13585
Autor:
Vyskočil, Jiří, Picek, Lukas
This paper describes the search for an alternative approach to the automatic categorization of camera trap images. First, we benchmark state-of-the-art classifiers using a single model for all images. Next, we evaluate methods combining MegaDetector
Externí odkaz:
http://arxiv.org/abs/2410.12769
We introduce nvTorchCam, an open-source library under the Apache 2.0 license, designed to make deep learning algorithms camera model-independent. nvTorchCam abstracts critical camera operations such as projection and unprojection, allowing developers
Externí odkaz:
http://arxiv.org/abs/2410.12074
As a novel 3D scene representation, semantic occupancy has gained much attention in autonomous driving. However, existing occupancy prediction methods mainly focus on designing better occupancy representations, such as tri-perspective view or neural
Externí odkaz:
http://arxiv.org/abs/2410.11228