Zobrazeno 1 - 10
of 162
pro vyhledávání: '"Huang, Qingqiu"'
Autor:
Nie, Ming, Xue, Yujing, Wang, Chunwei, Ye, Chaoqiang, Xu, Hang, Zhu, Xinge, Huang, Qingqiu, Mi, Michael Bi, Wang, Xinchao, Zhang, Li
Recently, polar-based representation has shown promising properties in perceptual tasks. In addition to Cartesian-based approaches, which separate point clouds unevenly, representing point clouds as polar grids has been recognized as an alternative d
Externí odkaz:
http://arxiv.org/abs/2308.03982
Autor:
Zeng, Yihan, Jiang, Chenhan, Mao, Jiageng, Han, Jianhua, Ye, Chaoqiang, Huang, Qingqiu, Yeung, Dit-Yan, Yang, Zhen, Liang, Xiaodan, Xu, Hang
Contrastive Language-Image Pre-training, benefiting from large-scale unlabeled text-image pairs, has demonstrated great performance in open-world vision understanding tasks. However, due to the limited Text-3D data pairs, adapting the success of 2D V
Externí odkaz:
http://arxiv.org/abs/2303.12417
LiDAR and camera are two important sensors for 3D object detection in autonomous driving. Despite the increasing popularity of sensor fusion in this field, the robustness against inferior image conditions, e.g., bad illumination and sensor misalignme
Externí odkaz:
http://arxiv.org/abs/2203.11496
Adversarial robustness has attracted extensive studies recently by revealing the vulnerability and intrinsic characteristics of deep networks. However, existing works on adversarial robustness mainly focus on balanced datasets, while real-world data
Externí odkaz:
http://arxiv.org/abs/2104.02703
The task of searching certain people in videos has seen increasing potential in real-world applications, such as video organization and editing. Most existing approaches are devised to work in an offline manner, where identities can only be inferred
Externí odkaz:
http://arxiv.org/abs/2008.03546
Shots are key narrative elements of various videos, e.g. movies, TV series, and user-generated videos that are thriving over the Internet. The types of shots greatly influence how the underlying ideas, emotions, and messages are expressed. The techni
Externí odkaz:
http://arxiv.org/abs/2008.03548
Recent years have seen remarkable advances in visual understanding. However, how to understand a story-based long video with artistic styles, e.g. movie, remains challenging. In this paper, we introduce MovieNet -- a holistic dataset for movie unders
Externí odkaz:
http://arxiv.org/abs/2007.10937
Publikováno v:
Proceedings Of The European Conference On Computer Vision (ECCV), 2020
We present a new loss function called Distribution-Balanced Loss for the multi-label recognition problems that exhibit long-tailed class distributions. Compared to conventional single-label classification problem, multi-label recognition problems are
Externí odkaz:
http://arxiv.org/abs/2007.09654
Recent works have shown that exploiting unlabeled data through label propagation can substantially reduce the labeling cost, which has been a critical issue in developing visual recognition models. Yet, how to propagate labels reliably, especially on
Externí odkaz:
http://arxiv.org/abs/2007.08802
Place is an important element in visual understanding. Given a photo of a building, people can often tell its functionality, e.g. a restaurant or a shop, its cultural style, e.g. Asian or European, as well as its economic type, e.g. industry oriented
Externí odkaz:
http://arxiv.org/abs/2007.03777