Zobrazeno 1 - 10
of 63
pro vyhledávání: '"Zhao, Sanyuan"'
Concurrent processing of multiple autonomous driving 3D perception tasks within the same spatiotemporal scene poses a significant challenge, in particular due to the computational inefficiencies and feature competition between tasks when using tradit
Externí odkaz:
http://arxiv.org/abs/2407.10876
We introduce $\textit{InteractiveVideo}$, a user-centric framework for video generation. Different from traditional generative approaches that operate based on user-provided images or text, our framework is designed for dynamic interaction, allowing
Externí odkaz:
http://arxiv.org/abs/2402.03040
End-to-end text spotting is a vital computer vision task that aims to integrate scene text detection and recognition into a unified framework. Typical methods heavily rely on Region-of-Interest (RoI) operations to extract local features and complex p
Externí odkaz:
http://arxiv.org/abs/2306.03377
Recent years have witnessed huge successes in 3D object detection to recognize common objects for autonomous driving (e.g., vehicles and pedestrians). However, most methods rely heavily on a large amount of well-labeled training data. This limits the
Externí odkaz:
http://arxiv.org/abs/2302.03914
Answering semantically-complicated questions according to an image is challenging in Visual Question Answering (VQA) task. Although the image can be well represented by deep learning, the question is always simply embedded and cannot well indicate it
Externí odkaz:
http://arxiv.org/abs/2112.07270
In this paper, we solve the sample shortage problem in the human parsing task. We begin with the self-learning strategy, which generates pseudo-labels for unlabeled data to retrain the model. However, directly using noisy pseudo-labels will cause err
Externí odkaz:
http://arxiv.org/abs/2004.08055
Publikováno v:
In Computer Vision and Image Understanding July 2023 232
Real-world face detection and alignment demand an advanced discriminative model to address challenges by pose, lighting and expression. Illuminated by the deep learning algorithm, some convolutional neural networks based face detection and alignment
Externí odkaz:
http://arxiv.org/abs/1707.09364
Publikováno v:
In Pattern Recognition December 2021 120
Publikováno v:
In Neurocomputing 17 September 2021 453:777-789