Zobrazeno 1 - 10
of 783
pro vyhledávání: '"Rui, Yong"'
Pre-trained models have become the preferred backbone due to the expansion of model parameters, with techniques like Parameter-Efficient Fine-Tuning (PEFTs) typically fixing the parameters of these models. However, pre-trained models may not always b
Externí odkaz:
http://arxiv.org/abs/2408.07337
Existing research on unconstrained in-the-wild head pose estimation suffers from the flaws of its datasets, which consist of either numerous samples by non-realistic synthesis or constrained collection, or small-scale natural images yet with plausibl
Externí odkaz:
http://arxiv.org/abs/2404.02544
Denoising diffusion probabilistic models for image inpainting aim to add the noise to the texture of image during the forward process and recover masked regions with unmasked ones of the texture via the reverse denoising process. Despite the meaningf
Externí odkaz:
http://arxiv.org/abs/2403.19898
Autor:
Hou, Feng, Yuan, Jin, Yang, Ying, Liu, Yang, Zhang, Yang, Zhong, Cheng, Shi, Zhongchao, Fan, Jianping, Rui, Yong, He, Zhiqiang
Traditional cross-domain tasks, including domain adaptation and domain generalization, rely heavily on training model by source domain data. With the recent advance of vision-language models (VLMs), viewed as natural source models, the cross-domain t
Externí odkaz:
http://arxiv.org/abs/2403.02714
Publikováno v:
ACM Comput. Surv. 55, 9, Article 188 (September 2023)
Video moment localization, also known as video moment retrieval, aiming to search a target segment within a video described by a given natural language query. Beyond the task of temporal action localization whereby the target actions are pre-defined,
Externí odkaz:
http://arxiv.org/abs/2306.07515
In recent years, deep models have achieved remarkable success in various vision tasks. However, their performance heavily relies on large training datasets. In contrast, humans exhibit hybrid learning, seamlessly integrating structured knowledge for
Externí odkaz:
http://arxiv.org/abs/2305.18731
Publikováno v:
IEEE TRANSACTIONS ON MULTIMEDIA, VOL.17, NO.11, pp.2000-2007, NOVEMBER 2015
The gap between low-level visual signals and high-level semantics has been progressively bridged by continuous development of deep neural network (DNN). With recent progress of DNN, almost all image classification tasks have achieved new records of a
Externí odkaz:
http://arxiv.org/abs/2302.13275
Knowledge distillation has been widely adopted in a variety of tasks and has achieved remarkable successes. Since its inception, many researchers have been intrigued by the dark knowledge hidden in the outputs of the teacher model. Recently, a study
Externí odkaz:
http://arxiv.org/abs/2302.08155
Image inpainting has achieved remarkable progress and inspired abundant methods, where the critical bottleneck is identified as how to fulfill the high-frequency structure and low-frequency texture information on the masked regions with semantics. To
Externí odkaz:
http://arxiv.org/abs/2209.08217
Publikováno v:
International Journal of Mining Science and Technology, Vol 34, Iss 4, Pp 461-477 (2024)
Accurate measurement of the evolution of rock joint void geometry is essential for comprehending the distribution characteristics of asperities responsible for shear and seepage behaviors. However, existing techniques often require specialized equipm
Externí odkaz:
https://doaj.org/article/31a1f3fe780449899110d664bf45b68f