Zobrazeno 1 - 10
of 501
pro vyhledávání: '"Jiang, Xiaolong"'
Autor:
Sun, Huixin, Wang, Runqi, Li, Yanjing, Cao, Xianbin, Jiang, Xiaolong, Hu, Yao, Zhang, Baochang
Large-scale pre-trained Vision-Language Models (VLMs) have gained prominence in various visual and multimodal tasks, yet the deployment of VLMs on downstream application platforms remains challenging due to their prohibitive requirements of training
Externí odkaz:
http://arxiv.org/abs/2409.17634
With recent generative models facilitating photo-realistic image synthesis, the proliferation of synthetic images has also engendered certain negative impacts on social platforms, thereby raising an urgent imperative to develop effective detectors. C
Externí odkaz:
http://arxiv.org/abs/2408.06741
Autor:
Yan, Cilin, Wang, Haochen, Yan, Shilin, Jiang, Xiaolong, Hu, Yao, Kang, Guoliang, Xie, Weidi, Gavves, Efstratios
Existing Video Object Segmentation (VOS) relies on explicit user instructions, such as categories, masks, or short phrases, restricting their ability to perform complex video segmentation requiring reasoning with world knowledge. In this paper, we in
Externí odkaz:
http://arxiv.org/abs/2407.11325
With the rapid development of generative models, discerning AI-generated content has evoked increasing attention from both industry and academia. In this paper, we conduct a sanity check on "whether the task of AI-generated image detection has been s
Externí odkaz:
http://arxiv.org/abs/2406.19435
Autor:
Yan, Cilin, Wang, Haochen, Jiang, Xiaolong, Hu, Yao, Tang, Xu, Kang, Guoliang, Gavves, Efstratios
Contrastive Vision-Language Pre-training(CLIP) demonstrates impressive zero-shot capability. The key to improve the adaptation of CLIP to downstream task with few exemplars lies in how to effectively model and transfer the useful knowledge embedded i
Externí odkaz:
http://arxiv.org/abs/2406.11252
Autor:
Wang, Mingze, Su, Lili, Yan, Cilin, Xu, Sheng, Yuan, Pengcheng, Jiang, Xiaolong, Zhang, Baochang
The intelligent interpretation of buildings plays a significant role in urban planning and management, macroeconomic analysis, population dynamics, etc. Remote sensing image building interpretation primarily encompasses building extraction and change
Externí odkaz:
http://arxiv.org/abs/2403.07564
Publikováno v:
Management Decision, 2023, Vol. 62, Issue 8, pp. 2532-2557.
Externí odkaz:
http://www.emeraldinsight.com/doi/10.1108/MD-04-2023-0531
Autor:
Zeng, Bohan, Li, Shanglin, Liu, Xuhui, Gao, Sicheng, Jiang, Xiaolong, Tang, Xu, Hu, Yao, Liu, Jianzhuang, Zhang, Baochang
Brain signal visualization has emerged as an active research area, serving as a critical interface between the human visual system and computer vision models. Although diffusion models have shown promise in analyzing functional magnetic resonance ima
Externí odkaz:
http://arxiv.org/abs/2305.10135
Autor:
Yan, Cilin, Wang, Haochen, Liu, Jie, Jiang, Xiaolong, Hu, Yao, Tang, Xu, Kang, Guoliang, Gavves, Efstratios
Click-based interactive segmentation aims to generate target masks via human clicking, which facilitates efficient pixel-level annotation and image editing. In such a task, target ambiguity remains a problem hindering the accuracy and efficiency of s
Externí odkaz:
http://arxiv.org/abs/2304.11609
CLIP (Contrastive Language-Image Pretraining) is well-developed for open-vocabulary zero-shot image-level recognition, while its applications in pixel-level tasks are less investigated, where most efforts directly adopt CLIP features without delibera
Externí odkaz:
http://arxiv.org/abs/2304.06957