Zobrazeno 1 - 10
of 20 770
pro vyhledávání: '"Jiang,Yu"'
Autor:
Zhang, Hui, Hong, Dexiang, Gao, Tingwei, Wang, Yitong, Shao, Jie, Wu, Xinglong, Wu, Zuxuan, Jiang, Yu-Gang
Diffusion models have been recognized for their ability to generate images that are not only visually appealing but also of high artistic quality. As a result, Layout-to-Image (L2I) generation has been proposed to leverage region-specific positions a
Externí odkaz:
http://arxiv.org/abs/2412.03859
Autor:
Peng, Wujian, Meng, Lingchen, Chen, Yitong, Xie, Yiweng, Liu, Yang, Gui, Tao, Xu, Hang, Qiu, Xipeng, Wu, Zuxuan, Jiang, Yu-Gang
Large Multimodal Models (LMMs) have made significant breakthroughs with the advancement of instruction tuning. However, while existing models can understand images and videos at a holistic level, they still struggle with instance-level understanding
Externí odkaz:
http://arxiv.org/abs/2412.03565
Autor:
Wang, Zhixiang, Ye, Guangnan, Wang, Xiaosen, Chen, Siheng, Wang, Zhibo, Ma, Xingjun, Jiang, Yu-Gang
Physical adversarial patches printed on clothing can easily allow individuals to evade person detectors. However, most existing adversarial patch generation methods prioritize attack effectiveness over stealthiness, resulting in patches that are aest
Externí odkaz:
http://arxiv.org/abs/2412.01440
Autor:
Yu, Junqiu, Ren, Xinlin, Gu, Yongchong, Lin, Haitao, Wang, Tianyu, Zhu, Yi, Xu, Hang, Jiang, Yu-Gang, Xue, Xiangyang, Fu, Yanwei
Language-guided robotic grasping is a rapidly advancing field where robots are instructed using human language to grasp specific objects. However, existing methods often depend on dense camera views and struggle to quickly update scenes, limiting the
Externí odkaz:
http://arxiv.org/abs/2412.02140
Autor:
Jiang, Yu
Filaments play a crucial role in providing the necessary environmental conditions for star formation, actively participating in the process. To facilitate the identification and analysis of filaments, we introduce DPConCFil (Directional and Positiona
Externí odkaz:
http://arxiv.org/abs/2412.01238
Autor:
Song, Xue, Cui, Jiequan, Zhang, Hanwang, Shi, Jiaxin, Chen, Jingjing, Zhang, Chi, Jiang, Yu-Gang
In this paper, we propose the LoRA of Change (LoC) framework for image editing with visual instructions, i.e., before-after image pairs. Compared to the ambiguities, insufficient specificity, and diverse interpretations of natural language, visual in
Externí odkaz:
http://arxiv.org/abs/2411.19156
Autor:
Sun, Zhihao, Jiang, Haoran, Chen, Haoran, Cao, Yixin, Qiu, Xipeng, Wu, Zuxuan, Jiang, Yu-Gang
Multimodal large language models have unlocked new possibilities for various multimodal tasks. However, their potential in image manipulation detection remains unexplored. When directly applied to the IMD task, M-LLMs often produce reasoning texts th
Externí odkaz:
http://arxiv.org/abs/2411.19466
Autor:
Xi, Zhiheng, Yang, Dingwen, Huang, Jixuan, Tang, Jiafu, Li, Guanyu, Ding, Yiwen, He, Wei, Hong, Boyang, Do, Shihan, Zhan, Wenyu, Wang, Xiao, Zheng, Rui, Ji, Tao, Shi, Xiaowei, Zhai, Yitao, Weng, Rongxiang, Wang, Jingang, Cai, Xunliang, Gui, Tao, Wu, Zuxuan, Zhang, Qi, Qiu, Xipeng, Huang, Xuanjing, Jiang, Yu-Gang
Training large language models (LLMs) to spend more time thinking and reflection before responding is crucial for effectively solving complex reasoning tasks in fields such as science, coding, and mathematics. However, the effectiveness of mechanisms
Externí odkaz:
http://arxiv.org/abs/2411.16579
Connectionist temporal classification (CTC)-based scene text recognition (STR) methods, e.g., SVTR, are widely employed in OCR applications, mainly due to their simple architecture, which only contains a visual model and a CTC-aligned linear classifi
Externí odkaz:
http://arxiv.org/abs/2411.15858
Autor:
Hossain, Md Shafayat, Zhang, Qi, Choi, Eun Sang, Ratkovski, Danilo, Lüscher, Bernhard, Li, Yongkai, Jiang, Yu-Xiao, Litskevich, Maksim, Cheng, Zi-Jia, Yin, Jia-Xin, Cochran, Tyler A., Casas, Brian, Kim, Byunghoon, Yang, Xian, Liu, Jinjin, Yao, Yugui, Bangura, Ali, Wang, Zhiwei, Fischer, Mark H., Neupert, Titus, Balicas, Luis, Hasan, M. Zahid
Determining the types of superconducting order in quantum materials is a challenge, especially when multiple degrees of freedom, such as bands or orbitals, contribute to the fermiology and when superconductivity competes, intertwines, or coexists wit
Externí odkaz:
http://arxiv.org/abs/2411.15333