Zobrazeno 1 - 10
of 16 214
pro vyhledávání: '"An, Yiqiang"'
Segmentation of ultra-high resolution (UHR) images is a critical task with numerous applications, yet it poses significant challenges due to high spatial resolution and rich fine details. Recent approaches adopt a dual-branch architecture, where a gl
Externí odkaz:
http://arxiv.org/abs/2412.10181
Mean-field limits have been used now as a standard tool in approximations, including for networks with a large number of nodes. Statistical inference on mean-filed models has attracted more attention recently mainly due to the rapid emergence of data
Externí odkaz:
http://arxiv.org/abs/2411.12936
Autor:
Huang, Xuan, Li, Hanhui, Liu, Wanquan, Liang, Xiaodan, Yan, Yiqiang, Cheng, Yuhao, Gao, Chengqiang
In this paper, we propose to create animatable avatars for interacting hands with 3D Gaussian Splatting (GS) and single-image inputs. Existing GS-based methods designed for single subjects often yield unsatisfactory results due to limited input views
Externí odkaz:
http://arxiv.org/abs/2410.08840
Inadequate bounding box modeling in regression tasks constrains the performance of one-stage 3D object detection. Our study reveals that the primary reason lies in two aspects: (1) The limited center-offset prediction seriously impairs the bounding b
Externí odkaz:
http://arxiv.org/abs/2409.00690
Acoustic scene classification (ASC) predominantly relies on supervised approaches. However, acquiring labeled data for training ASC models is often costly and time-consuming. Recently, self-supervised learning (SSL) has emerged as a powerful method f
Externí odkaz:
http://arxiv.org/abs/2408.14862
Autor:
Zhang, Shiyue, Chong, Zheng, Zhang, Xujie, Li, Hanhui, Cheng, Yuhao, Yan, Yiqiang, Liang, Xiaodan
General text-to-image models bring revolutionary innovation to the fields of arts, design, and media. However, when applied to garment generation, even the state-of-the-art text-to-image models suffer from fine-grained semantic misalignment, particul
Externí odkaz:
http://arxiv.org/abs/2408.12352
Autor:
Yang, Chuanpeng, Lu, Wang, Zhu, Yao, Wang, Yidong, Chen, Qian, Gao, Chenlong, Yan, Bingjie, Chen, Yiqiang
Large Language Models (LLMs) have showcased exceptional capabilities in various domains, attracting significant interest from both academia and industry. Despite their impressive performance, the substantial size and computational demands of LLMs pos
Externí odkaz:
http://arxiv.org/abs/2407.01885
Autor:
Chen, Dongping, Huang, Yue, Wu, Siyuan, Tang, Jingyu, Chen, Liuyi, Bai, Yilin, He, Zhigang, Wang, Chenlong, Zhou, Huichi, Li, Yiqiang, Zhou, Tianshuo, Yu, Yue, Gao, Chujie, Zhang, Qihui, Gui, Yi, Li, Zhen, Wan, Yao, Zhou, Pan, Gao, Jianfeng, Sun, Lichao
Recently, Multimodal Large Language Models (MLLMs) have been used as agents to control keyboard and mouse inputs by directly perceiving the Graphical User Interface (GUI) and generating corresponding code. However, current agents primarily exhibit ex
Externí odkaz:
http://arxiv.org/abs/2406.10819
By embedding the Hecke algebra $\check H_q$ of type $D$ into the Hecke algebra $H_{q,1}$ of type $B$ with unequal parameters $(q,1)$, the $q$-Schur algebras $S^\kappa_q(n,r)$ of type $D$ is naturally defined as the endomorphism algebra of the tensor
Externí odkaz:
http://arxiv.org/abs/2406.09057
Autor:
Cheng, Junhao, Lu, Xi, Li, Hanhui, Zai, Khun Loun, Yin, Baiqiao, Cheng, Yuhao, Yan, Yiqiang, Liang, Xiaodan
As cutting-edge Text-to-Image (T2I) generation models already excel at producing remarkable single images, an even more challenging task, i.e., multi-turn interactive image generation begins to attract the attention of related research communities. T
Externí odkaz:
http://arxiv.org/abs/2406.01388