Zobrazeno 1 - 10
of 463
pro vyhledávání: '"GONG Biao"'
Autor:
GAO Ruiqing, WANG Fei, LIU Hongwei, ZHANG Xiaolong, HE Zhihong, LIU Zhenming, WANG Zibang, GONG Biao
Publikováno v:
Meikuang Anquan, Vol 53, Iss 10, Pp 160-167 (2022)
Taking the 8105 working face of Madaotou Mine as the engineering background, the numerical calculation model of the flow field of the mining face in the extra-thick and spontaneous coal seam was established to analyze the oxygen consumption rate in t
Externí odkaz:
https://doaj.org/article/82c53b70d0594220ab14108ef1dda4d3
Autor:
Li, Wen, Fang, Muyuan, Zou, Cheng, Gong, Biao, Zheng, Ruobing, Wang, Meng, Chen, Jingdong, Yang, Ming
Despite the burst of innovative methods for controlling the diffusion process, effectively controlling image styles in text-to-image generation remains a challenging task. Many adapter-based methods impose image representation conditions on the denoi
Externí odkaz:
http://arxiv.org/abs/2409.02543
To transfer knowledge from seen attribute-object compositions to recognize unseen ones, recent compositional zero-shot learning (CZSL) methods mainly discuss the optimal classification branches to identify the elements, leading to the popularity of e
Externí odkaz:
http://arxiv.org/abs/2408.17083
Autor:
Chen, Chaochao, Zhang, Jiaming, Zhang, Yizhao, Zhang, Li, Lyu, Lingjuan, Li, Yuyuan, Gong, Biao, Yan, Chenggang
With increasing privacy concerns in artificial intelligence, regulations have mandated the right to be forgotten, granting individuals the right to withdraw their data from models. Machine unlearning has emerged as a potential solution to enable sele
Externí odkaz:
http://arxiv.org/abs/2408.14393
Autor:
Huang, Ziyuan, Ji, Kaixiang, Gong, Biao, Qing, Zhiwu, Zhang, Qinglong, Zheng, Kecheng, Wang, Jian, Chen, Jingdong, Yang, Ming
This paper introduces Chain-of-Sight, a vision-language bridge module that accelerates the pre-training of Multimodal Large Language Models (MLLMs). Our approach employs a sequence of visual resamplers that capture visual details at various spacial s
Externí odkaz:
http://arxiv.org/abs/2407.15819
Diffusion models excel at producing high-quality images; however, scaling to higher resolutions, such as 4K, often results in over-smoothed content, structural distortions, and repetitive patterns. To this end, we introduce ResMaster, a novel, traini
Externí odkaz:
http://arxiv.org/abs/2406.16476
Publikováno v:
Guangtongxin yanjiu, Vol , Pp 45-49,56 (2021)
In view of the insufficient research on the vibration characteristics of the submarine cable fault signal, A three-dimensional finite element model of the optical fiber composite submarine cable (submarine cable) under the anchoring action is establi
Externí odkaz:
https://doaj.org/article/b6eb0552f4344262b59ea7f285398f00
Autor:
Wang, Xiang, Zhang, Shiwei, Yuan, Hangjie, Qing, Zhiwu, Gong, Biao, Zhang, Yingya, Shen, Yujun, Gao, Changxin, Sang, Nong
Diffusion-based text-to-video generation has witnessed impressive progress in the past year yet still falls behind text-to-image generation. One of the key reasons is the limited scale of publicly available data (e.g., 10M video-text pairs in WebVid1
Externí odkaz:
http://arxiv.org/abs/2312.15770
Existing text-to-image (T2I) diffusion models usually struggle in interpreting complex prompts, especially those with quantity, object-attribute binding, and multi-subject descriptions. In this work, we introduce a semantic panel as the middleware in
Externí odkaz:
http://arxiv.org/abs/2311.17002
This study focuses on a novel task in text-to-image (T2I) generation, namely action customization. The objective of this task is to learn the co-existing action from limited data and generalize it to unseen humans or even animals. Experimental result
Externí odkaz:
http://arxiv.org/abs/2311.15841