Zobrazeno 1 - 10
of 407
pro vyhledávání: '"Shen, Dazhong"'
Autor:
Yu, Xiaoshan, Qin, Chuan, Shen, Dazhong, Yang, Shangshang, Ma, Haiping, Zhu, Hengshu, Zhang, Xingyi
In the realm of education, both independent learning and group learning are esteemed as the most classic paradigms. The former allows learners to self-direct their studies, while the latter is typically characterized by teacher-directed scenarios. Re
Externí odkaz:
http://arxiv.org/abs/2406.12465
Autor:
Wang, Fu-Yun, Huang, Zhaoyang, Bergman, Alexander William, Shen, Dazhong, Gao, Peng, Lingelbach, Michael, Sun, Keqiang, Bian, Weikang, Song, Guanglu, Liu, Yu, Li, Hongsheng, Wang, Xiaogang
The consistency model (CM) has recently made significant progress in accelerating the generation of diffusion models. However, its application to high-resolution, text-conditioned image generation in the latent space (a.k.a., LCM) remains unsatisfact
Externí odkaz:
http://arxiv.org/abs/2405.18407
Autor:
Zong, Zhuofan, Ma, Bingqi, Shen, Dazhong, Song, Guanglu, Shao, Hao, Jiang, Dongzhi, Li, Hongsheng, Liu, Yu
As the key component in multimodal large language models (MLLMs), the ability of the visual encoder greatly affects MLLM's understanding on diverse image content. Although some large-scale pretrained vision encoders such as vision encoders in CLIP an
Externí odkaz:
http://arxiv.org/abs/2404.13046
Autor:
Jiang, Feihu, Qin, Chuan, Zhang, Jingshuai, Yao, Kaichun, Chen, Xi, Shen, Dazhong, Zhu, Chen, Zhu, Hengshu, Xiong, Hui
In the contemporary era of widespread online recruitment, resume understanding has been widely acknowledged as a fundamental and crucial task, which aims to extract structured information from resume documents automatically. Compared to the tradition
Externí odkaz:
http://arxiv.org/abs/2404.13067
Classifier-Free Guidance (CFG) has been widely used in text-to-image diffusion models, where the CFG scale is introduced to control the strength of text guidance on the whole image space. However, we argue that a global CFG scale results in spatial i
Externí odkaz:
http://arxiv.org/abs/2404.05384
Autor:
Jiang, Dongzhi, Song, Guanglu, Wu, Xiaoshi, Zhang, Renrui, Shen, Dazhong, Zong, Zhuofan, Liu, Yu, Li, Hongsheng
Diffusion models have demonstrated great success in the field of text-to-image generation. However, alleviating the misalignment between the text prompts and images is still challenging. The root reason behind the misalignment has not been extensivel
Externí odkaz:
http://arxiv.org/abs/2404.03653
Collaborative filtering methods based on graph neural networks (GNNs) have witnessed significant success in recommender systems (RS), capitalizing on their ability to capture collaborative signals within intricate user-item relationships via message-
Externí odkaz:
http://arxiv.org/abs/2403.17416
Autor:
Wang, Fu-Yun, Wu, Xiaoshi, Huang, Zhaoyang, Shi, Xiaoyu, Shen, Dazhong, Song, Guanglu, Liu, Yu, Li, Hongsheng
Video outpainting is a challenging task, aiming at generating video content outside the viewport of the input video while maintaining inter-frame and intra-frame consistency. Existing methods fall short in either generation quality or flexibility. We
Externí odkaz:
http://arxiv.org/abs/2403.13745
Graph Convolutional Networks (GCNs) have become pivotal in recommendation systems for learning user and item embeddings by leveraging the user-item interaction graph's node information and topology. However, these models often face the famous over-sm
Externí odkaz:
http://arxiv.org/abs/2403.04287
Autor:
Zhang, Yunfei, Qin, Chuan, Shen, Dazhong, Ma, Haiping, Zhang, Le, Zhang, Xingyi, Zhu, Hengshu
During the past few decades, cognitive diagnostics modeling has attracted increasing attention in computational education communities, which is capable of quantifying the learning status and knowledge mastery levels of students. Indeed, the recent ad
Externí odkaz:
http://arxiv.org/abs/2401.10749