Zobrazeno 1 - 10
of 6 525
pro vyhledávání: '"DING Ming"'
In the rapidly evolving domain of Artificial Intelligence (AI), the complex interaction between innovation and regulation has become an emerging focus of our society. Despite tremendous advancements in AI's capabilities to excel in specific tasks and
Externí odkaz:
http://arxiv.org/abs/2412.04683
Autor:
Sun, Tiancheng, Bi, Shaolan, Chen, Xunzhou, Yuxi, Lu, Chen, Yuqin, Ding, Ming-Yi, Shi, Jianrong, Yan, Hongliang, Ge, Zhishuai
This study investigates the temporal and spatial variations in lithium abundance within the Milky Way using a sample of 22,034 main-sequence turn-off (MSTO) stars and subgiants, characterised by precise stellar ages, 3D NLTE (non-local thermodynamic
Externí odkaz:
http://arxiv.org/abs/2411.13011
The widespread use of image acquisition technologies, along with advances in facial recognition, has raised serious privacy concerns. Face de-identification usually refers to the process of concealing or replacing personal identifiers, which is regar
Externí odkaz:
http://arxiv.org/abs/2411.09863
Autor:
Cheng, Yean, Cai, Ziqi, Ding, Ming, Zheng, Wendi, Huang, Shiyu, Dong, Yuxiao, Tang, Jie, Shi, Boxin
We introduce DreamPolish, a text-to-3D generation model that excels in producing refined geometry and high-quality textures. In the geometry construction phase, our approach leverages multiple neural representations to enhance the stability of the sy
Externí odkaz:
http://arxiv.org/abs/2411.01602
Autor:
Yang, Mengmeng, Qu, Youyang, Ranbaduge, Thilina, Thapa, Chandra, Sultan, Nazatul, Ding, Ming, Suzuki, Hajime, Ni, Wei, Abuadbba, Sharif, Smith, David, Tyler, Paul, Pieprzyk, Josef, Rakotoarivelo, Thierry, Guan, Xinlong, M'rabet, Sirine
The vision for 6G aims to enhance network capabilities with faster data rates, near-zero latency, and higher capacity, supporting more connected devices and seamless experiences within an intelligent digital ecosystem where artificial intelligence (A
Externí odkaz:
http://arxiv.org/abs/2410.21986
Pedestrian action prediction is of great significance for many applications such as autonomous driving. However, state-of-the-art methods lack explainability to make trustworthy predictions. In this paper, a novel framework called MulCPred is propose
Externí odkaz:
http://arxiv.org/abs/2409.09446
Autor:
Hong, Wenyi, Wang, Weihan, Ding, Ming, Yu, Wenmeng, Lv, Qingsong, Wang, Yan, Cheng, Yean, Huang, Shiyu, Ji, Junhui, Xue, Zhao, Zhao, Lei, Yang, Zhuoyi, Gu, Xiaotao, Zhang, Xiaohan, Feng, Guanyu, Yin, Da, Wang, Zihan, Qi, Ji, Song, Xixuan, Zhang, Peng, Liu, Debing, Xu, Bin, Li, Juanzi, Dong, Yuxiao, Tang, Jie
Beginning with VisualGLM and CogVLM, we are continuously exploring VLMs in pursuit of enhanced vision-language fusion, efficient higher-resolution architecture, and broader modalities and applications. Here we propose the CogVLM2 family, a new genera
Externí odkaz:
http://arxiv.org/abs/2408.16500
This paper investigates the simultaneous reconstruction of the running cost function and the internal topological structure within the mean-field games (MFG) system utilizing partial boundary data. The inverse problem is notably challenging due to fa
Externí odkaz:
http://arxiv.org/abs/2408.08911
Autor:
Liu, Xiao, Zhang, Tianjie, Gu, Yu, Iong, Iat Long, Xu, Yifan, Song, Xixuan, Zhang, Shudan, Lai, Hanyu, Liu, Xinyi, Zhao, Hanlin, Sun, Jiadai, Yang, Xinyue, Yang, Yu, Qi, Zehan, Yao, Shuntian, Sun, Xueqiao, Cheng, Siyi, Zheng, Qinkai, Yu, Hao, Zhang, Hanchen, Hong, Wenyi, Ding, Ming, Pan, Lihang, Gu, Xiaotao, Zeng, Aohan, Du, Zhengxiao, Song, Chan Hee, Su, Yu, Dong, Yuxiao, Tang, Jie
Large Multimodal Models (LMMs) have ushered in a new era in artificial intelligence, merging capabilities in both language and vision to form highly capable Visual Foundation Agents. These agents are postulated to excel across a myriad of tasks, pote
Externí odkaz:
http://arxiv.org/abs/2408.06327
Autor:
Yang, Zhuoyi, Teng, Jiayan, Zheng, Wendi, Ding, Ming, Huang, Shiyu, Xu, Jiazheng, Yang, Yuanming, Hong, Wenyi, Zhang, Xiaohan, Feng, Guanyu, Yin, Da, Gu, Xiaotao, Zhang, Yuxuan, Wang, Weihan, Cheng, Yean, Liu, Ting, Xu, Bin, Dong, Yuxiao, Tang, Jie
We present CogVideoX, a large-scale text-to-video generation model based on diffusion transformer, which can generate 10-second continuous videos aligned with text prompt, with a frame rate of 16 fps and resolution of 768 * 1360 pixels. Previous vide
Externí odkaz:
http://arxiv.org/abs/2408.06072