Zobrazeno 1 - 10
of 67 749
pro vyhledávání: '"Chen, Yi‐An"'
The advent of Multimodal Large Language Models, leveraging the power of Large Language Models, has recently demonstrated superior multimodal understanding and reasoning abilities, heralding a new era for artificial general intelligence. However, achi
Externí odkaz:
http://arxiv.org/abs/2412.04447
Recent developments in Large Language Models pre-trained on extensive corpora have shown significant success in various natural language processing tasks with minimal fine-tuning. This success offers new promise for robotics, which has long been cons
Externí odkaz:
http://arxiv.org/abs/2412.04445
Autor:
Kong, Weijie, Tian, Qi, Zhang, Zijian, Min, Rox, Dai, Zuozhuo, Zhou, Jin, Xiong, Jiangfeng, Li, Xin, Wu, Bo, Zhang, Jianwei, Wu, Kathrina, Lin, Qin, Yuan, Junkun, Long, Yanxin, Wang, Aladdin, Wang, Andong, Li, Changlin, Huang, Duojun, Yang, Fang, Tan, Hao, Wang, Hongmei, Song, Jacob, Bai, Jiawang, Wu, Jianbing, Xue, Jinbao, Wang, Joey, Wang, Kai, Liu, Mengyang, Li, Pengyu, Li, Shuai, Wang, Weiyan, Yu, Wenqing, Deng, Xinchi, Li, Yang, Chen, Yi, Cui, Yutao, Peng, Yuanbo, Yu, Zhentao, He, Zhiyu, Xu, Zhiyong, Zhou, Zixiang, Xu, Zunnan, Tao, Yangyu, Lu, Qinglin, Liu, Songtao, Zhou, Daquan, Wang, Hongfa, Yang, Yong, Wang, Di, Liu, Yuhong, Jiang, Jie, Zhong, Caesar
Recent advancements in video generation have significantly impacted daily life for both individuals and industries. However, the leading video generation models remain closed-source, resulting in a notable performance gap between industry capabilitie
Externí odkaz:
http://arxiv.org/abs/2412.03603
Large language models (LLMs) have significantly advanced autonomous agents, particularly in zero-shot tool usage, also known as function calling. This research delves into enhancing the function-calling capabilities of LLMs by exploring different app
Externí odkaz:
http://arxiv.org/abs/2412.01130
Understanding the emotions in a dialogue usually requires external knowledge to accurately understand the contents. As the LLMs become more and more powerful, we do not want to settle on the limited ability of the pre-trained language model. However,
Externí odkaz:
http://arxiv.org/abs/2411.17674
Autor:
Ji, Xiaozhong, Hu, Xiaobin, Xu, Zhihong, Zhu, Junwei, Lin, Chuming, He, Qingdong, Zhang, Jiangning, Luo, Donghao, Chen, Yi, Lin, Qin, Lu, Qinglin, Wang, Chengjie
The study of talking face generation mainly explores the intricacies of synchronizing facial movements and crafting visually appealing, temporally-coherent animations. However, due to the limited exploration of global audio perception, current approa
Externí odkaz:
http://arxiv.org/abs/2411.16331
Autor:
Liu, Xin-Yang, Parikh, Meet Hemant, Fan, Xiantao, Du, Pan, Wang, Qing, Chen, Yi-Fan, Wang, Jian-Xun
Eddy-resolving turbulence simulations require stochastic inflow conditions that accurately replicate the complex, multi-scale structures of turbulence. Traditional recycling-based methods rely on computationally expensive precursor simulations, while
Externí odkaz:
http://arxiv.org/abs/2411.14378
Autor:
Cai, Shuhui, Qin, Huafeng, Wang, Huapei, Deng, Chenglong, Yang, Saihong, Xu, Ya, Zhang, Chi, Tang, Xu, Gu, Lixin, Li, Xiaoguang, Shen, Zhongshan, Zhang, Min, He, Kuang, Qi, Kaixian, Fan, Yunchang, Dong, Liang, Hou, Yifei, Shi, Pingyuan, Liu, Shuangchi, Su, Fei, Chen, Yi, Li, Qiuli, Li, Jinhua, Mitchell, Ross N., He, Huaiyu, Li, Chunlai, Pan, Yongxin, Zhu, Rixiang
The evolution of the lunar magnetic field can reveal the Moon's interior structure, thermal history, and surface environment. The mid-to-late stage evolution of the lunar magnetic field is poorly constrained, and thus the existence of a long-lived lu
Externí odkaz:
http://arxiv.org/abs/2411.13719
Autor:
Chen, Yi
Relativistic full weak-neutral axial-vector four-current distributions inside a general spin-$\frac{1}{2}$ system are systematically studied for the first time, where the second-class current contribution associated with the induced (pseudo-)tensor f
Externí odkaz:
http://arxiv.org/abs/2411.12521
Multimedia streaming accounts for the majority of traffic in today's internet. Mechanisms like adaptive bitrate streaming control the bitrate of a stream based on the estimated bandwidth, ideally resulting in smooth playback and a good Quality of Exp
Externí odkaz:
http://arxiv.org/abs/2410.21029