Zobrazeno 1 - 10
of 19 926
pro vyhledávání: '"An, Lizhen"'
Autor:
Jiang, Haiyang, Chen, Tong, Zhang, Wentao, Hung, Nguyen Quoc Viet, Yuan, Yuan, Li, Yong, Cui, Lizhen
Urban flow prediction is a classic spatial-temporal forecasting task that estimates the amount of future traffic flow for a given location. Though models represented by Spatial-Temporal Graph Neural Networks (STGNNs) have established themselves as ca
Externí odkaz:
http://arxiv.org/abs/2412.05534
Query-based models are extensively used in 3D object detection tasks, with a wide range of pre-trained checkpoints readily available online. However, despite their popularity, these models often require an excessive number of object queries, far surp
Externí odkaz:
http://arxiv.org/abs/2412.02054
We propose the first Bayesian methods for detecting change points in high-dimensional mean and covariance structures. These methods are constructed using pairwise Bayes factors, leveraging modularization to identify significant changes in individual
Externí odkaz:
http://arxiv.org/abs/2411.14864
The rapid spread of rumors on social media has posed significant challenges to maintaining public trust and information integrity. Since an information cascade process is essentially a propagation tree, recent rumor detection models leverage graph ne
Externí odkaz:
http://arxiv.org/abs/2411.12949
In the future sixth-generation (6G) era, to support accurate localization sensing and efficient communication link establishment for intelligent agents, a comprehensive understanding of the surrounding environment and proper channel modeling are indi
Externí odkaz:
http://arxiv.org/abs/2411.03711
Large Multimodal Models (LMMs) have demonstrated the ability to interact with humans under real-world conditions by combining Large Language Models (LLMs) and modality encoders to align multimodal information (visual and auditory) with text. However,
Externí odkaz:
http://arxiv.org/abs/2410.23861
Autor:
Deng, Xiang, Pang, Youxin, Zhao, Xiaochen, Xu, Chao, Wang, Lizhen, Xiao, Hongjiang, Yan, Shi, Zhang, Hongwen, Liu, Yebin
This paper introduces Stereo-Talker, a novel one-shot audio-driven human video synthesis system that generates 3D talking videos with precise lip synchronization, expressive body gestures, temporally consistent photo-realistic quality, and continuous
Externí odkaz:
http://arxiv.org/abs/2410.23836
The performance of large language models (LLMs) in natural language processing (NLP) tasks is significantly influenced by the quality and diversity of data used for supervised fine-tuning (SFT). Current data selection methods often focus solely on qu
Externí odkaz:
http://arxiv.org/abs/2410.12458
Large language models (LLMs) have exhibited outstanding performance in engaging with humans and addressing complex questions by leveraging their vast implicit knowledge and robust reasoning capabilities. However, such models are vulnerable to jailbre
Externí odkaz:
http://arxiv.org/abs/2410.11459
Text-to-SQL translates natural language queries into Structured Query Language (SQL) commands, enabling users to interact with databases using natural language. Essentially, the text-to-SQL task is a text generation task, and its development is prima
Externí odkaz:
http://arxiv.org/abs/2410.06011