Zobrazeno 1 - 10
of 23 556
pro vyhledávání: '"Mingxing An"'
Autor:
Luo, Xinchen, Cao, Jiangxia, Sun, Tianyu, Yu, Jinkai, Huang, Rui, Yuan, Wei, Lin, Hezheng, Zheng, Yichen, Wang, Shiyao, Hu, Qigen, Qiu, Changqing, Zhang, Jiaqi, Zhang, Xu, Yan, Zhiheng, Zhang, Jingming, Zhang, Simin, Wen, Mingxing, Liu, Zhaojie, Gai, Kun, Zhou, Guorui
In recent years, with the significant evolution of multi-modal large models, many recommender researchers realized the potential of multi-modal information for user interest modeling. In industry, a wide-used modeling architecture is a cascading para
Externí odkaz:
http://arxiv.org/abs/2411.11739
Autor:
Hwang, Jyh-Jing, Xu, Runsheng, Lin, Hubert, Hung, Wei-Chih, Ji, Jingwei, Choi, Kristy, Huang, Di, He, Tong, Covington, Paul, Sapp, Benjamin, Zhou, Yin, Guo, James, Anguelov, Dragomir, Tan, Mingxing
We introduce EMMA, an End-to-end Multimodal Model for Autonomous driving. Built on a multi-modal large language model foundation, EMMA directly maps raw camera sensor data into various driving-specific outputs, including planner trajectories, percept
Externí odkaz:
http://arxiv.org/abs/2410.23262
Autor:
Chen, Jiaxin, Chen, Mingxing
We introduce a program named KPROJ that unfolds the electronic and phononic band structure of materials modeled by supercells. The program is based on the $\textit{k}$-projection method, which projects the wavefunction of the supercell onto the ${\te
Externí odkaz:
http://arxiv.org/abs/2410.10910
Developing efficient traffic models is essential for optimizing transportation systems, yet current approaches remain time-intensive and susceptible to human errors due to their reliance on manual processes. Traditional workflows involve exhaustive l
Externí odkaz:
http://arxiv.org/abs/2409.16876
Autor:
Peng, Mingxing, Chen, Kehua, Guo, Xusen, Zhang, Qiming, Lu, Hongliang, Zhong, Hui, Chen, Di, Zhu, Meixin, Yang, Hai
Intelligent Transportation Systems (ITS) are vital in modern traffic management and optimization, significantly enhancing traffic efficiency and safety. Recently, diffusion models have emerged as transformative tools for addressing complex challenges
Externí odkaz:
http://arxiv.org/abs/2409.15816
Moir\'e lattices have served as the ideal quantum simulation platform for exploring novel physics due to the flat electronic bands resulting from the long wavelength moir\'e potentials. However, the large sizes of this type of system challenge the fi
Externí odkaz:
http://arxiv.org/abs/2409.07987
Autor:
Zhao, Yiyang, Wang, Shuai, Sun, Guangzhi, Chen, Zehua, Zhang, Chao, Xu, Mingxing, Zheng, Thomas Fang
In this paper, Whisper, a large-scale pre-trained model for automatic speech recognition, is proposed to apply to speaker verification. A partial multi-scale feature aggregation (PMFA) approach is proposed based on a subset of Whisper encoder blocks
Externí odkaz:
http://arxiv.org/abs/2408.15585
Publikováno v:
Interspeech2024
Automatic Speaker Verification (ASV) suffers from performance degradation in noisy conditions. To address this issue, we propose a novel adversarial learning framework that incorporates noise-disentanglement to establish a noise-independent speaker i
Externí odkaz:
http://arxiv.org/abs/2408.11562
End-to-end models have shown superior performance for automatic speech recognition (ASR). However, such models are often very large in size and thus challenging to deploy on resource-constrained edge devices. While quantisation can reduce model sizes
Externí odkaz:
http://arxiv.org/abs/2408.03979
We show that several models of interacting XXZ spin chains subject to boundary driving and dissipation possess a subtle kind of time-reversal symmetry, making their steady states exactly solvable. We focus on a model with a coherent boundary drive, s
Externí odkaz:
http://arxiv.org/abs/2407.12750