Zobrazeno 1 - 10
of 4 623
pro vyhledávání: '"Pan,Jing"'
Obtaining word timestamp information from end-to-end (E2E) ASR models remains challenging due to the lack of explicit time alignment during training. This issue is further complicated in multilingual models. Existing methods, either rely on lexicons
Externí odkaz:
http://arxiv.org/abs/2409.13913
Autor:
Hu, Kai-Xin, Pan, Jing-Han
Nanofluids are suspensions of nanoscale particles (such as metals and their oxides) in base fluids (such as water, oil, or alcohol), which can significantly enhance the heat transfer performance of the base fluid. However, when nanofluids are applied
Externí odkaz:
http://arxiv.org/abs/2409.09995
Autor:
Deng, Yongxin, Qiu, Xihe, Tan, Xiaoyu, Qu, Chao, Pan, Jing, Cheng, Yuan, Xu, Yinghui, Chu, Wei
Cognitive psychology investigates perception, attention, memory, language, problem-solving, decision-making, and reasoning. Kahneman's dual-system theory elucidates the human decision-making process, distinguishing between the rapid, intuitive System
Externí odkaz:
http://arxiv.org/abs/2409.03381
Autor:
Deng, Yongxin, Qiu, Xihe, Tan, Xiaoyu, Pan, Jing, Jue, Chen, Fang, Zhijun, Xu, Yinghui, Chu, Wei, Qi, Yuan
Large language models (LLMs) are trained on extensive text corpora, which inevitably include biased information. Although techniques such as Affective Alignment can mitigate some negative impacts of these biases, existing prompt-based attack methods
Externí odkaz:
http://arxiv.org/abs/2408.10608
Topological orders emerge in both microscopic quantum dynamics and macroscopic materials as a fundamental principle to characterize intricate properties in nature with vital significance, for instance, the Landau levels of electron systems in magneti
Externí odkaz:
http://arxiv.org/abs/2405.09456
Autor:
Hu, Shujie, Zhou, Long, Liu, Shujie, Chen, Sanyuan, Meng, Lingwei, Hao, Hongkun, Pan, Jing, Liu, Xunying, Li, Jinyu, Sivasankaran, Sunit, Liu, Linquan, Wei, Furu
The recent advancements in large language models (LLMs) have revolutionized the field of natural language processing, progressively broadening their scope to multimodal perception and generation. However, effectively integrating listening capabilitie
Externí odkaz:
http://arxiv.org/abs/2404.00656
The Multi-Reference Alignment (MRA) problem aims at the recovery of an unknown signal from repeated observations under the latent action of a group of cyclic isometries, in the presence of additive noise of high intensity $\sigma$. It is a more tract
Externí odkaz:
http://arxiv.org/abs/2312.07839
We present a cost-effective method to integrate speech into a large language model (LLM), resulting in a Contextual Speech Model with Instruction-following/in-context-learning Capabilities (COSMIC) multi-modal LLM. Using GPT-3.5, we generate Speech C
Externí odkaz:
http://arxiv.org/abs/2311.02248
Simultaneous Speech-to-Text translation serves a critical role in real-time crosslingual communication. Despite the advancements in recent years, challenges remain in achieving stability in the translation process, a concern primarily manifested in t
Externí odkaz:
http://arxiv.org/abs/2310.04399
Nonlinear optics of structured light has recently delivered intriguing fundamental physical phenomena in light-matter interactions and advanced applications from classical imaging to quantum informatics. The mutual interaction between spin, orbital a
Externí odkaz:
http://arxiv.org/abs/2305.01192