Zobrazeno 1 - 10
of 24 453
pro vyhledávání: '"YANG, Chao"'
The Hadamard product of tensor train (TT) tensors is one of the most fundamental nonlinear operations in scientific computing and data analysis. Due to its tendency to significantly increase TT ranks, the Hadamard product presents a major computation
Externí odkaz:
http://arxiv.org/abs/2410.04385
In this study, we aim to explore Multitask Speech Language Model (SpeechLM) efficient inference via token reduction. Unlike other modalities such as vision or text, speech has unique temporal dependencies, making previous efficient inference works on
Externí odkaz:
http://arxiv.org/abs/2410.03007
Autor:
Lu, Ke-Han, Chen, Zhehuai, Fu, Szu-Wei, Yang, Chao-Han Huck, Balam, Jagadeesh, Ginsburg, Boris, Wang, Yu-Chiang Frank, Lee, Hung-yi
Recent end-to-end speech language models (SLMs) have expanded upon the capabilities of large language models (LLMs) by incorporating pre-trained speech models. However, these SLMs often undergo extensive speech instruction-tuning to bridge the gap be
Externí odkaz:
http://arxiv.org/abs/2409.20007
Autor:
Sun, Tian-Rui, Geng, Jin-Jun, Yan, Jing-Zhi, Hu, You-Dong, Wu, Xue-Feng, Castro-Tirado, Alberto J., Yang, Chao, Ping, Yi-Ding, Hu, Chen-Ran, Xu, Fan, Gao, Hao-Xuan, Jiang, Ji-An, Zhu, Yan-Tian, Xue, Yongquan, Pérez-García, Ignacio, Wu, Si-Yu, Fernández-García, Emilio, Caballero-García, María D., Sánchez-Ramírez, Rubén, Guziy, Sergiy, Olivares, Ignacio, del Pulgar, Carlos Jesus Pérez, Castellón, A., Castillo, Sebastián, Xiong, Ding-Rong, Pandey, Shashi B., Hiriart, David, García-Segura, Guillermo, Lee, William H., Carrasco-García, I. M., Park, Il H., Meintjes, Petrus J., van Heerden, Hendrik J., Martín-Carrillo, Antonio, Hanlon, Lorraine, Zhang, Bin-Bin, Maury, Alain, Hernández-García, L., Gritsevich, Maria, Rossi, Andrea, Maiorano, Elisabetta, Cusano, Felice, D'Avanzo, Paolo, Ferro, Matteo, Melandri, Andrea, De Pasquale, Massimiliano, Brivio, Riccardo, Fang, Min, Fan, Lu-Lu, Hu, Wei-Da, Wan, Zhen, Hu, Lei, Zuo, Ying-Xi, Tang, Jin-Long, Zhang, Xiao-Ling, Zheng, Xian-Zhong, Li, Bin, Luo, Wen-Tao, Liu, Wei, Wang, Jian, Zhang, Hong-Fei, Liu, Hao, Gao, Jie, Liang, Ming, Wang, Hai-Ren, Yao, Da-Zhi, Cheng, Jing-Quan, Zhao, Wen, Dai, Zi-Gao
Thanks to the rapidly increasing time-domain facilities, we are entering a golden era of research on gamma-ray bursts (GRBs). In this Letter, we report our observations of GRB 240529A with the Burst Optical Observer and Transient Exploring System, th
Externí odkaz:
http://arxiv.org/abs/2409.17983
Large language models are typically fine-tuned to align with human preferences, but tuning large models is computationally intensive and complex. In this work, we introduce $\textit{Integrated Value Guidance}$ (IVG), a method that uses implicit and e
Externí odkaz:
http://arxiv.org/abs/2409.17819
Ensuring that the outputs of neural networks satisfy specific constraints is crucial for applying neural networks to real-life decision-making problems. In this paper, we consider making a batch of neural network outputs satisfy bounded and general l
Externí odkaz:
http://arxiv.org/abs/2409.17500
Annotating and recognizing speech emotion using prompt engineering has recently emerged with the advancement of Large Language Models (LLMs), yet its efficacy and reliability remain questionable. In this paper, we conduct a systematic study on this t
Externí odkaz:
http://arxiv.org/abs/2409.15551
Autor:
Hu, Ke, Chen, Zhehuai, Yang, Chao-Han Huck, Żelasko, Piotr, Hrinchuk, Oleksii, Lavrukhin, Vitaly, Balam, Jagadeesh, Ginsburg, Boris
Large language models (LLMs) have demonstrated remarkable advancements in language understanding and generation. Building on the success of text-based LLMs, recent research has adapted these models to use speech embeddings for prompting, resulting in
Externí odkaz:
http://arxiv.org/abs/2409.11538
Publikováno v:
Phys. Rev. A 110, 042205 (2024)
The multifractal critical phase (MCP) fundamentally differs from extended and localized phases, exhibiting delocalized distributions in both position and momentum spaces. The investigation on the MCP has largely focused on one-dimensional quasiperiod
Externí odkaz:
http://arxiv.org/abs/2409.10254
Autor:
Yang, Chao-Han Huck, Park, Taejin, Gong, Yuan, Li, Yuanchao, Chen, Zhehuai, Lin, Yen-Ting, Chen, Chen, Hu, Yuchen, Dhawan, Kunal, Żelasko, Piotr, Zhang, Chao, Chen, Yun-Nung, Tsao, Yu, Balam, Jagadeesh, Ginsburg, Boris, Siniscalchi, Sabato Marco, Chng, Eng Siong, Bell, Peter, Lai, Catherine, Watanabe, Shinji, Stolcke, Andreas
Given recent advances in generative AI technology, a key question is how large language models (LLMs) can enhance acoustic modeling tasks using text decoding results from a frozen, pretrained automatic speech recognition (ASR) model. To explore new c
Externí odkaz:
http://arxiv.org/abs/2409.09785