Zobrazeno 1 - 10
of 8 994
pro vyhledávání: '"Huang An-Ping"'
Autor:
Mittal, Vikash, Huang, Yi-Ping
Parrondo's paradox, a counterintuitive phenomenon where two losing strategies combine to produce a winning outcome, has been a subject of interest across various scientific fields, including quantum mechanics. In this study, we investigate the manife
Externí odkaz:
http://arxiv.org/abs/2407.16558
In this work, we introduce Speech-Copilot, a modular framework for instruction-oriented speech-processing tasks that minimizes human effort in toolset construction. Unlike end-to-end methods using large audio-language models, Speech-Copilot builds sp
Externí odkaz:
http://arxiv.org/abs/2407.09886
Deep learning-based end-to-end automatic speech recognition (ASR) has made significant strides but still struggles with performance on out-of-domain (OOD) samples due to domain shifts in real-world scenarios. Test-Time Adaptation (TTA) methods addres
Externí odkaz:
http://arxiv.org/abs/2406.11064
Large audio-language models (LALMs) enhance traditional large language models by integrating audio perception capabilities, allowing them to tackle audio-related tasks. Previous research has primarily focused on assessing the performance of LALMs acr
Externí odkaz:
http://arxiv.org/abs/2406.08402
We demonstrate an invertible all-optical gate on chip, with the roles of control and signal switchable by slightly adjusting their relative arrival time at the gate. It is based on quantum Zeno blockade driven by sum-frequency generation in a periodi
Externí odkaz:
http://arxiv.org/abs/2405.00150
We demonstrate parametric all-optical modulation in a periodically-poled lithium niobate microring resonator on chip. It employs quantum Zeno blockade between two distinct waves, a signal and a pump, through their sum-frequency generation at a large
Externí odkaz:
http://arxiv.org/abs/2402.10367
This paper presents an effective transfer learning framework for language adaptation in text-to-speech systems, with a focus on achieving language adaptation using minimal labeled and unlabeled data. While many works focus on reducing the usage of la
Externí odkaz:
http://arxiv.org/abs/2402.01692
We reveal that the criticality of the chiral phase transition in QCD at the macroscale arises from the microscopic energy levels of its fundamental constituents, the quarks. We establish a novel relation between cumulants of the chiral order paramete
Externí odkaz:
http://arxiv.org/abs/2402.16867
We reveal that the universal scaling properties of the chiral phase transition in quantum chromodynamics (QCD) at the macroscale are, in fact, encoded within the microscopic energy levels of its fundamental constituents, the quarks. We introduce a no
Externí odkaz:
http://arxiv.org/abs/2401.10263
Autor:
Huang, Hsin-Ping, Su, Yu-Chuan, Sun, Deqing, Jiang, Lu, Jia, Xuhui, Zhu, Yukun, Yang, Ming-Hsuan
Text-to-video generation has shown promising results. However, by taking only natural languages as input, users often face difficulties in providing detailed information to precisely control the model's output. In this work, we propose fine-grained c
Externí odkaz:
http://arxiv.org/abs/2312.02919