Zobrazeno 1 - 10
of 1 039
pro vyhledávání: '"Tan, KE"'
Autor:
Xu, Zhongweiyang, Aroudi, Ali, Tan, Ke, Pandey, Ashutosh, Lee, Jung-Suk, Xu, Buye, Nesta, Francesco
This paper presents a novel multi-channel speech enhancement approach, FoVNet, that enables highly efficient speech enhancement within a configurable field of view (FoV) of a smart-glasses user without needing specific target-talker(s) directions. It
Externí odkaz:
http://arxiv.org/abs/2408.06468
Adding visual cues to audio-based speech separation can improve separation performance. This paper introduces AV-CrossNet, an audiovisual (AV) system for speech enhancement, target speaker extraction, and multi-talker speaker separation. AV-CrossNet
Externí odkaz:
http://arxiv.org/abs/2406.11619
Self-supervised learned models have been found to be very effective for certain speech tasks such as automatic speech recognition, speaker identification, keyword spotting and others. While the features are undeniably useful in speech recognition and
Externí odkaz:
http://arxiv.org/abs/2403.01369
Autor:
Huang, Yang, Beers, Timothy C., Yuan, Hai-Bo, Tan, Ke-Feng, Wang, Wei, Zheng, Jie, Li, Chun, Lee, Young Sun, Li, Hai-Ning, Zhao, Jing-Kun, Xue, Xiang-Xiang, Liu, Yu-Juan, Zhang, Hua-Wei, Sun, Xue-Ang, Li, Ji, Gu, Hong-Rui, Wolf, Christian, Onken, Christopher A., Liu, Ji-Feng, Fan, Zhou, Zhao, Gang
We present precise photometric estimates of stellar parameters, including effective temperature, metallicity, luminosity classification, distance, and stellar age, for nearly 26 million stars using the methodology developed in the first paper of this
Externí odkaz:
http://arxiv.org/abs/2307.04469
Autor:
Kumar, Anurag, Tan, Ke, Ni, Zhaoheng, Manocha, Pranay, Zhang, Xiaohui, Henderson, Ethan, Xu, Buye
Measuring quality and intelligibility of a speech signal is usually a critical step in development of speech processing systems. To enable this, a variety of metrics to measure quality and intelligibility under different assumptions have been develop
Externí odkaz:
http://arxiv.org/abs/2304.01448
Publikováno v:
2023 ApJL 945 L5
We report the discovery of a new stream (dubbed as Yangtze) detected in $Gaia$ Data Release 3. The stream is at a heliocentric distance of $\sim$ 9.12 kpc and spans nearly 27$\deg$ by 1.9$\deg$ on sky. The colour-magnitude diagram of Yangtze indicate
Externí odkaz:
http://arxiv.org/abs/2302.05232
Despite multiple efforts made towards adopting complex-valued deep neural networks (DNNs), it remains an open question whether complex-valued DNNs are generally more effective than real-valued DNNs for monaural speech enhancement. This work is devote
Externí odkaz:
http://arxiv.org/abs/2301.04320
Most speech enhancement (SE) models learn a point estimate and do not make use of uncertainty estimation in the learning process. In this paper, we show that modeling heteroscedastic uncertainty by minimizing a multivariate Gaussian negative log-like
Externí odkaz:
http://arxiv.org/abs/2211.08624
Autor:
Tan, Ke
Speech signals are usually distorted by acoustic interference in daily listening environments. Such distortions severely degrade speech intelligibility and quality for human listeners, and make many speech-related tasks, such as automatic speech reco
Publikováno v:
Zhongguo linchuang yanjiu, Vol 37, Iss 4, Pp 560-563 (2024)
Objective To analyze the therapeutic effect of microsurgery on patients with severe aneurysmal subarachnoid hemorrhage (SaSAH). Methods A retrospective analysis was conducted on 14 SaSAH patients admitted to Beijing Chao-Yang Hospital, Capital Medica
Externí odkaz:
https://doaj.org/article/886323c700f247c6bcec84bd21d6e5dd