Zobrazeno 1 - 10
of 101
pro vyhledávání: '"Ding, Yiwei"'
Autor:
Han, Chaeyeon, Seshadri, Pavan, Ding, Yiwei, Posner, Noah, Koo, Bon Woo, Agrawal, Animesh, Lerch, Alexander, Guhathakurta, Subhrajit
While various sensors have been deployed to monitor vehicular flows, sensing pedestrian movement is still nascent. Yet walking is a significant mode of travel in many cities, especially those in Europe, Africa, and Asia. Understanding pedestrian volu
Externí odkaz:
http://arxiv.org/abs/2406.09998
Autor:
Ding, Yiwei, Lerch, Alexander
Common knowledge distillation methods require the teacher model and the student model to be trained on the same task. However, the usage of embeddings as teachers has also been proposed for different source tasks and target tasks. Prior work that use
Externí odkaz:
http://arxiv.org/abs/2402.06761
Autor:
Zhong, Wei, Liu, Yingyu, Yin, Qin, Zhao, Ruocan, Ding, Yiwei, Wang, Chong, Chen, Tindi, Dou, Xiankang, Xue, Xianghui
Open-path dual-comb spectroscopy (DCS) significantly enhances our understanding of regional trace gases. However, due to technical challenges, cost considerations, and eye-safety regulations, its sensing range and flexibility remain limited. The phot
Externí odkaz:
http://arxiv.org/abs/2401.11657
Autor:
Watcharasupat, Karn N., Wu, Chih-Wei, Ding, Yiwei, Orife, Iroro, Hipple, Aaron J., Williams, Phillip A., Kramer, Scott, Lerch, Alexander, Wolcott, William
Publikováno v:
IEEE Open Journal of Signal Processing, vol. 5, pp. 73-81, 2024
Cinematic audio source separation is a relatively new subtask of audio source separation, with the aim of extracting the dialogue, music, and effects stems from their mixture. In this work, we developed a model generalizing the Bandsplit RNN for any
Externí odkaz:
http://arxiv.org/abs/2309.02539
Autor:
Ding, Yiwei, Lerch, Alexander
Music classification has been one of the most popular tasks in the field of music information retrieval. With the development of deep learning models, the last decade has seen impressive improvements in a wide range of classification tasks. However,
Externí odkaz:
http://arxiv.org/abs/2306.17424
Autor:
Liu, Pengfei, Deng, Wenjin, Li, Hengda, Wang, Jintai, Zheng, Yinglin, Ding, Yiwei, Guo, Xiaohu, Zeng, Ming
It is still an interesting and challenging problem to synthesize a vivid and realistic singing face driven by music signal. In this paper, we present a method for this task with natural motions of the lip, facial expression, head pose, and eye states
Externí odkaz:
http://arxiv.org/abs/2303.14044
Publikováno v:
Shipin Kexue, Vol 45, Iss 13, Pp 198-209 (2024)
The effects of adding dried Lyophyllum decastes (DLR) or dried L. decastes by-products (DBY) on physicochemical properties and nutritional functions of extruded barley flour (BF) were analyzed. Results revealed a notable decrease in the degree of sta
Externí odkaz:
https://doaj.org/article/9d05d0319de441fcafe417d583d19a7b
This paper is the system description of the DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC22). In this challenge, we focus on track1 and track3. For track1, multiple backbone networks are adopted to extract frame-level
Externí odkaz:
http://arxiv.org/abs/2210.05092
Autor:
Ding, Yiwei, Deng, Wenjin, Zheng, Yinglin, Liu, Pengfei, Wang, Meihong, Cheng, Xuan, Bao, Jianmin, Chen, Dong, Zeng, Ming
In this paper, we present the Intra- and Inter-Human Relation Networks (I^2R-Net) for Multi-Person Pose Estimation. It involves two basic modules. First, the Intra-Human Relation Module operates on a single person and aims to capture Intra-Human depe
Externí odkaz:
http://arxiv.org/abs/2206.10892
Multi-branch convolutional neural network architecture has raised lots of attention in speaker verification since the aggregation of multiple parallel branches can significantly improve performance. However, this design is not efficient enough during
Externí odkaz:
http://arxiv.org/abs/2110.09720