Výsledky vyhledávání

Report

Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors

Autor: Han, Chaeyeon, Seshadri, Pavan, Ding, Yiwei, Posner, Noah, Koo, Bon Woo, Agrawal, Animesh, Lerch, Alexander, Guhathakurta, Subhrajit

While various sensors have been deployed to monitor vehicular flows, sensing pedestrian movement is still nascent. Yet walking is a significant mode of travel in many cities, especially those in Europe, Africa, and Asia. Understanding pedestrian volu

Externí odkaz: http://arxiv.org/abs/2406.09998

Zobrazit plný text záznamu

Report

Embedding Compression for Teacher-to-Student Knowledge Transfer

Autor: Ding, Yiwei, Lerch, Alexander

Common knowledge distillation methods require the teacher model and the student model to be trained on the same task. However, the usage of embeddings as teachers has also been proposed for different source tasks and target tasks. Prior work that use

Externí odkaz: http://arxiv.org/abs/2402.06761

Zobrazit plný text záznamu

Report

A photon-level broadband dual-comb interferometer for turbulent open-air trace gases detection application

Autor: Zhong, Wei, Liu, Yingyu, Yin, Qin, Zhao, Ruocan, Ding, Yiwei, Wang, Chong, Chen, Tindi, Dou, Xiankang, Xue, Xianghui

Open-path dual-comb spectroscopy (DCS) significantly enhances our understanding of regional trace gases. However, due to technical challenges, cost considerations, and eye-safety regulations, its sensing range and flexibility remain limited. The phot

Externí odkaz: http://arxiv.org/abs/2401.11657

Zobrazit plný text záznamu

Report

A Generalized Bandsplit Neural Network for Cinematic Audio Source Separation

Autor: Watcharasupat, Karn N., Wu, Chih-Wei, Ding, Yiwei, Orife, Iroro, Hipple, Aaron J., Williams, Phillip A., Kramer, Scott, Lerch, Alexander, Wolcott, William

Publikováno v: IEEE Open Journal of Signal Processing, vol. 5, pp. 73-81, 2024

Cinematic audio source separation is a relatively new subtask of audio source separation, with the aim of extracting the dialogue, music, and effects stems from their mixture. In this work, we developed a model generalizing the Bandsplit RNN for any

Externí odkaz: http://arxiv.org/abs/2309.02539

Zobrazit plný text záznamu

Report

Audio Embeddings as Teachers for Music Classification

Autor: Ding, Yiwei, Lerch, Alexander

Music classification has been one of the most popular tasks in the field of music information retrieval. With the development of deep learning models, the last decade has seen impressive improvements in a wide range of classification tasks. However,

Externí odkaz: http://arxiv.org/abs/2306.17424

Zobrazit plný text záznamu

Report

MusicFace: Music-driven Expressive Singing Face Synthesis

Autor: Liu, Pengfei, Deng, Wenjin, Li, Hengda, Wang, Jintai, Zheng, Yinglin, Ding, Yiwei, Guo, Xiaohu, Zeng, Ming

It is still an interesting and challenging problem to synthesize a vivid and realistic singing face driven by music signal. In this paper, we present a method for this task with natural motions of the lip, facial expression, head pose, and eye states

Externí odkaz: http://arxiv.org/abs/2303.14044

Zobrazit plný text záznamu

Akademický článek

Physicochemical Properties, in Vitro Gastrointestinal Digestion and Fermentation Characteristics of Extruded Barley Flour Incorporated with Lyophyllum decastes

Autor: DING Yiwei, FAN Songtao, BAI Juan, GU Yaojun, XIAO Xiang

Publikováno v: Shipin Kexue, Vol 45, Iss 13, Pp 198-209 (2024)

The effects of adding dried Lyophyllum decastes (DLR) or dried L. decastes by-products (DBY) on physicochemical properties and nutritional functions of extruded barley flour (BF) were analyzed. Results revealed a notable decrease in the degree of sta

Externí odkaz: https://doaj.org/article/9d05d0319de441fcafe417d583d19a7b

Zobrazit plný text záznamu

Report

The DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022

Autor: Qin, Xiaoyi, Li, Na, Lin, Yuke, Ding, Yiwei, Weng, Chao, Su, Dan, Li, Ming

This paper is the system description of the DKU-Tencent System for the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC22). In this challenge, we focus on track1 and track3. For track1, multiple backbone networks are adopted to extract frame-level

Externí odkaz: http://arxiv.org/abs/2210.05092

Zobrazit plný text záznamu

Report

I^2R-Net: Intra- and Inter-Human Relation Network for Multi-Person Pose Estimation

Autor: Ding, Yiwei, Deng, Wenjin, Zheng, Yinglin, Liu, Pengfei, Wang, Meihong, Cheng, Xuan, Bao, Jianmin, Chen, Dong, Zeng, Ming

In this paper, we present the Intra- and Inter-Human Relation Networks (I^2R-Net) for Multi-Person Pose Estimation. It involves two basic modules. First, the Intra-Human Relation Module operates on a single person and aims to capture Intra-Human depe

Externí odkaz: http://arxiv.org/abs/2206.10892

Zobrazit plný text záznamu

Report

Rep Works in Speaker Verification

Autor: Ma, Yufeng, Zhao, Miao, Ding, Yiwei, Zheng, Yu, Liu, Min, Xu, Minqiang

Multi-branch convolutional neural network architecture has raised lots of attention in speaker verification since the aggregation of multiple parallel branches can significantly improve performance. However, this design is not efficient enough during

Externí odkaz: http://arxiv.org/abs/2110.09720

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání