Zobrazeno 1 - 10
of 277
pro vyhledávání: '"Zhang, YuCong"'
In contrast to human speech, machine-generated sounds of the same type often exhibit consistent frequency characteristics and discernible temporal periodicity. However, leveraging these dual attributes in anomaly detection remains relatively under-ex
Externí odkaz:
http://arxiv.org/abs/2409.03610
This paper presents the Multimodal Laryngoscopic Video Analyzing System (MLVAS), a novel system that leverages both audio and video data to automatically extract key segments and metrics from raw laryngeal videostroboscopic videos for assisted clinic
Externí odkaz:
http://arxiv.org/abs/2409.03597
This paper proposes an approach for anomalous sound detection that incorporates outlier exposure and inlier modeling within a unified framework by multitask learning. While outlier exposure-based methods can extract features efficiently, it is not ro
Externí odkaz:
http://arxiv.org/abs/2309.07500
Publikováno v:
Shipin Kexue, Vol 45, Iss 15, Pp 40-48 (2024)
The effects of curdlan gum (CG), konjac gum (KGM), sodium alginate (SA) and xanthan gum (XG) on physicochemical properties, eating quality and digestibility of extruded reconstituted rice were compared in this study. The results showed that reconstit
Externí odkaz:
https://doaj.org/article/b1dd67f1f99644de87e96282e470ebb3
Autor:
Zou, Peilin1,2,3 (AUTHOR), Zhang, Yucong1,2 (AUTHOR), Chen, Liangkai4,5 (AUTHOR), Liu, Man1,2 (AUTHOR), Nie, Hao1,2 (AUTHOR), Gao, Hongyu1,2 (AUTHOR), Zhang, Cuntai1,2 (AUTHOR), Yan, Jinhua1,2 (AUTHOR) yanjinhua2013@outlook.com
Publikováno v:
BMC Public Health. 10/21/2024, Vol. 24, p1-11. 11p.
Target-speaker voice activity detection is currently a promising approach for speaker diarization in complex acoustic environments. This paper presents a novel Sequence-to-Sequence Target-Speaker Voice Activity Detection (Seq2Seq-TSVAD) method that c
Externí odkaz:
http://arxiv.org/abs/2210.16127
This paper discribes the DKU-DukeECE submission to the 4th track of the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22). Our system contains a fused voice activity detection model, a clustering-based diarization model, and a target-speaker vo
Externí odkaz:
http://arxiv.org/abs/2210.01677
Autor:
Yang, Qian, Ou, Chubin, Li, Kang, Wang, Zhongxiao, Zhang, Yucong, Liao, Xiangyun, Lv, Jianping, Si, Weixin
Publikováno v:
In Expert Systems With Applications 1 December 2024 255 Part B
Autor:
Zhou, Tongtong, Zhang, Yucong, Wang, Yihui, Liu, Qing, Yang, Yueyue, Qiu, Chao, Jiao, Aiquan, Jin, Zhengyu
Publikováno v:
In International Journal of Biological Macromolecules November 2024 279 Part 3
Publikováno v:
In Science of the Total Environment 1 November 2024 949