Zobrazeno 1 - 10
of 866
pro vyhledávání: '"Zhang Shilei"'
Autor:
Zhang, Chenhao, Wu, Yang, Chen, Jingyi, Jin, Haonan, Wang, Jinghui, Fan, Raymond, Steadman, Paul, van der Laan, Gerrit, Hesjedal, Thorsten, Zhang, Shilei
We performed a pump-probe experiment on the chiral magnet Cu$_2$OSeO$_3$ to study the relaxation dynamics of its non-collinear magnetic orders, employing a millisecond magnetic field pulse as the pump and resonant elastic x-ray scattering as the prob
Externí odkaz:
http://arxiv.org/abs/2410.05485
In this paper, we provide a large audio-visual speaker recognition dataset, VoxBlink2, which includes approximately 10M utterances with videos from 110K+ speakers in the wild. This dataset represents a significant expansion over the VoxBlink dataset,
Externí odkaz:
http://arxiv.org/abs/2407.11510
The diverse nature of dialects presents challenges for models trained on specific linguistic patterns, rendering them susceptible to errors when confronted with unseen or out-of-distribution (OOD) data. This study introduces a novel margin-enhanced j
Externí odkaz:
http://arxiv.org/abs/2406.18067
For speech classification tasks, deep learning models often achieve high accuracy but exhibit shortcomings in calibration, manifesting as classifiers exhibiting overconfidence. The significance of calibration lies in its critical role in guaranteeing
Externí odkaz:
http://arxiv.org/abs/2406.18065
Autor:
Shen, Yao, Gao, Yingying, Hao, Yaqian, Hu, Chenguang, Zhang, Fulin, Feng, Junlan, Zhang, Shilei
Noisy labels are inevitable, even in well-annotated datasets. The detection of noisy labels is of significant importance to enhance the robustness of speaker recognition models. In this paper, we propose a novel noisy label detection approach based o
Externí odkaz:
http://arxiv.org/abs/2406.13268
Pre-trained speech language models such as HuBERT and WavLM leverage unlabeled speech data for self-supervised learning and offer powerful representations for numerous downstream tasks. Despite the success of these models, their high requirements for
Externí odkaz:
http://arxiv.org/abs/2406.09444
Autor:
Yang, Runyan, Yang, Huibao, Zhang, Xiqing, Ye, Tiantian, Liu, Ying, Gao, Yingying, Zhang, Shilei, Deng, Chao, Feng, Junlan
Recently, there have been attempts to integrate various speech processing tasks into a unified model. However, few previous works directly demonstrated that joint optimization of diverse tasks in multitask speech models has positive influence on the
Externí odkaz:
http://arxiv.org/abs/2406.07801
The expectation to deploy a universal neural network for speech enhancement, with the aim of improving noise robustness across diverse speech processing tasks, faces challenges due to the existing lack of awareness within static speech enhancement fr
Externí odkaz:
http://arxiv.org/abs/2402.12746
Publikováno v:
International Journal of Distributed Sensor Networks, Vol 16 (2020)
This article aims to provide an efficient fault diagnosis method for gearbox. A self-organizing map–based fault model is developed to provide effective diagnosis of the faults of gearboxes using the gear signals extracted from gearboxes operating w
Externí odkaz:
https://doaj.org/article/2c8a830beb4148d080773ffd44ca0140
Cascading multiple pre-trained models is an effective way to compose an end-to-end system. However, fine-tuning the full cascaded model is parameter and memory inefficient and our observations reveal that only applying adapter modules on cascaded mod
Externí odkaz:
http://arxiv.org/abs/2310.17664