Výsledky vyhledávání - "Liang, Chengdong"

Report

Spatial-temporal Graph Based Multi-channel Speaker Verification With Ad-hoc Microphone Arrays

Autor: Chen, Yijiang, Liang, Chengdong, Zhang, Xiao-Lei

The performance of speaker verification degrades significantly in adverse acoustic environments with strong reverberation and noise. To address this issue, this paper proposes a spatial-temporal graph convolutional network (GCN) method for the multi-

Externí odkaz: http://arxiv.org/abs/2307.01386

Zobrazit plný text záznamu

Report

Wespeaker baselines for VoxSRC2023

Autor: Wang, Shuai, Liang, Chengdong, Xiang, Xu, Han, Bing, Chen, Zhengyang, Wang, Hongji, Ding, Wen

This report showcases the results achieved using the wespeaker toolkit for the VoxSRC2023 Challenge. Our aim is to provide participants, especially those with limited experience, with clear and straightforward guidelines to develop their initial syst

Externí odkaz: http://arxiv.org/abs/2306.15161

Zobrazit plný text záznamu

Report

Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames

Autor: Liang, Chengdong, Zhang, Xiao-Lei, Zhang, BinBin, Wu, Di, Li, Shengqiang, Song, Xingchen, Peng, Zhendong, Pan, Fuping

Recently, the unified streaming and non-streaming two-pass (U2/U2++) end-to-end model for speech recognition has shown great performance in terms of streaming capability, accuracy and latency. In this paper, we present fast-U2++, an enhanced version

Externí odkaz: http://arxiv.org/abs/2211.00941

Zobrazit plný text záznamu

Report

Wespeaker: A Research and Production oriented Speaker Embedding Learning Toolkit

Autor: Wang, Hongji, Liang, Chengdong, Wang, Shuai, Chen, Zhengyang, Zhang, Binbin, Xiang, Xu, Deng, Yanlei, Qian, Yanmin

Speaker modeling is essential for many related tasks, such as speaker recognition and speaker diarization. The dominant modeling approach is fixed-dimensional vector representation, i.e., speaker embedding. This paper introduces a research and produc

Externí odkaz: http://arxiv.org/abs/2210.17016

Zobrazit plný text záznamu

Report

Deep Learning Based Stage-wise Two-dimensional Speaker Localization with Large Ad-hoc Microphone Arrays

Autor: Liu, Shupei, Feng, Linfeng, Gong, Yijun, Liang, Chengdong, Zhang, Chen, Zhang, Xiao-Lei, Li, Xuelong

While deep-learning-based speaker localization has shown advantages in challenging acoustic environments, it often yields only direction-of-arrival (DOA) cues rather than precise two-dimensional (2D) coordinates. To address this, we propose a novel d

Externí odkaz: http://arxiv.org/abs/2210.10265

Zobrazit plný text záznamu

Akademický článek

Control of N/P ratios and cut-off voltage for Silicon-Based Li-ion batteries

Autor: Zhang, Hengtong, Gao, Yike, Zhu, Gaolong, Tan, Tiening, Liang, Chengdong, Hao, Shuai, Zhao, Chang, Chen, Wei, Ren, Dongsheng

Publikováno v: In Chemical Engineering Journal 1 November 2024 499

Zobrazit plný text záznamu

Report

Multi-Channel Far-Field Speaker Verification with Large-Scale Ad-hoc Microphone Arrays

Autor: Liang, Chengdong, Chen, Yijiang, Yao, Jiadi, Zhang, Xiao-Lei

Speaker verification based on ad-hoc microphone arrays has the potential of reducing the error significantly in adverse acoustic environments. However, existing approaches extract utterance-level speaker embeddings from each channel of an ad-hoc micr

Externí odkaz: http://arxiv.org/abs/2110.05975

Zobrazit plný text záznamu

Akademický článek

Advancing speaker embedding learning: Wespeaker toolkit for research and production

Autor: Wang, Shuai, Chen, Zhengyang, Han, Bing, Wang, Hongji, Liang, Chengdong, Zhang, Binbin, Xiang, Xu, Ding, Wen, Rohdin, Johan, Silnova, Anna, Qian, Yanmin, Li, Haizhou

Publikováno v: In Speech Communication July 2024 162

Zobrazit plný text záznamu

Report

AUC Optimization for Robust Small-footprint Keyword Spotting with Limited Training Data

Autor: Xu, Menglong, Li, Shengqiang, Liang, Chengdong, Zhang, Xiao-Lei

Deep neural networks provide effective solutions to small-footprint keyword spotting (KWS). However, if training data is limited, it remains challenging to achieve robust and highly accurate KWS in real-world scenarios where unseen sounds that are ou

Externí odkaz: http://arxiv.org/abs/2107.05859

Zobrazit plný text záznamu

Report

Attention-based multi-channel speaker verification with ad-hoc microphone arrays

Autor: Liang, Chengdong, Chen, Junqi, Guan, Shanzheng, Zhang, Xiao-Lei

Recently, ad-hoc microphone array has been widely studied. Unlike traditional microphone array settings, the spatial arrangement and number of microphones of ad-hoc microphone arrays are not known in advance, which hinders the adaptation of tradition

Externí odkaz: http://arxiv.org/abs/2107.00178

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání