Výsledky vyhledávání - "Xiulian Peng"

Text Image Super-Resolution Guided by Text Structure and Embedding Priors

Autor: Cong Huang, Xiulian Peng, Dong Liu, Yan Lu

Publikováno v: ACM Transactions on Multimedia Computing, Communications, and Applications.

We aim to super-resolve text images from unrecognizable low-resolution inputs. Existing super-resolution methods mainly learn a direct mapping from low-resolution to high-resolution images by exploring low-level features, which usually generate blurr

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::fad7b9c89fd72c8bdebbb7224a7027a9
https://doi.org/10.1145/3595924

Zobrazit plný text záznamu

Reference-Based Speech Enhancement via Feature Alignment and Fusion Network

Autor: Huanjing Yue, Wenxin Duo, Xiulian Peng, Jingyu Yang

Publikováno v: Proceedings of the AAAI Conference on Artificial Intelligence. 36:11648-11656

Speech enhancement aims at recovering a clean speech from a noisy input, which can be classified into single speech enhancement and personalized speech enhancement. Personalized speech enhancement usually utilizes the speaker identity extracted from

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::72a2f3f830120eb1ad06142479fd70d8
https://doi.org/10.1609/aaai.v36i10.21419

Zobrazit plný text záznamu

Time-Variance Aware Dynamic Kernel Generation for Real-Time Acoustic Echo Cancellation

Autor: Chengyu Zheng, Yuan Zhou, Xiulian Peng, Yuan Zhang, Yan Lu

Publikováno v: IEEE Signal Processing Letters. 29:967-971

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::6d2e1d005b362e1db2fec20dca3a2061
https://doi.org/10.1109/lsp.2022.3164359

Zobrazit plný text záznamu

Improving Speech Enhancement via Event-based Query

Autor: Yifei Xin, Xiulian Peng, Yan Lu

Existing deep learning based speech enhancement (SE) methods either use blind end-to-end training or explicitly incorporate speaker embedding or phonetic information into the SE network to enhance speech quality. In this paper, we perceive speech and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3450ea650fe274ed5b63151fb61a6c1e
http://arxiv.org/abs/2302.11558

Zobrazit plný text záznamu

Real-time speech enhancement with dynamic attention span

Autor: Chengyu Zheng, Yuan Zhou, Xiulian Peng, Yuan Zhang, Yan Lu

For real-time speech enhancement (SE) including noise suppression, dereverberation and acoustic echo cancellation, the time-variance of the audio signals becomes a severe challenge. The causality and memory usage limit that only the historical inform

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::71f1a81d15ee632cf50c0fc42700fcde

Zobrazit plný text záznamu

Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation

Autor: Xiaoyu Wang, Xiangyu Kong, Xiulian Peng, Yan Lu

In this paper we propose a multi-modal multi-correlation learning framework targeting at the task of audio-visual speech separation. Although previous efforts have been extensively put on combining audio and visual modalities, most of them solely ado

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7570676379c0fcb223e9533fa6c98b47
http://arxiv.org/abs/2207.01197

Zobrazit plný text záznamu

Latent-Domain Predictive Neural Speech Coding

Autor: Xue Jiang, Xiulian Peng, Huaying Xue, Yuan Zhang, Yan Lu

Neural audio/speech coding has recently demonstrated its capability to deliver high quality at much lower bitrates than traditional methods. However, existing neural audio/speech codecs employ either acoustic features or learned blind features with a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8d685579b284afd959c4dd02fba9e241

Zobrazit plný text záznamu

Traffic surveillance video coding with libraries of vehicles and background

Autor: Xiulian Peng, Changyue Ma, Li Li, Dong Liu, Feng Wu

Publikováno v: Journal of Visual Communication and Image Representation. 60:426-440

This paper presents a video coding scheme tailored for traffic surveillance videos, which features a pre-built library that is utilized in both encoder and decoder to pursue higher compression efficiency. We are motivated by the observation that, in

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::4ebb3d43a797b78d8c56b28f943a0566
https://doi.org/10.1016/j.jvcir.2019.03.009

Zobrazit plný text záznamu

Phoneme-based Distribution Regularization for Speech Enhancement

Autor: Xiulian Peng, Yan Lu, Zhiwei Xiong, Yajing Liu

Publikováno v: ICASSP

Existing speech enhancement methods mainly separate speech from noises at the signal level or in the time-frequency domain. They seldom pay attention to the semantic information of a corrupted signal. In this paper, we aim to bridge this gap by extra

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::874eff1db3090f2e3eb3569d413e84cd

Zobrazit plný text záznamu

Unequal Error Protection for Scalable Video Storage in the Cloud

Autor: Xiaodan Song, Guangming Shi, Jizheng Xu, Feng Wu, Xiulian Peng

Publikováno v: ICME

Redundancy is necessary for a storage system to recover from errors. The frequent errors in large-scale systems, e.g. cloud, make it desired to reduce the recovery cost. Among all kinds of data stored in the cloud, video takes a large portion due to

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d80c34dfadc77574e0df461c6773892c
https://doi.org/10.1109/tmm.2017.2751147

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání