Výsledky vyhledávání - "Siong, Chng Eng"

Report

Text-based Talking Video Editing with Cascaded Conditional Diffusion

Autor: Han, Bo, Zou, Heqing, Li, Haoyang, Wang, Guangcong, Siong, Chng Eng

Text-based talking-head video editing aims to efficiently insert, delete, and substitute segments of talking videos through a user-friendly text editing approach. It is challenging because of \textbf{1)} generalizable talking-face representation, \te

Externí odkaz: http://arxiv.org/abs/2407.14841

Zobrazit plný text záznamu

Report

Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures

Autor: Yip, Jia Qi, Ng, Dianwen, Ma, Bin, Siong, Chng Eng

Despite recent strides made in Speech Separation, most models are trained on datasets with neutral emotions. Emotional speech has been known to degrade performance of models in a variety of speech tasks, which reduces the effectiveness of these model

Externí odkaz: http://arxiv.org/abs/2309.07458

Zobrazit plný text záznamu

Report

Study of GANs for Noisy Speech Simulation from Clean Speech

Autor: Maben, Leander Melroy, Guo, Zixun, Chen, Chen, Chudiwal, Utkarsh, Siong, Chng Eng

The performance of speech processing models trained on clean speech drops significantly in noisy conditions. Training with noisy datasets alleviates the problem, but procuring such datasets is not always feasible. Noisy speech simulation models that

Externí odkaz: http://arxiv.org/abs/2305.12460

Zobrazit plný text záznamu

Report

Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning

Autor: Koh, Andrew, Tiwari, Soham, Siong, Chng Eng

In this paper, we propose an algorithm, Epochal Difficult Captions, to supplement the training of any model for the Automated Audio Captioning task. Epochal Difficult Captions is an elegant evolution to the keyword estimation task that previous work

Externí odkaz: http://arxiv.org/abs/2206.01918

Zobrazit plný text záznamu

Akademický článek

Noise-aware network with shared channel-attention encoder and joint constraint for noisy speech separation

Autor: Sun, Linhui, Zhou, Xiaolong, Gong, Aifei, Ye, Lei, Li, Pingan, Siong Chng, Eng

Publikováno v: In Digital Signal Processing February 2025 157

Zobrazit plný text záznamu

Report

Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model

Autor: Gupta, Tarun, Truong, Duc-Tuan, Anh, Tran The, Siong, Chng Eng

The estimation of speaker characteristics such as age and height is a challenging task, having numerous applications in voice forensic analysis. In this work, we propose a bi-encoder transformer mixture model for speaker age and height estimation. Co

Externí odkaz: http://arxiv.org/abs/2203.11774

Zobrazit plný text záznamu

Report

Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling

Autor: Rajaa, Shangeth, Van Tung, Pham, Siong, Chng Eng

Speaker profiling, which aims to estimate speaker characteristics such as age and height, has a wide range of applications inforensics, recommendation systems, etc. In this work, we propose a semisupervised learning approach to mitigate the issue of

Externí odkaz: http://arxiv.org/abs/2110.13653

Zobrazit plný text záznamu

Kniha

Modelling Public Sentiment in Twitter: Using Linguistic Patterns to Enhance Supervised Learning.

Autor: Chikersal, Prerna, Poria, Soujanya, Cambria, Erik, Gelbukh, Alexander, Siong, Chng Eng

Publikováno v: Computational Linguistics & Intelligent Text Processing 16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part II; 2015, p49-65, 17p

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Conference

A robust sound event recognition framework under TV playing conditions.

Autor: Terence, Ng Wen Zheng, Dat, Tran Huy, Dennis, Jonathan, Siong, Chng Eng

Publikováno v: 2013 Asia-Pacific Signal & Information Processing Association Annual Summit & Conference; 2013, p1-5, 5p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání