Zobrazeno 1 - 10
of 20
pro vyhledávání: '"Siong, Chng Eng"'
Text-based talking-head video editing aims to efficiently insert, delete, and substitute segments of talking videos through a user-friendly text editing approach. It is challenging because of \textbf{1)} generalizable talking-face representation, \te
Externí odkaz:
http://arxiv.org/abs/2407.14841
Despite recent strides made in Speech Separation, most models are trained on datasets with neutral emotions. Emotional speech has been known to degrade performance of models in a variety of speech tasks, which reduces the effectiveness of these model
Externí odkaz:
http://arxiv.org/abs/2309.07458
The performance of speech processing models trained on clean speech drops significantly in noisy conditions. Training with noisy datasets alleviates the problem, but procuring such datasets is not always feasible. Noisy speech simulation models that
Externí odkaz:
http://arxiv.org/abs/2305.12460
In this paper, we propose an algorithm, Epochal Difficult Captions, to supplement the training of any model for the Automated Audio Captioning task. Epochal Difficult Captions is an elegant evolution to the keyword estimation task that previous work
Externí odkaz:
http://arxiv.org/abs/2206.01918
Publikováno v:
In Digital Signal Processing February 2025 157
The estimation of speaker characteristics such as age and height is a challenging task, having numerous applications in voice forensic analysis. In this work, we propose a bi-encoder transformer mixture model for speaker age and height estimation. Co
Externí odkaz:
http://arxiv.org/abs/2203.11774
Speaker profiling, which aims to estimate speaker characteristics such as age and height, has a wide range of applications inforensics, recommendation systems, etc. In this work, we propose a semisupervised learning approach to mitigate the issue of
Externí odkaz:
http://arxiv.org/abs/2110.13653
Publikováno v:
Computational Linguistics & Intelligent Text Processing 16th International Conference, CICLing 2015, Cairo, Egypt, April 14-20, 2015, Proceedings, Part II; 2015, p49-65, 17p
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
2013 Asia-Pacific Signal & Information Processing Association Annual Summit & Conference; 2013, p1-5, 5p