Zobrazeno 1 - 10
of 161
pro vyhledávání: '"Thomas Hain"'
Publikováno v:
Journal of Big Data, Vol 8, Iss 1, Pp 1-16 (2021)
Abstract Speech based human-machine interaction and natural language understanding applications have seen a rapid development and wide adoption over the last few decades. This has led to a proliferation of studies that investigate Error detection and
Externí odkaz:
https://doaj.org/article/2a752c4a8ebc49488211eb508c5b87fc
Publikováno v:
Frontiers in Signal Processing, Vol 2 (2022)
Separation of speech mixtures in noisy and reverberant environments remains a challenging task for state-of-the-art speech separation systems. Time-domain audio speech separation networks (TasNets) are among the most commonly used network architectur
Externí odkaz:
https://doaj.org/article/02f17f0adaa14f3cbd21b78e7075c7b9
Publikováno v:
Interspeech 2022.
Publikováno v:
Interspeech 2022.
Multilingual speech recognition has drawn significant attention as an effective way to compensate data scarcity for low-resource languages. End-to-end (e2e) modelling is preferred over conventional hybrid systems, mainly because of no lexicon require
Publikováno v:
2022 International Workshop on Acoustic Signal Enhancement (IWAENC).
This paper proposes an unsupervised data selection method by using a submodular function based on contrastive loss ratios of target and training data sets. A model using a contrastive loss function is trained on both sets. Then the ratio of frame-lev
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5c469d023f7760bb389ca51cd287eb64
http://arxiv.org/abs/2207.12028
http://arxiv.org/abs/2207.12028
Publikováno v:
Journal of Big Data, Vol 8, Iss 1, Pp 1-16 (2021)
Speech based human-machine interaction and natural language understanding applications have seen a rapid development and wide adoption over the last few decades. This has led to a proliferation of studies that investigate Error detection and classifi
Autor:
Jose Antonio Lopez Saenz, Thomas Hain
Publikováno v:
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Separation of speech mixtures in noisy and reverberant environments remains a challenging task for state-of-the-art speech separation systems. Time-domain audio speech separation networks (TasNets) are among the most commonly used network architectur
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::128c8be9a2e3d04ff8c989b89630da43
https://eprints.whiterose.ac.uk/186716/1/frsip-02-856968.pdf
https://eprints.whiterose.ac.uk/186716/1/frsip-02-856968.pdf
Training of speech enhancement systems often does not incorporate knowledge of human perception and thus can lead to unnatural sounding results. Incorporating psychoacoustically motivated speech perception metrics as part of model training via a pred
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::db3add9d3ca4e0ed48f0223300380ff5
http://arxiv.org/abs/2203.12369
http://arxiv.org/abs/2203.12369