Zobrazeno 1 - 10
of 215
pro vyhledávání: '"Renals P"'
In this paper, we analyse the error patterns of the raw waveform acoustic models in TIMIT's phone recognition task. Our analysis goes beyond the conventional phone error rate (PER) metric. We categorise the phones into three groups: {affricate, dipht
Externí odkaz:
http://arxiv.org/abs/2406.00898
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022
We study the problem of learning robust acoustic models in adverse environments, characterized by a significant mismatch between training and test conditions. This problem is of paramount importance for the deployment of speech recognition systems th
Externí odkaz:
http://arxiv.org/abs/2110.08634
Autor:
Eshky, Aciel, Cleland, Joanne, Ribeiro, Manuel Sam, Sugden, Eleanor, Richmond, Korin, Renals, Steve
Ultrasound tongue imaging is used to visualise the intra-oral articulators during speech production. It is utilised in a range of applications, including speech and language therapy and phonetics research. Ultrasound and speech audio are recorded sim
Externí odkaz:
http://arxiv.org/abs/2105.15162
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
Speech Communication, Volume 128, April 2021, Pages 24-34
Speech sound disorders are a common communication impairment in childhood. Because speech disorders can negatively affect the lives and the development of children, clinical intervention is often recommended. To help with diagnosis and treatment, cli
Externí odkaz:
http://arxiv.org/abs/2103.00324
We investigate multi-speaker speech recognition from ultrasound images of the tongue and video images of the lips. We train our systems on imaging data from modal speech, and evaluate on matched test sets of two speaking modes: silent and modal speec
Externí odkaz:
http://arxiv.org/abs/2103.00333
Although the lower layers of a deep neural network learn features which are transferable across datasets, these layers are not transferable within the same dataset. That is, in general, freezing the trained feature extractor (the lower layers) and re
Externí odkaz:
http://arxiv.org/abs/2102.04697
Autor:
Mayuri Gogoi, Christopher A. Martin, Paul W. Bird, Martin J. Wiselka, Judi Gardener, Kate Ellis, Valerie Renals, Adam J. Lewszuk, Sally Hargreaves, Manish Pareek
Publikováno v:
Journal of Migration and Health, Vol 9, Iss , Pp 100217- (2024)
Background: Vaccine preventable diseases (VPDs) such as measles and rubella cause significant morbidity and mortality globally every year. The World Health Organization (WHO), reported vaccine coverage for both measles and rubella to be 71 % in 2019,
Externí odkaz:
https://doaj.org/article/a2d3d709423d4c65b44b3ba999588595
Autor:
Ribeiro, Manuel Sam, Sanger, Jennifer, Zhang, Jing-Xuan, Eshky, Aciel, Wrench, Alan, Richmond, Korin, Renals, Steve
We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of audio, ultrasound tongue imaging, and lip videos. TaL consists of two parts: TaL1 is a set of six recording sessions of one professional voice talent, a male native speaker of Eng
Externí odkaz:
http://arxiv.org/abs/2011.09804
Self-attention models such as Transformers, which can capture temporal relationships without being limited by the distance between events, have given competitive speech recognition results. However, we note the range of the learned context increases
Externí odkaz:
http://arxiv.org/abs/2011.04906