Zobrazeno 1 - 9
of 9
pro vyhledávání: '"Aciel Eshky"'
Autor:
Eleanor Sugden, Manuel Sam Ribeiro, Korin Richmond, Aciel Eshky, Joanne Cleland, Steve Renals
Publikováno v:
Eshky, A, Cleland, J, Ribeiro, M S, Sugden, E, Richmond, K & Renals, S 2021, ' Automatic audiovisual synchronisation for ultrasound tongue imaging ', Speech Communication, vol. 132, pp. 83-95 . https://doi.org/10.1016/j.specom.2021.05.008
Ultrasound tongue imaging is used to visualise the intra-oral articulators during speech production. It is utilised in a range of applications, including speech and language therapy and phonetics research. Ultrasound and speech audio are recorded sim
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::36ab08ddc455d9a8e7feae7afb3c9e5f
https://hdl.handle.net/20.500.11820/9e592bfa-43d4-4093-91cc-e0a9d052cf8a
https://hdl.handle.net/20.500.11820/9e592bfa-43d4-4093-91cc-e0a9d052cf8a
Publikováno v:
Ribeiro, M S, Cleland, J, Eshky, A, Richmond, K & Renals, S 2021, ' Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors ', Speech Communication, vol. 128, pp. 24-34 . https://doi.org/10.1016/j.specom.2021.02.001
Speech sound disorders are a common communication impairment in childhood. Because speech disorders can negatively affect the lives and the development of children, clinical intervention is often recommended. To help with diagnosis and treatment, cli
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::af82896b0b787a2b53764699debbdf95
Publikováno v:
Ribeiro, M S, Eshky, A, Richmond, K & Renals, S 2021, Silent versus modal multi-speaker speech recognition from ultrasound and video . in 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021 . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 641-645, Interspeech 2021, Brno, Czech Republic, 30/08/21 . https://doi.org/10.21437/Interspeech.2021-23
We investigate multi-speaker speech recognition from ultrasound images of the tongue and video images of the lips. We train our systems on imaging data from modal speech, and evaluate on matched test sets of two speaking modes: silent and modal speec
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c45cdd0287786ecaa2566a62abfc2422
Autor:
Jing-Xuan Zhang, Aciel Eshky, Manuel Sam Ribeiro, Jennifer Sanger, Korin Richmond, Steve Renals, Alan A Wrench
Publikováno v:
SLT
Ribeiro, M S, Sanger, J, Zhang, J-X, Eshky, A, Wrench, A, Richmond, K & Renals, S 2021, TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos . in 2021 IEEE Spoken Language Technology Workshop (SLT) . Institute of Electrical and Electronics Engineers (IEEE), pp. 1109-1116, IEEE Spoken Language Technology Workshop, 19/01/21 . https://doi.org/10.1109/SLT48900.2021.9383619
Ribeiro, M S, Sanger, J, Zhang, J-X, Eshky, A, Wrench, A, Richmond, K & Renals, S 2021, TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos . in 2021 IEEE Spoken Language Technology Workshop (SLT) . Institute of Electrical and Electronics Engineers (IEEE), pp. 1109-1116, IEEE Spoken Language Technology Workshop, 19/01/21 . https://doi.org/10.1109/SLT48900.2021.9383619
We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of audio, ultrasound tongue imaging, and lip videos. TaL consists of two parts: TaL1 is a set of six recording sessions of one professional voice talent, a male native speaker of Eng
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8727b0a1cf799c0770a78413d95c697a
http://arxiv.org/abs/2011.09804
http://arxiv.org/abs/2011.09804
Publikováno v:
Ribeiro, M, Eshky, A, Richmond, K & Renals, S 2019, Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions . in INTERSPEECH 2019: Proceedings of the 20th Annual Conference of the International Speech Communication Association (ISCA) . Graz, Austria, pp. 16-20, Interspeech 2019, Graz, Austria, 15/09/19 . https://doi.org/10.21437/Interspeech.2019-2612
INTERSPEECH
INTERSPEECH
We investigate the automatic processing of child speech therapy sessions using ultrasound visual biofeedback, with a specific focus on complementing acoustic features with ultrasound images of the tongue for the tasks of speaker diarization and time-
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::00be2c8a431f6596b8282129813aef2c
http://arxiv.org/abs/1907.00818
http://arxiv.org/abs/1907.00818
Publikováno v:
Ribeiro, M S, Eshky, A, Richmond, K & Renals, S 2019, Speaker-Independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech . in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . Institute of Electrical and Electronics Engineers (IEEE), Brighton, United Kingdom, pp. 1328-1332, 44th International Conference on Acoustics, Speech, and Signal Processing, Brighton, United Kingdom, 12/05/19 . https://doi.org/10.1109/ICASSP.2019.8683564
ICASSP
ICASSP
Ultrasound tongue imaging (UTI) provides a convenient way to visualize the vocal tract during speech production. UTI is increasingly being used for speech therapy, making it important to develop automatic methods to assist various time-consuming manu
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f1cad5428860707b7525ae1afa6658a1
https://hdl.handle.net/20.500.11820/87071137-b539-4896-b73b-550151ee5cd0
https://hdl.handle.net/20.500.11820/87071137-b539-4896-b73b-550151ee5cd0
Autor:
Alan A Wrench, Manuel Sam Ribeiro, James M. Scobbie, Korin Richmond, Aciel Eshky, Zoe Roxburgh, Joanne Cleland
Publikováno v:
Interspeech 2018
Eshky, A, Ribeiro, M S, Cleland, J, Richmond, K, Roxburgh, Z, Scobbie, J & Wrench, A 2018, UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions . in INTERSPEECH 2018: Proceedings of the 19th Annual Conference of the International Speech Communication Association (ISCA) . Hyderabad, India, pp. 1888-1892, Interspeech 2018, Hyderabad, India, 2/09/18 . https://doi.org/10.21437/Interspeech.2018-1736
INTERSPEECH
Eshky, A, Ribeiro, M S, Cleland, J, Richmond, K, Roxburgh, Z, Scobbie, J & Wrench, A 2018, UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions . in INTERSPEECH 2018: Proceedings of the 19th Annual Conference of the International Speech Communication Association (ISCA) . Hyderabad, India, pp. 1888-1892, Interspeech 2018, Hyderabad, India, 2/09/18 . https://doi.org/10.21437/Interspeech.2018-1736
INTERSPEECH
We introduce UltraSuite, a curated repository of ultrasound and acoustic data, collected from recordings of child speech therapy sessions. This release includes three data collections, one from typically developing children and two from children with
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::93e75f1c368fcc149bbc45558f734782
Publikováno v:
Eshky, A, Ribeiro, M, Richmond, K & Renals, S 2019, Synchronising audio and ultrasound by learning cross-modal embeddings . in INTERSPEECH 2019: Proceedings of the 20th Annual Conference of the International Speech Communication Association (ISCA) . Graz, Austria, pp. 4100-4104, Interspeech 2019, Graz, Austria, 15/09/19 . https://doi.org/10.21437/Interspeech.2019-1804
INTERSPEECH
INTERSPEECH
Audiovisual synchronisation is the task of determining the time offset between speech audio and a video recording of the articulators. In child speech therapy, audio and ultrasound videos of the tongue are captured using instruments which rely on har
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b0f31c006ec6c153e8c743893a09aa12
Publikováno v:
Eshky, A, Allison, B, Ramamoorthy, S & Steedman, M 2014, A Generative Model for User Simulation in a Spatial Navigation Domain . in Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics . Gothenburg, Sweden, pp. 626-635 . < http://www.aclweb.org/anthology/E14-1066 >
EACL
EACL
We propose the use of a generative model to simulate user behaviour in a novel task-oriented dialog domain, where user goals are spatial routes across artificial landscapes. We show how to derive an efficient feature-based representation of spatial g
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7ed844ae50af1a1733978af2dbf42640
https://www.pure.ed.ac.uk/ws/files/20099313/E14_1066.pdf
https://www.pure.ed.ac.uk/ws/files/20099313/E14_1066.pdf