Výsledky vyhledávání

Automatic audiovisual synchronisation for ultrasound tongue imaging

Autor: Eleanor Sugden, Manuel Sam Ribeiro, Korin Richmond, Aciel Eshky, Joanne Cleland, Steve Renals

Publikováno v: Eshky, A, Cleland, J, Ribeiro, M S, Sugden, E, Richmond, K & Renals, S 2021, ' Automatic audiovisual synchronisation for ultrasound tongue imaging ', Speech Communication, vol. 132, pp. 83-95 . https://doi.org/10.1016/j.specom.2021.05.008

Ultrasound tongue imaging is used to visualise the intra-oral articulators during speech production. It is utilised in a range of applications, including speech and language therapy and phonetics research. Ultrasound and speech audio are recorded sim

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::36ab08ddc455d9a8e7feae7afb3c9e5f
https://hdl.handle.net/20.500.11820/9e592bfa-43d4-4093-91cc-e0a9d052cf8a

Zobrazit plný text záznamu

Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors

Autor: Aciel Eshky, Joanne Cleland, Manuel Sam Ribeiro, Steve Renals, Korin Richmond

Publikováno v: Ribeiro, M S, Cleland, J, Eshky, A, Richmond, K & Renals, S 2021, ' Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors ', Speech Communication, vol. 128, pp. 24-34 . https://doi.org/10.1016/j.specom.2021.02.001

Speech sound disorders are a common communication impairment in childhood. Because speech disorders can negatively affect the lives and the development of children, clinical intervention is often recommended. To help with diagnosis and treatment, cli

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::af82896b0b787a2b53764699debbdf95

Zobrazit plný text záznamu

Silent versus modal multi-speaker speech recognition from ultrasound and video

Autor: Steve Renals, Korin Richmond, Aciel Eshky, Manuel Sam Ribeiro

Publikováno v: Ribeiro, M S, Eshky, A, Richmond, K & Renals, S 2021, Silent versus modal multi-speaker speech recognition from ultrasound and video . in 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021 . Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 641-645, Interspeech 2021, Brno, Czech Republic, 30/08/21 . https://doi.org/10.21437/Interspeech.2021-23

We investigate multi-speaker speech recognition from ultrasound images of the tongue and video images of the lips. We train our systems on imaging data from modal speech, and evaluate on matched test sets of two speaking modes: silent and modal speec

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c45cdd0287786ecaa2566a62abfc2422

Zobrazit plný text záznamu

TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos

Autor: Jing-Xuan Zhang, Aciel Eshky, Manuel Sam Ribeiro, Jennifer Sanger, Korin Richmond, Steve Renals, Alan A Wrench

Publikováno v: SLT
Ribeiro, M S, Sanger, J, Zhang, J-X, Eshky, A, Wrench, A, Richmond, K & Renals, S 2021, TaL: a synchronised multi-speaker corpus of ultrasound tongue imaging, audio, and lip videos . in 2021 IEEE Spoken Language Technology Workshop (SLT) . Institute of Electrical and Electronics Engineers (IEEE), pp. 1109-1116, IEEE Spoken Language Technology Workshop, 19/01/21 . https://doi.org/10.1109/SLT48900.2021.9383619

We present the Tongue and Lips corpus (TaL), a multi-speaker corpus of audio, ultrasound tongue imaging, and lip videos. TaL consists of two parts: TaL1 is a set of six recording sessions of one professional voice talent, a male native speaker of Eng

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8727b0a1cf799c0770a78413d95c697a
http://arxiv.org/abs/2011.09804

Zobrazit plný text záznamu

Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions

Autor: Manuel Sam Ribeiro, Korin Richmond, Aciel Eshky, Steve Renals

Publikováno v: Ribeiro, M, Eshky, A, Richmond, K & Renals, S 2019, Ultrasound tongue imaging for diarization and alignment of child speech therapy sessions . in INTERSPEECH 2019: Proceedings of the 20th Annual Conference of the International Speech Communication Association (ISCA) . Graz, Austria, pp. 16-20, Interspeech 2019, Graz, Austria, 15/09/19 . https://doi.org/10.21437/Interspeech.2019-2612
INTERSPEECH

We investigate the automatic processing of child speech therapy sessions using ultrasound visual biofeedback, with a specific focus on complementing acoustic features with ultrasound images of the tongue for the tasks of speaker diarization and time-

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::00be2c8a431f6596b8282129813aef2c
http://arxiv.org/abs/1907.00818

Zobrazit plný text záznamu

Speaker-Independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech

Autor: Aciel Eshky, Manuel Sam Ribeiro, Steve Renals, Korin Richmond

Publikováno v: Ribeiro, M S, Eshky, A, Richmond, K & Renals, S 2019, Speaker-Independent Classification of Phonetic Segments from Raw Ultrasound in Child Speech . in ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) . Institute of Electrical and Electronics Engineers (IEEE), Brighton, United Kingdom, pp. 1328-1332, 44th International Conference on Acoustics, Speech, and Signal Processing, Brighton, United Kingdom, 12/05/19 . https://doi.org/10.1109/ICASSP.2019.8683564
ICASSP

Ultrasound tongue imaging (UTI) provides a convenient way to visualize the vocal tract during speech production. UTI is increasingly being used for speech therapy, making it important to develop automatic methods to assist various time-consuming manu

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f1cad5428860707b7525ae1afa6658a1
https://hdl.handle.net/20.500.11820/87071137-b539-4896-b73b-550151ee5cd0

Zobrazit plný text záznamu

UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions

Autor: Alan A Wrench, Manuel Sam Ribeiro, James M. Scobbie, Korin Richmond, Aciel Eshky, Zoe Roxburgh, Joanne Cleland

Publikováno v: Interspeech 2018
Eshky, A, Ribeiro, M S, Cleland, J, Richmond, K, Roxburgh, Z, Scobbie, J & Wrench, A 2018, UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions . in INTERSPEECH 2018: Proceedings of the 19th Annual Conference of the International Speech Communication Association (ISCA) . Hyderabad, India, pp. 1888-1892, Interspeech 2018, Hyderabad, India, 2/09/18 . https://doi.org/10.21437/Interspeech.2018-1736
INTERSPEECH

We introduce UltraSuite, a curated repository of ultrasound and acoustic data, collected from recordings of child speech therapy sessions. This release includes three data collections, one from typically developing children and two from children with

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::93e75f1c368fcc149bbc45558f734782

Zobrazit plný text záznamu

Synchronising audio and ultrasound by learning cross-modal embeddings

Autor: Manuel Sam Ribeiro, Steve Renals, Korin Richmond, Aciel Eshky

Publikováno v: Eshky, A, Ribeiro, M, Richmond, K & Renals, S 2019, Synchronising audio and ultrasound by learning cross-modal embeddings . in INTERSPEECH 2019: Proceedings of the 20th Annual Conference of the International Speech Communication Association (ISCA) . Graz, Austria, pp. 4100-4104, Interspeech 2019, Graz, Austria, 15/09/19 . https://doi.org/10.21437/Interspeech.2019-1804
INTERSPEECH

Audiovisual synchronisation is the task of determining the time offset between speech audio and a video recording of the articulators. In child speech therapy, audio and ultrasound videos of the tongue are captured using instruments which rely on har

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b0f31c006ec6c153e8c743893a09aa12

Zobrazit plný text záznamu

A Generative Model for User Simulation in a Spatial Navigation Domain

Autor: Subramanian Ramamoorthy, Aciel Eshky, Ben Allison, Mark Steedman

Publikováno v: Eshky, A, Allison, B, Ramamoorthy, S & Steedman, M 2014, A Generative Model for User Simulation in a Spatial Navigation Domain . in Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics . Gothenburg, Sweden, pp. 626-635 . < http://www.aclweb.org/anthology/E14-1066 >
EACL

We propose the use of a generative model to simulate user behaviour in a novel task-oriented dialog domain, where user goals are spatial routes across artificial landscapes. We show how to derive an efficient feature-based representation of spatial g

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7ed844ae50af1a1733978af2dbf42640
https://www.pure.ed.ac.uk/ws/files/20099313/E14_1066.pdf

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání