Výsledky vyhledávání

Report

A Multimodal Framework for the Assessment of the Schizophrenia Spectrum

Autor: Premananth, Gowtham, Siriwardena, Yashish M., Resnik, Philip, Bansal, Sonia, Kelly, Deanna L., Espy-Wilson, Carol

This paper presents a novel multimodal framework to distinguish between different symptom classes of subjects in the schizophrenia spectrum and healthy controls using audio, video, and text modalities. We implemented Convolution Neural Network and Lo

Externí odkaz: http://arxiv.org/abs/2406.09706

Zobrazit plný text záznamu

Report

Accent Conversion with Articulatory Representations

Autor: Siriwardena, Yashish M., Swedlow, Nathan, Howard, Audrey, Gitterman, Evan, Darcy, Dan, Espy-Wilson, Carol, Fanelli, Andrea

Conversion of non-native accented speech to native (American) English has a wide range of applications such as improving intelligibility of non-native speech. Previous work on this domain has used phonetic posteriograms as the target speech represent

Externí odkaz: http://arxiv.org/abs/2406.05947

Zobrazit plný text záznamu

Report

Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings

Autor: Attia, Ahmed Adel, Demszky, Dorottya, Ogunremi, Tolulope, Liu, Jing, Espy-Wilson, Carol

Creating Automatic Speech Recognition (ASR) systems that are robust and resilient to classroom conditions is paramount to the development of AI tools to aid teachers and students. In this work, we study the efficacy of continued pretraining (CPT) in

Externí odkaz: http://arxiv.org/abs/2405.13018

Zobrazit plný text záznamu

Report

Empirical model of SSUSI-derived auroral ionization rates

Autor: Bender, Stefan, Espy, Patrick J., Paxton, Larry J.

We present an empirical model for auroral (90--150 km) electron--ion pair production rates, ionization rates for short, derived from SSUSI (Special Sensor Ultraviolet Spectrographic Imager) electron energy and flux data. Using the Fang et al., 2010 p

Externí odkaz: http://arxiv.org/abs/2312.11130

Zobrazit plný text záznamu

Report

A multi-modal approach for identifying schizophrenia using cross-modal attention

Autor: Premananth, Gowtham, Siriwardena, Yashish M., Resnik, Philip, Espy-Wilson, Carol

This study focuses on how different modalities of human communication can be used to distinguish between healthy controls and subjects with schizophrenia who exhibit strong positive symptoms. We developed a multi-modal schizophrenia classification sy

Externí odkaz: http://arxiv.org/abs/2309.15136

Zobrazit plný text záznamu

Report

Improving Speech Inversion Through Self-Supervised Embeddings and Enhanced Tract Variables

Autor: Attia, Ahmed Adel, Siriwardena, Yashish M., Espy-Wilson, Carol

The performance of deep learning models depends significantly on their capacity to encode input features efficiently and decode them into meaningful outputs. Better input and output representation has the potential to boost models' performance and ge

Externí odkaz: http://arxiv.org/abs/2309.09220

Zobrazit plný text záznamu

Report

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults

Autor: Attia, Ahmed Adel, Liu, Jing, Ai, Wei, Demszky, Dorottya, Espy-Wilson, Carol

Recent advancements in Automatic Speech Recognition (ASR) systems, exemplified by Whisper, have demonstrated the potential of these systems to approach human-level performance given sufficient data. However, this progress doesn't readily extend to AS

Externí odkaz: http://arxiv.org/abs/2309.07927

Zobrazit plný text záznamu

Report

Speaker-independent Speech Inversion for Estimation of Nasalance

Autor: Siriwardena, Yashish M., Espy-Wilson, Carol, Boyce, Suzanne, Tiede, Mark K., Oren, Liran

The velopharyngeal (VP) valve regulates the opening between the nasal and oral cavities. This valve opens and closes through a coordinated motion of the velum and pharyngeal walls. Nasalance is an objective measure derived from the oral and nasal aco

Externí odkaz: http://arxiv.org/abs/2306.00203

Zobrazit plný text záznamu

Report

Acoustic-to-Articulatory Speech Inversion Features for Mispronunciation Detection of /r/ in Child Speech Sound Disorders

Autor: Benway, Nina R, Siriwardena, Yashish M, Preston, Jonathan L, Hitchcock, Elaine, McAllister, Tara, Espy-Wilson, Carol

Publikováno v: Proc. INTERSPEECH 2023, 4568-4572

Acoustic-to-articulatory speech inversion could enhance automated clinical mispronunciation detection to provide detailed articulatory feedback unattainable by formant-based mispronunciation detection algorithms; however, it is unclear the extent to

Externí odkaz: http://arxiv.org/abs/2305.16085

Zobrazit plný text záznamu

Report

Enhancing Speech Articulation Analysis using a Geometric Transformation of the X-ray Microbeam Dataset

Autor: Attia, Ahmed Adel, Tiede, Mark, Espy-Wilson, Carol Y.

Accurate analysis of speech articulation is crucial for speech analysis. However, X-Y coordinates of articulators strongly depend on the anatomy of the speakers and the variability of pellet placements, and existing methods for mapping anatomical lan

Externí odkaz: http://arxiv.org/abs/2305.10775

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání