Výsledky vyhledávání - "Kadiri, Sudarsana Reddy"

Report

Can a Machine Distinguish High and Low Amount of Social Creak in Speech?

Autor: Laukkanen, Anne-Maria, Kadiri, Sudarsana Reddy, Narayanan, Shrikanth, Alku, Paavo

Objectives: ncreased prevalence of social creak particularly among female speakers has been reported in several studies. The study of social creak has been previously conducted by combining perceptual evaluation of speech with conventional acoustical

Externí odkaz: http://arxiv.org/abs/2410.17028

Zobrazit plný text záznamu

Report

Evaluation of state-of-the-art ASR Models in Child-Adult Interactions

Autor: Ashvin, Aditya, Lahiri, Rimita, Kommineni, Aditya, Bishop, Somer, Lord, Catherine, Kadiri, Sudarsana Reddy, Narayanan, Shrikanth

The ability to reliably transcribe child-adult conversations in a clinical setting is valuable for diagnosis and understanding of numerous developmental disorders such as Autism Spectrum Disorder. Recent advances in deep learning architectures and av

Externí odkaz: http://arxiv.org/abs/2409.16135

Zobrazit plný text záznamu

Report

MMSD-Net: Towards Multi-modal Stuttering Detection

Autor: Nie, Liangyu, Kadiri, Sudarsana Reddy, Agrawal, Ruchit

Stuttering is a common speech impediment that is caused by irregular disruptions in speech production, affecting over 70 million people across the world. Standard automatic speech processing tools do not take speech ailments into account and are ther

Externí odkaz: http://arxiv.org/abs/2407.11492

Zobrazit plný text záznamu

Report

Wav2vec-based Detection and Severity Level Classification of Dysarthria from Speech

Autor: Javanmardi, Farhad, Tirronen, Saska, Kodali, Manila, Kadiri, Sudarsana Reddy, Alku, Paavo

Publikováno v: in Proc. ICASSP, Rhodes Island, Greece, June 4-10, 2023

Automatic detection and severity level classification of dysarthria directly from acoustic speech signals can be used as a tool in medical diagnosis. In this work, the pre-trained wav2vec 2.0 model is studied as a feature extractor to build detection

Externí odkaz: http://arxiv.org/abs/2309.14107

Zobrazit plný text záznamu

Report

Analysis and Detection of Pathological Voice using Glottal Source Features

Autor: Kadiri, Sudarsana Reddy, Alku, Paavo

Publikováno v: IEEE Journal of Selected Topics in Signal Processing, Vol. 14, No. 2, pp. 367-379, February 2020

Automatic detection of voice pathology enables objective assessment and earlier intervention for the diagnosis. This study provides a systematic analysis of glottal source features and investigates their effectiveness in voice pathology detection. Gl

Externí odkaz: http://arxiv.org/abs/2309.14080

Zobrazit plný text záznamu

Report

Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals

Autor: Gowda, Dhananjaya, Kadiri, Sudarsana Reddy, Story, Brad, Alku, Paavo

Publikováno v: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 28, pp. 1901-1914, 2020

In this paper, we propose a new method for the accurate estimation and tracking of formants in speech signals using time-varying quasi-closed-phase (TVQCP) analysis. Conventional formant tracking methods typically adopt a two-stage estimate-and-track

Externí odkaz: http://arxiv.org/abs/2308.16540

Zobrazit plný text záznamu

Report

Refining a Deep Learning-based Formant Tracker using Linear Prediction Methods

Autor: Alku, Paavo, Kadiri, Sudarsana Reddy, Gowda, Dhananjaya

In this study, formant tracking is investigated by refining the formants tracked by an existing data-driven tracker, DeepFormants, using the formants estimated in a model-driven manner by linear prediction (LP)-based methods. As LP-based formant esti

Externí odkaz: http://arxiv.org/abs/2308.09051

Zobrazit plný text záznamu

Report

Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features

Autor: Kadiri, Sudarsana Reddy, Kodali, Manila, Alku, Paavo

Developing objective methods for assessing the severity of Parkinson's disease (PD) is crucial for improving the diagnosis and treatment. This study proposes two sets of novel features derived from the single frequency filtering (SFF) method: (1) SFF

Externí odkaz: http://arxiv.org/abs/2308.09042

Zobrazit plný text záznamu

Report

Investigation of Self-supervised Pre-trained Models for Classification of Voice Quality from Speech and Neck Surface Accelerometer Signals

Autor: Kadiri, Sudarsana Reddy, Javanmardi, Farhad, Alku, Paavo

Prior studies in the automatic classification of voice quality have mainly studied the use of the acoustic speech signal as input. Recently, a few studies have been carried out by jointly using both speech and neck surface accelerometer (NSA) signals

Externí odkaz: http://arxiv.org/abs/2308.03226

Zobrazit plný text záznamu

Report

End-to-end Ensemble-based Feature Selection for Paralinguistics Tasks

Autor: Grósz, Tamás, Singh, Mittul, Kadiri, Sudarsana Reddy, Kathania, Hemant, Kurimo, Mikko

The events of recent years have highlighted the importance of telemedicine solutions which could potentially allow remote treatment and diagnosis. Relatedly, Computational Paralinguistics, a unique subfield of Speech Processing, aims to extract infor

Externí odkaz: http://arxiv.org/abs/2210.15978

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání