Výsledky vyhledávání - "Sudarsana Reddy Kadiri"

Akademický článek

Classification of Phonation Modes in Classical Singing Using Modulation Power Spectral Features

Autor: Manuel Brandner, Paul Armin Bereuter, Sudarsana Reddy Kadiri, Alois Sontacchi

Publikováno v: IEEE Access, Vol 11, Pp 29149-29161 (2023)

In singing, the perceptual term “voice quality” is used to describe expressed emotions and singing styles. In voice physiology research, specific voice qualities are discussed using the term phonation modes and are directly related to the voicing

Externí odkaz: https://doaj.org/article/bf73d8e9de724c2eb3e583e3066af48d

Zobrazit plný text záznamu

Akademický článek

Hierarchical Multi-Class Classification of Voice Disorders Using Self-Supervised Models and Glottal Features

Autor: Saska Tirronen, Sudarsana Reddy Kadiri, Paavo Alku

Publikováno v: IEEE Open Journal of Signal Processing, Vol 4, Pp 80-88 (2023)

Previous studies on the automatic classification of voice disorders have mostly investigated the binary classification task, which aims to distinguish pathological voice from healthy voice. Using multi-class classifiers, however, more fine-grained id

Externí odkaz: https://doaj.org/article/5a8e8f3c45de4706a6275b27483d343b

Zobrazit plný text záznamu

Akademický článek

Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks

Autor: Dhananjaya N. Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo Alku

Publikováno v: IEEE Access, Vol 9, Pp 151631-151640 (2021)

Formant tracking is investigated in this study by using trackers based on dynamic programming (DP) and deep neural nets (DNNs). Using the DP approach, six formant estimation methods were first compared. The six methods include linear prediction (LP)

Externí odkaz: https://doaj.org/article/d3e5f843aaf44751a9b53d75a6163f7e

Zobrazit plný text záznamu

Akademický článek

Excitation Features of Speech for Speaker-Specific Emotion Detection

Autor: Sudarsana Reddy Kadiri, Paavo Alku

Publikováno v: IEEE Access, Vol 8, Pp 60382-60391 (2020)

In this article, we study emotion detection from speech in a speaker-specific scenario. By parameterizing the excitation component of voiced speech, the study explores deviations between emotional speech (e.g., speech produced in anger, happiness, sa

Externí odkaz: https://doaj.org/article/bab65234fb174c14bb51b49f788381a6

Zobrazit plný text záznamu

Akademický článek

Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification

Autor: Rashmi Kethireddy, Sudarsana Reddy Kadiri, Paavo Alku, Suryakanth V. Gangashetty

Publikováno v: IEEE Access, Vol 8, Pp 174871-174879 (2020)

In this study, we propose Mel-weighted single frequency filtering (SFF) spectrograms for dialect identification. The spectrum derived using SFF has high spectral resolution for harmonics and resonances while simultaneously maintaining good time-resol

Externí odkaz: https://doaj.org/article/2dc2e1826381423daecf4e58784fb8bf

Zobrazit plný text záznamu

Akademický článek

Subjective Evaluation of Basic Emotions from Audio–Visual Data

Autor: Sudarsana Reddy Kadiri, Paavo Alku

Publikováno v: Sensors, Vol 22, Iss 13, p 4931 (2022)

Understanding of the perception of emotions or affective states in humans is important to develop emotion-aware systems that work in realistic scenarios. In this paper, the perception of emotions in naturalistic human interaction (audio–visual data

Externí odkaz: https://doaj.org/article/f7b3d5d0de8444928aba3e7a89b65811

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Using Data Augmentation and Time-Scale Modification to Improve ASR of Children’s Speech in Noisy Environments

Autor: Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo

Publikováno v: Applied Sciences, Vol 11, Iss 18, p 8420 (2021)

Current ASR systems show poor performance in recognition of children’s speech in noisy environments because recognizers are typically trained with clean adults’ speech and therefore there are two mismatches between training and testing phases (i.

Externí odkaz: https://doaj.org/article/fe822c2210854acd9cb678c765e87a47

Zobrazit plný text záznamu

Data Augmentation Using Spectral Warping for Low Resource Children ASR

Autor: Hemant Kumar Kathania, Viredner Kadyan, Sudarsana Reddy Kadiri, Mikko Kurimo

Publikováno v: Aalto University

In low resource children automatic speech recognition (ASR) the performance is degraded due to limited acoustic and speaker variability available in small datasets. In this paper, we propose a spectral warping based data augmentation method to captur

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::51e2fa01ebb5601ec42d46d5801c008e
https://doi.org/10.1007/s11265-022-01820-0

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání