Zobrazeno 1 - 10
of 61
pro vyhledávání: '"Sudarsana Reddy Kadiri"'
Publikováno v:
IEEE Access, Vol 11, Pp 29149-29161 (2023)
In singing, the perceptual term “voice quality” is used to describe expressed emotions and singing styles. In voice physiology research, specific voice qualities are discussed using the term phonation modes and are directly related to the voicing
Externí odkaz:
https://doaj.org/article/bf73d8e9de724c2eb3e583e3066af48d
Publikováno v:
IEEE Open Journal of Signal Processing, Vol 4, Pp 80-88 (2023)
Previous studies on the automatic classification of voice disorders have mostly investigated the binary classification task, which aims to distinguish pathological voice from healthy voice. Using multi-class classifiers, however, more fine-grained id
Externí odkaz:
https://doaj.org/article/5a8e8f3c45de4706a6275b27483d343b
Publikováno v:
IEEE Access, Vol 9, Pp 151631-151640 (2021)
Formant tracking is investigated in this study by using trackers based on dynamic programming (DP) and deep neural nets (DNNs). Using the DP approach, six formant estimation methods were first compared. The six methods include linear prediction (LP)
Externí odkaz:
https://doaj.org/article/d3e5f843aaf44751a9b53d75a6163f7e
Autor:
Sudarsana Reddy Kadiri, Paavo Alku
Publikováno v:
IEEE Access, Vol 8, Pp 60382-60391 (2020)
In this article, we study emotion detection from speech in a speaker-specific scenario. By parameterizing the excitation component of voiced speech, the study explores deviations between emotional speech (e.g., speech produced in anger, happiness, sa
Externí odkaz:
https://doaj.org/article/bab65234fb174c14bb51b49f788381a6
Publikováno v:
IEEE Access, Vol 8, Pp 174871-174879 (2020)
In this study, we propose Mel-weighted single frequency filtering (SFF) spectrograms for dialect identification. The spectrum derived using SFF has high spectral resolution for harmonics and resonances while simultaneously maintaining good time-resol
Externí odkaz:
https://doaj.org/article/2dc2e1826381423daecf4e58784fb8bf
Autor:
Sudarsana Reddy Kadiri, Paavo Alku
Publikováno v:
Sensors, Vol 22, Iss 13, p 4931 (2022)
Understanding of the perception of emotions or affective states in humans is important to develop emotion-aware systems that work in realistic scenarios. In this paper, the perception of emotions in naturalistic human interaction (audio–visual data
Externí odkaz:
https://doaj.org/article/f7b3d5d0de8444928aba3e7a89b65811
Publikováno v:
Applied Sciences, Vol 11, Iss 18, p 8420 (2021)
Current ASR systems show poor performance in recognition of children’s speech in noisy environments because recognizers are typically trained with clean adults’ speech and therefore there are two mismatches between training and testing phases (i.
Externí odkaz:
https://doaj.org/article/fe822c2210854acd9cb678c765e87a47
Publikováno v:
Aalto University
In low resource children automatic speech recognition (ASR) the performance is degraded due to limited acoustic and speaker variability available in small datasets. In this paper, we propose a spectral warping based data augmentation method to captur
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).