Zobrazeno 1 - 10
of 828
pro vyhledávání: '"Ghosh, Prasanta"'
Autor:
Veerababu, D., Ghosh, Prasanta K.
Publikováno v:
Journal of Sound and Vibration, 585, 2024:118476
Neural networks have emerged as a tool for solving differential equations in many branches of engineering and science. But their progress in frequency domain acoustics is limited by the vanishing gradient problem that occurs at higher frequencies. Th
Externí odkaz:
http://arxiv.org/abs/2405.04603
In this paper, we present a 170.83 hour Indian English spontaneous speech dataset. Lack of Indian English speech data is one of the major hindrances in developing robust speech systems which are adapted to the Indian speech style. Moreover this scarc
Externí odkaz:
http://arxiv.org/abs/2312.00698
Autor:
Bandekar, Jesuraj, Udupa, Sathvik, Singh, Abhayjeet, Jayakumar, Anjali, G, Deekshitha, Badiger, Sandhya, Kumar, Saurabh, VH, Pooja, Ghosh, Prasanta Kumar
With the advent of high-quality speech synthesis, there is a lot of interest in controlling various prosodic attributes of speech. Speaking rate is an essential attribute towards modelling the expressivity of speech. In this work, we propose a novel
Externí odkaz:
http://arxiv.org/abs/2310.08846
Autor:
Singh, Abhayjeet, Mehta, Arjun Singh, S, Ashish Khuraishi K, G, Deekshitha, Date, Gauri, Nanavati, Jai, Bandekar, Jesuraja, Basumatary, Karnalius, P, Karthika, Badiger, Sandhya, Udupa, Sathvik, Kumar, Saurabh, Savitha, Ghosh, Prasanta Kumar, V, Prashanthi, Pai, Priyanka, Nanavati, Raoul, Saxena, Rohan, Mora, Sai Praneeth Reddy, Raghavan, Srinivasa
Automatic speech recognition (ASR) performance has improved drastically in recent years, mainly enabled by self-supervised learning (SSL) based acoustic models such as wav2vec2 and large-scale multi-lingual training like Whisper. A huge challenge sti
Externí odkaz:
http://arxiv.org/abs/2307.07948
Asthma is one of the chronic inflammatory diseases of the airways, which causes chest tightness, wheezing, breathlessness, and cough. Spirometry is an effort-dependent test used to monitor and diagnose lung conditions like Asthma. Vocal breath sound
Externí odkaz:
http://arxiv.org/abs/2305.00242
Breathing is an essential part of human survival, which carries information about a person's physiological and psychological state. Generally, breath boundaries are marked by experts before using for any task. An unsupervised algorithm for breath bou
Externí odkaz:
http://arxiv.org/abs/2304.03758
Speech systems are sensitive to accent variations. This is especially challenging in the Indian context, with an abundance of languages but a dearth of linguistic studies characterising pronunciation variations. The growing number of L2 English speak
Externí odkaz:
http://arxiv.org/abs/2212.09284
Voiced speech signals such as continuous speech are known to have acoustic features such as pitch(F0), and formant frequencies(F1, F2, F3) which can be used for gender classification. However, gender classification studies using non-speech signals su
Externí odkaz:
http://arxiv.org/abs/2211.06371
Autor:
Udupa, Sathvik, Ghosh, Prasanta Kumar
Real-Time Magnetic resonance imaging (rtMRI) of the midsagittal plane of the mouth is of interest for speech production research. In this work, we focus on estimating utterance level rtMRI video from the spoken phoneme sequence. We obtain time-aligne
Externí odkaz:
http://arxiv.org/abs/2210.16881
In this work, we investigate the effectiveness of pretrained Self-Supervised Learning (SSL) features for learning the mapping for acoustic to articulatory inversion (AAI). Signal processing-based acoustic features such as MFCCs have been predominantl
Externí odkaz:
http://arxiv.org/abs/2210.16871