Výsledky vyhledávání - "Ghosh, Prasanta"

Report

Neural network based approach for solving problems in plane wave duct acoustics

Autor: Veerababu, D., Ghosh, Prasanta K.

Publikováno v: Journal of Sound and Vibration, 585, 2024:118476

Neural networks have emerged as a tool for solving differential equations in many branches of engineering and science. But their progress in frequency domain acoustics is limited by the vanishing gradient problem that occurs at higher frequencies. Th

Externí odkaz: http://arxiv.org/abs/2405.04603

Zobrazit plný text záznamu

Report

SPIRE-SIES: A Spontaneous Indian English Speech Corpus

Autor: Singh, Abhayjeet, Shah, Charu, Varadaraj, Rajashri, Chauhan, Sonakshi, Ghosh, Prasanta Kumar

In this paper, we present a 170.83 hour Indian English spontaneous speech dataset. Lack of Indian English speech data is one of the major hindrances in developing robust speech systems which are adapted to the Indian speech style. Moreover this scarc

Externí odkaz: http://arxiv.org/abs/2312.00698

Zobrazit plný text záznamu

Report

Speaking rate attention-based duration prediction for speed control TTS

Autor: Bandekar, Jesuraj, Udupa, Sathvik, Singh, Abhayjeet, Jayakumar, Anjali, G, Deekshitha, Badiger, Sandhya, Kumar, Saurabh, VH, Pooja, Ghosh, Prasanta Kumar

With the advent of high-quality speech synthesis, there is a lot of interest in controlling various prosodic attributes of speech. Speaking rate is an essential attribute towards modelling the expressivity of speech. In this work, we propose a novel

Externí odkaz: http://arxiv.org/abs/2310.08846

Zobrazit plný text záznamu

Report

Model Adaptation for ASR in low-resource Indian Languages

Automatic speech recognition (ASR) performance has improved drastically in recent years, mainly enabled by self-supervised learning (SSL) based acoustic models such as wav2vec2 and large-scale multi-lingual training like Whisper. A huge challenge sti

Externí odkaz: http://arxiv.org/abs/2307.07948

Zobrazit plný text záznamu

Report

Analysis of vocal breath sounds before and after administering Bronchodilator in Asthmatic patients

Autor: Yadav, Shivani, Gope, Dipanjan, K., Uma Maheswari, Ghosh, Prasanta Kumar

Asthma is one of the chronic inflammatory diseases of the airways, which causes chest tightness, wheezing, breathlessness, and cough. Spirometry is an effort-dependent test used to monitor and diagnose lung conditions like Asthma. Vocal breath sound

Externí odkaz: http://arxiv.org/abs/2305.00242

Zobrazit plný text záznamu

Report

An unsupervised segmentation of vocal breath sounds

Autor: Yadav, Shivani, Gope, Dipanjan, K., Uma Maheswari, Ghosh, Prasanta Kumar

Breathing is an essential part of human survival, which carries information about a person's physiological and psychological state. Generally, breath boundaries are marked by experts before using for any task. An unsupervised algorithm for breath bou

Externí odkaz: http://arxiv.org/abs/2304.03758

Zobrazit plný text záznamu

Report

An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations

Autor: Jain, Shelly, Pal, Priyanshi, Vuppala, Anil, Ghosh, Prasanta, Yarra, Chiranjeevi

Speech systems are sensitive to accent variations. This is especially challenging in the Indian context, with an abundance of languages but a dearth of linguistic studies characterising pronunciation variations. The growing number of L2 English speak

Externí odkaz: http://arxiv.org/abs/2212.09284

Zobrazit plný text záznamu

Report

Vocal Breath Sound Based Gender Classification

Autor: Solanki, Mohammad Shaique, Bharadwaj, Ashutosh M, K, Jeevan, Ghosh, Prasanta Kumar

Voiced speech signals such as continuous speech are known to have acoustic features such as pitch(F0), and formant frequencies(F1, F2, F3) which can be used for gender classification. However, gender classification studies using non-speech signals su

Externí odkaz: http://arxiv.org/abs/2211.06371

Zobrazit plný text záznamu

Report

Real-Time MRI Video synthesis from time aligned phonemes with sequence-to-sequence networks

Autor: Udupa, Sathvik, Ghosh, Prasanta Kumar

Real-Time Magnetic resonance imaging (rtMRI) of the midsagittal plane of the mouth is of interest for speech production research. In this work, we focus on estimating utterance level rtMRI video from the spoken phoneme sequence. We obtain time-aligne

Externí odkaz: http://arxiv.org/abs/2210.16881

Zobrazit plný text záznamu

Report

Improved acoustic-to-articulatory inversion using representations from pretrained self-supervised learning models

Autor: Udupa, Sathvik, C, Siddarth, Ghosh, Prasanta Kumar

In this work, we investigate the effectiveness of pretrained Self-Supervised Learning (SSL) features for learning the mapping for acoustic to articulatory inversion (AAI). Signal processing-based acoustic features such as MFCCs have been predominantl

Externí odkaz: http://arxiv.org/abs/2210.16871

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání