Zobrazeno 1 - 10
of 146
pro vyhledávání: '"Ananthapadmanabha, T."'
The recent advances in the field of deep learning have not been fully utilised for decoding imagined speech primarily because of the unavailability of sufficient training samples to train a deep network. In this paper, we present a novel architecture
Externí odkaz:
http://arxiv.org/abs/2003.09374
Subjective and objective experiments are conducted to understand the extent to which a speaker's gender influences the acoustics of unvoiced (U) sounds. U segments of utterances are replaced by the corresponding segments of a speaker of opposite gend
Externí odkaz:
http://arxiv.org/abs/1807.05813
A judicious combination of dictionary learning methods, block sparsity and source recovery algorithm are used in a hierarchical manner to identify the noises and the speakers from a noisy conversation between two people. Conversations are simulated u
Externí odkaz:
http://arxiv.org/abs/1609.09764
Using a known speaker-intrinsic normalization procedure, formant data are scaled by the reciprocal of the geometric mean of the first three formant frequencies. This reduces the influence of the talker but results in a distorted vowel space. The prop
Externí odkaz:
http://arxiv.org/abs/1609.05104
A dictionary learning based audio source classification algorithm is proposed to classify a sample audio signal as one amongst a finite set of different audio sources. Cosine similarity measure is used to select the atoms during dictionary learning.
Externí odkaz:
http://arxiv.org/abs/1510.07774
An objective critical distance (OCD) has been defined as that spacing between adjacent formants, when the level of the valley between them reaches the mean spectral level. The measured OCD lies in the same range (viz., 3-3.5 bark) as the critical dis
Externí odkaz:
http://arxiv.org/abs/1506.04828
Enhancing Smart Grid Security with SHA-SARIMAX: Identifying and Restoring Corrupted Files from FDIA.
Publikováno v:
IAENG International Journal of Computer Science; Aug2024, Vol. 51 Issue 8, p1112-1121, 10p
Linear prediction (LP) technique estimates an optimum all-pole filter of a given order for a frame of speech signal. The coefficients of the all-pole filter, 1/A(z) are referred to as LP coefficients (LPCs). The gain of the inverse of the all-pole fi
Externí odkaz:
http://arxiv.org/abs/1411.1267
Detection of transitions between broad phonetic classes in a speech signal is an important problem which has applications such as landmark detection and segmentation. The proposed hierarchical method detects silence to non-silence transitions, high a
Externí odkaz:
http://arxiv.org/abs/1411.0370
This paper is concerned with modelling and simulation of VSAT (very small aperture terminal) data messaging network operating in India at Karnataka with extended C-band. VSATs in Karnataka of KPTCL use VSATS 6.875-6.9465G Hz uplinks and 4.650- 4.7215
Externí odkaz:
http://arxiv.org/abs/1206.1722