Zobrazeno 1 - 10
of 18
pro vyhledávání: '"Sri Rama Murty Kodukula"'
Publikováno v:
Circuits, Systems & Signal Processing; Jul2024, Vol. 43 Issue 7, p4487-4507, 21p
Publikováno v:
2022 National Conference on Communications (NCC).
Publikováno v:
Circuits, Systems, and Signal Processing. 39:5169-5197
This paper presents a new approach for unsupervised segmentation and labeling of acoustically homogeneous segments from the speech signals. The virtual labels, thus obtained, are used to build unsupervised acoustic models in the absence of manual tra
In this paper, we demonstrate the significance of restoring harmonics of the fundamental frequency (pitch) in deep neural network (DNN) based speech enhancement. We propose a sliding-window attention network to regress the spectral magnitude mask (SM
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::365c954468bc95925b3dc461e0d99417
https://doi.org/10.36227/techrxiv.15051972.v1
https://doi.org/10.36227/techrxiv.15051972.v1
Publikováno v:
IEEE Transactions on Multimedia. 21:1672-1680
A fixed dimensional representation for action clips of varying lengths has been proposed in the literature using aggregation models like bag-of-words and Fisher vector. These representations are high dimensional and require classification techniques
Publikováno v:
NCC
In the recent past, Deep neural networks became the most successful approach to extract the speaker embeddings. Among the existing methods, the x-vector system, that extracts a fixed dimensional representation from varying length speech signal, becam
Publikováno v:
NCC
Most of the speech enhancement algorithms rely on estimating the magnitude spectrum of the clean speech signal from that of the noisy speech signal using either spectral regression or spectral masking. Because of difficulty in processing the phase of
Publikováno v:
ICIP
Actions can be recognized effectively when the various atomic attributes forming the action are identified and combined in the form of a representation. In this paper, a low-dimensional representation is extracted from a pool of attributes learned in
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 23:2371-2383
The phase spectrum of Fourier transform has received lesser prominence than its magnitude counterpart in speech processing. In this paper, we propose a method for parametric modeling of the phase spectrum, and discuss its applications in speech signa
Publikováno v:
Circuits, Systems, and Signal Processing. 35:2584-2609
Epochs are instants of significant excitation of vocal-tract system in speech production process. In this paper, we attempt to extract information about epochs from phase spectra of speech signals. The phase spectrum of speech is modelled as the resp