Výsledky vyhledávání - "Phani Sankar Nidadavolu"

Akademický článek

Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks

Autor: Ruben Zazo, Phani Sankar Nidadavolu, Nanxin Chen, Joaquin Gonzalez-Rodriguez, Najim Dehak

Publikováno v: IEEE Access, Vol 6, Pp 22524-22530 (2018)

Age estimation from speech has recently received increased interest as it is useful for many applications such as user-profiling, targeted marketing, or personalized call-routing. This kind of applications need to quickly estimate the age of the spea

Externí odkaz: https://doaj.org/article/cef807ed01b6401f8b0ae5794dc5f2d6

Zobrazit plný text záznamu

RefTextLAS: Reference Text Biased Listen, Attend, and Spell Model For Accurate Reading Evaluation

Autor: Phani Sankar Nidadavolu, Na Xu, Nick Jutila, Ravi Teja Gadde, Aswarth Abhilash Dara, Joseph Savold, Sapan Patel, Aaron Hoff, Veerdhawal Pande, Kevin Crews, Ankur Gandhe, Ariya Rastrow, Roland Maas

Publikováno v: Interspeech 2022.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::d6b874e926bcc9284dd61970aec06b91
https://doi.org/10.21437/interspeech.2022-11078

Zobrazit plný text záznamu

Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19

Autor: Daniel Garcia-Romero, Pedro Torres-Carrasquiilo, Nanxin Chen, Saurabh Kataria, Jesús Antonio Villalba López, Phani Sankar Nidadavolu, Jonas Borgstrom, Alan V. McCree, Najim Dehak, Gregory Sell, Leibny Paola Garcia-Perera

Publikováno v: Odyssey

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::fc0557a7d85a6e58ed35e5bc383fa079
https://doi.org/10.21437/odyssey.2020-39

Zobrazit plný text záznamu

Analysis of Deep Feature Loss Based Enhancement for Speaker Verification

Autor: Najim Dehak, Saurabh Kataria, Jesús Villalba, Phani Sankar Nidadavolu

Publikováno v: Odyssey

Data augmentation is conventionally used to inject robustness in Speaker Verification systems. Several recently organized challenges focus on handling novel acoustic environments. Deep learning based speech enhancement is a modern solution for this.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ee8a4c665686862fa53f7a650a436f0c
https://doi.org/10.21437/odyssey.2020-66

Zobrazit plný text záznamu

Speaker detection in the wild: Lessons learned from JSALT 2019

Publikováno v: Odyssey 2020 The Speaker and Language Recognition Workshop
Odyssey 2020 The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan
Odyssey
HAL

This paper presents the problems and solutions addressed at the JSALT workshop when using a single microphone for speaker detection in adverse scenarios. The main focus was to tackle a wide range of conditions that go from meetings to wild speech. We

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::149a4b3cb5d20f380116b4922e46b0dc
https://hal.science/hal-02417632

Zobrazit plný text záznamu

NeuroSpeech: An open-source software for Parkinson's speech analysis

Autor: Milos Cernak, Najim Dehak, Jesús Francisco Vargas-Bonilla, Julius Hannink, Phani Sankar Nidadavolu, Heidi Christensen, Maria Yancheva, Alyssa Vann, Nikolai Vogler, Hamidreza Chinaei, Frank Rudzicz, Elmar Nöth, Tobias Bocklet, Juan Camilo Vásquez-Correa, Raman Arora, Juan Rafael Orozco-Arroyave

Publikováno v: Digital Signal Processing. 77:207-221

A new software for modeling pathological speech signals is presented in this paper. The software is called NeuroSpeech. This software enables the analysis of pathological speech signals considering different speech dimensions: phonation, articulation

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::17788370006506cd01424d2a144e87c9
https://doi.org/10.1016/j.dsp.2017.07.004

Zobrazit plný text záznamu

Age Estimation in Short Speech Utterances Based on LSTM Recurrent Neural Networks

Autor: Najim Dehak, Joaquin Gonzalez-Rodriguez, Phani Sankar Nidadavolu, Nanxin Chen, Ruben Zazo

Publikováno v: IEEE Access, Vol 6, Pp 22524-22530 (2018)

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::570c38d67a31d031a8a5004116d6e677
https://ieeexplore.ieee.org/document/8316819/

Zobrazit plný text záznamu

The JHU ASR System for VOiCES from a Distance Challenge 2019

Autor: Hainan Xu, Sanjeev Khudanpur, Daniel Povey, Yiming Wang, David Snyder, Vimal Manohar, Phani Sankar Nidadavolu

Publikováno v: INTERSPEECH

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ee067bb243c696e834a31b86d85130ab
https://doi.org/10.21437/interspeech.2019-1948

Zobrazit plný text záznamu

Cycle-GANs for Domain Adaptation of Acoustic Features for Speaker Recognition

Autor: Jesús Villalba, Najim Dehak, Phani Sankar Nidadavolu

Publikováno v: ICASSP

It is well known that domain mismatch between the training and evaluation data hinders the performance of any machine learning system. Various factors contribute to domain mismatch. In speaker recognition systems, it mainly occurs due to the mismatch

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::852dbb9e90fd65e7e3d16b69ead120b9
https://doi.org/10.1109/icassp.2019.8683055

Zobrazit plný text záznamu

Investigation on Neural Bandwidth Extension of Telephone Speech for Improved Speaker Recognition

Autor: Jesús Villalba, Phani Sankar Nidadavolu, Vicente A. Iglesias, Najim Dehak

Publikováno v: ICASSP

We extend our previous work on training mixed-bandwidth (BW) speaker recognition system by predicting missing information in upperband (UB) of upsampled telephone speech. Mixed-BW systems combine speech from narrowband (NB) and wideband (WB) speech c

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::d5813ed47e662ae82f71da5c7e17c0a2
https://doi.org/10.1109/icassp.2019.8682992

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání