Zobrazeno 1 - 10
of 20
pro vyhledávání: '"Phani Sankar Nidadavolu"'
Publikováno v:
IEEE Access, Vol 6, Pp 22524-22530 (2018)
Age estimation from speech has recently received increased interest as it is useful for many applications such as user-profiling, targeted marketing, or personalized call-routing. This kind of applications need to quickly estimate the age of the spea
Externí odkaz:
https://doaj.org/article/cef807ed01b6401f8b0ae5794dc5f2d6
Autor:
Phani Sankar Nidadavolu, Na Xu, Nick Jutila, Ravi Teja Gadde, Aswarth Abhilash Dara, Joseph Savold, Sapan Patel, Aaron Hoff, Veerdhawal Pande, Kevin Crews, Ankur Gandhe, Ariya Rastrow, Roland Maas
Publikováno v:
Interspeech 2022.
Autor:
Daniel Garcia-Romero, Pedro Torres-Carrasquiilo, Nanxin Chen, Saurabh Kataria, Jesús Antonio Villalba López, Phani Sankar Nidadavolu, Jonas Borgstrom, Alan V. McCree, Najim Dehak, Gregory Sell, Leibny Paola Garcia-Perera
Publikováno v:
Odyssey
Publikováno v:
Odyssey
Data augmentation is conventionally used to inject robustness in Speaker Verification systems. Several recently organized challenges focus on handling novel acoustic environments. Deep learning based speech enhancement is a modern solution for this.
Autor:
Sajjad Abdoli, Latane Bullock, Paola García, Lei Sun, Jesús Villalba, Hervé Bredin, Wassim Bouaziz, Hadrien Titeux, Saurabh Kataria, Diego Castan, Sizhu Chen, Kong Aik Lee, Léo Galmant, Jun Du, Alejandrina Cristia, Marie-Philippe Gill, Ling Guo, Marvin Lavechin, Phani Sankar Nidadavolu, Najim Dehak, Bar Ben-Yair, Koji Okabe, Emmanuel Dupoux, Xin Wang
Publikováno v:
Odyssey 2020 The Speaker and Language Recognition Workshop
Odyssey 2020 The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan
Odyssey
HAL
Odyssey 2020 The Speaker and Language Recognition Workshop, Nov 2020, Tokyo, Japan
Odyssey
HAL
This paper presents the problems and solutions addressed at the JSALT workshop when using a single microphone for speaker detection in adverse scenarios. The main focus was to tackle a wide range of conditions that go from meetings to wild speech. We
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::149a4b3cb5d20f380116b4922e46b0dc
https://hal.science/hal-02417632
https://hal.science/hal-02417632
Autor:
Milos Cernak, Najim Dehak, Jesús Francisco Vargas-Bonilla, Julius Hannink, Phani Sankar Nidadavolu, Heidi Christensen, Maria Yancheva, Alyssa Vann, Nikolai Vogler, Hamidreza Chinaei, Frank Rudzicz, Elmar Nöth, Tobias Bocklet, Juan Camilo Vásquez-Correa, Raman Arora, Juan Rafael Orozco-Arroyave
Publikováno v:
Digital Signal Processing. 77:207-221
A new software for modeling pathological speech signals is presented in this paper. The software is called NeuroSpeech. This software enables the analysis of pathological speech signals considering different speech dimensions: phonation, articulation
Publikováno v:
IEEE Access, Vol 6, Pp 22524-22530 (2018)
Age estimation from speech has recently received increased interest as it is useful for many applications such as user-profiling, targeted marketing, or personalized call-routing. This kind of applications need to quickly estimate the age of the spea
Autor:
Hainan Xu, Sanjeev Khudanpur, Daniel Povey, Yiming Wang, David Snyder, Vimal Manohar, Phani Sankar Nidadavolu
Publikováno v:
INTERSPEECH
Publikováno v:
ICASSP
It is well known that domain mismatch between the training and evaluation data hinders the performance of any machine learning system. Various factors contribute to domain mismatch. In speaker recognition systems, it mainly occurs due to the mismatch
Publikováno v:
ICASSP
We extend our previous work on training mixed-bandwidth (BW) speaker recognition system by predicting missing information in upperband (UB) of upsampled telephone speech. Mixed-BW systems combine speech from narrowband (NB) and wideband (WB) speech c