Zobrazeno 1 - 10
of 14
pro vyhledávání: '"Chaudhuri, Sourish"'
Autor:
Roth, Joseph, Chaudhuri, Sourish, Klejch, Ondrej, Marvin, Radhika, Gallagher, Andrew, Kaver, Liat, Ramaswamy, Sharadh, Stopczynski, Arkadiusz, Schmid, Cordelia, Xi, Zhonghua, Pantofaru, Caroline
Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled
Externí odkaz:
http://arxiv.org/abs/1901.01342
Autor:
Chaudhuri, Sourish, Roth, Joseph, Ellis, Daniel P. W., Gallagher, Andrew, Kaver, Liat, Marvin, Radhika, Pantofaru, Caroline, Reale, Nathan, Reid, Loretta Guarino, Wilson, Kevin, Xi, Zhonghua
Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization. Both audio- and vision-based approaches have been used for this task in various s
Externí odkaz:
http://arxiv.org/abs/1808.00606
In this paper, we present a system that associates faces with voices in a video by fusing information from the audio and visual signals. The thesis underlying our work is that an extremely simple approach to generating (weak) speech clusters can be c
Externí odkaz:
http://arxiv.org/abs/1706.00079
Autor:
Hershey, Shawn, Chaudhuri, Sourish, Ellis, Daniel P. W., Gemmeke, Jort F., Jansen, Aren, Moore, R. Channing, Plakal, Manoj, Platt, Devin, Saurous, Rif A., Seybold, Bryan, Slaney, Malcolm, Weiss, Ron J., Wilson, Kevin
Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 vide
Externí odkaz:
http://arxiv.org/abs/1609.09430
Autor:
De, Soham, Roy, Indradyumna, Prabhakar, Tarunima, Suneja, Kriti, Chaudhuri, Sourish, Singh, Rita, Raj, Bhiksha
Publikováno v:
INTERSPEECH-2012, 1744-1747 (2012)
Given the large number of new musical tracks released each year, automated approaches to plagiarism detection are essential to help us track potential violations of copyright. Most current approaches to plagiarism detection are based on musical simil
Externí odkaz:
http://arxiv.org/abs/1503.00022
Autor:
Chaudhuri, Sourish, Raj, Bhiksha
Publikováno v:
2013 IEEE International Conference on Acoustics, Speech & Signal Processing; 2013, p833-837, 5p
Publikováno v:
2012 IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP); 1/ 1/2012, p489-492, 4p
Autor:
Kang, Moonyoung, Chaudhuri, Sourish, Kumar, Rohit, Wang, Yi-Chia, Rosé, Eric R., Rosé, Carolyn P., Cui, Yue
Publikováno v:
Intelligent Tutoring Systems (9783540691303); 2008, p793-795, 3p
Autor:
Chaudhuri, Sourish, Kumar, Rohit, Joshi, Mahesh, Terrell, Elon, Higgs, Fred, Aleven, Vincent, Penstein Rosé, Carolyn
Publikováno v:
Intelligent Tutoring Systems (9783540691303); 2008, p807-809, 3p
Publikováno v:
2013 IEEE International Conference on Acoustics, Speech & Signal Processing; 2013, pA1-A44, 44p