Zobrazeno 1 - 10
of 19
pro vyhledávání: '"Sourish Chaudhuri"'
Autor:
Caroline Pantofaru, Ondrej Klejch, Cordelia Schmid, Joseph Roth, Arkadiusz Stopczynski, Sharadh Ramaswamy, Zhonghua Xi, Radhika Marvin, Andrew C. Gallagher, Sourish Chaudhuri, Liat Kaver
Publikováno v:
ICASSP
Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled
Publikováno v:
ICASSP
We present a system that associates faces with voices in a video by fusing information from the audio and visual signals. The thesis underlying our work is that an extreme simple approach to generating (weak) speech clusters can be combined with stro
Autor:
Caroline Pantofaru, Radhika Marvin, Joseph Roth, Nathan Reale, Liat Kaver, Sourish Chaudhuri, Kevin W. Wilson, Andrew C. Gallagher, Daniel P. W. Ellis, Zhonghua Xi, Loretta Guarino Reid
Publikováno v:
INTERSPEECH
Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization. Both audio- and vision-based approaches have been used for this task in various s
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::17f8453f0b6567f186232f54b7c9b663
Autor:
R. Channing Moore, Aren Jansen, Sourish Chaudhuri, Kevin W. Wilson, Manoj Plakal, Shawn Hershey, Rif A. Saurous, Daniel P. W. Ellis, Devin Platt, Malcolm Slaney, Bryan Seybold, Ron Weiss, Jort F. Gemmeke
Publikováno v:
ICASSP
Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 vide
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::82f4931fb9e5b087ff1934c8093ff651
Autor:
Sourish Chaudhuri, Bhiksha Raj
Publikováno v:
ICASSP
Current audio analysis techniques rely on fairly shallow analysis of audio content, using symbols or patterns extracted directly from the observed acoustics. We hypothesize that the observed acoustics actually map to semantics in a hierarchical manne
Publikováno v:
INTERSPEECH
Autor:
Bhiksha Raj, Sourish Chaudhuri, Kriti Suneja, Indradyumna Roy, Tarunima Prabhakar, Soham De, Rita Singh
Publikováno v:
INTERSPEECH
Given the large number of new musical tracks released each year, automated approaches to plagiarism detection are essential to help us track potential violations of copyright. Most current approaches to plagiarism detection are based on musical simil
Publikováno v:
ICASSP
In most real-world audio recordings, we encounter several types of audio events. In this paper, we develop a technique for detecting signature audio events, that is based on identifying patterns of occurrences of automatically learned atomic units of
Publikováno v:
INTERSPEECH
Publikováno v:
Interspeech 2011.