Výsledky vyhledávání - "Sourish Chaudhuri"

Ava Active Speaker: An Audio-Visual Dataset for Active Speaker Detection

Autor: Caroline Pantofaru, Ondrej Klejch, Cordelia Schmid, Joseph Roth, Arkadiusz Stopczynski, Sharadh Ramaswamy, Zhonghua Xi, Radhika Marvin, Andrew C. Gallagher, Sourish Chaudhuri, Liat Kaver

Publikováno v: ICASSP

Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech enhancement, and human-robot interaction. The absence of a large, carefully labeled

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::9605bc7082663178dcb3ea66e6467d2d
https://doi.org/10.1109/icassp40776.2020.9053900

Zobrazit plný text záznamu

Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen

Autor: Ian Sturdy, Sourish Chaudhuri, Caroline Pantofaru, Malcolm Slaney, Ken Hoover

Publikováno v: ICASSP

We present a system that associates faces with voices in a video by fusing information from the audio and visual signals. The thesis underlying our work is that an extreme simple approach to generating (weak) speech clusters can be combined with stro

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::9838d68b201d0a3de7cc8dd4da88fcef
https://doi.org/10.1109/icassp.2018.8461891

Zobrazit plný text záznamu

AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies

Autor: Caroline Pantofaru, Radhika Marvin, Joseph Roth, Nathan Reale, Liat Kaver, Sourish Chaudhuri, Kevin W. Wilson, Andrew C. Gallagher, Daniel P. W. Ellis, Zhonghua Xi, Loretta Guarino Reid

Publikováno v: INTERSPEECH

Speech activity detection (or endpointing) is an important processing step for applications such as speech recognition, language identification and speaker diarization. Both audio- and vision-based approaches have been used for this task in various s

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::17f8453f0b6567f186232f54b7c9b663

Zobrazit plný text záznamu

CNN Architectures for Large-Scale Audio Classification

Autor: R. Channing Moore, Aren Jansen, Sourish Chaudhuri, Kevin W. Wilson, Manoj Plakal, Shawn Hershey, Rif A. Saurous, Daniel P. W. Ellis, Devin Platt, Malcolm Slaney, Bryan Seybold, Ron Weiss, Jort F. Gemmeke

Publikováno v: ICASSP

Convolutional Neural Networks (CNNs) have proven very effective in image classification and show promise for audio. We use various CNN architectures to classify the soundtracks of a dataset of 70M training videos (5.24 million hours) with 30,871 vide

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::82f4931fb9e5b087ff1934c8093ff651

Zobrazit plný text záznamu

Unsupervised hierarchical structure induction for deeper semantic analysis of audio

Autor: Sourish Chaudhuri, Bhiksha Raj

Publikováno v: ICASSP

Current audio analysis techniques rely on fairly shallow analysis of audio content, using symbols or patterns extracted directly from the observed acoustics. We hypothesize that the observed acoustics actually map to semantics in a hierarchical manne

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::662eeb84d1d89aac086aeef4cf4cc6f1
https://doi.org/10.1109/icassp.2013.6637765

Zobrazit plný text záznamu

Plagiarism detection in polyphonic music using monaural signal separation

Autor: Bhiksha Raj, Sourish Chaudhuri, Kriti Suneja, Indradyumna Roy, Tarunima Prabhakar, Soham De, Rita Singh

Publikováno v: INTERSPEECH

Given the large number of new musical tracks released each year, automated approaches to plagiarism detection are essential to help us track potential violations of copyright. Most current approaches to plagiarism detection are based on musical simil

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e2a51853e974b0fab619de535a0f313c
https://doi.org/10.21437/interspeech.2012-476

Zobrazit plný text záznamu

Audio event detection from acoustic unit occurrence patterns

Autor: Sourish Chaudhuri, Bhiksha Raj, Pranay Dighe, Rita Singh, Anurag Kumar

Publikováno v: ICASSP

In most real-world audio recordings, we encounter several types of audio events. In this paper, we develop a technique for detecting signature audio events, that is based on identifying patterns of occurrences of automatically learned atomic units of

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::beed0b3f276b124971491443b433ce2f
https://doi.org/10.1109/icassp.2012.6287923

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání