Zobrazeno 1 - 10
of 42
pro vyhledávání: '"Karl Ni"'
Publikováno v:
Applications of Machine Learning 2021.
Visual search and similarity can aid an e-commerce platform by providing appropriate recommendations where semantic labeling and associated metadata does not always exist. In this work, we detail the specifics of our system that powers visually simil
Publikováno v:
Applications of Machine Learning 2021.
The performance of recommendation systems is highly dependent on candidate matching techniques for scoping users’ information needs. Existing candidate matching methods are based on text embedding and collaborative filtering, which base similarity
Publikováno v:
INTERSPEECH
We propose an algorithm to denoise speakers from a single microphone in the presence of non-stationary and dynamic noise. Our approach is inspired by the recent success of neural network models separating speakers from other speakers and singers from
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::537b6a8ba6cd5852998e5f9c1e01a8d9
http://arxiv.org/abs/1804.10669
http://arxiv.org/abs/1804.10669
Autor:
Cory Stephenson, M. A. Barrios, Paul Gamble, Chris Bartels, Mahesh Kumar Nandwana, Julien van Hout, J. Hetherly, Horacio Franco, Martin Graciarena, Colleen Richey, Zeb Armstrong, Karl Ni, Allen R. Stauffer, Aaron Lawson
Publikováno v:
INTERSPEECH
This paper introduces the Voices Obscured In Complex Environmental Settings (VOICES) corpus, a freely available dataset under Creative Commons BY 4.0. This dataset will promote speech and signal processing research of speech recorded by far-field mic
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f37da0c0f8deb8b0d82030ecbc62cf9b
Publikováno v:
ACSSC
Academic work in identifying writers of handwritten documents has previously focused on clean benchmark datasets: plain white documents with uniform writing instruments. Solutions on this type of data have achieved hit-in-top-10 accuracy rates reachi
Publikováno v:
SiPS
We propose an algorithm to separate simultaneously speaking persons from each other, the “cocktail party problem”, using a single microphone. Our approach involves a deep recurrent neural networks regression to a vector space that is descriptive
Publikováno v:
WACV
Identifying the writer of a handwritten document based on visual features is difficult, as evidenced by the limited number of subject matter experts proficient in forensic document analysis. Automating writer identification would be beneficial for su
Publikováno v:
ICIP
Image annotation, or prediction of multiple tags for an image, is a challenging task. Most current algorithms are based on large sets of handcrafted features. Deep convolutional neural networks have recently outperformed humans in image classificatio
Autor:
Zeb Armstrong, Paul Gamble, Colleen Richey, Karl Ni, Aaron Lawson, Cory Stephenson, J. Hetherly, Martin Graciarena, M. A. Barrios, Todd Stavish
Publikováno v:
The Journal of the Acoustical Society of America. 143:1869-1869
The speakers in the room (SITR) corpus is a collaboration between Lab41 and SRI International, designed to be a freely available data set for speech and acoustics research in noisy room conditions. The main focus of the corpus is on distant microphon