Výsledky vyhledávání - "McDermott, Josh"

Report

Finding Fallen Objects Via Asynchronous Audio-Visual Integration

Autor: Gan, Chuang, Gu, Yi, Zhou, Siyuan, Schwartz, Jeremy, Alter, Seth, Traer, James, Gutfreund, Dan, Tenenbaum, Joshua B., McDermott, Josh, Torralba, Antonio

The way an object looks and sounds provide complementary reflections of its physical properties. In many settings cues from vision and audition arrive asynchronously but must be integrated, as when we hear an object dropped on the floor and then must

Externí odkaz: http://arxiv.org/abs/2207.03483

Zobrazit plný text záznamu

Report

Object-based synthesis of scraping and rolling sounds based on non-linear physical constraints

Autor: Agarwal, Vinayak, Cusimano, Maddie, Traer, James, McDermott, Josh

Publikováno v: Proceeding of the 24th International Conference on Digital Audio Effects (DAFx-20in21), 2021

Sustained contact interactions like scraping and rolling produce a wide variety of sounds. Previous studies have explored ways to synthesize these sounds efficiently and intuitively but could not fully mimic the rich structure of real instances of th

Externí odkaz: http://arxiv.org/abs/2112.08984

Zobrazit plný text záznamu

Report

Neural Population Geometry Reveals the Role of Stochasticity in Robust Perception

Autor: Dapello, Joel, Feather, Jenelle, Le, Hang, Marques, Tiago, Cox, David D., McDermott, Josh H., DiCarlo, James J., Chung, SueYeon

Adversarial examples are often cited by neuroscientists and machine learning researchers as an example of how computational models diverge from biological sensory systems. Recent work has proposed adding biologically-inspired components to visual neu

Externí odkaz: http://arxiv.org/abs/2111.06979

Zobrazit plný text záznamu

Akademický článek

Listening with generative models

Autor: Cusimano, Maddie, Hewitt, Luke B., McDermott, Josh H.

Publikováno v: In Cognition December 2024 253

Zobrazit plný text záznamu

Report

The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark for Physically Realistic Embodied AI

Autor: Gan, Chuang, Zhou, Siyuan, Schwartz, Jeremy, Alter, Seth, Bhandwaldar, Abhishek, Gutfreund, Dan, Yamins, Daniel L. K., DiCarlo, James J, McDermott, Josh, Torralba, Antonio, Tenenbaum, Joshua B.

We introduce a visually-guided and physics-driven task-and-motion planning benchmark, which we call the ThreeDWorld Transport Challenge. In this challenge, an embodied agent equipped with two 9-DOF articulated arms is spawned randomly in a simulated

Externí odkaz: http://arxiv.org/abs/2103.14025

Zobrazit plný text záznamu

Report

Speech Denoising with Auditory Models

Autor: Saddler, Mark R., Francl, Andrew, Feather, Jenelle, Qian, Kaizhi, Zhang, Yang, McDermott, Josh H.

Contemporary speech enhancement predominantly relies on audio transforms that are trained to reconstruct a clean speech waveform. The development of high-performing neural network sound recognition systems has raised the possibility of using deep fea

Externí odkaz: http://arxiv.org/abs/2011.10706

Zobrazit plný text záznamu

Report

ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation

We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation. TDW enables simulation of high-fidelity sensory data and physical interactions between mobile agents and objects in rich 3D environments. Unique properties in

Externí odkaz: http://arxiv.org/abs/2007.04954

Zobrazit plný text záznamu

Report

Untangling in Invariant Speech Recognition

Autor: Stephenson, Cory, Feather, Jenelle, Padhy, Suchismita, Elibol, Oguz, Tang, Hanlin, McDermott, Josh, Chung, SueYeon

Encouraged by the success of deep neural networks on a variety of visual tasks, much theoretical and experimental work has been aimed at understanding and interpreting how vision networks operate. Meanwhile, deep neural networks have also achieved im

Externí odkaz: http://arxiv.org/abs/2003.01787

Zobrazit plný text záznamu

Report

Self-Supervised Audio-Visual Co-Segmentation

Autor: Rouditchenko, Andrew, Zhao, Hang, Gan, Chuang, McDermott, Josh, Torralba, Antonio

Segmenting objects in images and separating sound sources in audio are challenging tasks, in part because traditional approaches require large amounts of labeled data. In this paper we develop a neural network model for visual object segmentation and

Externí odkaz: http://arxiv.org/abs/1904.09013

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání