Zobrazeno 1 - 10
of 324
pro vyhledávání: '"McDermott, Josh"'
Autor:
Gan, Chuang, Gu, Yi, Zhou, Siyuan, Schwartz, Jeremy, Alter, Seth, Traer, James, Gutfreund, Dan, Tenenbaum, Joshua B., McDermott, Josh, Torralba, Antonio
The way an object looks and sounds provide complementary reflections of its physical properties. In many settings cues from vision and audition arrive asynchronously but must be integrated, as when we hear an object dropped on the floor and then must
Externí odkaz:
http://arxiv.org/abs/2207.03483
Publikováno v:
Proceeding of the 24th International Conference on Digital Audio Effects (DAFx-20in21), 2021
Sustained contact interactions like scraping and rolling produce a wide variety of sounds. Previous studies have explored ways to synthesize these sounds efficiently and intuitively but could not fully mimic the rich structure of real instances of th
Externí odkaz:
http://arxiv.org/abs/2112.08984
Autor:
Dapello, Joel, Feather, Jenelle, Le, Hang, Marques, Tiago, Cox, David D., McDermott, Josh H., DiCarlo, James J., Chung, SueYeon
Adversarial examples are often cited by neuroscientists and machine learning researchers as an example of how computational models diverge from biological sensory systems. Recent work has proposed adding biologically-inspired components to visual neu
Externí odkaz:
http://arxiv.org/abs/2111.06979
Publikováno v:
In Cognition December 2024 253
Autor:
Gan, Chuang, Zhou, Siyuan, Schwartz, Jeremy, Alter, Seth, Bhandwaldar, Abhishek, Gutfreund, Dan, Yamins, Daniel L. K., DiCarlo, James J, McDermott, Josh, Torralba, Antonio, Tenenbaum, Joshua B.
We introduce a visually-guided and physics-driven task-and-motion planning benchmark, which we call the ThreeDWorld Transport Challenge. In this challenge, an embodied agent equipped with two 9-DOF articulated arms is spawned randomly in a simulated
Externí odkaz:
http://arxiv.org/abs/2103.14025
Autor:
Saddler, Mark R., Francl, Andrew, Feather, Jenelle, Qian, Kaizhi, Zhang, Yang, McDermott, Josh H.
Contemporary speech enhancement predominantly relies on audio transforms that are trained to reconstruct a clean speech waveform. The development of high-performing neural network sound recognition systems has raised the possibility of using deep fea
Externí odkaz:
http://arxiv.org/abs/2011.10706
Autor:
Gan, Chuang, Schwartz, Jeremy, Alter, Seth, Mrowca, Damian, Schrimpf, Martin, Traer, James, De Freitas, Julian, Kubilius, Jonas, Bhandwaldar, Abhishek, Haber, Nick, Sano, Megumi, Kim, Kuno, Wang, Elias, Lingelbach, Michael, Curtis, Aidan, Feigelis, Kevin, Bear, Daniel M., Gutfreund, Dan, Cox, David, Torralba, Antonio, DiCarlo, James J., Tenenbaum, Joshua B., McDermott, Josh H., Yamins, Daniel L. K.
We introduce ThreeDWorld (TDW), a platform for interactive multi-modal physical simulation. TDW enables simulation of high-fidelity sensory data and physical interactions between mobile agents and objects in rich 3D environments. Unique properties in
Externí odkaz:
http://arxiv.org/abs/2007.04954
Autor:
Stephenson, Cory, Feather, Jenelle, Padhy, Suchismita, Elibol, Oguz, Tang, Hanlin, McDermott, Josh, Chung, SueYeon
Encouraged by the success of deep neural networks on a variety of visual tasks, much theoretical and experimental work has been aimed at understanding and interpreting how vision networks operate. Meanwhile, deep neural networks have also achieved im
Externí odkaz:
http://arxiv.org/abs/2003.01787
Segmenting objects in images and separating sound sources in audio are challenging tasks, in part because traditional approaches require large amounts of labeled data. In this paper we develop a neural network model for visual object segmentation and
Externí odkaz:
http://arxiv.org/abs/1904.09013
Publikováno v:
In Cognition March 2023 232