Zobrazeno 1 - 10
of 21
pro vyhledávání: '"Vasudevan, Arun Balajee"'
Motion planning is crucial for safe navigation in complex urban environments. Historically, motion planners (MPs) have been evaluated with procedurally-generated simulators like CARLA. However, such synthetic benchmarks do not capture real-world mult
Externí odkaz:
http://arxiv.org/abs/2406.10714
Autor:
Yang, Mengyu, Grady, Patrick, Brahmbhatt, Samarth, Vasudevan, Arun Balajee, Kemp, Charles C., Hays, James
How easy is it to sneak up on a robot? We examine whether we can detect people using only the incidental sounds they produce as they move, even when they try to be quiet. We collect a robotic dataset of high-quality 4-channel audio paired with 360 de
Externí odkaz:
http://arxiv.org/abs/2310.03743
Different self-supervised tasks (SSL) reveal different features from the data. The learned feature representations can exhibit different performance for each downstream task. In this light, this work aims to combine Multiple SSL tasks (Multi-SSL) tha
Externí odkaz:
http://arxiv.org/abs/2201.01046
Humans can robustly recognize and localize objects by using visual and/or auditory cues. While machines are able to do the same with visual data already, less work has been done with sounds. This work develops an approach for scene understanding pure
Externí odkaz:
http://arxiv.org/abs/2109.02763
Humans can robustly recognize and localize objects by integrating visual and auditory cues. While machines are able to do the same now with images, less work has been done with sounds. This work develops an approach for dense semantic labelling of so
Externí odkaz:
http://arxiv.org/abs/2003.04210
The role of robots in society keeps expanding, bringing with it the necessity of interacting and communicating with humans. In order to keep such interaction intuitive, we provide automatic wayfinding based on verbal navigational instructions. Our fi
Externí odkaz:
http://arxiv.org/abs/1910.02029
We investigate the problem of object referring (OR) i.e. to localize a target object in a visual scene coming with a language description. Humans perceive the world more as continued video snippets than as static images, and describe objects not only
Externí odkaz:
http://arxiv.org/abs/1801.01582
Object referring has important applications, especially for human-machine interaction. While having received great attention, the task is mainly attacked with written language (text) as input rather than spoken language (speech), which is more natura
Externí odkaz:
http://arxiv.org/abs/1711.03800
Although the problem of automatic video summarization has recently received a lot of attention, the problem of creating a video summary that also highlights elements relevant to a search query has been less studied. We address this problem by posing
Externí odkaz:
http://arxiv.org/abs/1705.00581
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.