Zobrazeno 1 - 10
of 10 479
pro vyhledávání: '"Topcu A"'
Autonomous agents perceive and interpret their surroundings by integrating multimodal inputs, such as vision, audio, and LiDAR. These perceptual modalities support retrieval tasks, such as place recognition in robotics. However, current multimodal re
Externí odkaz:
http://arxiv.org/abs/2411.10513
Multimodal foundation models offer a promising framework for robotic perception and planning by processing sensory inputs to generate actionable plans. However, addressing uncertainty in both perception (sensory interpretation) and decision-making (p
Externí odkaz:
http://arxiv.org/abs/2411.01639
Strategic coordination between autonomous agents and human partners under incomplete information can be modeled as turn-based cooperative games. We extend a turn-based game under incomplete information, the shared-control game, to allow players to ta
Externí odkaz:
http://arxiv.org/abs/2410.18242
Autor:
Liu, Xinjie, Li, Jingqi, Fotiadis, Filippos, Karabag, Mustafa O., Milzman, Jesse, Fridovich-Keil, David, Topcu, Ufuk
Common feedback strategies in multi-agent dynamic games require all players' state information to compute control strategies. However, in real-world scenarios, sensing and communication limitations between agents make full state feedback expensive or
Externí odkaz:
http://arxiv.org/abs/2410.16441
We develop a method that integrates the tree of thoughts and multi-agent framework to enhance the capability of pre-trained language models in solving complex, unfamiliar games. The method decomposes game-solving into four incremental tasks -- game s
Externí odkaz:
http://arxiv.org/abs/2410.14890
Although pre-trained language models can generate executable plans (e.g., programmatic policies) for solving robot tasks, the generated plans may violate task-relevant logical specifications due to the models' black-box nature. A significant gap rema
Externí odkaz:
http://arxiv.org/abs/2410.14865
Multimodal encoders like CLIP excel in tasks such as zero-shot image classification and cross-modal retrieval. However, they require excessive training data. We propose canonical similarity analysis (CSA), which uses two unimodal encoders to replicat
Externí odkaz:
http://arxiv.org/abs/2410.07610
Modern-world robotics involves complex environments where multiple autonomous agents must interact with each other and other humans. This necessitates advanced interactive multi-agent motion planning techniques. Generalized Nash equilibrium(GNE), a s
Externí odkaz:
http://arxiv.org/abs/2410.05554
Autor:
Yang, Yunhao, Hu, Yuxin, Ye, Mao, Zhang, Zaiwei, Lu, Zhichao, Xu, Yi, Topcu, Ufuk, Snyder, Ben
Multimodal foundation models offer promising advancements for enhancing driving perception systems, but their high computational and financial costs pose challenges. We develop a method that leverages foundation models to refine predictions from exis
Externí odkaz:
http://arxiv.org/abs/2410.01144
We present Basis-to-Basis (B2B) operator learning, a novel approach for learning operators on Hilbert spaces of functions based on the foundational ideas of function encoders. We decompose the task of learning operators into two parts: learning sets
Externí odkaz:
http://arxiv.org/abs/2410.00171