Výsledky vyhledávání

Report

Any2Any: Incomplete Multimodal Retrieval with Conformal Prediction

Autor: Li, Po-han, Yang, Yunhao, Omama, Mohammad, Chinchali, Sandeep, Topcu, Ufuk

Autonomous agents perceive and interpret their surroundings by integrating multimodal inputs, such as vision, audio, and LiDAR. These perceptual modalities support retrieval tasks, such as place recognition in robotics. However, current multimodal re

Externí odkaz: http://arxiv.org/abs/2411.10513

Zobrazit plný text záznamu

Report

Know Where You're Uncertain When Planning with Multimodal Foundation Models: A Formal Framework

Autor: Bhatt, Neel P., Yang, Yunhao, Siva, Rohan, Milan, Daniel, Topcu, Ufuk, Wang, Zhangyang

Multimodal foundation models offer a promising framework for robotic perception and planning by processing sensory inputs to generate actionable plans. However, addressing uncertainty in both perception (sensory interpretation) and decision-making (p

Externí odkaz: http://arxiv.org/abs/2411.01639

Zobrazit plný text záznamu

Report

Human-Agent Coordination in Games under Incomplete Information via Multi-Step Intent

Autor: Chen, Shenghui, Zhao, Ruihan, Chinchali, Sandeep, Topcu, Ufuk

Strategic coordination between autonomous agents and human partners under incomplete information can be modeled as turn-based cooperative games. We extend a turn-based game under incomplete information, the shared-control game, to allow players to ta

Externí odkaz: http://arxiv.org/abs/2410.18242

Zobrazit plný text záznamu

Report

Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming Approach

Autor: Liu, Xinjie, Li, Jingqi, Fotiadis, Filippos, Karabag, Mustafa O., Milzman, Jesse, Fridovich-Keil, David, Topcu, Ufuk

Common feedback strategies in multi-agent dynamic games require all players' state information to compute control strategies. However, in real-world scenarios, sensing and communication limitations between agents make full state feedback expensive or

Externí odkaz: http://arxiv.org/abs/2410.16441

Zobrazit plný text záznamu

Report

Reasoning, Memorization, and Fine-Tuning Language Models for Non-Cooperative Games

Autor: Yang, Yunhao, Berthellemy, Leonard, Topcu, Ufuk

We develop a method that integrates the tree of thoughts and multi-agent framework to enhance the capability of pre-trained language models in solving complex, unfamiliar games. The method decomposes game-solving into four incremental tasks -- game s

Externí odkaz: http://arxiv.org/abs/2410.14890

Zobrazit plný text záznamu

Report

Joint Verification and Refinement of Language Models for Safety-Constrained Planning

Autor: Yang, Yunhao, Ward, William, Hu, Zichao, Biswas, Joydeep, Topcu, Ufuk

Although pre-trained language models can generate executable plans (e.g., programmatic policies) for solving robot tasks, the generated plans may violate task-relevant logical specifications due to the models' black-box nature. A significant gap rema

Externí odkaz: http://arxiv.org/abs/2410.14865

Zobrazit plný text záznamu

Report

CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features

Autor: Li, Po-han, Chinchali, Sandeep P., Topcu, Ufuk

Multimodal encoders like CLIP excel in tasks such as zero-shot image classification and cross-modal retrieval. However, they require excessive training data. We propose canonical similarity analysis (CSA), which uses two unimodal encoders to replicat

Externí odkaz: http://arxiv.org/abs/2410.07610

Zobrazit plný text záznamu

Report

MultiNash-PF: A Particle Filtering Approach for Computing Multiple Local Generalized Nash Equilibria in Trajectory Games

Autor: Bhatt, Maulik, Askari, Iman, Yu, Yue, Topcu, Ufuk, Fang, Huazhen, Mehr, Negar

Modern-world robotics involves complex environments where multiple autonomous agents must interact with each other and other humans. This necessitates advanced interactive multi-agent motion planning techniques. Generalized Nash equilibrium(GNE), a s

Externí odkaz: http://arxiv.org/abs/2410.05554

Zobrazit plný text záznamu

Report

Uncertainty-Guided Enhancement on Driving Perception System via Foundation Models

Autor: Yang, Yunhao, Hu, Yuxin, Ye, Mao, Zhang, Zaiwei, Lu, Zhichao, Xu, Yi, Topcu, Ufuk, Snyder, Ben

Multimodal foundation models offer promising advancements for enhancing driving perception systems, but their high computational and financial costs pose challenges. We develop a method that leverages foundation models to refine predictions from exis

Externí odkaz: http://arxiv.org/abs/2410.01144

Zobrazit plný text záznamu

Report

Basis-to-Basis Operator Learning Using Function Encoders

Autor: Ingebrand, Tyler, Thorpe, Adam J., Goswami, Somdatta, Kumar, Krishna, Topcu, Ufuk

We present Basis-to-Basis (B2B) operator learning, a novel approach for learning operators on Hilbert spaces of functions based on the foundational ideas of function encoders. We decompose the task of learning operators into two parts: learning sets

Externí odkaz: http://arxiv.org/abs/2410.00171

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání