Výsledky vyhledávání - "Wang, Tonghan"

Report

On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow

Autor: Wang, Tonghan, Dong, Heng, Jiang, Yanchen, Parkes, David C., Tambe, Milind

Multiagent systems grapple with partial observability (PO), and the decentralized POMDP (Dec-POMDP) model highlights the fundamental nature of this challenge. Whereas recent approaches to address PO have appealed to deep learning models, providing a

Externí odkaz: http://arxiv.org/abs/2410.13953

Zobrazit plný text záznamu

Report

The Bandit Whisperer: Communication Learning for Restless Bandits

Autor: Zhao, Yunfan, Wang, Tonghan, Nagaraj, Dheeraj, Taneja, Aparna, Tambe, Milind

Applying Reinforcement Learning (RL) to Restless Multi-Arm Bandits (RMABs) offers a promising avenue for addressing allocation problems with resource constraints and temporal dynamics. However, classic RMAB models largely overlook the challenges of (

Externí odkaz: http://arxiv.org/abs/2408.05686

Zobrazit plný text záznamu

Report

Principal-Agent Reinforcement Learning: Orchestrating AI Agents with Contracts

Autor: Ivanov, Dima, Dütting, Paul, Talgam-Cohen, Inbal, Wang, Tonghan, Parkes, David C.

The increasing deployment of AI is shaping the future landscape of the internet, which is set to become an integrated ecosystem of AI agents. Orchestrating the interaction among AI agents necessitates decentralized, self-sustaining mechanisms that ha

Externí odkaz: http://arxiv.org/abs/2407.18074

Zobrazit plný text záznamu

Report

GemNet: Menu-Based, Strategy-Proof Multi-Bidder Auctions Through Deep Learning

Autor: Wang, Tonghan, Jiang, Yanchen, Parkes, David C.

Automated mechanism design (AMD) uses computational methods for mechanism design. Differentiable economics is a form of AMD that uses deep learning to learn mechanism designs and has enabled strong progress in AMD in recent years. Nevertheless, a maj

Externí odkaz: http://arxiv.org/abs/2406.07428

Zobrazit plný text záznamu

Report

Social Environment Design

Autor: Zhang, Edwin, Zhao, Sadie, Wang, Tonghan, Hossain, Safwan, Gasztowtt, Henry, Zheng, Stephan, Parkes, David C., Tambe, Milind, Chen, Yiling

Artificial Intelligence (AI) holds promise as a technology that can be used to improve government and economic policy-making. This paper proposes a new research agenda towards this end by introducing Social Environment Design, a general framework for

Externí odkaz: http://arxiv.org/abs/2402.14090

Zobrazit plný text záznamu

Report

Multi-Sender Persuasion: A Computational Perspective

Autor: Hossain, Safwan, Wang, Tonghan, Lin, Tao, Chen, Yiling, Parkes, David C., Xu, Haifeng

We consider the multi-sender persuasion problem: multiple players with informational advantage signal to convince a single self-interested actor to take certain actions. This problem generalizes the seminal Bayesian Persuasion framework and is ubiqui

Externí odkaz: http://arxiv.org/abs/2402.04971

Zobrazit plný text záznamu

Report

Never Explore Repeatedly in Multi-Agent Reinforcement Learning

Autor: Li, Chenghao, Wang, Tonghan, Zhang, Chongjie, Zhao, Qianchuan

In the realm of multi-agent reinforcement learning, intrinsic motivations have emerged as a pivotal tool for exploration. While the computation of many intrinsic rewards relies on estimating variational posteriors using neural network approximators,

Externí odkaz: http://arxiv.org/abs/2308.09909

Zobrazit plný text záznamu

Report

Deep Contract Design via Discontinuous Networks

Autor: Wang, Tonghan, Dütting, Paul, Ivanov, Dmitry, Talgam-Cohen, Inbal, Parkes, David C.

Publikováno v: NeurIPS 2023

Contract design involves a principal who establishes contractual agreements about payments for outcomes that arise from the actions of an agent. In this paper, we initiate the study of deep learning for the automated design of optimal contracts. We i

Externí odkaz: http://arxiv.org/abs/2307.02318

Zobrazit plný text záznamu

Report

Symmetry-Aware Robot Design with Structured Subgroups

Autor: Dong, Heng, Zhang, Junyu, Wang, Tonghan, Zhang, Chongjie

Robot design aims at learning to create robots that can be easily controlled and perform tasks efficiently. Previous works on robot design have proven its ability to generate robots for various tasks. However, these works searched the robots directly

Externí odkaz: http://arxiv.org/abs/2306.00036

Zobrazit plný text záznamu

Report

Non-Linear Coordination Graphs

Autor: Kang, Yipeng, Wang, Tonghan, Wu, Xiaoran, Yang, Qianlan, Zhang, Chongjie

Publikováno v: NeurIPS 2022

Value decomposition multi-agent reinforcement learning methods learn the global value function as a mixing of each agent's individual utility functions. Coordination graphs (CGs) represent a higher-order decomposition by incorporating pairwise payoff

Externí odkaz: http://arxiv.org/abs/2211.08404

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání