Výsledky vyhledávání

Report

Autor: Alamdari, Parand A., Ebadian, Soroush, Procaccia, Ariel D.

We consider the challenge of AI value alignment with multiple individuals that have different reward functions and optimal policies in an underlying Markov decision process. We formalize this problem as one of policy aggregation, where the goal is to

Externí odkaz: http://arxiv.org/abs/2411.03651

Zobrazit plný text záznamu

Report

Honor Among Bandits: No-Regret Learning for Online Fair Division

Autor: Procaccia, Ariel D., Schiffer, Benjamin, Zhang, Shirley

We consider the problem of online fair division of indivisible goods to players when there are a finite number of types of goods and player values are drawn from distributions with unknown means. Our goal is to maximize social welfare subject to allo

Externí odkaz: http://arxiv.org/abs/2407.01795

Zobrazit plný text záznamu

Kniha

Organismic animal biology : an evolutionary approach / Ariel D. Chipman.

Autor: Chipman, Ariel D.

Report

Federated Assemblies

Autor: Halpern, Daniel, Procaccia, Ariel D., Shapiro, Ehud, Talmon, Nimrod

A citizens' assembly is a group of people who are randomly selected to represent a larger population in a deliberation. While this approach has successfully strengthened democracy, it has certain limitations that suggest the need for assemblies to fo

Externí odkaz: http://arxiv.org/abs/2405.19129

Zobrazit plný text záznamu

Report

Learning Social Welfare Functions

Autor: Pardeshi, Kanad Shrikar, Shapira, Itai, Procaccia, Ariel D., Singh, Aarti

Is it possible to understand or imitate a policy maker's rationale by looking at past decisions they made? We formalize this question as the problem of learning social welfare functions belonging to the well-studied family of power mean functions. We

Externí odkaz: http://arxiv.org/abs/2405.17700

Zobrazit plný text záznamu

Report

Bias Detection Via Signaling

Autor: Chen, Yiling, Lin, Tao, Procaccia, Ariel D., Ramdas, Aaditya, Shapira, Itai

We introduce and study the problem of detecting whether an agent is updating their prior beliefs given new evidence in an optimal way that is Bayesian, or whether they are biased towards their own prior. In our model, biased agents form posterior bel

Externí odkaz: http://arxiv.org/abs/2405.17694

Zobrazit plný text záznamu

Report

Axioms for AI Alignment from Human Feedback

Autor: Ge, Luise, Halpern, Daniel, Micha, Evi, Procaccia, Ariel D., Shapira, Itai, Vorobeychik, Yevgeniy, Wu, Junlin

In the context of reinforcement learning from human feedback (RLHF), the reward function is generally derived from maximum likelihood estimation of a random utility model based on pairwise comparisons made by humans. The problem of learning a reward

Externí odkaz: http://arxiv.org/abs/2405.14758

Zobrazit plný text záznamu

Elektronická kniha

Organismic Animal Biology : An Evolutionary Approach

Autor: Chipman, Ariel D., author

Externí odkaz: https://doi.org/10.1093/oso/9780192893581.001.0001

Zobrazit plný text záznamu

Report

Multi-Apartment Rent Division

Autor: Procaccia, Ariel D., Schiffer, Benjamin, Zhang, Shirley

Rent division is the well-studied problem of fairly assigning rooms and dividing rent among a set of roommates within a single apartment. A shortcoming of existing solutions is that renters are assumed to be considering apartments in isolation, where

Externí odkaz: http://arxiv.org/abs/2403.08051

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání