Zobrazeno 1 - 10
of 2 797
pro vyhledávání: '"Ariel, D."'
We consider the challenge of AI value alignment with multiple individuals that have different reward functions and optimal policies in an underlying Markov decision process. We formalize this problem as one of policy aggregation, where the goal is to
Externí odkaz:
http://arxiv.org/abs/2411.03651
We consider the problem of online fair division of indivisible goods to players when there are a finite number of types of goods and player values are drawn from distributions with unknown means. Our goal is to maximize social welfare subject to allo
Externí odkaz:
http://arxiv.org/abs/2407.01795
A citizens' assembly is a group of people who are randomly selected to represent a larger population in a deliberation. While this approach has successfully strengthened democracy, it has certain limitations that suggest the need for assemblies to fo
Externí odkaz:
http://arxiv.org/abs/2405.19129
Is it possible to understand or imitate a policy maker's rationale by looking at past decisions they made? We formalize this question as the problem of learning social welfare functions belonging to the well-studied family of power mean functions. We
Externí odkaz:
http://arxiv.org/abs/2405.17700
We introduce and study the problem of detecting whether an agent is updating their prior beliefs given new evidence in an optimal way that is Bayesian, or whether they are biased towards their own prior. In our model, biased agents form posterior bel
Externí odkaz:
http://arxiv.org/abs/2405.17694
Autor:
Ge, Luise, Halpern, Daniel, Micha, Evi, Procaccia, Ariel D., Shapira, Itai, Vorobeychik, Yevgeniy, Wu, Junlin
In the context of reinforcement learning from human feedback (RLHF), the reward function is generally derived from maximum likelihood estimation of a random utility model based on pairwise comparisons made by humans. The problem of learning a reward
Externí odkaz:
http://arxiv.org/abs/2405.14758
Rent division is the well-studied problem of fairly assigning rooms and dividing rent among a set of roommates within a single apartment. A shortcoming of existing solutions is that renters are assumed to be considering apartments in isolation, where
Externí odkaz:
http://arxiv.org/abs/2403.08051