Výsledky vyhledávání - "Alamdari, Parand A."

Report

Jump Starting Bandits with LLM-Generated Prior Knowledge

Autor: Alamdari, Parand A., Cao, Yanshuai, Wilson, Kevin H.

We present substantial evidence demonstrating the benefits of integrating Large Language Models (LLMs) with a Contextual Multi-Armed Bandit framework. Contextual bandits have been widely used in recommendation systems to generate personalized suggest

Externí odkaz: http://arxiv.org/abs/2406.19317

Zobrazit plný text záznamu

Report

Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making

Autor: Alamdari, Parand A., Klassen, Toryn Q., Creager, Elliot, McIlraith, Sheila A.

Fair decision making has largely been studied with respect to a single decision. Here we investigate the notion of fairness in the context of sequential decision making where multiple stakeholders can be affected by the outcomes of decisions. We obse

Externí odkaz: http://arxiv.org/abs/2312.04772

Zobrazit plný text záznamu

Report

Be Considerate: Objectives, Side Effects, and Deciding How to Act

Autor: Alamdari, Parand Alizadeh, Klassen, Toryn Q., Icarte, Rodrigo Toro, McIlraith, Sheila A.

Recent work in AI safety has highlighted that in sequential decision making, objectives are often underspecified or incomplete. This gives discretion to the acting agent to realize the stated objective in ways that may result in undesirable outcomes.

Externí odkaz: http://arxiv.org/abs/2106.02617

Zobrazit plný text záznamu

Report

Formal Methods with a Touch of Magic

Autor: Alamdari, Parand Alizadeh, Avni, Guy, Henzinger, Thomas A., Lukina, Anna

Machine learning and formal methods have complimentary benefits and drawbacks. In this work, we address the controller-design problem with a combination of techniques from both fields. The use of black-box neural networks in deep reinforcement learni

Externí odkaz: http://arxiv.org/abs/2005.12175

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání