Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Alamdari, Parand A."'
We present substantial evidence demonstrating the benefits of integrating Large Language Models (LLMs) with a Contextual Multi-Armed Bandit framework. Contextual bandits have been widely used in recommendation systems to generate personalized suggest
Externí odkaz:
http://arxiv.org/abs/2406.19317
Fair decision making has largely been studied with respect to a single decision. Here we investigate the notion of fairness in the context of sequential decision making where multiple stakeholders can be affected by the outcomes of decisions. We obse
Externí odkaz:
http://arxiv.org/abs/2312.04772
Recent work in AI safety has highlighted that in sequential decision making, objectives are often underspecified or incomplete. This gives discretion to the acting agent to realize the stated objective in ways that may result in undesirable outcomes.
Externí odkaz:
http://arxiv.org/abs/2106.02617
Machine learning and formal methods have complimentary benefits and drawbacks. In this work, we address the controller-design problem with a combination of techniques from both fields. The use of black-box neural networks in deep reinforcement learni
Externí odkaz:
http://arxiv.org/abs/2005.12175