Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Ribas, Oswaldo"'
Autor:
Agarwal, Alekh, Bird, Sarah, Cozowicz, Markus, Hoang, Luong, Langford, John, Lee, Stephen, Li, Jiaji, Melamed, Dan, Oshri, Gal, Ribas, Oswaldo, Sen, Siddhartha, Slivkins, Alex
Applications and systems are constantly faced with decisions that require picking from a set of actions based on contextual information. Reinforcement-based learning algorithms such as contextual bandits can be very effective in these settings, but a
Externí odkaz:
http://arxiv.org/abs/1606.03966