Výsledky vyhledávání - "contextual bandits"

Akademický článek

Thompson Sampling for Stochastic Bandits with Noisy Contexts: An Information-Theoretic Regret Analysis

Autor: Sharu Theresa Jose, Shana Moothedath

Publikováno v: Entropy, Vol 26, Iss 7, p 606 (2024)

We study stochastic linear contextual bandits (CB) where the agent observes a noisy version of the true context through a noise channel with unknown channel parameters. Our objective is to design an action policy that can “approximate” that of a

Externí odkaz: https://doaj.org/article/771636c4bb64435781d21c35fd7c6bc3

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

A Deep Contextual Bandit-Based End-to-End Slice Provisioning Approach for Efficient Allocation of 5G Network Resources

Autor: Ralph Voltaire J. Dayot, In-Ho Ra, Hyung-Jin Kim

Publikováno v: Network, Vol 2, Iss 3, Pp 370-388 (2022)

5G networks have been experiencing challenges in handling the heterogeneity and influx of user requests brought upon by the constant emergence of various services. As such, network slicing is considered one of the critical technologies for improving

Externí odkaz: https://doaj.org/article/77417bb41bc348f29aff865d1ea9c4ba

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Bayesian Contextual Bandits for Hyper Parameter Optimization

Autor: Guoxin Sui, Yong Yu

Publikováno v: IEEE Access, Vol 8, Pp 42971-42979 (2020)

Hyper parameter optimization (HPO) is a crucial step in modern machine learning systems. Bayesian optimization (BO) has shown great promise in HPO, where the parameter evaluation is conducted through a black-box optimization procedure. However, the m

Externí odkaz: https://doaj.org/article/9a6640b4d6e848a3bc30f3b81e116c3b

Zobrazit plný text záznamu

Akademický článek

A Systematic Study on Reinforcement Learning Based Applications

Autor: Keerthana Sivamayil, Elakkiya Rajasekar, Belqasem Aljafari, Srete Nikolovski, Subramaniyaswamy Vairavasundaram, Indragandhi Vairavasundaram

Publikováno v: Energies, Vol 16, Iss 3, p 1512 (2023)

We have analyzed 127 publications for this review paper, which discuss applications of Reinforcement Learning (RL) in marketing, robotics, gaming, automated cars, natural language processing (NLP), internet of things security, recommendation systems,

Externí odkaz: https://doaj.org/article/13d97b866c1f43e9ac18e47cab747ea3

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání