Zobrazeno 1 - 10
of 3 588
pro vyhledávání: '"Wu A-Shan"'
Off-policy actor-critic algorithms have shown promise in deep reinforcement learning for continuous control tasks. Their success largely stems from leveraging pessimistic state-action value function updates, which effectively address function approxi
Externí odkaz:
http://arxiv.org/abs/2406.03890
PAC-Bayesian analysis is a frequentist framework for incorporating prior knowledge into learning. It was inspired by Bayesian learning, which allows sequential data processing and naturally turns posteriors from one processing step into priors for th
Externí odkaz:
http://arxiv.org/abs/2405.14681
Reinforcement learning for continuous control under sparse rewards is an under-explored problem despite its significance in real life. Many complex skills build on intermediate ones as prerequisites. For instance, a humanoid locomotor has to learn ho
Externí odkaz:
http://arxiv.org/abs/2402.03055
Publikováno v:
Asia Pacific Journal of Marketing and Logistics, 2024, Vol. 36, Issue 10, pp. 2411-2428.
Externí odkaz:
http://www.emeraldinsight.com/doi/10.1108/APJML-10-2023-1053
The cold posterior effect (CPE) (Wenzel et al., 2020) in Bayesian deep learning shows that, for posteriors with a temperature $T<1$, the resulting posterior predictive could have better performances than the Bayesian posterior ($T=1$). As the Bayesia
Externí odkaz:
http://arxiv.org/abs/2310.01189
Publikováno v:
In Materials Today Communications January 2025 42
Autor:
Wu, Xiang-Yao, Wu, Ben-Shan
According to De Broglie's idea of analogy, the relation between quantum mechanics and classical mechanics is similar to that between wave optics and geometric optics, we have given the quantum equation of the gravitational field intensity $E_g(\vec{r
Externí odkaz:
http://arxiv.org/abs/2208.13748
Autor:
Wu, Yi-Shan, Seldin, Yevgeny
We present a new concentration of measure inequality for sums of independent bounded random variables, which we name a split-kl inequality. The inequality is particularly well-suited for ternary random variables, which naturally show up in a variety
Externí odkaz:
http://arxiv.org/abs/2206.00706
Autor:
Li, Yen-Ching, Lee, Yun-Chieh, Murakami, Megumi, Huang, Yang-Hui, Hung, Tai-Ho, Wu, Yu-Shan, Ambudkar, Suresh.V., Wu, Chung-Pu
Publikováno v:
In Biomedicine & Pharmacotherapy November 2024 180
Autor:
Lin, Bing-Huan, Li, Yen-Ching, Murakami, Megumi, Wu, Yu-Shan, Huang, Yang-Hui, Hung, Tai-Ho, Ambudkar, Suresh.V., Wu, Chung-Pu
Publikováno v:
In Biomedicine & Pharmacotherapy November 2024 180