Výsledky vyhledávání - "Tan, Charlie B."

Report

Beyond the Boundaries of Proximal Policy Optimization

Autor: Tan, Charlie B., Toledo, Edan, Ellis, Benjamin, Foerster, Jakob N., Huszár, Ferenc

Proximal policy optimization (PPO) is a widely-used algorithm for on-policy reinforcement learning. This work offers an alternative perspective of PPO, in which it is decomposed into the inner-loop estimation of update vectors, and the outer-loop app

Externí odkaz: http://arxiv.org/abs/2411.00666

Zobrazit plný text záznamu

Report

On the Limitations of Fractal Dimension as a Measure of Generalization

Autor: Tan, Charlie B., García-Redondo, Inés, Wang, Qiquan, Bronstein, Michael M., Monod, Anthea

Bounding and predicting the generalization gap of overparameterized neural networks remains a central open problem in theoretical machine learning. There is a recent and growing body of literature that proposes the framework of fractals to model opti

Externí odkaz: http://arxiv.org/abs/2406.02234

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání