Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications

Autor:	Huyên Pham, Nicolas Langrené, Côme Huré, Achref Bachouch
Přispěvatelé:	University of Oslo (UiO), Laboratoire de Probabilités, Statistiques et Modélisations (LPSM (UMR_8001)), Université Paris Diderot - Paris 7 (UPD7)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Data61 [Canberra] (CSIRO), Australian National University (ANU)-Commonwealth Scientific and Industrial Research Organisation [Canberra] (CSIRO), ANR-15-CE05-0024,CAESARS,Contrôle et simulation des systèmes électriques, interaction et robustesse(2015), Laboratoire de Probabilités, Statistique et Modélisation (LPSM (UMR_8001))
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	Statistics and Probability FOS: Computer and information sciences reinforcement learning value iteration General Mathematics 0211 other engineering and technologies Computational Finance (q-fin.CP) Machine Learning (stat.ML) 02 engineering and technology 01 natural sciences [QFIN.CP]Quantitative Finance [q-fin]/Computational Finance [q-fin.CP] FOS: Economics and business 010104 statistics & probability Stochastic differential equation Quantitative Finance - Computational Finance Quadratic equation [STAT.ML]Statistics [stat]/Machine Learning [stat.ML] Statistics - Machine Learning FOS: Mathematics Reinforcement learning 0101 mathematics Mathematics - Optimization and Control Mathematics Valuation (algebra) Stochastic control 021103 operations research business.industry Deep learning Probability (math.PR) [MATH.MATH-PR]Mathematics [math]/Probability [math.PR] Nonlinear system Optimization and Control (math.OC) Artificial intelligence Markov decision process Policy iteration algorithm quantization [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC] business Algorithm Mathematics - Probability
Zdroj:	Methodology and Computing in Applied Probability Methodology and Computing in Applied Probability, Springer Verlag, In press
ISSN:	1387-5841 1573-7713
Popis:	This paper presents several numerical applications of deep learning-based algorithms that have been introduced in [HPBL18]. Numerical and comparative tests using TensorFlow illustrate the performance of our different algorithms, namely control learning by performance iteration (algorithms NNcontPI and ClassifPI), control learning by hybrid iteration (algorithms Hybrid-Now and Hybrid-LaterQ), on the 100-dimensional nonlinear PDEs examples from [EHJ17] and on quadratic backward stochastic differential equations as in [CR16]. We also performed tests on low-dimension control problems such as an option hedging problem in finance, as well as energy storage problems arising in the valuation of gas storage and in microgrid management. Numerical results and comparisons to quantization-type algorithms Qknn, as an efficient algorithm to numerically solve low-dimensional control problems, are also provided; and some corresponding codes are available on https://github.com/comeh/. Comment: 39 pages, 14 figures. Methodology and Computing in Applied Probability, Springer Verlag, In press
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::08cfc183d6d4b31d00b942d3e38a08fd https://hal.archives-ouvertes.fr/hal-01949221 Zobrazit plný text záznamu Full text from SpringerLink