Deep neural networks algorithms for stochastic control problems on finite horizon: numerical applications

Autor: Huyên Pham, Nicolas Langrené, Côme Huré, Achref Bachouch
Přispěvatelé: University of Oslo (UiO), Laboratoire de Probabilités, Statistiques et Modélisations (LPSM (UMR_8001)), Université Paris Diderot - Paris 7 (UPD7)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Data61 [Canberra] (CSIRO), Australian National University (ANU)-Commonwealth Scientific and Industrial Research Organisation [Canberra] (CSIRO), ANR-15-CE05-0024,CAESARS,Contrôle et simulation des systèmes électriques, interaction et robustesse(2015), Laboratoire de Probabilités, Statistique et Modélisation (LPSM (UMR_8001))
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Statistics and Probability
FOS: Computer and information sciences
reinforcement learning
value iteration
General Mathematics
0211 other engineering and technologies
Computational Finance (q-fin.CP)
Machine Learning (stat.ML)
02 engineering and technology
01 natural sciences
[QFIN.CP]Quantitative Finance [q-fin]/Computational Finance [q-fin.CP]
FOS: Economics and business
010104 statistics & probability
Stochastic differential equation
Quantitative Finance - Computational Finance
Quadratic equation
[STAT.ML]Statistics [stat]/Machine Learning [stat.ML]
Statistics - Machine Learning
FOS: Mathematics
Reinforcement learning
0101 mathematics
Mathematics - Optimization and Control
Mathematics
Valuation (algebra)
Stochastic control
021103 operations research
business.industry
Deep learning
Probability (math.PR)
[MATH.MATH-PR]Mathematics [math]/Probability [math.PR]
Nonlinear system
Optimization and Control (math.OC)
Artificial intelligence
Markov decision process
Policy iteration algorithm
quantization
[MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]
business
Algorithm
Mathematics - Probability
Zdroj: Methodology and Computing in Applied Probability
Methodology and Computing in Applied Probability, Springer Verlag, In press
ISSN: 1387-5841
1573-7713
Popis: This paper presents several numerical applications of deep learning-based algorithms that have been introduced in [HPBL18]. Numerical and comparative tests using TensorFlow illustrate the performance of our different algorithms, namely control learning by performance iteration (algorithms NNcontPI and ClassifPI), control learning by hybrid iteration (algorithms Hybrid-Now and Hybrid-LaterQ), on the 100-dimensional nonlinear PDEs examples from [EHJ17] and on quadratic backward stochastic differential equations as in [CR16]. We also performed tests on low-dimension control problems such as an option hedging problem in finance, as well as energy storage problems arising in the valuation of gas storage and in microgrid management. Numerical results and comparisons to quantization-type algorithms Qknn, as an efficient algorithm to numerically solve low-dimensional control problems, are also provided; and some corresponding codes are available on https://github.com/comeh/.
Comment: 39 pages, 14 figures. Methodology and Computing in Applied Probability, Springer Verlag, In press
Databáze: OpenAIRE