Hamilton-Jacobi Approach for State-Constrained Differential Games and Numerical Learning Methods for Optimal Control Problems

Autor:	Gammoudi, Nidhal
Přispěvatelé:	STAR, ABES, Optimisation et commande (OC), Unité de Mathématiques Appliquées (UMA), École Nationale Supérieure de Techniques Avancées (ENSTA Paris)-École Nationale Supérieure de Techniques Avancées (ENSTA Paris), Institut Polytechnique de Paris, Hasnaa Zidani
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Apprentissage profond Apprentissage par renforcement Optimal Control Reinforcement learning Deep learning [MATH.MATH-OC] Mathematics [math]/Optimization and Control [math.OC] [MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC] Multi-Objectif Approche Hamilton-Jacobi Jeux coopératifs Hamilton-Jacobi Approach Cooperative Games Commande optimale
Zdroj:	Optimization and Control [math.OC]. Institut Polytechnique de Paris, 2021. English. ⟨NNT : 2021IPPAE005⟩
Popis:	This thesis will focus on the study of a theoretical and numerical approach for the multi-objective control problems with state constraints. Multi-objective optimization is an important approach for modelling complex problems in order to analyse the balance between different criteria to minimize. Here, the approach that will be used is based on the theory of Hamilton-Jacobi equations. The goal is to introduce a new methodology to study the properties and compute the Pareto front for multi-objective problems using the value function of an optimal control problem. La thèse aura pour objectif d'étudier une approche théorique et numérique pour l'optimisation multi-objectif de trajectoires avec contraintes sur l’état. L'optimisation multi-objectif est une approche importante pour modéliser des problèmes complexes dans le but d’analyser le compromis entre différents critères à minimiser. Ici, l’approche qui sera utilisée est basée sur la théorie des équations de Hamilton-Jacobi. Le but est d’introduire une nouvelle méthodologie pour étudier les propriétés et calculer le front de Pareto pour les problèmes multi-objectif en utilisant la fonction valeur d’un problème de commande optimale.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::95b7f0306047869c983a9d1b53fd66a1 https://theses.hal.science/tel-03323613 Zobrazit plný text záznamu