Stochastic optimal controller design for medium access constrained networked control systems with unknown dynamics

Autor:	Luigi Glielmo, B. Subathra, Sreram Balasubramaniyan, Seshadhri Srinivasan, Valentina Emilia Balas, Hamed Kebraei
Přispěvatelé:	Balasubramaniyan, Sreram, Srinivasan, Seshadhri, Kebraei, Hamed, Subathra, B., Balas, Valentina Emilia, Glielmo, Luigi
Rok vydání:	2017
Předmět:	0209 industrial biotechnology Computer science Q-learning 02 engineering and technology Networked control systems (NCSs) 020901 industrial engineering & automation Software Control theory Artificial Intelligence 0202 electrical engineering electronic engineering information engineering Markov Decision Process (MDP) Controller design business.industry Stochastic optimal controller Control engineering Networked control system Human-Computer Interaction Constraint Dynamics (music) Control system 020201 artificial intelligence & image processing Computer Vision and Pattern Recognition Medium acce business
Zdroj:	Intelligent Decision Technologies. 11:253-264
ISSN:	1875-8843 1872-4981
DOI:	10.3233/idt-170293
Popis:	This paper proposes a stochastic optimal controller for networked control systems (NCS) with unknown dynamics and medium access constraints. The medium access constraint of NCS is modelled as a Markov Decision Process (MDP) that switches modes depending the channel access to the actuators. We then show that using the MDP assumption, the NCS with medium access constraint can be modelled as a Markovian jump linear system. Then a stochastic optimal controller is proposed that minimizes the quadratic cost function using Q-learning algorithm. The resulting control algorithm simultaneously optimizes the quadratic cost function and also allocates the network bandwidth judiciously by designing a scheduler. Two compensation strategies transmit zero and zero-order hold for control inputs that fail to get an access to channel are studied. The proposed controller and scheduler are illustrated using experiments on networks and simulations on an industrial four-tank system. The advantage of the proposed approach is that the optimal controller and scheduler can be designed forward-in-time for NCS with unknown dynamics. This is a departure from traditional dynamic programming based approaches that assume complete knowledge of the NCS dynamics and network constraints beforehand to solve the optimal controller problem backward-in-time.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6010fa56c9cb8f7c1d1dccc8eb651d9b https://doi.org/10.3233/idt-170293 Zobrazit plný text záznamu