TY - GEN
T1 - Model Free Deep Deterministic Policy Gradient Controller for Setpoint Tracking of Non-Minimum Phase Systems
AU - Tavakkoli, Fatemeh
AU - Sarhadi, Pouria
AU - Clement, Benoit
AU - Naeem, Wasif
PY - 2024/5/22
Y1 - 2024/5/22
N2 - Deep Reinforcement Learning (DRL) techniques have received significant attention in control and decision-making algorithms. Most applications involve complex decision-making systems, justified by the algorithms' computational power and cost. While model-based versions are emerging, model-free DRL approaches are intriguing for their independence from models, yet they remain relatively less explored in terms of performance, particularly in applied control. This study conducts a thorough performance analysis comparing the data-driven DRL paradigm with a classical state feedback controller, both designed based on the same cost (reward) function of the linear quadratic regulator (LQR) problem. Twelve additional performance criteria are introduced to assess the controllers' performance, independent of the LQR problem for which they are designed. Two Deep Deterministic Policy Gradient (DDPG)-based controllers are developed, leveraging DDPG's widespread reputation. These controllers are aimed at addressing a challenging setpoint tracking problem in a Non-Minimum Phase (NMP) system. The performance and robustness of the controllers are assessed in the presence of operational challenges, including disturbance, noise, initial conditions, and model uncertainties. The findings suggest that the DDPG controller demonstrates promising behavior under rigorous test conditions. Nevertheless, further improvements are necessary for the DDPG controller to outperform classical methods in all criteria. While DRL algorithms may excel in complex environments owing to the flexibility in the reward function definition, this paper offers practical insights and a comparison framework specifically designed to evaluate these algorithms within the context of control engineering.
AB - Deep Reinforcement Learning (DRL) techniques have received significant attention in control and decision-making algorithms. Most applications involve complex decision-making systems, justified by the algorithms' computational power and cost. While model-based versions are emerging, model-free DRL approaches are intriguing for their independence from models, yet they remain relatively less explored in terms of performance, particularly in applied control. This study conducts a thorough performance analysis comparing the data-driven DRL paradigm with a classical state feedback controller, both designed based on the same cost (reward) function of the linear quadratic regulator (LQR) problem. Twelve additional performance criteria are introduced to assess the controllers' performance, independent of the LQR problem for which they are designed. Two Deep Deterministic Policy Gradient (DDPG)-based controllers are developed, leveraging DDPG's widespread reputation. These controllers are aimed at addressing a challenging setpoint tracking problem in a Non-Minimum Phase (NMP) system. The performance and robustness of the controllers are assessed in the presence of operational challenges, including disturbance, noise, initial conditions, and model uncertainties. The findings suggest that the DDPG controller demonstrates promising behavior under rigorous test conditions. Nevertheless, further improvements are necessary for the DDPG controller to outperform classical methods in all criteria. While DRL algorithms may excel in complex environments owing to the flexibility in the reward function definition, this paper offers practical insights and a comparison framework specifically designed to evaluate these algorithms within the context of control engineering.
KW - Training
KW - State feedback
KW - Uncertainty
KW - Costs
KW - Computational modeling
KW - Decision making
KW - Noise
UR - http://www.scopus.com/inward/record.url?scp=85194879026&partnerID=8YFLogxK
U2 - 10.1109/CONTROL60310.2024.10531953
DO - 10.1109/CONTROL60310.2024.10531953
M3 - Conference contribution
AN - SCOPUS:85194879026
T3 - 2024 UKACC 14th International Conference on Control, CONTROL 2024
SP - 163
EP - 168
BT - 2024 UKACC 14th International Conference on Control, CONTROL 2024
PB - Institute of Electrical and Electronics Engineers
T2 - 14th UKACC International Conference on Control, CONTROL 2024
Y2 - 10 April 2024 through 12 April 2024
ER -