International Journal of Control, Automation, and Systems 2024; 22(8): 2444-2454
https://doi.org/10.1007/s12555-023-0460-1
© The International Journal of Control, Automation, and Systems
This paper investigates the problem of input-constrained optimal control for nonaffine nonlinear discretetime systems in the presence of an inaccurate model. To address this problem, a bounded function is used to convert the input-constrained optimal control problem into an unconstrained counterpart. Additionally, an optimal stage cost function is introduced to quantify the discrepancy between the inaccurate model and the true system dynamics. Subsequently, a policy iteration based stage cost learning (PISCL) algorithm is proposed to obtain the optimal stage cost function and the convergence of the algorithm is proved. The proposed approach provides a new framework for addressing input-constrained control problems of nonaffine nonlinear systems with an inaccurate model, bridging the gap between model-based and data-based approximate dynamic programming techniques. Numerical experiments validate the effectiveness of the PISCL algorithm in obtaining the constrained optimal control policies for nonaffine nonlinear discrete-time systems without precise system models.
Keywords Adaptive dynamic programming, input-constrained, optimal control, stage cost function.
International Journal of Control, Automation, and Systems 2024; 22(8): 2444-2454
Published online August 1, 2024 https://doi.org/10.1007/s12555-023-0460-1
Copyright © The International Journal of Control, Automation, and Systems.
Jianfeng Wang, Yan Wang*, and Zhicheng Ji
Jiangnan University
This paper investigates the problem of input-constrained optimal control for nonaffine nonlinear discretetime systems in the presence of an inaccurate model. To address this problem, a bounded function is used to convert the input-constrained optimal control problem into an unconstrained counterpart. Additionally, an optimal stage cost function is introduced to quantify the discrepancy between the inaccurate model and the true system dynamics. Subsequently, a policy iteration based stage cost learning (PISCL) algorithm is proposed to obtain the optimal stage cost function and the convergence of the algorithm is proved. The proposed approach provides a new framework for addressing input-constrained control problems of nonaffine nonlinear systems with an inaccurate model, bridging the gap between model-based and data-based approximate dynamic programming techniques. Numerical experiments validate the effectiveness of the PISCL algorithm in obtaining the constrained optimal control policies for nonaffine nonlinear discrete-time systems without precise system models.
Keywords: Adaptive dynamic programming, input-constrained, optimal control, stage cost function.
Vol. 22, No. 9, pp. 2673~2953
Jiahui Xu, Jingcheng Wang*, Jun Rao, Yanjiu Zhong, and Shangwei Zhao
International Journal of Control, Automation and Systems 2022; 20(9): 3098-3109Dengguo Xu, Qinglin Wang, and Yuan Li*
International Journal of Control, Automation and Systems 2020; 18(5): 1116-1127Zhihui Wu*, Guo-Ping Liu, June Hu, Hui Yu, and Dongyan Chen
International Journal of Control, Automation, and Systems 2024; 22(9): 2699-2710