International Journal of Control, Automation, and Systems 2023; 21(9): 3068-3079
https://doi.org/10.1007/s12555-022-0355-6
© The International Journal of Control, Automation, and Systems
Two-time-scale (TTS) systems were proposed to describe accurately complex systems that include multiple variables running on two-time scales. Different response speeds of variables and incomplete model information affect the tracking performance of TTS systems. For tracking control of an unknown model, the practicability of reinforcement learning (RL) has been subject to criticism, as the method requires a stable initial policy. Based on singular perturbation theory (SPT), a composite sub-optimal tracking policy is investigated combining model information with measured data. Besides, a selection criterion for the initial stabilizing policy is presented by considering the policy as an input constraint. The proposed method integrating RL technique with convex optimization improves the tracking performance and practicability effectively. Finally, an emulation experiment in F-8 aircraft is given to demonstrate the validity of the developed method.
Keywords Convex optimization, input constrained, reinforcement learning, sub-optimal tracking control, twotime-scale system.
International Journal of Control, Automation, and Systems 2023; 21(9): 3068-3079
Published online September 1, 2023 https://doi.org/10.1007/s12555-022-0355-6
Copyright © The International Journal of Control, Automation, and Systems.
Xuejie Que, Zhenlei Wang*, and Xin Wang
East China University of Science and Technology
Two-time-scale (TTS) systems were proposed to describe accurately complex systems that include multiple variables running on two-time scales. Different response speeds of variables and incomplete model information affect the tracking performance of TTS systems. For tracking control of an unknown model, the practicability of reinforcement learning (RL) has been subject to criticism, as the method requires a stable initial policy. Based on singular perturbation theory (SPT), a composite sub-optimal tracking policy is investigated combining model information with measured data. Besides, a selection criterion for the initial stabilizing policy is presented by considering the policy as an input constraint. The proposed method integrating RL technique with convex optimization improves the tracking performance and practicability effectively. Finally, an emulation experiment in F-8 aircraft is given to demonstrate the validity of the developed method.
Keywords: Convex optimization, input constrained, reinforcement learning, sub-optimal tracking control, twotime-scale system.
Vol. 22, No. 9, pp. 2673~2953
Tiantian Hao*, De Xu, and Shaohua Yan
International Journal of Control, Automation, and Systems 2024; 22(5): 1613-1623Haipeng Chen, Wenxing Fu, Junmin Liu, Dengxiu Yu, and Kang Chen*
International Journal of Control, Automation, and Systems 2023; 21(10): 3443-3455Sergey Pankov
International Journal of Control, Automation and Systems 2023; 21(1): 306-317