International Journal of Control, Automation, and Systems 2023; 21(9): 3068-3079
https://doi.org/10.1007/s12555-022-0355-6
© The International Journal of Control, Automation, and Systems
Two-time-scale (TTS) systems were proposed to describe accurately complex systems that include multiple variables running on two-time scales. Different response speeds of variables and incomplete model information affect the tracking performance of TTS systems. For tracking control of an unknown model, the practicability of reinforcement learning (RL) has been subject to criticism, as the method requires a stable initial policy. Based on singular perturbation theory (SPT), a composite sub-optimal tracking policy is investigated combining model information with measured data. Besides, a selection criterion for the initial stabilizing policy is presented by considering the policy as an input constraint. The proposed method integrating RL technique with convex optimization improves the tracking performance and practicability effectively. Finally, an emulation experiment in F-8 aircraft is given to demonstrate the validity of the developed method.
Keywords Convex optimization, input constrained, reinforcement learning, sub-optimal tracking control, twotime-scale system.
International Journal of Control, Automation, and Systems 2023; 21(9): 3068-3079
Published online September 1, 2023 https://doi.org/10.1007/s12555-022-0355-6
Copyright © The International Journal of Control, Automation, and Systems.
Xuejie Que, Zhenlei Wang*, and Xin Wang
East China University of Science and Technology
Two-time-scale (TTS) systems were proposed to describe accurately complex systems that include multiple variables running on two-time scales. Different response speeds of variables and incomplete model information affect the tracking performance of TTS systems. For tracking control of an unknown model, the practicability of reinforcement learning (RL) has been subject to criticism, as the method requires a stable initial policy. Based on singular perturbation theory (SPT), a composite sub-optimal tracking policy is investigated combining model information with measured data. Besides, a selection criterion for the initial stabilizing policy is presented by considering the policy as an input constraint. The proposed method integrating RL technique with convex optimization improves the tracking performance and practicability effectively. Finally, an emulation experiment in F-8 aircraft is given to demonstrate the validity of the developed method.
Keywords: Convex optimization, input constrained, reinforcement learning, sub-optimal tracking control, twotime-scale system.
Vol. 23, No. 3, pp. 683~972
Youngbum Kim and Jinwhan Kim*
International Journal of Control, Automation, and Systems 2025; 23(3): 896-906Sung-Gil Park, Han-Byeol Kim, Yong-Jun Lee, Woo-Jin Ahn*, and Myo Taeg Lim*
International Journal of Control, Automation, and Systems 2025; 23(2): 449-458Wonseok Kang, Jaehyun Kim, Jiseong Chung, Seungwon Choi, and Tae-wan Kim*
International Journal of Control, Automation, and Systems 2025; 23(2): 405-417