Regular Papers

International Journal of Control, Automation, and Systems 2023; 21(9): 3068-3079

https://doi.org/10.1007/s12555-022-0355-6

© The International Journal of Control, Automation, and Systems

Reinforcement Learning for Input Constrained Sub-optimal Tracking Control in Discrete-time Two-time-scale Systems

Xuejie Que, Zhenlei Wang*, and Xin Wang

East China University of Science and Technology

Abstract

Two-time-scale (TTS) systems were proposed to describe accurately complex systems that include multiple variables running on two-time scales. Different response speeds of variables and incomplete model information affect the tracking performance of TTS systems. For tracking control of an unknown model, the practicability of reinforcement learning (RL) has been subject to criticism, as the method requires a stable initial policy. Based on singular perturbation theory (SPT), a composite sub-optimal tracking policy is investigated combining model information with measured data. Besides, a selection criterion for the initial stabilizing policy is presented by considering the policy as an input constraint. The proposed method integrating RL technique with convex optimization improves the tracking performance and practicability effectively. Finally, an emulation experiment in F-8 aircraft is given to demonstrate the validity of the developed method.

Keywords Convex optimization, input constrained, reinforcement learning, sub-optimal tracking control, twotime-scale system.

Article

Regular Papers

International Journal of Control, Automation, and Systems 2023; 21(9): 3068-3079

Published online September 1, 2023 https://doi.org/10.1007/s12555-022-0355-6

Copyright © The International Journal of Control, Automation, and Systems.

Reinforcement Learning for Input Constrained Sub-optimal Tracking Control in Discrete-time Two-time-scale Systems

Xuejie Que, Zhenlei Wang*, and Xin Wang

East China University of Science and Technology

Abstract

Two-time-scale (TTS) systems were proposed to describe accurately complex systems that include multiple variables running on two-time scales. Different response speeds of variables and incomplete model information affect the tracking performance of TTS systems. For tracking control of an unknown model, the practicability of reinforcement learning (RL) has been subject to criticism, as the method requires a stable initial policy. Based on singular perturbation theory (SPT), a composite sub-optimal tracking policy is investigated combining model information with measured data. Besides, a selection criterion for the initial stabilizing policy is presented by considering the policy as an input constraint. The proposed method integrating RL technique with convex optimization improves the tracking performance and practicability effectively. Finally, an emulation experiment in F-8 aircraft is given to demonstrate the validity of the developed method.

Keywords: Convex optimization, input constrained, reinforcement learning, sub-optimal tracking control, twotime-scale system.

IJCAS
September 2024

Vol. 22, No. 9, pp. 2673~2953

Stats or Metrics

Share this article on

  • line

Related articles in IJCAS

IJCAS

eISSN 2005-4092
pISSN 1598-6446