International Journal of Control, Automation, and Systems 2023; 21(9): 3022-3035
https://doi.org/10.1007/s12555-021-0904-4
© The International Journal of Control, Automation, and Systems
In this work, a behavior-based adaptive dynamic programming (BADP) method is proposed for the coordination control of unmanned ground vehicle-manipulator systems (UGVMs). Through a null-space-based behavioral control (NSBC) framework, the multi-objective coordination control is transformed into a single-objective tracking control at the mission layer. Since cost functions and control constraints are simplified at the control layer, the complexity of controller design is reduced. Then, an identifier-actor-critic reinforcement learning algorithm framework is introduced to learn the optimal control policy by balancing the control performance and consumption. Simulation results show that control costs are reduced by around 13.5% per sampling period compared to existing multiple objective control methods. Finally, the BADP method is experimentally validated using four real UGVMs.
Keywords Adaptive dynamic programming, behavioral control, multi-objective mission, UGVMs.
International Journal of Control, Automation, and Systems 2023; 21(9): 3022-3035
Published online September 1, 2023 https://doi.org/10.1007/s12555-021-0904-4
Copyright © The International Journal of Control, Automation, and Systems.
Zhenyi Zhang, Jianfei Chen, Zhibin Mo, Yutao Chen, and Jie Huang*
Fuzhou University
In this work, a behavior-based adaptive dynamic programming (BADP) method is proposed for the coordination control of unmanned ground vehicle-manipulator systems (UGVMs). Through a null-space-based behavioral control (NSBC) framework, the multi-objective coordination control is transformed into a single-objective tracking control at the mission layer. Since cost functions and control constraints are simplified at the control layer, the complexity of controller design is reduced. Then, an identifier-actor-critic reinforcement learning algorithm framework is introduced to learn the optimal control policy by balancing the control performance and consumption. Simulation results show that control costs are reduced by around 13.5% per sampling period compared to existing multiple objective control methods. Finally, the BADP method is experimentally validated using four real UGVMs.
Keywords: Adaptive dynamic programming, behavioral control, multi-objective mission, UGVMs.
Vol. 22, No. 9, pp. 2673~2953
Jianfeng Wang, Yan Wang*, and Zhicheng Ji
International Journal of Control, Automation, and Systems 2024; 22(8): 2444-2454Wei Wang and Changyang Feng*
International Journal of Control, Automation and Systems 2023; 21(7): 2398-2408Jin-Gang Zhao
International Journal of Control, Automation and Systems 2023; 21(4): 1349-1360