Regular Papers

International Journal of Control, Automation, and Systems 2023; 21(9): 3022-3035

https://doi.org/10.1007/s12555-021-0904-4

© The International Journal of Control, Automation, and Systems

A Behavior-based Adaptive Dynamic Programming Method for Multiple Mobile Manipulators Coordination Control

Zhenyi Zhang, Jianfei Chen, Zhibin Mo, Yutao Chen, and Jie Huang*

Fuzhou University

Abstract

In this work, a behavior-based adaptive dynamic programming (BADP) method is proposed for the coordination control of unmanned ground vehicle-manipulator systems (UGVMs). Through a null-space-based behavioral control (NSBC) framework, the multi-objective coordination control is transformed into a single-objective tracking control at the mission layer. Since cost functions and control constraints are simplified at the control layer, the complexity of controller design is reduced. Then, an identifier-actor-critic reinforcement learning algorithm framework is introduced to learn the optimal control policy by balancing the control performance and consumption. Simulation results show that control costs are reduced by around 13.5% per sampling period compared to existing multiple objective control methods. Finally, the BADP method is experimentally validated using four real UGVMs.

Keywords Adaptive dynamic programming, behavioral control, multi-objective mission, UGVMs.

Article

Regular Papers

International Journal of Control, Automation, and Systems 2023; 21(9): 3022-3035

Published online September 1, 2023 https://doi.org/10.1007/s12555-021-0904-4

Copyright © The International Journal of Control, Automation, and Systems.

A Behavior-based Adaptive Dynamic Programming Method for Multiple Mobile Manipulators Coordination Control

Zhenyi Zhang, Jianfei Chen, Zhibin Mo, Yutao Chen, and Jie Huang*

Fuzhou University

Abstract

In this work, a behavior-based adaptive dynamic programming (BADP) method is proposed for the coordination control of unmanned ground vehicle-manipulator systems (UGVMs). Through a null-space-based behavioral control (NSBC) framework, the multi-objective coordination control is transformed into a single-objective tracking control at the mission layer. Since cost functions and control constraints are simplified at the control layer, the complexity of controller design is reduced. Then, an identifier-actor-critic reinforcement learning algorithm framework is introduced to learn the optimal control policy by balancing the control performance and consumption. Simulation results show that control costs are reduced by around 13.5% per sampling period compared to existing multiple objective control methods. Finally, the BADP method is experimentally validated using four real UGVMs.

Keywords: Adaptive dynamic programming, behavioral control, multi-objective mission, UGVMs.

IJCAS
September 2024

Vol. 22, No. 9, pp. 2673~2953

Stats or Metrics

Share this article on

  • line

Related articles in IJCAS

IJCAS

eISSN 2005-4092
pISSN 1598-6446