International Journal of Control, Automation, and Systems 2024; 22(5): 1613-1623
https://doi.org/10.1007/s12555-022-1218-x
© The International Journal of Control, Automation, and Systems
Achieving energy-efficient motion is important for the application of quadruped robots in a wide range. In this paper, we propose a hierarchical control framework that combines reinforcement learning and virtual model control to achieve energy-efficient motion with a planned gait. A reinforcement learning network is designed to learn the policy that maps the state of the robot to the action. The action is the increment of stance ratio, one of the gait parameters. The learned policy network is used as a high-level gait parameter modulator to adjust the gait parameters according to the body’s velocity. The virtual model control method is used to compute the required force of robot’s body. Then this force is decomposed to the feet of the stance legs with quadratic programming optimization. In the lowest level, the proportional-derivative controllers are used to control the joints’ motion. Simulation and experiments are well conducted on the robot A1. The experimental results verify the effectiveness of the proposed method.
Keywords Energy-efficient motion, quadruped robot, reinforcement learning, virtual model control.
International Journal of Control, Automation, and Systems 2024; 22(5): 1613-1623
Published online May 1, 2024 https://doi.org/10.1007/s12555-022-1218-x
Copyright © The International Journal of Control, Automation, and Systems.
Tiantian Hao*, De Xu, and Shaohua Yan
Chinese Academy of Sciences
Achieving energy-efficient motion is important for the application of quadruped robots in a wide range. In this paper, we propose a hierarchical control framework that combines reinforcement learning and virtual model control to achieve energy-efficient motion with a planned gait. A reinforcement learning network is designed to learn the policy that maps the state of the robot to the action. The action is the increment of stance ratio, one of the gait parameters. The learned policy network is used as a high-level gait parameter modulator to adjust the gait parameters according to the body’s velocity. The virtual model control method is used to compute the required force of robot’s body. Then this force is decomposed to the feet of the stance legs with quadratic programming optimization. In the lowest level, the proportional-derivative controllers are used to control the joints’ motion. Simulation and experiments are well conducted on the robot A1. The experimental results verify the effectiveness of the proposed method.
Keywords: Energy-efficient motion, quadruped robot, reinforcement learning, virtual model control.
Vol. 23, No. 3, pp. 683~972
Youngbum Kim and Jinwhan Kim*
International Journal of Control, Automation, and Systems 2025; 23(3): 896-906Sung-Gil Park, Han-Byeol Kim, Yong-Jun Lee, Woo-Jin Ahn*, and Myo Taeg Lim*
International Journal of Control, Automation, and Systems 2025; 23(2): 449-458Yanan Fan*, Zhongcai Pei, and Zhiyong Tang
International Journal of Control, Automation, and Systems 2024; 22(4): 1360-1370