Regular Papers

International Journal of Control, Automation, and Systems 2024; 22(5): 1613-1623

https://doi.org/10.1007/s12555-022-1218-x

© The International Journal of Control, Automation, and Systems

Quadrupedal Locomotion in an Energy-efficient Way Based on Reinforcement Learning

Tiantian Hao*, De Xu, and Shaohua Yan

Chinese Academy of Sciences

Abstract

Achieving energy-efficient motion is important for the application of quadruped robots in a wide range. In this paper, we propose a hierarchical control framework that combines reinforcement learning and virtual model control to achieve energy-efficient motion with a planned gait. A reinforcement learning network is designed to learn the policy that maps the state of the robot to the action. The action is the increment of stance ratio, one of the gait parameters. The learned policy network is used as a high-level gait parameter modulator to adjust the gait parameters according to the body’s velocity. The virtual model control method is used to compute the required force of robot’s body. Then this force is decomposed to the feet of the stance legs with quadratic programming optimization. In the lowest level, the proportional-derivative controllers are used to control the joints’ motion. Simulation and experiments are well conducted on the robot A1. The experimental results verify the effectiveness of the proposed method.

Keywords Energy-efficient motion, quadruped robot, reinforcement learning, virtual model control.

Article

Regular Papers

International Journal of Control, Automation, and Systems 2024; 22(5): 1613-1623

Published online May 1, 2024 https://doi.org/10.1007/s12555-022-1218-x

Copyright © The International Journal of Control, Automation, and Systems.

Quadrupedal Locomotion in an Energy-efficient Way Based on Reinforcement Learning

Tiantian Hao*, De Xu, and Shaohua Yan

Chinese Academy of Sciences

Abstract

Achieving energy-efficient motion is important for the application of quadruped robots in a wide range. In this paper, we propose a hierarchical control framework that combines reinforcement learning and virtual model control to achieve energy-efficient motion with a planned gait. A reinforcement learning network is designed to learn the policy that maps the state of the robot to the action. The action is the increment of stance ratio, one of the gait parameters. The learned policy network is used as a high-level gait parameter modulator to adjust the gait parameters according to the body’s velocity. The virtual model control method is used to compute the required force of robot’s body. Then this force is decomposed to the feet of the stance legs with quadratic programming optimization. In the lowest level, the proportional-derivative controllers are used to control the joints’ motion. Simulation and experiments are well conducted on the robot A1. The experimental results verify the effectiveness of the proposed method.

Keywords: Energy-efficient motion, quadruped robot, reinforcement learning, virtual model control.

IJCAS
March 2025

Vol. 23, No. 3, pp. 683~972

Stats or Metrics

Share this article on

  • line

Related articles in IJCAS

IJCAS

eISSN 2005-4092
pISSN 1598-6446