首页|Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion

Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion

来源：

英文摘要

We present a unified gait-conditioned reinforcement learning framework that enables humanoid robots to perform standing, walking, running, and smooth transitions within a single recurrent policy. A compact reward routing mechanism dynamically activates gait-specific objectives based on a one-hot gait ID, mitigating reward interference and supporting stable multi-gait learning. Human-inspired reward terms promote biomechanically natural motions, such as straight-knee stance and coordinated arm-leg swing, without requiring motion capture data. A structured curriculum progressively introduces gait complexity and expands command space over multiple phases. In simulation, the policy successfully achieves robust standing, walking, running, and gait transitions. On the real Unitree G1 humanoid, we validate standing, walking, and walk-to-stand transitions, demonstrating stable and coordinated locomotion. This work provides a scalable, reference-free solution toward versatile and naturalistic humanoid control across diverse modes and environments.

作者：Tianhu Peng、Lingfan Bao、CHengxu Zhou

作者单位：

学科分类：自动化技术、自动化技术设备计算技术、计算机技术

推荐引用：Tianhu Peng,Lingfan Bao,CHengxu Zhou.Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion[EB/OL].(2025-05-26)[2025-06-07].https://arxiv.org/abs/2505.20619.点此复制

Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion

Gait-Conditioned Reinforcement Learning with Multi-Phase Curriculum for Humanoid Locomotion

评论