Abstract
Quadruped robots operating in unpredictable environments face a high risk of falls, which requires robust self-recovery strategies. In this work, we present a deep reinforcement learning approach utilizing Proximal Policy Optimization (PPO) for autonomous self-recovery behavior on the Unitree A1 quadruped robot. Using only proprioceptive sensor data, our approach achieves effective recovery behaviors. The proposed model, trained within the IsaacLab simulation environment, demonstrates effective generalization through sim-to-sim and sim-to-real transfers, successfully recovering from falls even under unseen joint control parameters and carrying unfamiliar payloads. Validation tests in both IsaacLab and Gazebo highlight the robustness and adaptability of our approach to real-world uncertainties. Throughout the operation, the proposed method showed responsive, reliable, and robust performance, highlighting its potential for practical deployment.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2025 10th International Conference on Robotics and Automation Engineering, ICRAE 2025 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 16-21 |
| Number of pages | 6 |
| ISBN (Electronic) | 9798331550257 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 10th International Conference on Robotics and Automation Engineering, ICRAE 2025 - Haikou, China Duration: 14 Nov 2025 → 16 Nov 2025 |
Publication series
| Name | Proceedings - 2025 10th International Conference on Robotics and Automation Engineering, ICRAE 2025 |
|---|
Conference
| Conference | 10th International Conference on Robotics and Automation Engineering, ICRAE 2025 |
|---|---|
| Country/Territory | China |
| City | Haikou |
| Period | 14/11/25 → 16/11/25 |
Bibliographical note
Publisher Copyright:©2025 IEEE.
Keywords
- Deep Reinforcement Learning
- Proximal Policy Optimization (PPO)
- Quadruped Robot
- Self-Recovery
- Sim-to-Real Transfer
- Simto-Sim Transfer
Fingerprint
Dive into the research topics of 'Efficient and Robust Self-Recovery of Quadruped Robots Using Asymmetric Proximal Policy Optimization'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver