Learning Bipedal Walking on a Quadruped Robot via Adversarial Motion Priors

Peng, Tianhu; Bao, Lingfan; Humphreys, Joseph; Delfaki, Andromachi Maria; Kanoulas, Dimitrios; Zhou, Chengxu

doi:10.1007/978-3-031-72062-8_11

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 15052))

Included in the following conference series:

Annual Conference Towards Autonomous Robotic Systems

176 Accesses

Abstract

Previous studies have successfully demonstrated agile and robust locomotion in challenging terrains for quadrupedal robots. However, the bipedal locomotion mode for quadruped robots remains unverified. This paper explores the adaptation of a learning framework originally designed for quadrupedal robots to operate blind locomotion in biped mode. We leverage a framework that incorporates Adversarial Motion Priors with a teacher-student policy to enable imitation of a reference trajectory and navigation on tough terrain. Our work involves transferring and evaluating a similar learning framework on a quadruped robot in biped mode, aiming to achieve stable walking on both flat and complicated terrains. Our simulation results demonstrate that the trained policy enables the quadruped robot to navigate both flat and challenging terrains, including stairs and uneven surfaces.

This work was supported by the Royal Society [grant number RG$\backslash $R2$\backslash $232409] and the UKRI Future Leaders Fellowship [grant number MR/V025333/1]. For a visual overview of our framework and results, please refer to the video at https://youtu.be/JYD1RlrQRWM.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Robust Walking and Sim-to-Real Optimization for Quadruped Robots via Reinforcement Learning

Article 16 December 2024

Research on a New Configuration of Quadruped Robot Based on Reinforcement Learning

A Novel Framework for Adaptive Quadruped Robot Locomotion Learning in Uncertain Environments

References

Bao, L., et al.: Deep reinforcement learning for bipedal locomotion: a brief survey (2024). arXiv: 2404.17070 [cs.RO]
Ho, J., Ermon, S.: Generative adversarial imitation learning. In: International Conference on Neural Information Processing Systems, pp. 4572–4580 (2016)
Google Scholar
Peng, X., et al.: DeepLoco: dynamic locomotion skills using hierarchical deep reinforcement learning. ACM Trans. Graph. 36, 1–13 (2017)
Google Scholar
Xie, Z., et al.: ALLSTEPS: curriculum-driven learning of stepping stone skills. Comput. Graph. Forum 39, 213–224 (2020)
Article MATH Google Scholar
Siekmann, J., et al.: Sim-to-real learning of all common bipedal gaits via periodic reward composition. In: IEEE International Conference on Robotics and Automation, pp. 7309–7315 (2021)
Google Scholar
Xie, Z., et al.: Learning locomotion skills for Cassie: iterative design and sim-to-real. In: Conference on Robot Learning, pp. 317–329 (2020)
Google Scholar
Li, Z., et al.: Reinforcement learning for robust parameterized locomotion control of bipedal robots. In: IEEE International Conference on Robotics and Automation, pp. 2811–2817 (2021)
Google Scholar
Zhang, Q., et al.: Whole-body humanoid robot locomotion with human reference. arXiv preprint arXiv:2402.18294 (2024)
Wu, J., et al.: Learning robust and agile legged locomotion using adversarial motion priors. IEEE Rob. Autom. Lett. 8(8), 4975–4982 (2023)
Google Scholar
Escontrela, A., et al.: Adversarial motion priors make good substitutes for complex reward functions. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 25–32 (2022)
Google Scholar
Wang, Y., Jiang, Z., Chen, J.: Learning robust, agile, natural legged locomotion skills in the wild. In: RoboLetics: Workshop on Robot Learning in Athletics@ CoRL (2023)
Google Scholar
Winkler, A., et al.: Gait and trajectory optimization for legged systems through phase-based end-effector parameterization. IEEE Rob. Autom. Lett. 3(3), 1560–1567 (2018)
Google Scholar
Yu, F., et al.: Dynamic bipedal turning through sim-to-real reinforcement learning. In: IEEE-RAS International Conference on Humanoid Robots, pp. 903–910 (2022)
Google Scholar
Duan, H., et al.: Learning task space actions for bipedal locomotion. In: IEEE International Conference on Robotics and Automation, pp. 1276–1282 (2021)
Google Scholar
Castillo, G.A., et al.: Template model inspired task space learning for robust bipedal locomotion. In: IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 8582–8589 (2023)
Google Scholar
Lee, J., et al.: Learning quadrupedal locomotion over challenging terrain. Sci. Rob. 5(47), eabc5986 (2020)
Google Scholar
Kumar, A., et al.: RMA: rapid motor adaptation for legged robots. arXiv preprint arXiv:2107.04034 (2021)
Yu, C., Rosendo, A.: Multi-modal legged locomotion framework with automated residual reinforcement learning. IEEE Rob. Autom. Lett. 7(4), 10312–10319 (2022)
Google Scholar
Peng, X.B., et al.: AMP: adversarial motion priors for stylized physics based character control. ACM Trans. Graph. 40(4), 1–20 (2021)
Google Scholar
Winkler, A.W.: TOWR-an open-source trajectory optimizer for legged robots in C (2018). https://github.com/ethz-adrl/towr
Schulman, J., et al.: Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017)
Margolis, G.B., et al.: Rapid locomotion via reinforcement learning. arXiv preprint arXiv:2205.02824 (2022)
Rudin, N., et al.: Learning to walk in minutes using massively parallel deep reinforcement learning. In: Conference on Robot Learning, pp. 91–100. PMLR (2022)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Mechanical Engineering, University of Leeds, Leeds, UK
Tianhu Peng, Lingfan Bao & Joseph Humphreys
Department of Computer Science, University College London, London, UK
Andromachi Maria Delfaki, Dimitrios Kanoulas & Chengxu Zhou

Authors

Tianhu Peng
View author publications
You can also search for this author in PubMed Google Scholar
Lingfan Bao
View author publications
You can also search for this author in PubMed Google Scholar
Joseph Humphreys
View author publications
You can also search for this author in PubMed Google Scholar
Andromachi Maria Delfaki
View author publications
You can also search for this author in PubMed Google Scholar
Dimitrios Kanoulas
View author publications
You can also search for this author in PubMed Google Scholar
Chengxu Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chengxu Zhou .

Editor information

Editors and Affiliations

Department of Electronic and Electrical Engineering, Brunel University London, London, UK
M. Nazmul Huda
Department of Mechanical and Aerospace Engineering, Brunel University London, Uxbridge, UK
Mingfeng Wang
Department of Electronic and Electrical Engineering, Brunel University London, London, UK
Tatiana Kalganova

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peng, T., Bao, L., Humphreys, J., Delfaki, A.M., Kanoulas, D., Zhou, C. (2025). Learning Bipedal Walking on a Quadruped Robot via Adversarial Motion Priors. In: Huda, M.N., Wang, M., Kalganova, T. (eds) Towards Autonomous Robotic Systems. TAROS 2024. Lecture Notes in Computer Science(), vol 15052. Springer, Cham. https://doi.org/10.1007/978-3-031-72062-8_11

Download citation

DOI: https://doi.org/10.1007/978-3-031-72062-8_11
Published: 30 December 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72061-1
Online ISBN: 978-3-031-72062-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning Bipedal Walking on a Quadruped Robot via Adversarial Motion Priors

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Robust Walking and Sim-to-Real Optimization for Quadruped Robots via Reinforcement Learning

Research on a New Configuration of Quadruped Robot Based on Reinforcement Learning

A Novel Framework for Adaptive Quadruped Robot Locomotion Learning in Uncertain Environments

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Learning Bipedal Walking on a Quadruped Robot via Adversarial Motion Priors

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Robust Walking and Sim-to-Real Optimization for Quadruped Robots via Reinforcement Learning

Research on a New Configuration of Quadruped Robot Based on Reinforcement Learning

A Novel Framework for Adaptive Quadruped Robot Locomotion Learning in Uncertain Environments

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.