Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration

Chenu, Alexandre; Serris, Olivier; Sigaud, Olivier; Perrin-Gilbert, Nicolas

Computer Science > Robotics

arXiv:2211.04786 (cs)

[Submitted on 9 Nov 2022 (v1), last revised 17 Apr 2023 (this version, v2)]

Title:Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration

Authors:Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert

View PDF

Abstract:Deep Reinforcement Learning has been successfully applied to learn robotic control. However, the corresponding algorithms struggle when applied to problems where the agent is only rewarded after achieving a complex task. In this context, using demonstrations can significantly speed up the learning process, but demonstrations can be costly to acquire. In this paper, we propose to leverage a sequential bias to learn control policies for complex robotic tasks using a single demonstration. To do so, our method learns a goal-conditioned policy to control a system between successive low-dimensional goals. This sequential goal-reaching approach raises a problem of compatibility between successive goals: we need to ensure that the state resulting from reaching a goal is compatible with the achievement of the following goals. To tackle this problem, we present a new algorithm called DCIL-II. We show that DCIL-II can solve with unprecedented sample efficiency some challenging simulated tasks such as humanoid locomotion and stand-up as well as fast running with a simulated Cassie robot. Our method leveraging sequentiality is a step towards the resolution of complex robotic tasks under minimal specification effort, a key feature for the next generation of autonomous robots.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2211.04786 [cs.RO]
	(or arXiv:2211.04786v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2211.04786

Submission history

From: Nicolas Perrin-Gilbert [view email]
[v1] Wed, 9 Nov 2022 10:28:40 UTC (3,449 KB)
[v2] Mon, 17 Apr 2023 09:18:28 UTC (3,449 KB)

Computer Science > Robotics

Title:Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Robotics

Title:Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.