Shaping in Practice: Training Wheels to Learn Fast Hopping Directly in Hardware

Heim, Steve; Ruppert, Felix; Sarvestani, Alborz A.; Spröwitz, Alexander

doi:10.1109/ICRA.2018.8460984

Computer Science > Robotics

arXiv:1709.10273 (cs)

[Submitted on 29 Sep 2017 (v1), last revised 7 Mar 2018 (this version, v2)]

Title:Shaping in Practice: Training Wheels to Learn Fast Hopping Directly in Hardware

Authors:Steve Heim, Felix Ruppert, Alborz A. Sarvestani, Alexander Spröwitz

View PDF

Abstract:Learning instead of designing robot controllers can greatly reduce engineering effort required, while also emphasizing robustness. Despite considerable progress in simulation, applying learning directly in hardware is still challenging, in part due to the necessity to explore potentially unstable parameters. We explore the concept of shaping the reward landscape with training wheels: temporary modifications of the physical hardware that facilitate learning. We demonstrate the concept with a robot leg mounted on a boom learning to hop fast. This proof of concept embodies typical challenges such as instability and contact, while being simple enough to empirically map out and visualize the reward landscape. Based on our results we propose three criteria for designing effective training wheels for learning in robotics. A video synopsis can be found at this https URL.

Comments:	Accepted to the IEEE International Conference on Robotics and Automation (ICRA) 2018, 6 pages, 6 figures
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:1709.10273 [cs.RO]
	(or arXiv:1709.10273v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.1709.10273
Journal reference:	2018 IEEE International Conference on Robotics and Automation (ICRA)
Related DOI:	https://doi.org/10.1109/ICRA.2018.8460984

Submission history

From: Steve Heim [view email]
[v1] Fri, 29 Sep 2017 08:09:17 UTC (3,837 KB)
[v2] Wed, 7 Mar 2018 21:58:53 UTC (1,082 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 1709

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Steve Heim
Felix Ruppert
Alborz A. Sarvestani
Alexander Spröwitz

export BibTeX citation

Computer Science > Robotics

Title:Shaping in Practice: Training Wheels to Learn Fast Hopping Directly in Hardware

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Shaping in Practice: Training Wheels to Learn Fast Hopping Directly in Hardware

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators