Efficient reinforcement learning for robots using informative simulated priors

Cutler, Mark; How, Jonathan P.

dc.contributor.author	Cutler, Mark Johnson
dc.contributor.author	How, Jonathan P
dc.date.accessioned	2017-05-24T13:02:07Z
dc.date.available	2017-05-24T13:02:07Z
dc.date.issued	2015-07
dc.date.submitted	2015-05
dc.identifier.issn	1050-4729
dc.identifier.uri	http://hdl.handle.net/1721.1/109303
dc.description.abstract	Autonomous learning through interaction with the physical world is a promising approach to designing controllers and decision-making policies for robots. Unfortunately, learning on robots is often difficult due to the large number of samples needed for many learning algorithms. Simulators are one way to decrease the samples needed from the robot by incorporating prior knowledge of the dynamics into the learning algorithm. In this paper we present a novel method for transferring data from a simulator to a robot, using simulated data as a prior for real-world learning. A Bayesian nonparametric prior is learned from a potentially black-box simulator. The mean of this function is used as a prior for the Probabilistic Inference for Learning Control (PILCO) algorithm. The simulated prior improves the convergence rate and performance of PILCO by directing the policy search in areas of the state-space that have not yet been observed by the robot. Simulated and hardware results show the benefits of using the prior knowledge in the learning framework.	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/ICRA.2015.7139550	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Efficient reinforcement learning for robots using informative simulated priors	en_US
dc.type	Article	en_US
dc.identifier.citation	Cutler, Mark and How, Jonathan P. “Efficient Reinforcement Learning for Robots Using Informative Simulated Priors.” 2015 IEEE International Conference on Robotics and Automation (ICRA), May 26-30 2015, Seattle, Washington, Institute of Electrical and Electronics Engineers (IEEE), July 2015 © 2015 Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics	en_US
dc.contributor.mitauthor	Cutler, Mark Johnson
dc.contributor.mitauthor	How, Jonathan P
dc.relation.journal	2015 IEEE International Conference on Robotics and Automation (ICRA)	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Cutler, Mark; How, Jonathan P.	en_US
dspace.embargo.terms	N	en_US
dc.identifier.orcid	https://orcid.org/0000-0003-0776-7901
dc.identifier.orcid	https://orcid.org/0000-0001-8576-1930
mit.license	OPEN_ACCESS_POLICY	en_US

Files in this item

Name:: How_Efficient reinforcement.pdf
Size:: 679.4Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record