dc.contributor.author | Cutler, Mark Johnson | |
dc.contributor.author | How, Jonathan P | |
dc.date.accessioned | 2017-05-24T13:02:07Z | |
dc.date.available | 2017-05-24T13:02:07Z | |
dc.date.issued | 2015-07 | |
dc.date.submitted | 2015-05 | |
dc.identifier.issn | 1050-4729 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/109303 | |
dc.description.abstract | Autonomous learning through interaction with the physical world is a promising approach to designing controllers and decision-making policies for robots. Unfortunately, learning on robots is often difficult due to the large number of samples needed for many learning algorithms. Simulators are one way to decrease the samples needed from the robot by incorporating prior knowledge of the dynamics into the learning algorithm. In this paper we present a novel method for transferring data from a simulator to a robot, using simulated data as a prior for real-world learning. A Bayesian nonparametric prior is learned from a potentially black-box simulator. The mean of this function is used as a prior for the Probabilistic Inference for Learning Control (PILCO) algorithm. The simulated prior improves the convergence rate and performance of PILCO by directing the policy search in areas of the state-space that have not yet been observed by the robot. Simulated and hardware results show the benefits of using the prior knowledge in the learning framework. | en_US |
dc.language.iso | en_US | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1109/ICRA.2015.7139550 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | MIT web domain | en_US |
dc.title | Efficient reinforcement learning for robots using informative simulated priors | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Cutler, Mark and How, Jonathan P. “Efficient Reinforcement Learning for Robots Using Informative Simulated Priors.” 2015 IEEE International Conference on Robotics and Automation (ICRA), May 26-30 2015, Seattle, Washington, Institute of Electrical and Electronics Engineers (IEEE), July 2015 © 2015 Institute of Electrical and Electronics Engineers (IEEE) | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | en_US |
dc.contributor.mitauthor | Cutler, Mark Johnson | |
dc.contributor.mitauthor | How, Jonathan P | |
dc.relation.journal | 2015 IEEE International Conference on Robotics and Automation (ICRA) | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Cutler, Mark; How, Jonathan P. | en_US |
dspace.embargo.terms | N | en_US |
dc.identifier.orcid | https://orcid.org/0000-0003-0776-7901 | |
dc.identifier.orcid | https://orcid.org/0000-0001-8576-1930 | |
mit.license | OPEN_ACCESS_POLICY | en_US |