Improving the efficiency of Bayesian inverse reinforcement learning

Michini, Bernard; How, Jonathan P.

dc.contributor.author	How, Jonathan P.
dc.contributor.author	Michini, Bernard J.
dc.date.accessioned	2013-10-23T16:56:46Z
dc.date.available	2013-10-23T16:56:46Z
dc.date.issued	2012-05
dc.identifier.isbn	978-1-4673-1405-3
dc.identifier.isbn	978-1-4673-1403-9
dc.identifier.isbn	978-1-4673-1578-4
dc.identifier.isbn	978-1-4673-1404-6
dc.identifier.uri	http://hdl.handle.net/1721.1/81489
dc.description.abstract	Inverse reinforcement learning (IRL) is the task of learning the reward function of a Markov Decision Process (MDP) given knowledge of the transition function and a set of expert demonstrations. While many IRL algorithms exist, Bayesian IRL [1] provides a general and principled method of reward learning by casting the problem in the Bayesian inference framework. However, the algorithm as originally presented suffers from several inefficiencies that prohibit its use for even moderate problem sizes. This paper proposes modifications to the original Bayesian IRL algorithm to improve its efficiency and tractability in situations where the state space is large and the expert demonstrations span only a small portion of it. The key insight is that the inference task should be focused on states that are similar to those encountered by the expert, as opposed to making the naive assumption that the expert demonstrations contain enough information to accurately infer the reward function over the entire state space. A modified algorithm is presented and experimental results show substantially faster convergence while maintaining the solution quality of the original method.	en_US
dc.description.sponsorship	United States. Office of Naval Research (Science of Autonomy Program Contract N000140910625))	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/ICRA.2012.6225241	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Improving the efficiency of Bayesian inverse reinforcement learning	en_US
dc.type	Article	en_US
dc.identifier.citation	Michini, Bernard, and Jonathan P. How. “Improving the efficiency of Bayesian inverse reinforcement learning.” In 2012 IEEE International Conference on Robotics and Automation, 3651-3656. Institute of Electrical and Electronics Engineers, 2012.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Aerospace Controls Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics	en_US
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems	en_US
dc.contributor.mitauthor	Michini, Bernard J.	en_US
dc.contributor.mitauthor	How, Jonathan P.	en_US
dc.relation.journal	Proceedings of the 2012 IEEE International Conference on Robotics and Automation	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Michini, Bernard; How, Jonathan P.	en_US
dc.identifier.orcid	https://orcid.org/0000-0001-8576-1930
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: How_Improving the efficiency of ...
Size:: 409.0Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record