Provably efficient learning with typed parametric models

Brunskill, Emma; Leffler, Bethany R.; Li, Lihong; Littman, Michael L.; Roy, Nicholas

dc.contributor.author	Brunskill, Emma
dc.contributor.author	Leffler, Bethany R.
dc.contributor.author	Li, Lihong
dc.contributor.author	Littman, Michael L.
dc.contributor.author	Roy, Nicholas
dc.date.accessioned	2010-11-29T17:59:03Z
dc.date.available	2010-11-29T17:59:03Z
dc.date.issued	2009-08
dc.date.submitted	2009-03
dc.identifier.issn	1532-4435
dc.identifier.issn	1533-7928
dc.identifier.uri	http://hdl.handle.net/1721.1/60042
dc.description.abstract	To quickly achieve good performance, reinforcement-learning algorithms for acting in large continuous-valued domains must use a representation that is both sufficiently powerful to capture important domain characteristics, and yet simultaneously allows generalization, or sharing, among experiences. Our algorithm balances this tradeoff by using a stochastic, switching, parametric dynamics representation. We argue that this model characterizes a number of significant, real-world domains, such as robot navigati on across varying terrain. We prove that this representational assumption allows our algorithm to be probably approximately correct with a sample complexity that scales polynomially with all problem-specific quantities including the state-space dimension. We also explicitly incorporate the error introduced by approximate planning in our sample complexity bounds, in contrast to prior Probably Approximately Correct (PAC) Markov Decision Processes (MDP) approaches, which typically assume the estimated MDP can be solved exactly. Our experimental results on constructing plans for driving to work using real car trajectory data, as well as a small robot experiment on navigating varying terrain, demonstrate that our dynamics representation enables us to capture real-world dynamics in a sufficient manner to produce good performance.	en_US
dc.language.iso	en_US
dc.publisher	Journal of Machine Learning Research	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/1577069.1755851	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	N. Roy via Barbara Williams	en_US
dc.title	Provably efficient learning with typed parametric models	en_US
dc.type	Article	en_US
dc.identifier.citation	Brunskill, Emma et al. "Provably Efficient Learning with Typed Parametric Models." Journal of Machine Learning Research, 10 (December 2009), 1955-1988.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics	en_US
dc.contributor.approver	Roy, Nicholas
dc.contributor.mitauthor	Roy, Nicholas
dc.contributor.mitauthor	Brunskill, Emma
dc.relation.journal	Journal of Machine Learning Research	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Brunskill, Emma; Leffler, Bethany R.; Li, Lihong; Littman, Michael L.; Roy, Nicholas
dc.identifier.orcid	https://orcid.org/0000-0002-8293-0492
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: brunskill09a[1].pdf
Size:: 2.371Mb
Format:: PDF
Description:: Main article

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record