Show simple item record

dc.contributor.authorBrunskill, Emma
dc.contributor.authorLeffler, Bethany R.
dc.contributor.authorLi, Lihong
dc.contributor.authorLittman, Michael L.
dc.contributor.authorRoy, Nicholas
dc.date.accessioned2010-11-29T17:59:03Z
dc.date.available2010-11-29T17:59:03Z
dc.date.issued2009-08
dc.date.submitted2009-03
dc.identifier.issn1532-4435
dc.identifier.issn1533-7928
dc.identifier.urihttp://hdl.handle.net/1721.1/60042
dc.description.abstractTo quickly achieve good performance, reinforcement-learning algorithms for acting in large continuous-valued domains must use a representation that is both sufficiently powerful to capture important domain characteristics, and yet simultaneously allows generalization, or sharing, among experiences. Our algorithm balances this tradeoff by using a stochastic, switching, parametric dynamics representation. We argue that this model characterizes a number of significant, real-world domains, such as robot navigati on across varying terrain. We prove that this representational assumption allows our algorithm to be probably approximately correct with a sample complexity that scales polynomially with all problem-specific quantities including the state-space dimension. We also explicitly incorporate the error introduced by approximate planning in our sample complexity bounds, in contrast to prior Probably Approximately Correct (PAC) Markov Decision Processes (MDP) approaches, which typically assume the estimated MDP can be solved exactly. Our experimental results on constructing plans for driving to work using real car trajectory data, as well as a small robot experiment on navigating varying terrain, demonstrate that our dynamics representation enables us to capture real-world dynamics in a sufficient manner to produce good performance.en_US
dc.language.isoen_US
dc.publisherJournal of Machine Learning Researchen_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/1577069.1755851en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceN. Roy via Barbara Williamsen_US
dc.titleProvably efficient learning with typed parametric modelsen_US
dc.typeArticleen_US
dc.identifier.citationBrunskill, Emma et al. "Provably Efficient Learning with Typed Parametric Models." Journal of Machine Learning Research, 10 (December 2009), 1955-1988.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronauticsen_US
dc.contributor.approverRoy, Nicholas
dc.contributor.mitauthorRoy, Nicholas
dc.contributor.mitauthorBrunskill, Emma
dc.relation.journalJournal of Machine Learning Researchen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsBrunskill, Emma; Leffler, Bethany R.; Li, Lihong; Littman, Michael L.; Roy, Nicholas
dc.identifier.orcidhttps://orcid.org/0000-0002-8293-0492
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record