Show simple item record

dc.contributor.authorDoshi-Velez, Finale P.
dc.contributor.authorWingate, David
dc.contributor.authorRoy, Nicholas
dc.contributor.authorTenenbaum, Joshua B.
dc.date.accessioned2011-09-28T19:38:37Z
dc.date.available2011-09-28T19:38:37Z
dc.date.issued2010-12
dc.identifier.isbn9781617823800
dc.identifier.urihttp://hdl.handle.net/1721.1/66107
dc.description.abstractWe consider reinforcement learning in partially observable domains where the agent can query an expert for demonstrations. Our nonparametric Bayesian approach combines model knowledge, inferred from expert information and independent exploration, with policy knowledge inferred from expert trajectories. We introduce priors that bias the agent towards models with both simple representations and simple policies, resulting in improved policy and model learning.en_US
dc.language.isoen_US
dc.publisherNeural Information Processing Systems Foundationen_US
dc.relation.isversionofhttp://media.nips.cc/Conferences/2010/2010-NIPS-Conference-Program.pdfen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleNonparametric Bayesian Policy Priors for Reinforcement Learningen_US
dc.typeArticleen_US
dc.identifier.citationDoshi-Velez, Finale, David Wingate, Nicholas Roy, and Joshua Tenenbaum. "Nonparametric Bayesian Policy Priors for Reinforcement Learning." Proceedings of the 24th Annual Conference on Neural Information Processing Systems, NIPS 2010, December 6-9, 2010, Vancouver, British Columbia.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronauticsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systemsen_US
dc.contributor.approverRoy, Nicholas
dc.contributor.mitauthorRoy, Nicholas
dc.contributor.mitauthorDoshi-Velez, Finale P.
dc.contributor.mitauthorWingate, David
dc.contributor.mitauthorTenenbaum, Joshua B.
dc.relation.journalProceedings of the 24th Annual Conference on Neural Information Processing Systems, (NIPS 2010)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsDoshi-Velez, Finale; Wingate, David; Roy, Nicholas; Tenenbaum, Joshuaen_US
dc.identifier.orcidhttps://orcid.org/0000-0002-1925-2035
dc.identifier.orcidhttps://orcid.org/0000-0002-8293-0492
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record