Show simple item record

dc.contributor.authorGeramifard, Alborz
dc.contributor.authorWalsh, Thomas J.
dc.contributor.authorRoy, Nicholas
dc.contributor.authorHow, Jonathan P.
dc.date.accessioned2015-05-19T18:41:05Z
dc.date.available2015-05-19T18:41:05Z
dc.date.issued2013-07
dc.identifier.urihttp://hdl.handle.net/1721.1/97035
dc.description.abstractMatching pursuit (MP) methods are a promising class of feature construction algorithms for value function approximation. Yet existing MP methods require creating a pool of potential features, mandating expert knowledge or enumeration of a large feature pool, both of which hinder scalability. This paper introduces batch incremental feature dependency discovery (Batch-iFDD) as an MP method that inherits a provable convergence property. Additionally, Batch-iFDD does not require a large pool of features, leading to lower computational complexity. Empirical policy evaluation results across three domains with up to one million states highlight the scalability of Batch-iFDD over the previous state of the art MP algorithm.en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant N00014-07-1-0749)en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant N00014-11-1-0688)en_US
dc.language.isoen_US
dc.publisherAssociation for Uncertainty in Artificial Intelligence (AUAI)en_US
dc.relation.isversionofhttp://auai.org/uai2013/prints/papers/139.pdfen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleBatch-iFDD for representation expansion in large MDPsen_US
dc.typeArticleen_US
dc.identifier.citationGeramifard, Alborz, Thomas J. Walsh, Nicholas Roy, and Jonathan P. How. "Batch-iFDD for representation expansion in large MDPs." 2013 Conference on Uncertainty in Artificial Intelligence (July 2013).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronauticsen_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systemsen_US
dc.contributor.mitauthorGeramifard, Alborzen_US
dc.contributor.mitauthorWalsh, Thomas J.en_US
dc.contributor.mitauthorRoy, Nicholasen_US
dc.contributor.mitauthorHow, Jonathan P.en_US
dc.relation.journalProceedings of the 2013 Conference on Uncertainty in Artificial Intelligenceen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsGeramifard, Alborz; Walsh, Thomas J.; Roy, Nicholas; How, Jonathan P.en_US
dc.identifier.orcidhttps://orcid.org/0000-0001-8576-1930
dc.identifier.orcidhttps://orcid.org/0000-0002-2508-1957
dc.identifier.orcidhttps://orcid.org/0000-0002-8293-0492
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record