Show simple item record

dc.contributor.authorBarry, Jennifer
dc.contributor.authorKaelbling, Leslie P.
dc.contributor.authorLozano-Perez, Tomas
dc.date.accessioned2014-10-10T17:43:26Z
dc.date.available2014-10-10T17:43:26Z
dc.date.issued2011-07
dc.identifier.isbn978-1-57735-512-0
dc.identifier.isbn978-1-57735-516-8
dc.identifier.urihttp://hdl.handle.net/1721.1/90898
dc.description.abstractThis paper presents an algorithm for finding approximately optimal policies in very large Markov decision processes by constructing a hierarchical model and then solving it approximately. It exploits factored representations to achieve compactness and efficiency and to discover connectivity properties of the domain. We provide a bound on the quality of the solutions and give asymptotic analysis of the runtimes; in addition we demonstrate performance on a collection of very large domains. Results show that the quality of resulting policies is very good and the total running times, for both creating and solving the hierarchy, are significantly less than for an optimal factored MDP solver.en_US
dc.description.sponsorshipUnited States. Office of Naval Research (ONR MURI grant N00014-09-1-1051)en_US
dc.description.sponsorshipUnited States. Air Force Office of Scientific Research (AFOSR grant AOARD-104135)en_US
dc.language.isoen_US
dc.publisherAAAI Press/International Joint Conferences on Artificial Intelligenceen_US
dc.relation.isversionofhttp://ijcai.org/papers11/Papers/IJCAI11-323.pdfen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleDetH*: Approximate Hierarchical Solution of Large Markov Decision Processesen_US
dc.typeArticleen_US
dc.identifier.citationBarry, Jennifer, Leslie Pack Kaelbling, and Tomás Lozano-Pérez. "DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes. In 22nd 2011 International Joint Conference on Artificial Intelligence, IJCAI-11, Barcelona, Catalonia, Spain, 16–22 July 2011. AAAI Press, (2011): p.1928-1935.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorBarry, Jenniferen_US
dc.contributor.mitauthorKaelbling, Leslie P.en_US
dc.contributor.mitauthorLozano-Perez, Tomasen_US
dc.relation.journalProceedings of the 22nd 2011 International Joint Conference on Artificial Intelligenceen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsBarry, Jennifer L.; Kaelbling, Leslie Pack; Lozano-Pérez, Tomásen_US
dc.identifier.orcidhttps://orcid.org/0000-0002-8657-2450
dc.identifier.orcidhttps://orcid.org/0000-0001-6054-7145
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record