DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes

Barry, Jennifer L.; Kaelbling, Leslie Pack; Lozano-Pérez, Tomás

dc.contributor.author	Barry, Jennifer
dc.contributor.author	Kaelbling, Leslie P.
dc.contributor.author	Lozano-Perez, Tomas
dc.date.accessioned	2014-10-10T17:43:26Z
dc.date.available	2014-10-10T17:43:26Z
dc.date.issued	2011-07
dc.identifier.isbn	978-1-57735-512-0
dc.identifier.isbn	978-1-57735-516-8
dc.identifier.uri	http://hdl.handle.net/1721.1/90898
dc.description.abstract	This paper presents an algorithm for finding approximately optimal policies in very large Markov decision processes by constructing a hierarchical model and then solving it approximately. It exploits factored representations to achieve compactness and efficiency and to discover connectivity properties of the domain. We provide a bound on the quality of the solutions and give asymptotic analysis of the runtimes; in addition we demonstrate performance on a collection of very large domains. Results show that the quality of resulting policies is very good and the total running times, for both creating and solving the hierarchy, are significantly less than for an optimal factored MDP solver.	en_US
dc.description.sponsorship	United States. Office of Naval Research (ONR MURI grant N00014-09-1-1051)	en_US
dc.description.sponsorship	United States. Air Force Office of Scientific Research (AFOSR grant AOARD-104135)	en_US
dc.language.iso	en_US
dc.publisher	AAAI Press/International Joint Conferences on Artificial Intelligence	en_US
dc.relation.isversionof	http://ijcai.org/papers11/Papers/IJCAI11-323.pdf	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT web domain	en_US
dc.title	DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes	en_US
dc.type	Article	en_US
dc.identifier.citation	Barry, Jennifer, Leslie Pack Kaelbling, and Tomás Lozano-Pérez. "DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes. In 22nd 2011 International Joint Conference on Artificial Intelligence, IJCAI-11, Barcelona, Catalonia, Spain, 16–22 July 2011. AAAI Press, (2011): p.1928-1935.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Barry, Jennifer	en_US
dc.contributor.mitauthor	Kaelbling, Leslie P.	en_US
dc.contributor.mitauthor	Lozano-Perez, Tomas	en_US
dc.relation.journal	Proceedings of the 22nd 2011 International Joint Conference on Artificial Intelligence	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Barry, Jennifer L.; Kaelbling, Leslie Pack; Lozano-Pérez, Tomás	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-8657-2450
dc.identifier.orcid	https://orcid.org/0000-0001-6054-7145
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Lozano-Perez_DetH.pdf
Size:: 296.1Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record