| dc.contributor.author | Barry, Jennifer | |
| dc.contributor.author | Kaelbling, Leslie P. | |
| dc.contributor.author | Lozano-Perez, Tomas | |
| dc.date.accessioned | 2014-10-10T17:43:26Z | |
| dc.date.available | 2014-10-10T17:43:26Z | |
| dc.date.issued | 2011-07 | |
| dc.identifier.isbn | 978-1-57735-512-0 | |
| dc.identifier.isbn | 978-1-57735-516-8 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/90898 | |
| dc.description.abstract | This paper presents an algorithm for finding approximately optimal policies in very large Markov decision processes by constructing a hierarchical model and then solving it approximately. It exploits factored representations to achieve compactness and efficiency and to discover connectivity properties of the domain. We provide a bound on the quality of the solutions and give asymptotic analysis of the runtimes; in addition we demonstrate performance on a collection of very large domains. Results show that the quality of resulting policies is very good and the total running times, for both creating and solving the hierarchy, are significantly less than for an optimal factored MDP solver. | en_US |
| dc.description.sponsorship | United States. Office of Naval Research (ONR MURI grant N00014-09-1-1051) | en_US |
| dc.description.sponsorship | United States. Air Force Office of Scientific Research (AFOSR grant AOARD-104135) | en_US |
| dc.language.iso | en_US | |
| dc.publisher | AAAI Press/International Joint Conferences on Artificial Intelligence | en_US |
| dc.relation.isversionof | http://ijcai.org/papers11/Papers/IJCAI11-323.pdf | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | MIT web domain | en_US |
| dc.title | DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Barry, Jennifer, Leslie Pack Kaelbling, and Tomás Lozano-Pérez. "DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes. In 22nd 2011 International Joint Conference on Artificial Intelligence, IJCAI-11, Barcelona, Catalonia, Spain, 16–22 July 2011. AAAI Press, (2011): p.1928-1935. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Barry, Jennifer | en_US |
| dc.contributor.mitauthor | Kaelbling, Leslie P. | en_US |
| dc.contributor.mitauthor | Lozano-Perez, Tomas | en_US |
| dc.relation.journal | Proceedings of the 22nd 2011 International Joint Conference on Artificial Intelligence | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Barry, Jennifer L.; Kaelbling, Leslie Pack; Lozano-Pérez, Tomás | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0002-8657-2450 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-6054-7145 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |