dc.contributor.author | Konidaris, George | |
dc.contributor.author | Kaelbling, Leslie P. | |
dc.contributor.author | Lozano-Perez, Tomas | |
dc.date.accessioned | 2014-09-22T19:21:38Z | |
dc.date.available | 2014-09-22T19:21:38Z | |
dc.date.issued | 2013-06 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/90275 | |
dc.description.abstract | We consider the problem of how to plan efficiently in low-level, continuous state spaces with temporally abstract actions (or skills), by constructing abstract representations of the problem suitable for task-level planning.The central question this effort poses is which abstract representations are required to express and evaluate plans composed of sequences of skills. We show that classifiers can be used as a symbolic representation system, and that the ability to represent the preconditions and effects of an agent's skills is both necessary and sufficient for task-level planning.The resulting representations allow a reinforcement learning agent to acquire a symbolic representation appropriate for planning from experience. | en_US |
dc.language.iso | en_US | |
dc.publisher | American Association for the Advancement of Science (AAAS) | en_US |
dc.relation.isversionof | http://www.aaai.org/ocs/index.php/WS/AAAIW13/paper/view/7147 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | MIT web domain | en_US |
dc.title | Symbol acquisition for task-level planning | en_US |
dc.type | Article | en_US |
dc.identifier.citation | KONIDARIS, G.; KAELBLING, L.; LOZANO-PEREZ, T. Symbol Acquisition for Task-Level Planning. AAAI Workshops, North America, jun. 2013. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.mitauthor | Konidaris, George | en_US |
dc.contributor.mitauthor | Kaelbling, Leslie P. | en_US |
dc.contributor.mitauthor | Lozano-Perez, Tomas | en_US |
dc.relation.journal | Workshops at the Twenty-Seventh AAAI Conference on Artificial Intelligence | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Konidaris, George; Kaelbling, Leslie P.; Lozano-Perez, Tomas | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-8657-2450 | |
dc.identifier.orcid | https://orcid.org/0000-0001-6054-7145 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |