Adaptive Envelope MDPs for Relational Equivalence-based Planning

Gardiol, Natalia H.; Kaelbling, Leslie Pack

dc.contributor.advisor	Leslie Kaelbling	en_US
dc.contributor.author	Gardiol, Natalia H.	en_US
dc.contributor.author	Kaelbling, Leslie Pack	en_US
dc.contributor.other	Learning and Intelligent Systems	en_US
dc.date.accessioned	2008-08-01T21:30:16Z
dc.date.available	2008-08-01T21:30:16Z
dc.date.issued	2008-07-29	en_US
dc.identifier.other	MIT-CSAIL-TR-2008-050	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/41920
dc.description.abstract	We describe a method to use structured representations of the environmentâ€™s dynamics to constrain and speed up the planning process. Given a problem domain described in a probabilistic logical description language, we develop an anytime technique that incrementally improves on an initial, partial policy. This partial solution is found by ï¬rst reducing the number of predicates needed to represent a relaxed version of the problem to a minimum, and then dynamically partitioning the action space into a set of equivalence classes with respect to this minimal representation. Our approach uses the envelope MDP framework, which creates a Markov decision process out of a subset of the full state space as de- termined by the initial partial solution. This strategy permits an agent to begin acting within a restricted part of the full state space and to expand its envelope judiciously as resources permit.	en_US
dc.format.extent	17 p.	en_US
dc.relation	Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory	en_US
dc.relation		en_US
dc.title	Adaptive Envelope MDPs for Relational Equivalence-based Planning	en_US

Files in this item

Name:: MIT-CSAIL-TR-2008-050.pdf
Size:: 695.0Kb
Format:: PDF

View/Open

Name:: MIT-CSAIL-TR-2008-050.ps
Size:: 72.13Kb
Format:: Postscript

View/Open

This item appears in the following Collection(s)

CSAIL Technical Reports (July 1, 2003 - present)

Show simple item record