Mdp Optimal Control under Temporal Logic Constraints
Author(s)
Ding, Xu Chu; Smith, Stephen L.; Belta, Calin; Rus, Daniela L.
DownloadRus_MDP optimal control.pdf (446.3Kb)
OPEN_ACCESS_POLICY
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
In this paper, we develop a method to automatically generate a control policy for a dynamical system modeled as a Markov Decision Process (MDP). The control specification is given as a Linear Temporal Logic (LTL) formula over a set of propositions defined on the states of the MDP. We synthesize a control policy such that the MDP satisfies the given specification almost surely, if such a policy exists. In addition, we designate an “optimizing proposition” to be repeatedly satisfied, and we formulate a novel optimization criterion in terms of minimizing the expected cost in between satisfactions of this proposition. We propose a sufficient condition for a policy to be optimal, and develop a dynamic programming algorithm that synthesizes a policy that is optimal under some conditions, and sub-optimal otherwise. This problem is motivated by robotic applications requiring persistent tasks, such as environmental monitoring or data gathering, to be performed.
Date issued
2011-12Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. School of EngineeringJournal
50th IEEE Conference on Decision and Control and European Control Conference 2011 (CDC-ECC)
Publisher
Institute of Electrical and Electronics Engineers (IEEE)
Citation
Ding, Xu Chu et al. “MDP Optimal Control Under Temporal Logic Constraints.” 50th IEEE Conference on Decision and Control and European Control Conference 2011 (CDC-ECC). 532–538.
Version: Author's final manuscript
ISBN
978-1-61284-799-3
978-1-61284-800-6
ISSN
0743-1546