Solving uncertain MDPs with objectives that are separable over instantiations of model uncertainty

Adulyasak, Yossiri; Varakantham, Pradeep; Ahmed, Asrar; Jaillet, Patrick

dc.contributor.author	Adulyasak, Yossiri
dc.contributor.author	Varakantham, Pradeep
dc.contributor.author	Ahmed, Asrar
dc.contributor.author	Jaillet, Patrick
dc.date.accessioned	2018-06-12T13:26:47Z
dc.date.available	2018-06-12T13:26:47Z
dc.date.issued	2015-01
dc.identifier.isbn	ISBN:0-262-51129-0
dc.identifier.uri	http://hdl.handle.net/1721.1/116234
dc.description.abstract	Markov Decision Problems, MDPs offer an effective mechanism for planning under uncertainty. However, due to unavoidable uncertainty over models, it is difficult to obtain an exact specification of an MDP. We are interested in solving MDPs, where transition and reward functions are not exactly specified. Existing research has primarily focussed on computing infinite horizon stationary policies when optimizing robustness, regret and percentile based objectives. We focus specifically on finite horizon problems with a special emphasis on objectives that are separable over individual instantiations of model uncertainty (i.e., objectives that can be expressed as a sum over instantiations of model uncertainty): (a) First, we identify two separable objectives for uncertain MDPs: Average Value Maximization (AVM) and Confidence Probability Maximisation (CPM). (b) Second, we provide optimization based solutions to compute policies for uncertain MDPs with such objectives. In particular, we exploit the separability of AVM and CPM objectives by employing Lagrangian dual decomposition (LDD). (c) Finally, we demonstrate the utility of the LDD approach on a benchmark problem from the literature.	en_US
dc.description.sponsorship	National Research Foundation of Singapore	en_US
dc.language.iso	en_US
dc.publisher	AAAI Press	en_US
dc.relation.isversionof	http://dl.acm.org/citation.cfm?id=2888196	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT Web Domain	en_US
dc.title	Solving uncertain MDPs with objectives that are separable over instantiations of model uncertainty	en_US
dc.type	Article	en_US
dc.identifier.citation	Adulyasak, Yossiri, Pradeep Varakantham, Asrar Ahmed and Patrick Jaillet. "Solving uncertain MDPs with objectives that are separable over instantiations of model uncertainty." In Proceeding AAAI'15 Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Austin, Texas, January 25-30 2015, AAAI Press, ©2015, pp. 3454-3460.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Civil and Environmental Engineering	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Adulyasak, Yossiri
dc.contributor.mitauthor	Jaillet, Patrick
dc.relation.journal	Proceeding AAAI'15 Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Adulyasak, Yossiri; Varakantham, Pradeep; Ahmed, Asrar; Jaillet, Patrick	en_US
dspace.embargo.terms	N	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-8585-6566
mit.license	OPEN_ACCESS_POLICY	en_US

Files in this item

Name:: Jaillet_Solving uncertain MDPS.pdf
Size:: 419.7Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record