dc.contributor.author | Tsitsiklis, John N | |
dc.contributor.author | Mannor, Shie | |
dc.date.accessioned | 2017-04-13T12:59:13Z | |
dc.date.available | 2017-04-13T12:59:13Z | |
dc.date.issued | 2013-06 | |
dc.date.submitted | 2011-06 | |
dc.identifier.issn | 0377-2217 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/108091 | |
dc.description.abstract | We consider finite horizon Markov decision processes under performance measures that involve both the mean and the variance of the cumulative reward. We show that either randomized or history-based policies can improve performance. We prove that the complexity of computing a policy that maximizes the mean reward under a variance constraint is NP-hard for some cases, and strongly NP-hard for others. We finally offer pseudopolynomial exact and approximation algorithms. | en_US |
dc.description.sponsorship | National Science Foundation (U.S.) (CMMI-0856063) | en_US |
dc.language.iso | en_US | |
dc.publisher | Elsevier | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1016/j.ejor.2013.06.019 | en_US |
dc.rights | Creative Commons Attribution-NonCommercial-NoDerivs License | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | en_US |
dc.source | Prof. Tsitsiklis via Chris Sherratt | en_US |
dc.title | Algorithmic aspects of mean–variance optimization in Markov decision processes | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Mannor, Shie and Tsitsiklis, John N. “Algorithmic Aspects of Mean–variance Optimization in Markov Decision Processes.” European Journal of Operational Research 231, no. 3 (December 2013): 645–653. © 2013 Elsevier B.V. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Laboratory for Information and Decision Systems | en_US |
dc.contributor.approver | tsitsiklis john | en_US |
dc.contributor.mitauthor | Tsitsiklis, John N | |
dc.contributor.mitauthor | Mannor, Shie | |
dc.relation.journal | European Journal of Operational Research | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/JournalArticle | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dspace.orderedauthors | Mannor, Shie; Tsitsiklis, John N. | en_US |
dspace.embargo.terms | N | en_US |
dc.identifier.orcid | https://orcid.org/0000-0003-2658-8239 | |
mit.license | PUBLISHER_CC | en_US |
mit.metadata.status | Complete | |