Show simple item record

dc.contributor.authorMastin, Andrew
dc.contributor.authorJaillet, Patrick
dc.date.accessioned2015-12-18T15:07:17Z
dc.date.available2015-12-18T15:07:17Z
dc.date.issued2014-07
dc.date.submitted2013-03
dc.identifier.issn0022-3239
dc.identifier.issn1573-2878
dc.identifier.urihttp://hdl.handle.net/1721.1/100430
dc.description.abstractRollout algorithms have demonstrated excellent performance on a variety of dynamic and discrete optimization problems. Interpreted as an approximate dynamic programming algorithm, a rollout algorithm estimates the value-to-go at each decision stage by simulating future events while following a heuristic policy, referred to as the base policy. While in many cases rollout algorithms are guaranteed to perform as well as their base policies, there have been few theoretical results showing additional improvement in performance. In this paper, we perform a probabilistic analysis of the subset sum problem and 0–1 knapsack problem, giving theoretical evidence that rollout algorithms perform strictly better than their base policies. Using a stochastic model from the existing literature, we analyze two rollout methods that we refer to as the exhaustive rollout and consecutive rollout, both of which employ a simple greedy base policy. We prove that both methods yield a significant improvement in expected performance after a single iteration of the rollout algorithm, relative to the base policy.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant 1029603)en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Grant N00014-12-1-0033)en_US
dc.description.sponsorshipNational Science Foundation (U.S.). Graduate Research Fellowshipen_US
dc.language.isoen_US
dc.publisherSpringer-Verlagen_US
dc.relation.isversionofhttp://dx.doi.org/10.1007/s10957-014-0603-xen_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleAverage-Case Performance of Rollout Algorithms for Knapsack Problemsen_US
dc.typeArticleen_US
dc.identifier.citationMastin, Andrew, and Patrick Jaillet. “Average-Case Performance of Rollout Algorithms for Knapsack Problems.” Journal of Optimization Theory and Applications 165, no. 3 (July 26, 2014): 964–984.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systemsen_US
dc.contributor.mitauthorMastin, Andrewen_US
dc.contributor.mitauthorJaillet, Patricken_US
dc.relation.journalJournal of Optimization Theory and Applicationsen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsMastin, Andrew; Jaillet, Patricken_US
dc.identifier.orcidhttps://orcid.org/0000-0002-8585-6566
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record