Average-Case Performance of Rollout Algorithms for Knapsack Problems

Mastin, Andrew; Jaillet, Patrick

dc.contributor.author	Mastin, Andrew
dc.contributor.author	Jaillet, Patrick
dc.date.accessioned	2015-12-18T15:07:17Z
dc.date.available	2015-12-18T15:07:17Z
dc.date.issued	2014-07
dc.date.submitted	2013-03
dc.identifier.issn	0022-3239
dc.identifier.issn	1573-2878
dc.identifier.uri	http://hdl.handle.net/1721.1/100430
dc.description.abstract	Rollout algorithms have demonstrated excellent performance on a variety of dynamic and discrete optimization problems. Interpreted as an approximate dynamic programming algorithm, a rollout algorithm estimates the value-to-go at each decision stage by simulating future events while following a heuristic policy, referred to as the base policy. While in many cases rollout algorithms are guaranteed to perform as well as their base policies, there have been few theoretical results showing additional improvement in performance. In this paper, we perform a probabilistic analysis of the subset sum problem and 0–1 knapsack problem, giving theoretical evidence that rollout algorithms perform strictly better than their base policies. Using a stochastic model from the existing literature, we analyze two rollout methods that we refer to as the exhaustive rollout and consecutive rollout, both of which employ a simple greedy base policy. We prove that both methods yield a significant improvement in expected performance after a single iteration of the rollout algorithm, relative to the base policy.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant 1029603)	en_US
dc.description.sponsorship	United States. Office of Naval Research (Grant N00014-12-1-0033)	en_US
dc.description.sponsorship	National Science Foundation (U.S.). Graduate Research Fellowship	en_US
dc.language.iso	en_US
dc.publisher	Springer-Verlag	en_US
dc.relation.isversionof	http://dx.doi.org/10.1007/s10957-014-0603-x	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Average-Case Performance of Rollout Algorithms for Knapsack Problems	en_US
dc.type	Article	en_US
dc.identifier.citation	Mastin, Andrew, and Patrick Jaillet. “Average-Case Performance of Rollout Algorithms for Knapsack Problems.” Journal of Optimization Theory and Applications 165, no. 3 (July 26, 2014): 964–984.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems	en_US
dc.contributor.mitauthor	Mastin, Andrew	en_US
dc.contributor.mitauthor	Jaillet, Patrick	en_US
dc.relation.journal	Journal of Optimization Theory and Applications	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Mastin, Andrew; Jaillet, Patrick	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-8585-6566
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Jaillet_Average-case.pdf
Size:: 615.9Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record