Distributed Asynchronous Policy Iteration in Dynamic Programming

Bertsekas, Dimitri P.; Yu, Huizhen

dc.contributor.author	Bertsekas, Dimitri P.
dc.contributor.author	Yu, Huizhen
dc.date.accessioned	2011-06-02T16:37:30Z
dc.date.available	2011-06-02T16:37:30Z
dc.date.issued	2010-09
dc.identifier.uri	http://hdl.handle.net/1721.1/63169
dc.description.abstract	We consider the distributed solution of dynamic programming (DP) problems by policy iteration. We envision a network of processors, each updating asynchronously a local policy and a local cost function, defined on a portion of the state space. The computed values are communicated asynchronously between processors and are used to perform the local policy and cost updates. The natural algorithm of this type can fail even under favorable circumstances, as shown by Williams and Baird [WiB93]. We propose an alternative and almost as simple algorithm, which converges to the optimum under the most general conditions, including asynchronous updating by multiple processors using outdated local cost functions of other processors.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant ECCS-0801549)	en_US
dc.description.sponsorship	United States. Dept. of the Air Force (Air Force Grant FA9550-10-1-0412)	en_US
dc.description.sponsorship	Los Alamos National Laboratory. Information Science and Technology Institute	en_US
dc.description.sponsorship	Academy of Finland (Grant 118653 (ALGODAN))	en_US
dc.description.sponsorship	PASCAL Network of Excellence (IST-2002-506778)	en_US
dc.language.iso	en_US
dc.publisher	University of Illinois at Urbana-Champaign	en_US
dc.relation.isversionof	http://eprints.pascal-network.org/archive/00008065/	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Distributed Asynchronous Policy Iteration in Dynamic Programming	en_US
dc.type	Article	en_US
dc.identifier.citation	Bertsekas, Dimitri P. and Huizhen Yu. “Distributed asynchronous policy iteration in dynamic programming.” 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton). Monticello, IL, USA, 2010. 1368-1375.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.approver	Bertsekas, Dimitri P.
dc.contributor.mitauthor	Bertsekas, Dimitri P.
dc.relation.journal	Allerton Conference on Communication, Control, and Computing. Proceedings, 48th, 2010	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
dspace.orderedauthors	Bertsekas, Dimitri P.; Yu, Huizhen
dc.identifier.orcid	https://orcid.org/0000-0001-6909-7208
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Bertsekas_Distributed asynchro ...
Size:: 251.1Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record