| dc.contributor.author | Bertsekas, Dimitri P. | |
| dc.contributor.author | Yu, Huizhen | |
| dc.date.accessioned | 2011-06-02T16:37:30Z | |
| dc.date.available | 2011-06-02T16:37:30Z | |
| dc.date.issued | 2010-09 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/63169 | |
| dc.description.abstract | We consider the distributed solution of dynamic
programming (DP) problems by policy iteration. We envision
a network of processors, each updating asynchronously a local
policy and a local cost function, defined on a portion of the state
space. The computed values are communicated asynchronously
between processors and are used to perform the local policy
and cost updates. The natural algorithm of this type can fail
even under favorable circumstances, as shown by Williams
and Baird [WiB93]. We propose an alternative and almost as
simple algorithm, which converges to the optimum under the
most general conditions, including asynchronous updating by
multiple processors using outdated local cost functions of other
processors. | en_US |
| dc.description.sponsorship | National Science Foundation (U.S.) (Grant ECCS-0801549) | en_US |
| dc.description.sponsorship | United States. Dept. of the Air Force (Air Force Grant FA9550-10-1-0412) | en_US |
| dc.description.sponsorship | Los Alamos National Laboratory. Information Science and Technology Institute | en_US |
| dc.description.sponsorship | Academy of Finland (Grant 118653 (ALGODAN)) | en_US |
| dc.description.sponsorship | PASCAL Network of Excellence (IST-2002-506778) | en_US |
| dc.language.iso | en_US | |
| dc.publisher | University of Illinois at Urbana-Champaign | en_US |
| dc.relation.isversionof | http://eprints.pascal-network.org/archive/00008065/ | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike 3.0 | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
| dc.source | MIT web domain | en_US |
| dc.title | Distributed Asynchronous Policy Iteration in Dynamic Programming | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Bertsekas, Dimitri P. and Huizhen Yu. “Distributed asynchronous policy iteration in dynamic programming.” 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton). Monticello, IL, USA, 2010. 1368-1375. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.approver | Bertsekas, Dimitri P. | |
| dc.contributor.mitauthor | Bertsekas, Dimitri P. | |
| dc.relation.journal | Allerton Conference on Communication, Control, and Computing. Proceedings, 48th, 2010 | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| dspace.orderedauthors | Bertsekas, Dimitri P.; Yu, Huizhen | |
| dc.identifier.orcid | https://orcid.org/0000-0001-6909-7208 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |