Show simple item record

dc.contributor.authorBertsekas, Dimitri P.
dc.contributor.authorYu, Huizhen
dc.date.accessioned2011-06-02T16:37:30Z
dc.date.available2011-06-02T16:37:30Z
dc.date.issued2010-09
dc.identifier.urihttp://hdl.handle.net/1721.1/63169
dc.description.abstractWe consider the distributed solution of dynamic programming (DP) problems by policy iteration. We envision a network of processors, each updating asynchronously a local policy and a local cost function, defined on a portion of the state space. The computed values are communicated asynchronously between processors and are used to perform the local policy and cost updates. The natural algorithm of this type can fail even under favorable circumstances, as shown by Williams and Baird [WiB93]. We propose an alternative and almost as simple algorithm, which converges to the optimum under the most general conditions, including asynchronous updating by multiple processors using outdated local cost functions of other processors.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant ECCS-0801549)en_US
dc.description.sponsorshipUnited States. Dept. of the Air Force (Air Force Grant FA9550-10-1-0412)en_US
dc.description.sponsorshipLos Alamos National Laboratory. Information Science and Technology Instituteen_US
dc.description.sponsorshipAcademy of Finland (Grant 118653 (ALGODAN))en_US
dc.description.sponsorshipPASCAL Network of Excellence (IST-2002-506778)en_US
dc.language.isoen_US
dc.publisherUniversity of Illinois at Urbana-Champaignen_US
dc.relation.isversionofhttp://eprints.pascal-network.org/archive/00008065/en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleDistributed Asynchronous Policy Iteration in Dynamic Programmingen_US
dc.typeArticleen_US
dc.identifier.citationBertsekas, Dimitri P. and Huizhen Yu. “Distributed asynchronous policy iteration in dynamic programming.” 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton). Monticello, IL, USA, 2010. 1368-1375.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverBertsekas, Dimitri P.
dc.contributor.mitauthorBertsekas, Dimitri P.
dc.relation.journalAllerton Conference on Communication, Control, and Computing. Proceedings, 48th, 2010en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsBertsekas, Dimitri P.; Yu, Huizhen
dc.identifier.orcidhttps://orcid.org/0000-0001-6909-7208
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record