dc.contributor.author | Redding, Joshua | |
dc.contributor.author | Geramifard, Alborz | |
dc.contributor.author | Choi, Han-Lim | |
dc.contributor.author | How, Jonathan P. | |
dc.date.accessioned | 2013-10-23T13:29:12Z | |
dc.date.available | 2013-10-23T13:29:12Z | |
dc.date.issued | 2010-08 | |
dc.identifier.isbn | 978-1-60086-962-4 | |
dc.identifier.issn | 1946-9802 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/81477 | |
dc.description.abstract | In this paper, we introduce a method for learning and adapting cooperative control strategies in real-time stochastic domains. Our framework is an instance of the intelligent cooperative control architecture (iCCA)[superscript 1]. The agent starts by following the "safe" plan calculated by the planning module and incrementally adapting its policy to maximize the cumulative rewards. Actor-critic and consensus-based bundle algorithm (CBBA) were employed as the building blocks of the iCCA framework. We demonstrate the performance of our approach by simulating limited fuel unmanned aerial vehicles aiming for stochastic targets. In one experiment where the optimal solution can be calculated, the integrated framework boosted the optimality of the solution by an average of %10, when compared to running each of the modules individually, while keeping the computational load within the requirements for real-time implementation. | en_US |
dc.description.sponsorship | Boeing Scientific Research Laboratories | en_US |
dc.description.sponsorship | United States. Air Force Office of Scientific Research (Grant FA9550-08-1-0086) | en_US |
dc.language.iso | en_US | |
dc.publisher | American Institute of Aeronautics and Astronautics | en_US |
dc.relation.isversionof | http://dx.doi.org/10.2514/6.2010-7586 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike 3.0 | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
dc.source | MIT web domain | en_US |
dc.title | Actor-Critic Policy Learning in Cooperative Planning | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Redding, Joshua, Alborz Geramifard, Han-Lim Choi, and Jonathan How. “Actor-Critic Policy Learning in Cooperative Planning.” In AIAA Guidance, Navigation, and Control Conference. American Institute of Aeronautics and Astronautics, 2010. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Aerospace Controls Laboratory | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Aeronautics and Astronautics | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Laboratory for Information and Decision Systems | en_US |
dc.contributor.mitauthor | Redding, Joshua | en_US |
dc.contributor.mitauthor | Geramifard, Alborz | en_US |
dc.contributor.mitauthor | Choi, Han-Lim | en_US |
dc.contributor.mitauthor | How, Jonathan P. | en_US |
dc.relation.journal | Proceedings of the AIAA Guidance, Navigation, and Control Conference | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dspace.orderedauthors | Redding, Joshua; Geramifard, Alborz; Choi, Han-Lim; How, Jonathan | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-2508-1957 | |
dc.identifier.orcid | https://orcid.org/0000-0001-8576-1930 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |