| dc.contributor.author | Geramifard, Alborz | |
| dc.contributor.author | Redding, Joshua | |
| dc.contributor.author | Roy, Nicholas | |
| dc.contributor.author | How, Jonathan P. | |
| dc.date.accessioned | 2013-10-29T16:58:49Z | |
| dc.date.available | 2013-10-29T16:58:49Z | |
| dc.date.issued | 2011-06 | |
| dc.identifier.isbn | 978-1-4577-0081-1 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/81838 | |
| dc.description.abstract | Risk and reward are fundamental concepts in the cooperative control of unmanned systems. This paper focuses on a constructive relationship between a cooperative planner and a learner in order to mitigate the learning risk while boosting the asymptotic performance and safety of agent behavior. Our framework is an instance of the intelligent cooperative control architecture (iCCA) where a learner (Natural actor-critic, Sarsa) initially follows a “safe” policy generated by a cooperative planner (consensus-based bundle algorithm). The learner incrementally improves this baseline policy through interaction, while avoiding behaviors believed to be “risky”. This paper extends previous work toward the coupling of learning and cooperative control strategies in real-time stochastic domains in two ways: (1) the risk analysis module supports stochastic risk models, and (2) learning schemes that do not store the policy as a separate entity are integrated with the cooperative planner extending the applicability of iCCA framework. The performance of the resulting approaches are demonstrated through simulation of limited fuel UAVs in a stochastic task assignment problem. Results show an 8% reduction in risk, while improving the performance up to 30%. | en_US |
| dc.description.sponsorship | United States. Air Force Office of Scientific Research (Grant FA9550-09-1-0522) | en_US |
| dc.description.sponsorship | Boeing Scientific Research Laboratories | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | en_US |
| dc.relation.isversionof | http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5991309 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike 3.0 | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
| dc.source | MIT web domain | en_US |
| dc.title | UAV Cooperative Control with Stochastic Risk Models | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Geramifard, Alborz et al. "UAV Cooperative Control with Stochastic Risk Models." IEEE American Control Conference, 2011. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Aerospace Controls Laboratory | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Laboratory for Information and Decision Systems | en_US |
| dc.contributor.mitauthor | Geramifard, Alborz | en_US |
| dc.contributor.mitauthor | Redding, Joshua | en_US |
| dc.contributor.mitauthor | Roy, Nicholas | en_US |
| dc.contributor.mitauthor | How, Jonathan P. | en_US |
| dc.relation.journal | Proceedings of the 2011 American Control Conference | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Geramifard, Alborz; Redding, Joshua; Roy, Nicholas; How, Jonathan P. | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0002-2508-1957 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-8576-1930 | |
| dc.identifier.orcid | https://orcid.org/0000-0002-8293-0492 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |