Show simple item record

dc.contributor.authorGeramifard, Alborz
dc.contributor.authorRedding, Joshua
dc.contributor.authorRoy, Nicholas
dc.contributor.authorHow, Jonathan P.
dc.date.accessioned2013-10-29T16:58:49Z
dc.date.available2013-10-29T16:58:49Z
dc.date.issued2011-06
dc.identifier.isbn978-1-4577-0081-1
dc.identifier.urihttp://hdl.handle.net/1721.1/81838
dc.description.abstractRisk and reward are fundamental concepts in the cooperative control of unmanned systems. This paper focuses on a constructive relationship between a cooperative planner and a learner in order to mitigate the learning risk while boosting the asymptotic performance and safety of agent behavior. Our framework is an instance of the intelligent cooperative control architecture (iCCA) where a learner (Natural actor-critic, Sarsa) initially follows a “safe” policy generated by a cooperative planner (consensus-based bundle algorithm). The learner incrementally improves this baseline policy through interaction, while avoiding behaviors believed to be “risky”. This paper extends previous work toward the coupling of learning and cooperative control strategies in real-time stochastic domains in two ways: (1) the risk analysis module supports stochastic risk models, and (2) learning schemes that do not store the policy as a separate entity are integrated with the cooperative planner extending the applicability of iCCA framework. The performance of the resulting approaches are demonstrated through simulation of limited fuel UAVs in a stochastic task assignment problem. Results show an 8% reduction in risk, while improving the performance up to 30%.en_US
dc.description.sponsorshipUnited States. Air Force Office of Scientific Research (Grant FA9550-09-1-0522)en_US
dc.description.sponsorshipBoeing Scientific Research Laboratoriesen_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_US
dc.relation.isversionofhttp://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5991309en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleUAV Cooperative Control with Stochastic Risk Modelsen_US
dc.typeArticleen_US
dc.identifier.citationGeramifard, Alborz et al. "UAV Cooperative Control with Stochastic Risk Models." IEEE American Control Conference, 2011.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Aerospace Controls Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systemsen_US
dc.contributor.mitauthorGeramifard, Alborzen_US
dc.contributor.mitauthorRedding, Joshuaen_US
dc.contributor.mitauthorRoy, Nicholasen_US
dc.contributor.mitauthorHow, Jonathan P.en_US
dc.relation.journalProceedings of the 2011 American Control Conferenceen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsGeramifard, Alborz; Redding, Joshua; Roy, Nicholas; How, Jonathan P.en_US
dc.identifier.orcidhttps://orcid.org/0000-0002-2508-1957
dc.identifier.orcidhttps://orcid.org/0000-0001-8576-1930
dc.identifier.orcidhttps://orcid.org/0000-0002-8293-0492
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record