Show simple item record

dc.contributor.authorLutjens, Bjorn
dc.contributor.authorEverett, Michael F
dc.contributor.authorHow, Jonathan P
dc.date.accessioned2020-05-27T13:08:41Z
dc.date.available2020-05-27T13:08:41Z
dc.date.issued2019-08
dc.identifier.isbn9781538660270
dc.identifier.isbn978-1-5386-6026-3
dc.identifier.urihttps://hdl.handle.net/1721.1/125488
dc.description.abstractMany current autonomous systems are being designed with a strong reliance on black box predictions from deep neural networks (DNNs). However, DNNs tend to be overconfident in predictions on unseen data and can give unpredictable results for far-from-distribution test data. The importance of predictions that are robust to this distributional shift is evident for safety-critical applications, such as collision avoidance around pedestrians. Measures of model uncertainty can be used to identify unseen data, but the state-of-the-art extraction methods such as Bayesian neural networks are mostly intractable to compute. This paper uses MC-Dropout and Bootstrapping to give computationally tractable and parallelizable uncertainty estimates. The methods are embedded in a Safe Reinforcement Learning framework to form uncertainty-aware navigation around pedestrians. The result is a collision avoidance policy that knows what it does not know and cautiously avoids pedestrians that exhibit unseen behavior. The policy is demonstrated in simulation to be more robust to novel observations and take safer actions than an uncertainty-unaware baseline. Keywords: Uncertainty; Collision avoidance; Neural networks; Computational modeling; Training; Data models; Reinforcement learningen_US
dc.language.isoen
dc.publisherIEEEen_US
dc.relation.isversionofhttps://dx.doi.org/10.1109/icra.2019.8793611en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleSafe Reinforcement Learning With Model Uncertainty Estimatesen_US
dc.typeArticleen_US
dc.identifier.citationLutjens, Bjorn, Everett, Michael, and How, Jonathan P., "Safe Reinforcement Learning With Model Uncertainty Estimates." 2019 International Conference on Robotics and Automation (ICRA), May 2019, Montreal, Canada, IEEE, August 2019.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Aerospace Controls Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Aeronautics and Astronauticsen_US
dc.relation.journal2019 International Conference on Robotics and Automationen_US
dc.eprint.versionOriginal manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-10-28T17:45:18Z
dspace.date.submission2019-10-28T17:45:26Z
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record