Safe Reinforcement Learning With Model Uncertainty Estimates

Lutjens, Bjorn; Everett, Michael F; How, Jonathan P

dc.contributor.author	Lutjens, Bjorn
dc.contributor.author	Everett, Michael F
dc.contributor.author	How, Jonathan P
dc.date.accessioned	2020-05-27T13:08:41Z
dc.date.available	2020-05-27T13:08:41Z
dc.date.issued	2019-08
dc.identifier.isbn	9781538660270
dc.identifier.isbn	978-1-5386-6026-3
dc.identifier.uri	https://hdl.handle.net/1721.1/125488
dc.description.abstract	Many current autonomous systems are being designed with a strong reliance on black box predictions from deep neural networks (DNNs). However, DNNs tend to be overconfident in predictions on unseen data and can give unpredictable results for far-from-distribution test data. The importance of predictions that are robust to this distributional shift is evident for safety-critical applications, such as collision avoidance around pedestrians. Measures of model uncertainty can be used to identify unseen data, but the state-of-the-art extraction methods such as Bayesian neural networks are mostly intractable to compute. This paper uses MC-Dropout and Bootstrapping to give computationally tractable and parallelizable uncertainty estimates. The methods are embedded in a Safe Reinforcement Learning framework to form uncertainty-aware navigation around pedestrians. The result is a collision avoidance policy that knows what it does not know and cautiously avoids pedestrians that exhibit unseen behavior. The policy is demonstrated in simulation to be more robust to novel observations and take safer actions than an uncertainty-unaware baseline. Keywords: Uncertainty; Collision avoidance; Neural networks; Computational modeling; Training; Data models; Reinforcement learning	en_US
dc.language.iso	en
dc.publisher	IEEE	en_US
dc.relation.isversionof	https://dx.doi.org/10.1109/icra.2019.8793611	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	Safe Reinforcement Learning With Model Uncertainty Estimates	en_US
dc.type	Article	en_US
dc.identifier.citation	Lutjens, Bjorn, Everett, Michael, and How, Jonathan P., "Safe Reinforcement Learning With Model Uncertainty Estimates." 2019 International Conference on Robotics and Automation (ICRA), May 2019, Montreal, Canada, IEEE, August 2019.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Aerospace Controls Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics	en_US
dc.relation.journal	2019 International Conference on Robotics and Automation	en_US
dc.eprint.version	Original manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2019-10-28T17:45:18Z
dspace.date.submission	2019-10-28T17:45:26Z
mit.metadata.status	Complete

Files in this item

Name:: 1810.08700.pdf
Size:: 2.076Mb
Format:: PDF
Description:: Submitted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record