MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Toward robust deep stereo networks : uncertainty learning, novelty detection, and online adaptation

Author(s)
Knowles, Milo(Milo Henry Lovelace)
Thumbnail
Download1227276364-MIT.pdf (42.50Mb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
Nicholas Roy.
Terms of use
MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
Although deep learning continues to improve the state-of-the-art for stereo depth estimation on benchmark computer vision datasets, there are many remaining challenges in making these algorithms robust enough for deployment in real-world applications where reliability is critical [1, 2]. First, most deep stereo networks do not provide a measure of their own uncertainty, so we do not know when the network's depth estimates are reliable enough to use for other tasks such as planning, mapping, or localization [1]. Second, recent work has demonstrated that the accuracy of deep stereo networks can degrade considerably in novel environments that differ from those of the training set [3, 4, 5]. These failures due to domain shift make it difficult to guarantee performance in real-world applications where the visual environment may change unpredictably. On their own, existing deep stereo networks do not provide a principled way to detect and mitigate these domain shift effects [1]. This thesis makes three contributions towards creating deep stereo networks that are safer to deploy in novel environments. First, we experimentally demonstrate that training sets with predictable geometry encourage the network to learn brittle priors that do not generalize to new environments. We show that the network architecture and training set can be designed to minimize overfitting and improve generalization. Second, we train our deep stereo network to estimate aleatoric uncertainty. We demonstrate that the uncertainty predictions are well-calibrated with empirical error, and allow us to reason about the reliability of our model. Third, we show that the learned model of aleatoric uncertainty is useful for both detecting images from novel environments, and mitigating domain shift through online adaptation
Description
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2020
 
Cataloged from student-submitted PDF of thesis.
 
Includes bibliographical references (pages 87-93).
 
Date issued
2020
URI
https://hdl.handle.net/1721.1/129167
Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Publisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.