Toward robust deep stereo networks : uncertainty learning, novelty detection, and online adaptation

Knowles, Milo(Milo Henry Lovelace)

Author(s)

Knowles, Milo(Milo Henry Lovelace)

Download1227276364-MIT.pdf (42.50Mb)

Other Contributors

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.

Advisor

Nicholas Roy.

Terms of use

MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582

Metadata

Show full item record

Abstract

Although deep learning continues to improve the state-of-the-art for stereo depth estimation on benchmark computer vision datasets, there are many remaining challenges in making these algorithms robust enough for deployment in real-world applications where reliability is critical [1, 2]. First, most deep stereo networks do not provide a measure of their own uncertainty, so we do not know when the network's depth estimates are reliable enough to use for other tasks such as planning, mapping, or localization [1]. Second, recent work has demonstrated that the accuracy of deep stereo networks can degrade considerably in novel environments that differ from those of the training set [3, 4, 5]. These failures due to domain shift make it difficult to guarantee performance in real-world applications where the visual environment may change unpredictably. On their own, existing deep stereo networks do not provide a principled way to detect and mitigate these domain shift effects [1]. This thesis makes three contributions towards creating deep stereo networks that are safer to deploy in novel environments. First, we experimentally demonstrate that training sets with predictable geometry encourage the network to learn brittle priors that do not generalize to new environments. We show that the network architecture and training set can be designed to minimize overfitting and improve generalization. Second, we train our deep stereo network to estimate aleatoric uncertainty. We demonstrate that the uncertainty predictions are well-calibrated with empirical error, and allow us to reason about the reliability of our model. Third, we show that the learned model of aleatoric uncertainty is useful for both detecting images from novel environments, and mitigating domain shift through online adaptation

Description

Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2020

Cataloged from student-submitted PDF of thesis.

Includes bibliographical references (pages 87-93).

Date issued

2020

URI

https://hdl.handle.net/1721.1/129167

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Publisher

Massachusetts Institute of Technology

Keywords

Electrical Engineering and Computer Science.

Collections

Graduate Theses