MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Learning-based online monitoring for robust robotic perception

Author(s)
Gupta, Arjun(Arjun R.)
Thumbnail
Download1192555157-MIT.pdf (9.766Mb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
Luca Carlone.
Terms of use
MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
Autonomous agents rely on accurate perception of the surrounding environment for robust operation. In some applications, perception errors due to model misspecifications or incorrect neural network predictions are inconsequential; however, in safety critical applications, like autonomous driving, one misprediction can have dire effects. There are several works that aim to characterize the robustness of perception offline, but there is a lack of tools to monitor the correctness of perception online during operation. In this thesis, we develop several algorithms to monitor the correctness of pedestrian detections and of 3D mesh models of the environment to enable autonomous agents to detect and react to inconsistencies in their world model. We start by developing a method to track humans in the context of a Simultaneous Localization and Mapping (SLAM) pipeline while monitoring the correctness of pedestrian localization via pose-graph optimization. We then move to the more fine-grained task of monitoring the pose and shape of detected humans from a single image. We develop several model-based approaches and a learning-based approach, the Adversarially Trained Online Monitor (ATOM). ATOM outperforms the model-based approaches and can be used to effectively flag perception errors for human shape and pose estimation. Finally, we investigate methods for monitoring 3D mesh models of the environment with face-level precision using several model-based methods and Face Error Network (FEN).
Description
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, May, 2020
 
Cataloged from the official PDF of thesis.
 
Includes bibliographical references (pages 42-50).
 
Date issued
2020
URI
https://hdl.handle.net/1721.1/127404
Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Publisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.