dc.contributor.advisor | Luca Carlone. | en_US |
dc.contributor.author | Gupta, Arjun(Arjun R.) | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2020-09-15T21:56:05Z | |
dc.date.available | 2020-09-15T21:56:05Z | |
dc.date.copyright | 2020 | en_US |
dc.date.issued | 2020 | en_US |
dc.identifier.uri | https://hdl.handle.net/1721.1/127404 | |
dc.description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, May, 2020 | en_US |
dc.description | Cataloged from the official PDF of thesis. | en_US |
dc.description | Includes bibliographical references (pages 42-50). | en_US |
dc.description.abstract | Autonomous agents rely on accurate perception of the surrounding environment for robust operation. In some applications, perception errors due to model misspecifications or incorrect neural network predictions are inconsequential; however, in safety critical applications, like autonomous driving, one misprediction can have dire effects. There are several works that aim to characterize the robustness of perception offline, but there is a lack of tools to monitor the correctness of perception online during operation. In this thesis, we develop several algorithms to monitor the correctness of pedestrian detections and of 3D mesh models of the environment to enable autonomous agents to detect and react to inconsistencies in their world model. We start by developing a method to track humans in the context of a Simultaneous Localization and Mapping (SLAM) pipeline while monitoring the correctness of pedestrian localization via pose-graph optimization. We then move to the more fine-grained task of monitoring the pose and shape of detected humans from a single image. We develop several model-based approaches and a learning-based approach, the Adversarially Trained Online Monitor (ATOM). ATOM outperforms the model-based approaches and can be used to effectively flag perception errors for human shape and pose estimation. Finally, we investigate methods for monitoring 3D mesh models of the environment with face-level precision using several model-based methods and Face Error Network (FEN). | en_US |
dc.description.statementofresponsibility | by Arjun Gupta. | en_US |
dc.format.extent | 50 pages | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Learning-based online monitoring for robust robotic perception | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M. Eng. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.identifier.oclc | 1192555157 | en_US |
dc.description.collection | M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science | en_US |
dspace.imported | 2020-09-15T21:56:05Z | en_US |
mit.thesis.degree | Master | en_US |
mit.thesis.department | EECS | en_US |