MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Doctoral Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Doctoral Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Geometrically and Temporally Consistent Robot Perception

Author(s)
Lin, Muyuan
Thumbnail
DownloadThesis PDF (37.71Mb)
Advisor
Karaman, Sertac
Terms of use
In Copyright - Educational Use Permitted Copyright MIT http://rightsstatements.org/page/InC-EDU/1.0/
Metadata
Show full item record
Abstract
Perception algorithms rely on the capability to reconcile an ideal mathematical model of an object with imperfect data. Despite having been investigated extensively, robot perception remains challenging with spurious outliers, limited training data, and computational constraints. To this end, we investigate geometric and temporal consistency for modeling and inference in perception problems. In this dissertation, we begin our work with the development of an end-to-end neural network model for 6D object pose estimation. The model applies a 3D fully-convolutional network to extract geometric features and enforces pairwise consistency of features via spectral convolution on a compatibility graph. We then develop a graph-theoretic framework for the hypothesis pruning problem. Specifically, we provide a planted clique perspective which draws the connection between a statistical model and robust estimation. This perspective leads to the design of a learning heuristic that is efficient and generalizable. Finally, we leverage temporal consistency for the 6D pose tracking of unknown objects in a video sequence. Our algorithm estimates the optical flow to produce temporally stable motion propagation, and optimizes scene structures and 6D object pose jointly. Instead of directly propagating pose estimation using frame-to-frame correlation, we register a target object in a new frame with a globally consistent model of the scene. Our method is shown to be efficient and accurate for 6D pose tracking.
Date issued
2022-09
URI
https://hdl.handle.net/1721.1/147325
Department
Massachusetts Institute of Technology. Department of Mechanical Engineering
Publisher
Massachusetts Institute of Technology

Collections
  • Doctoral Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.