Deep learning-based approaches for depth and 6-DoF pose estimation

Lin, Muyuan(Scientist in mechanical engineering)Massachusetts Institute of Technology.

Author(s)

Lin, Muyuan(Scientist in mechanical engineering)Massachusetts Institute of Technology.

Download1200048688-MIT.pdf (7.091Mb)

Other Contributors

Massachusetts Institute of Technology. Department of Mechanical Engineering.

Advisor

Sertac Karaman.

Terms of use

MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. http://dspace.mit.edu/handle/1721.1/7582

Metadata

Show full item record

Abstract

In this thesis, we investigated two important geometric vision problems, namely, depth estimation from a single RGB image, and 6-DoF object pose estimation from a partial point cloud. Geometric vision problems are concerned with extracting information (e.g. depth, agent trajectory, 3D structure, 6-DoF pose of objects) of the scene from noisy sensor data (e.g. RGB images, LiDAR) by exploiting geometric constraints (e.g. epipolar constraint, rigid motion of objects). Deep learning framework has achieved impressive progress in many computer vision tasks such as image recognition and segmentation. However, applying deep learning-based approaches to geometric vision problems, which are particularly important in safety-critical robotics applications, remains an open problem. The main challenge lies in the fact that it is not straightforward to incorporate geometric constraints, arising from image formation process and physical properties, to optimization problems. To this end, we explore possibilities of enforcing such constraints either by decomposing a problem into two sub-problems each respecting desired constraints, or designing an estimator establishing relationship between intermediate representations and predicted outputs. We propose a deep learning-based approach for -each problem. Through extensive experiments, we show that our proposed approaches produce results comparable with state of the art on public datasets.

Description

Thesis: S.M., Massachusetts Institute of Technology, Department of Mechanical Engineering, 2020

Cataloged from PDF of thesis.

Includes bibliographical references (pages 67-79).

Date issued

2020

URI

https://hdl.handle.net/1721.1/128089

Department

Massachusetts Institute of Technology. Department of Mechanical Engineering

Publisher

Massachusetts Institute of Technology

Keywords

Mechanical Engineering.

Collections

Graduate Theses