Segmentation Based Tracking for Aerial Robot Global
Localization in Unstructured Environments with Oblique
Monocular Camera Orientation

Shafferman, Hannah R.

dc.contributor.advisor	Ricard, Michael J.
dc.contributor.advisor	Nino, Jose A.
dc.contributor.advisor	How, Jonathan P.
dc.contributor.author	Shafferman, Hannah R.
dc.date.accessioned	2025-10-06T17:35:26Z
dc.date.available	2025-10-06T17:35:26Z
dc.date.issued	2025-05
dc.date.submitted	2025-06-23T14:45:13.807Z
dc.identifier.uri	https://hdl.handle.net/1721.1/162933
dc.description.abstract	In the field of robotics, there has been a growing interest in multi-robot systems and their potential to improve the efficiency, scale, and reliability of tasks beyond what an individual robot can achieve. Global localization is a crucial task for autonomous robot navigation, specifically in the multi-agent scenario where robots need to localize within maps communicated by other agents. The scenario where vehicles are viewing their environments from the same perspective, or camera viewpoint, is well studied. However, when environments are mapped from different camera viewing angles, traditional methods fail to match visual features and thus fail to localize. The technical gap that this thesis addresses is when autonomous vehicles within a team are mapping the same environment from different viewpoints, specifically nadir and an oblique camera orientations in an unstructured environment. Many existing visual place recognition (VPR) methods fail to match visual features that look visually different due to appearance, illumination, or viewpoint changes and thus fail to localize. In this thesis, we demonstrate the shortcomings of previous work to generalize to an off-nadir camera angle and explore the benefits and challenges that arise with utilizing oblique imagery for visual feature detection and tracking. We propose a segmentation-based object tracking pipeline to improve tracking and environment mapping performance in this traditionally challenging scenario. Our approach consists of 1) a front-end auto-segmentation tracking pipeline followed by 2) a submap correspondence search, which exploits geometric consistencies between environment maps to align vehicle reference frames. We evaluate our approach on a challenging indoor, cluttered dataset and demonstrate a maximum precision 74% higher than traditional and learning-based baseline methods, with a map size 0.5% the size of the most memory conservative traditional baseline method.
dc.publisher	Massachusetts Institute of Technology
dc.rights	In Copyright - Educational Use Permitted
dc.rights	Copyright retained by author(s)
dc.rights.uri	https://rightsstatements.org/page/InC-EDU/1.0/
dc.title	Segmentation Based Tracking for Aerial Robot Global Localization in Unstructured Environments with Oblique Monocular Camera Orientation
dc.type	Thesis
dc.description.degree	S.M.
dc.contributor.department	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
mit.thesis.degree	Master
thesis.degree.name	Master of Science in Aeronautics and Astronautics

Files in this item

Name:: shafferman-hshaff-ms-aeroastro ...
Size:: 9.677Mb
Format:: PDF
Description:: Thesis PDF

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record

Segmentation Based Tracking for Aerial Robot Global Localization in Unstructured Environments with Oblique Monocular Camera Orientation

Files in this item

This item appears in the following Collection(s)