Show simple item record

dc.contributor.authorGreene, W. Nicholas (William Nicholas)
dc.contributor.authorRoy, Nicholas
dc.date.accessioned2021-11-03T20:03:07Z
dc.date.available2021-11-03T20:03:07Z
dc.date.issued2020-09
dc.identifier.urihttps://hdl.handle.net/1721.1/137312
dc.description.abstract© 2020 IEEE. We propose an efficient method for monocular simultaneous localization and mapping (SLAM) that is capable of estimating metrically-scaled motion without additional sensors or hardware acceleration by integrating metric depth predictions from a neural network into a geometric SLAM factor graph. Unlike learned end-to-end SLAM systems, ours does not ignore the relative geometry directly observable in the images. Unlike existing learned depth estimation approaches, ours leverages the insight that when used to estimate scale, learned depth predictions need only be coarse in image space. This allows us to shrink our network to the point that performing inference on a standard CPU becomes computationally tractable.We make several improvements to our network architecture and training procedure to address the lack of depth observability when using coarse images, which allows us to estimate spatially coarse, but depth-accurate predictions in only 30 ms per frame without GPU acceleration. At runtime we incorporate the learned metric data as unary scale factors in a Sim(3) pose graph. Our method is able to generate accurate, scaled poses without additional sensors, hardware accelerators, or special maneuvers and does not ignore or corrupt the observable epipolar geometry. We show compelling results on the KITTI benchmark dataset in addition to real-world experiments with a handheld camera.en_US
dc.description.sponsorshipNSF (Grant 1122374)en_US
dc.description.sponsorshipArmy Research Laboratory (Contract W911NF-17-2-0181)en_US
dc.language.isoen
dc.publisherIEEEen_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/ICRA40945.2020.9196900en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleMetrically-Scaled Monocular SLAM using Learned Scale Factorsen_US
dc.typeArticleen_US
dc.identifier.citationGreene, W. Nicholas (William Nicholas) and Roy, Nicholas. 2020. "Metrically-Scaled Monocular SLAM using Learned Scale Factors." Proceedings - IEEE International Conference on Robotics and Automation.
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.relation.journalProceedings - IEEE International Conference on Robotics and Automationen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-05-03T18:45:15Z
dspace.orderedauthorsGreene, WN; Roy, Nen_US
dspace.date.submission2021-05-03T18:45:16Z
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusPublication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record