Show simple item record

dc.contributor.authorShi, Jingnan
dc.contributor.authorYang, Heng
dc.contributor.authorCarlone, Luca
dc.date.accessioned2021-12-07T20:29:19Z
dc.date.available2021-12-07T16:02:15Z
dc.date.available2021-12-07T20:29:19Z
dc.date.issued2021
dc.identifier.urihttps://hdl.handle.net/1721.1/138354.2
dc.description.abstractWe consider a category-level perception problem, where one is given 3D sensor data picturing an object of a given category (e.g., a car), and has to reconstruct the pose and shape of the object despite intra-class variability (i.e., different car models have different shapes). We consider an active shape model, where —for an object category— we are given a library of potential CAD models describing objects in that category, and we adopt a standard formulation where pose and shape estimation are formulated as a non-convex optimization. Our first contribution is to provide the first certifiably optimal solver for pose and shape estimation. In particular, we show that rotation estimation can be decoupled from the estimation of the object translation and shape, and we demonstrate that (i) the optimal object rotation can be computed via a tight (small-size) semidefinite relaxation, and (ii) the translation and shape parameters can be computed in closed-form given the rotation. Our second contribution is to add an outlier rejection layer to our solver, hence making it robust to a large number of misdetections. Towards this goal, we wrap our optimal solver in a robust estimation scheme based on graduated non-convexity. To further enhance robustness to outliers, we also develop the first graph-theoretic formulation to prune outliers in category-level perception, which removes outliers via convex hull and maximum clique computations; the resulting approach is robust to 70 − 90% outliers. Our third contribution is an extensive experimental evaluation. Besides providing an ablation study on a simulated dataset and on the PASCAL3D+ dataset, we combine our solver with a deep-learned keypoint detector, and show that the resulting approach improves over the state of the art in vehicle pose estimation in the ApolloScape datasets.en_US
dc.description.sponsorshipARL (Contract W911NF-17-2-0181)en_US
dc.description.sponsorshipONR (Contract N00014-18-1-2828)en_US
dc.language.isoen
dc.publisherRobotics: Science and Systems Foundationen_US
dc.relation.isversionof10.15607/RSS.2021.XVII.025en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceProf. Carloneen_US
dc.titleOptimal Pose and Shape Estimation for Category-level 3D Object Perceptionen_US
dc.typeArticleen_US
dc.identifier.citationShi, Jingnan, Yang, Heng and Carlone, Luca. 2021. "Optimal Pose and Shape Estimation for Category-level 3D Object Perception." Robotics: Science and Systems XVII.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Laboratory for Information and Decision Systemsen_US
dc.relation.journalRobotics: Science and Systems XVIIen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-12-07T15:56:18Z
dspace.orderedauthorsShi, J; Yang, H; Carlone, Len_US
dspace.date.submission2021-12-07T15:56:21Z
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusPublication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version