Optimal Pose and Shape Estimation for Category-level 3D Object Perception

Shi, Jingnan; Yang, Heng; Carlone, Luca

dc.contributor.author	Shi, Jingnan
dc.contributor.author	Yang, Heng
dc.contributor.author	Carlone, Luca
dc.date.accessioned	2021-12-07T20:29:19Z
dc.date.available	2021-12-07T16:02:15Z
dc.date.available	2021-12-07T20:29:19Z
dc.date.issued	2021
dc.identifier.uri	https://hdl.handle.net/1721.1/138354.2
dc.description.abstract	We consider a category-level perception problem, where one is given 3D sensor data picturing an object of a given category (e.g., a car), and has to reconstruct the pose and shape of the object despite intra-class variability (i.e., different car models have different shapes). We consider an active shape model, where —for an object category— we are given a library of potential CAD models describing objects in that category, and we adopt a standard formulation where pose and shape estimation are formulated as a non-convex optimization. Our first contribution is to provide the first certifiably optimal solver for pose and shape estimation. In particular, we show that rotation estimation can be decoupled from the estimation of the object translation and shape, and we demonstrate that (i) the optimal object rotation can be computed via a tight (small-size) semidefinite relaxation, and (ii) the translation and shape parameters can be computed in closed-form given the rotation. Our second contribution is to add an outlier rejection layer to our solver, hence making it robust to a large number of misdetections. Towards this goal, we wrap our optimal solver in a robust estimation scheme based on graduated non-convexity. To further enhance robustness to outliers, we also develop the first graph-theoretic formulation to prune outliers in category-level perception, which removes outliers via convex hull and maximum clique computations; the resulting approach is robust to 70 − 90% outliers. Our third contribution is an extensive experimental evaluation. Besides providing an ablation study on a simulated dataset and on the PASCAL3D+ dataset, we combine our solver with a deep-learned keypoint detector, and show that the resulting approach improves over the state of the art in vehicle pose estimation in the ApolloScape datasets.	en_US
dc.description.sponsorship	ARL (Contract W911NF-17-2-0181)	en_US
dc.description.sponsorship	ONR (Contract N00014-18-1-2828)	en_US
dc.language.iso	en
dc.publisher	Robotics: Science and Systems Foundation	en_US
dc.relation.isversionof	10.15607/RSS.2021.XVII.025	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	Prof. Carlone	en_US
dc.title	Optimal Pose and Shape Estimation for Category-level 3D Object Perception	en_US
dc.type	Article	en_US
dc.identifier.citation	Shi, Jingnan, Yang, Heng and Carlone, Luca. 2021. "Optimal Pose and Shape Estimation for Category-level 3D Object Perception." Robotics: Science and Systems XVII.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Laboratory for Information and Decision Systems	en_US
dc.relation.journal	Robotics: Science and Systems XVII	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2021-12-07T15:56:18Z
dspace.orderedauthors	Shi, J; Yang, H; Carlone, L	en_US
dspace.date.submission	2021-12-07T15:56:21Z
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Publication Information Needed	en_US

Files in this item

Name:: 2104.08383.pdf
Size:: 8.132Mb
Format:: Unknown
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record

Version	Item	Date	Summary
2	1721.1/138354.2*	2021-12-07T20:07:40Z	Authority information verified/added.
1	1721.1/138354	2021-12-07T16:02:15Z

DSpace@MIT

Optimal Pose and Shape Estimation for Category-level 3D Object Perception

Files in this item

This item appears in the following Collection(s)

Version History