Parsing Semantic Parts of Cars Using Graphical Models and Segment Appearance Consistency

Lu, Wenhao; Lian, Xiaochen; Yuille, Alan L.

Author(s)

Lu, Wenhao; Lian, Xiaochen; Yuille, Alan L.

DownloadCBMM-Memo-018.pdf (5.023Mb)

Terms of use

Attribution-NonCommercial 3.0 United States http://creativecommons.org/licenses/by-nc/3.0/us/

Metadata

Show full item record

Abstract

This paper addresses the problem of semantic part parsing (segmentation) of cars, i.e.assigning every pixel within the car to one of the parts (e.g.body, window, lights, license plates and wheels). We formulate this as a landmark identification problem, where a set of landmarks specifies the boundaries of the parts. A novel mixture of graphical models is proposed, which dynamically couples the landmarks to a hierarchy of segments. When modeling pairwise relation between landmarks, this coupling enables our model to exploit the local image contents in addition to spatial deformation, an aspect that most existing graphical models ignore. In particular, our model enforces appearance consistency between segments within the same part. Parsing the car, including finding the optimal coupling between landmarks and segments in the hierarchy, is performed by dynamic programming. We evaluate our method on a subset of PASCAL VOC 2010 car images and on the car subset of 3D Object Category dataset (CAR3D). We show good results and, in particular, quantify the effectiveness of using the segment appearance consistency in terms of accuracy of part localization and segmentation.

Date issued

2014-06-13

URI

http://hdl.handle.net/1721.1/100182

Publisher

Center for Brains, Minds and Machines (CBMM), arXiv

Citation

arXiv:1406.2375v2

Series/Report no.

CBMM Memo Series;018

Keywords

Hierarchy, Vision, Object Recognition

Collections

CBMM Memo Series

The following license files are associated with this item:

Creative Commons