Login

Learning Grammatical Models for Object Recognition

Show simple item record

dc.contributor.advisor Leslie Kaelbling en_US
dc.contributor.author Aycinena, Meg en_US
dc.contributor.author Kaelbling, Leslie Pack en_US
dc.contributor.author Lozano-Perez, Tomas en_US
dc.contributor.other Learning and Intelligent Systems en_US
dc.date.accessioned 2008-02-25T19:46:04Z
dc.date.available 2008-02-25T19:46:04Z
dc.date.issued 2008-02-25 en_US
dc.identifier.other MIT-CSAIL-TR-2008-011 en_US
dc.identifier.uri http://hdl.handle.net/1721.1/40288
dc.description.abstract Many object recognition systems are limited by their inability to share common parts or structure among related object classes. This capability is desirable because it allows information about parts and relationships in one object class to be generalized to other classes for which it is relevant. With this goal in mind, we have designed a representation and recognition framework that captures structural variability and shared part structure within and among object classes. The framework uses probabilistic geometric grammars (PGGs) to represent object classes recursively in terms of their parts, thereby exploiting the hierarchical and substitutive structure inherent to many types of objects. To incorporate geometric and appearance information, we extend traditional probabilistic context-free grammars to represent distributions over the relative geometric characteristics of object parts as well as the appearance of primitive parts. We describe an efficient dynamic programming algorithm for object categorization and localization in images given a PGG model. We also develop an EM algorithm to estimate the parameters of a grammar structure from training data, and a search-based structure learning approach that finds a compact grammar to explain the image data while sharing substructure among classes. Finally, we describe a set of experiments that demonstrate empirically that the system provides a performance benefit. en_US
dc.description.provenance Submitted by CSAIL Importer (publications-dspace@csail.mit.edu) on 2008-02-25T19:46:03Z No. of bitstreams: 2 MIT-CSAIL-TR-2008-011.pdf: 21438250 bytes, checksum: 57959173c4ac5539ac7944e966979916 (MD5) MIT-CSAIL-TR-2008-011.ps: 93497541 bytes, checksum: cc4e59aa4075e73fe6e6d7f4887c4ac4 (MD5) en
dc.description.provenance Made available in DSpace on 2008-02-25T19:46:04Z (GMT). No. of bitstreams: 2 MIT-CSAIL-TR-2008-011.pdf: 21438250 bytes, checksum: 57959173c4ac5539ac7944e966979916 (MD5) MIT-CSAIL-TR-2008-011.ps: 93497541 bytes, checksum: cc4e59aa4075e73fe6e6d7f4887c4ac4 (MD5) Previous issue date: 2008-02-25 en
dc.format.extent 28 p. en_US
dc.relation Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory en_US
dc.relation en_US
dc.title Learning Grammatical Models for Object Recognition en_US

Files in this item

Files Size Format
MIT-CSAIL-TR-2008-011.pdf 21.43Mb application/pdf
MIT-CSAIL-TR-2008-011.ps 93.49Mb application/postscript

This item appears in the following Collection(s)

Show simple item record

Search DSpace@MIT


Advanced Search

Browse

My Account

Links