Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization

Torralba, A.; Murphy, K. P.; Freeman, W. T.

dc.contributor.author	Torralba, Antonio
dc.contributor.author	Murphy, K. P.
dc.contributor.author	Freeman, William T.
dc.date.accessioned	2012-09-20T17:34:06Z
dc.date.available	2012-09-20T17:34:06Z
dc.date.issued	2010-03
dc.identifier.issn	0001-0782
dc.identifier.uri	http://hdl.handle.net/1721.1/73074
dc.description.abstract	Recognizing objects in images is an active area of research in computer vision. In the last two decades, there has been much progress and there are already object recognition systems operating in commercial products. However, most of the algorithms for detecting objects perform an exhaustive search across all locations and scales in the image comparing local image regions with an object model. That approach ignores the semantic structure of scenes and tries to solve the recognition problem by brute force. In the real world, objects tend to covary with other objects, providing a rich collection of contextual associations. These contextual associations can be used to reduce the search space by looking only in places in which the object is expected to be; this also increases performance, by rejecting patterns that look like the target but appear in unlikely places. Most modeling attempts so far have defined the context of an object in terms of other previously recognized objects. The drawback of this approach is that inferring the context becomes as difficult as detecting each object. An alternative view of context relies on using the entire scene information holistically. This approach is algorithmically attractive since it dispenses with the need for a prior step of individual object recognition. In this paper, we use a probabilistic framework for encoding the relationships between context and object properties and we show how an integrated system provides improved performance. We view this as a significant step toward general purpose machine vision systems.	en_US
dc.description.sponsorship	United States. National Geospatial-Intelligence Agency (NEGI-1582-04-0004)	en_US
dc.description.sponsorship	United States. Army Research Office. Multidisciplinary University Research Initiative (Grant Number N00014-06-1-0734)	en_US
dc.description.sponsorship	National Science Foundation (U.S.). (Contract IIS-0413232)	en_US
dc.description.sponsorship	National Defense Science and Engineering Graduate Fellowship	en_US
dc.language.iso	en_US
dc.publisher	Association for Computing Machinery (ACM)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/1666420.1666446	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike 3.0	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/3.0/	en_US
dc.source	Other University Web Domain	en_US
dc.title	Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization	en_US
dc.type	Article	en_US
dc.identifier.citation	A. Torralba, K. P. Murphy, and W. T. Freeman. 2010. Using the forest to see the trees: exploiting context for visual object detection and localization. Communications of the ACM 53, 3 (March 2010), 107-114.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.approver	Freeman, William T.
dc.contributor.mitauthor	Torralba, Antonio
dc.relation.journal	Communications of the ACM	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Torralba, A.; Murphy, K. P.; Freeman, W. T.	en
dc.identifier.orcid	https://orcid.org/0000-0003-4915-0256
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Freeman-Using the forest.pdf
Size:: 1.013Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record