Sharing visual features for multiclass and multiview object detection

Torralba, Antonio; Murphy, Kevin P.; Freeman, William T.

Author(s)

Torralba, Antonio; Murphy, Kevin P.; Freeman, William T.

DownloadMIT-CSAIL-TR-2004-019.ps (23.05Mb)

Additional downloads

Metadata

Show full item record

Abstract

We consider the problem of detecting a large number of different classes of objects in cluttered scenes. Traditional approaches require applying a battery of different classifiers to the image, at multiple locations and scales. This can be slow and can require a lot of training data, since each classifier requires the computation of many different image features. In particular, for independently trained detectors, the (run-time) computational complexity, and the (training-time) sample complexity, scales linearly with the number of classes to be detected. It seems unlikely that such an approach will scale up to allow recognition of hundreds or thousands of objects.We present a multi-class boosting procedure (joint boosting) that reduces the computational and sample complexity, by finding common features that can be shared across the classes (and/or views). The detectors for each class are trained jointly, rather than independently. For a given performance level, the total number of features required, and therefore the computational cost, is observed to scale approximately logarithmically with the number of classes. The features selected jointly are closer to edges and generic features typical of many natural structures instead of finding specific object parts. Those generic features generalize better and reduce considerably the computational cost of an algorithm for multi-class object detection.

Date issued

2004-04-14

URI

http://hdl.handle.net/1721.1/30399

Other identifiers

MIT-CSAIL-TR-2004-019

AIM-2004-008

Series/Report no.

Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory

Keywords

AI, Object detection, sharing features, feature selection, multiclass, Boosting

Collections

CSAIL Technical Reports (July 1, 2003 - present)