Show simple item record

dc.contributor.authorBarbu, A
dc.contributor.authorMayo, D
dc.contributor.authorAlverio, J
dc.contributor.authorLuo, W
dc.contributor.authorWang, C
dc.contributor.authorGutfreund, D
dc.contributor.authorTenenbaum, J
dc.contributor.authorKatz, B
dc.date.accessioned2021-12-07T14:17:49Z
dc.date.available2021-12-07T14:17:49Z
dc.date.issued2019-01-01
dc.identifier.urihttps://hdl.handle.net/1721.1/138345
dc.description.abstract© 2019 Neural information processing systems foundation. All rights reserved. We collect a large real-world test set, ObjectNet, for object recognition with controls where object backgrounds, rotations, and imaging viewpoints are random. Most scientific experiments have controls, confounds which are removed from the data, to ensure that subjects cannot perform a task by exploiting trivial correlations in the data. Historically, large machine learning and computer vision datasets have lacked such controls. This has resulted in models that must be fine-tuned for new datasets and perform better on datasets than in real-world applications. When tested on ObjectNet, object detectors show a 40-45% drop in performance, with respect to their performance on other benchmarks, due to the controls for biases. Controls make ObjectNet robust to fine-tuning showing only small performance increases. We develop a highly automated platform that enables gathering datasets with controls by crowdsourcing image capturing and annotation. ObjectNet is the same size as the ImageNet test set (50,000 images), and by design does not come paired with a training set in order to encourage generalization. The dataset is both easier than ImageNet - objects are largely centered and unoccluded - and harder, due to the controls. Although we focus on object recognition here, data with controls can be gathered at scale using automated tools throughout machine learning to generate datasets that exercise models in new ways thus providing valuable feedback to researchers. This work opens up new avenues for research in generalizable, robust, and more human-like computer vision and in creating datasets where results are predictive of real-world performance.en_US
dc.language.isoen
dc.relation.isversionofhttps://papers.nips.cc/paper/2019/hash/97af07a14cacba681feacf3012730892-Abstract.htmlen_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceNeural Information Processing Systems (NIPS)en_US
dc.titleObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition modelsen_US
dc.typeArticleen_US
dc.identifier.citationBarbu, A, Mayo, D, Alverio, J, Luo, W, Wang, C et al. 2019. "ObjectNet: A large-scale bias-controlled dataset for pushing the limits of object recognition models." Advances in Neural Information Processing Systems, 32.
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.contributor.departmentCenter for Brains, Minds, and Machines
dc.contributor.departmentMIT-IBM Watson AI Lab
dc.relation.journalAdvances in Neural Information Processing Systemsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-12-07T14:15:26Z
dspace.orderedauthorsBarbu, A; Mayo, D; Alverio, J; Luo, W; Wang, C; Gutfreund, D; Tenenbaum, J; Katz, Ben_US
dspace.date.submission2021-12-07T14:15:31Z
mit.journal.volume32en_US
mit.licensePUBLISHER_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record