Understanding and Predicting Image Memorability at a Large Scale

Khosla, Aditya; Raju, Akhil S.; Torralba, Antonio; Oliva, Aude

dc.contributor.author	Khosla, Aditya
dc.contributor.author	Raju, Akhil G.
dc.contributor.author	Torralba, Antonio
dc.contributor.author	Oliva, Aude
dc.date.accessioned	2017-12-29T20:12:30Z
dc.date.available	2017-12-29T20:12:30Z
dc.date.issued	2016-02
dc.date.submitted	2015-12
dc.identifier.isbn	978-1-4673-8391-2
dc.identifier.uri	http://hdl.handle.net/1721.1/112993
dc.description.abstract	Progress in estimating visual memorability has been limited by the small scale and lack of variety of benchmark data. Here, we introduce a novel experimental procedure to objectively measure human memory, allowing us to build LaMem, the largest annotated image memorability dataset to date (containing 60,000 images from diverse sources). Using Convolutional Neural Networks (CNNs), we show that fine-tuned deep features outperform all other features by a large margin, reaching a rank correlation of 0.64, near human consistency (0.68). Analysis of the responses of the high-level CNN layers shows which objects and regions are positively, and negatively, correlated with memorability, allowing us to create memorability maps for each image and provide a concrete method to perform image memorability manipulation. This work demonstrates that one can now robustly estimate the memorability of images from many different classes, positioning memorability and deep memorability features as prime candidates to estimate the utility of information for cognitive systems. Our model and data are available at: http://memorability.csail.mit.edu.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant 1532591)	en_US
dc.description.sponsorship	McGovern Institute for Brain Research at MIT. Neurotechnology (MINT) Program	en_US
dc.description.sponsorship	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory. MIT Big Data Initiative	en_US
dc.description.sponsorship	Google (Firm)	en_US
dc.description.sponsorship	Xerox Corporation	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/ICCV.2015.275	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT Web Domain	en_US
dc.title	Understanding and Predicting Image Memorability at a Large Scale	en_US
dc.type	Article	en_US
dc.identifier.citation	Khosla, Aditya, et al. "Understanding and Predicting Image Memorability at a Large Scale." 2015 IEEE International Conference on Computer Vision (ICCV), 7-13 December 2015, Santiago, Chile, IEEE, 2015, pp. 2390–98.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.department	Massachusetts Institute of Technology. Media Laboratory	en_US
dc.contributor.mitauthor	Khosla, Aditya
dc.contributor.mitauthor	Raju, Akhil G.
dc.contributor.mitauthor	Torralba, Antonio
dc.contributor.mitauthor	Oliva, Aude
dc.relation.journal	2015 IEEE International Conference on Computer Vision (ICCV)	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Khosla, Aditya; Raju, Akhil S.; Torralba, Antonio; Oliva, Aude	en_US
dspace.embargo.terms	N	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-0007-3352
dc.identifier.orcid	https://orcid.org/0000-0003-4915-0256
mit.license	OPEN_ACCESS_POLICY	en_US

Files in this item

Name:: Torralba_Understanding and ...
Size:: 4.135Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record