Login

A Benchmark of Computational Models of Saliency to Predict Human Fixations

Show full item record




Title: A Benchmark of Computational Models of Saliency to Predict Human Fixations
Author: Judd, Tilke; Durand, Frédo; Torralba, Antonio
Other Contributors: Computer Graphics
Advisor: Frédo Durand
Issue Date: 2012-01-13
Abstract: Many computational models of visual attention have been created from a wide variety of different approaches to predict where people look in images. Each model is usually introduced by demonstrating performances on new images, and it is hard to make immediate comparisons between models. To alleviate this problem, we propose a benchmark data set containing 300 natural images with eye tracking data from 39 observers to compare model performances. We calculate the performance of 10 models at predicting ground truth fixations using three different metrics. We provide a way for people to submit new models for evaluation online. We find that the Judd et al. and Graph-based visual saliency models perform best. In general, models with blurrier maps and models that include a center bias perform well. We add and optimize a blur and center bias for each model and show improvements. We compare performances to baseline models of chance, center and human performance. We show that human performance increases with the number of humans to a limit. We analyze the similarity of different models using multidimensional scaling and explore the relationship between model performance and fixation consistency. Finally, we offer observations about how to improve saliency models in the future.
URI: http://hdl.handle.net/1721.1/68590
Series/Report no.: MIT-CSAIL-TR-2012-001
Keywords: fixation maps, saliency maps, vision

Files in this item

Files Size Format View
MIT-CSAIL-TR-2012-001.pdf 50.57Mb PDF View/Open
supplementalMaterial.pdf 8.721Mb PDF View/Open

The following license files are associated with this item:

This item appears in the following Collection(s)

Show full item record

Creative Commons Attribution 3.0 Unported Except where otherwise noted, this item's license is described as Creative Commons Attribution 3.0 Unported

Search DSpace@MIT


Advanced Search

Browse

My Account

Links