GANalyze: Toward Visual Definitions of Cognitive Image Properties

Goetschalckx, Lore; Andonian, Alexander J; Oliva, Aude; Isola, Phillip John

Author(s)

Goetschalckx, Lore; Andonian, Alexander J; Oliva, Aude; Isola, Phillip John

DownloadSubmitted version (9.481Mb)

Open Access Policy

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

We introduce a framework that uses Generative Adversarial Networks (GANs) to study cognitive properties like memorability, aesthetics, and emotional valence. These attributes are of interest because we do not have a concrete visual definition of what they entail. What does it look like for a dog to be more memorable? GANs allow us to generate a manifold of natural-looking images with fine-grained differences in their visual attributes. By navigating this manifold in directions that increase memorability, we can visualize what it looks like for a particular generated image to become more memorable. The resulting ''visual definitions' surface image properties (like ''object size') that may underlie memorability. Through behavioral experiments, we verify that our method indeed discovers image manipulations that causally affect human memory performance. We further demonstrate that the same framework can be used to analyze image aesthetics and emotional valence. ganalyze.csail.mit.edu.

Date issued

2020-10

URI

https://hdl.handle.net/1721.1/129403

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science; Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory

Journal

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

Publisher

IEEE

Citation

Goetschalckx, Lore et al. “GANalyze: Toward Visual Definitions of Cognitive Image Properties.” Paper in the Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), 27 October-2 November 2019, Seoul, Korea, IEEE, © 2019 The Author(s)

Version: Original manuscript

ISBN

9781728148038

Collections

MIT Open Access Articles