MovieQA: Understanding Stories in Movies through Question-Answering

Tapaswi, Makarand; Zhu, Yukun; Stiefelhagen, Rainer; Torralba, Antonio; Urtasun, Raquel; Fidler, Sanja

Author(s)

Tapaswi, Makarand; Zhu, Yukun; Stiefelhagen, Rainer; Torralba, Antonio; Urtasun, Raquel; ... Show more

DownloadTorralba_MovieQA.pdf (2.420Mb)

OPEN_ACCESS_POLICY

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

We introduce the MovieQA dataset which aims to evaluate automatic story comprehension from both video and text. The dataset consists of 14,944 questions about 408 movies with high semantic diversity. The questions range from simpler "Who" did "What" to "Whom", to "Why" and "How" certain events occurred. Each question comes with a set of five possible answers, a correct one and four deceiving answers provided by human annotators. Our dataset is unique in that it contains multiple sources of information - video clips, plots, subtitles, scripts, and DVS. We analyze our data through various statistics and methods. We further extend existing QA techniques to show that question-answering with such open-ended semantics is hard. We make this data set public along with an evaluation benchmark to encourage inspiring work in this challenging domain. Keywords: Motion pictures, Visualization, Semantics, Voltage control, Cognition, Natural languages, Computer vision

Date issued

2016-12

URI

http://hdl.handle.net/1721.1/113894

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Journal

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

Citation

Tapaswi, Makarand, et al. "MovieQA: Understanding Stories in Movies through Question-Answering." 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 27-30 June, 2016, Las Vegas, Nevada, IEEE, 2016, pp. 4631–40.

Version: Author's final manuscript

ISBN

978-1-4673-8851-1

Collections

MIT Open Access Articles

DSpace@MIT