Show simple item record

dc.contributor.authorRecasens Continente, Adria
dc.contributor.authorVondrick, Carl Martin
dc.contributor.authorKhosla, Aditya
dc.contributor.authorTorralba, Antonio
dc.date.accessioned2019-11-06T20:19:42Z
dc.date.available2019-11-06T20:19:42Z
dc.date.issued2017-12
dc.identifier.issn2380-7504
dc.identifier.urihttps://hdl.handle.net/1721.1/122778
dc.description.abstractFollowing the gaze of people inside videos is an important signal for understanding people and their actions. In this paper, we present an approach for following gaze in video by predicting where a person (in the video) is looking even when the object is in a different frame. We collect VideoGaze, a new dataset which we use as a benchmark to both train and evaluate models. Given one frame with a person in it, our model estimates a density for gaze location in every frame and the probability that the person is looking in that particular frame. A key aspect of our approach is an end-to-end model that jointly estimates: saliency, gaze pose, and geometric relationships between views while only using gaze as supervision. Visualizations suggest that the model learns to internally solve these intermediate tasks automatically without additional supervision. Experiments show that our approach follows gaze in video better than existing approaches, enabling a richer understanding of human activities in video. Keywords: Motion pictures, Head, Three-dimensional displays, Predictive models, Geometry, Semantics, gaze tracking, learning (artificial intelligence), video signal processingen_US
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/iccv.2017.160en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleFollowing Gaze in Videoen_US
dc.typeArticleen_US
dc.identifier.citationRecasens Continente, Adria et al. "Following Gaze in Video," 2017 IEEE International Conference on Computer Vision (ICCV), October 2017, Venice, Italy, Institute of Electrical and Electronics Engineers, December 2017 ©IEEEen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.relation.journal2017 IEEE International Conference on Computer Vision (ICCV)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2019-07-11T16:25:39Z
dspace.date.submission2019-07-11T16:25:40Z


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record