Gaze Prediction in First-Person View Videos
Author(s)Zhou, Diane Yue.
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Aude Oliva and Mathew Monfort.
MetadataShow full item record
Gaze is an important topic in computer vision as it reveals points of interest that tend to capture a subject's attention in a scene and potential intentions of the subject of gaze. Gaze data is becoming more readily obtainable with technological advances in wearable cameras, enabling the potential for more accurate first-person view gaze prediction models and interesting analyses of gaze. In this research, we use gaze data collected from Pupil Labs glasses to build and compare several gaze prediction models. Our models predict the location of gaze in each frame of a first-person view video by leveraging convolutional neural networks based solely on visual saliency maps. We believe that future work in incorporating more context information about the camera wearer's behavior and the scenes in the videos would further improve the model's performance.
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, September, 2020Cataloged from student-submitted PDF of thesis.Includes bibliographical references (pages ).
DepartmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Massachusetts Institute of Technology
Electrical Engineering and Computer Science.