dc.contributor.advisor | Aude Oliva and Mathew Monfort. | en_US |
dc.contributor.author | Zhou, Diane Yue. | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2021-01-06T18:31:16Z | |
dc.date.available | 2021-01-06T18:31:16Z | |
dc.date.copyright | 2020 | en_US |
dc.date.issued | 2020 | en_US |
dc.identifier.uri | https://hdl.handle.net/1721.1/129144 | |
dc.description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, September, 2020 | en_US |
dc.description | Cataloged from student-submitted PDF of thesis. | en_US |
dc.description | Includes bibliographical references (pages ). | en_US |
dc.description.abstract | Gaze is an important topic in computer vision as it reveals points of interest that tend to capture a subject's attention in a scene and potential intentions of the subject of gaze. Gaze data is becoming more readily obtainable with technological advances in wearable cameras, enabling the potential for more accurate first-person view gaze prediction models and interesting analyses of gaze. In this research, we use gaze data collected from Pupil Labs glasses to build and compare several gaze prediction models. Our models predict the location of gaze in each frame of a first-person view video by leveraging convolutional neural networks based solely on visual saliency maps. We believe that future work in incorporating more context information about the camera wearer's behavior and the scenes in the videos would further improve the model's performance. | en_US |
dc.description.statementofresponsibility | by Diane Yue Zhou. | en_US |
dc.format.extent | pages | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | MIT theses may be protected by copyright. Please reuse MIT thesis content according to the MIT Libraries Permissions Policy, which is available through the URL provided. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Gaze Prediction in First-Person View Videos | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M. Eng. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.identifier.oclc | 1227276695 | en_US |
dc.description.collection | M.Eng. Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science | en_US |
dspace.imported | 2021-01-06T18:31:15Z | en_US |
mit.thesis.degree | Master | en_US |
mit.thesis.department | EECS | en_US |