MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Doctoral Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Doctoral Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Exploiting visual motion to understand our visual world

Author(s)
Xue, Tianfan
Thumbnail
DownloadFull printable version (21.82Mb)
Other Contributors
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science.
Advisor
William T. Freeman.
Terms of use
MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
Motion is important for understanding our visual world. The human visual system relies heavily on motion perception to recognize the movement of objects, to infer the 3D geometry of a scene, and to perceive the emotions of other people. Modern computer vision systems also use motion signals extracted from video sequences to infer high-level visual concepts, including human activities and abnormal events. Both human and computer visual systems try to perceive changes in the 3D physical world through its 2D projection, either on the image plane or on our retinas. The observed 2D pixel movement is the result of several factors. First, the image sensor might move, inducing egocentric motion, even when the scene is static. Second, the medium between objects and a camera might change and affect how light transmits from the objects to the sensor, like the shimmering in a hot-road mirage. Finally, the objects in a scene might move, either actively, like a person walking along a street, or passively, like a tree branch that is vibrating due to wind. All of these movements reveal information about our visual world. In this dissertation, we will discuss how to infer physical properties of our visual world from observed 2D movement. First, we show how to infer the depth of a scene from egocentric motion and use this to remove undesired visual obstructions. Second, we relate the slight wiggling motion due to refraction to the movement of hot air and infer the location and velocity of the airflow. Last, we illustrate how to infer the physical properties of objects, such as their deformation space or internal structure, from their motion.
Description
Thesis: Ph. D., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, 2017.
 
Cataloged from PDF version of thesis.
 
Includes bibliographical references (pages 115-126).
 
Date issued
2017
URI
http://hdl.handle.net/1721.1/113978
Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Publisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.

Collections
  • Doctoral Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.