Login

Towards trainable man-machine interfaces : combining top-down constraints with bottom-up learning in facial analysis

Show full item record




Title: Towards trainable man-machine interfaces : combining top-down constraints with bottom-up learning in facial analysis
Author: Kumar, Vinay P. (Vinay Prasanna), 1972-
Other Contributors: Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences.
Advisor: Tomaso Poggio.
Department: Massachusetts Institute of Technology. Dept. of Brain and Cognitive Sciences.
Publisher: Massachusetts Institute of Technology
Issue Date: 2002
Abstract: This thesis proposes a miethodology for the design of man-machine interfaces by combining top-down and bottom-up processes in vision. From a computational perspective, we propose that the scientific-cognitive question of combining top-down and bottom-up knowledge is similar to the engineering question of labeling a training set in a supervised learning problem. We investigate these questions in the realm of facial analysis. We propose the use of a linear morphable model (LMM) for representing top-down structure and use it to model various facial variations such as mouth shapes and expression, the pose of faces and visual speech (visemes). We apply a supervised learning method based on support vector machine (SVM) regression for estimating the parameters of LMMs directly from pixel-based representations of faces. We combine these methods for designing new, more self-contained systems for recognizing facial expressions, estimating facial pose and for recognizing visemes.
Description: Thesis (Ph.D. in Computational Cognitive Science)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2002.Includes bibliographical references (leaves 72-[77]).
URI: http://hdl.handle.net/1721.1/29243
Keywords: Brain and Cognitive Sciences.

Files in this item

Files Size Format View Description
Preview, non-printable (open to all) 3.717Mb PDF View/Open Preview, non-printable (open to all)
Full printable version (MIT only) 3.716Mb PDF View/Open Full printable version (MIT only)

This item appears in the following Collection(s)

Show full item record

Search DSpace@MIT


Advanced Search

Browse

My Account

Links