DSpace About DSpace Software     MIT Libraries    
 

DSpace at MIT >
Computer Science and Artificial Intelligence Lab (CSAIL) >
CSAIL Digital Archive >
CSAIL Technical Reports (July 1, 2003 - present) >

Please use this identifier to cite or link to this item: http://hdl.handle.net/1721.1/41526

Title: Gesture in Automatic Discourse Processing
Authors: Eisenstein, Jacob
Advisor: Randall Davis
Other contributors: Natural Language Processing
Issue Date: 7-May-2008
Related To: Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory
Abstract: Computers cannot fully understand spoken language without access to the wide range of modalities that accompany speech. This thesis addresses the particularly expressive modality of hand gesture, and focuses on building structured statistical models at the intersection of speech, vision, and meaning.My approach is distinguished in two key respects. First, gestural patterns are leveraged to discover parallel structures in the meaning of the associated speech. This differs from prior work that attempted to interpret individual gestures directly, an approach that was prone to a lack of generality across speakers. Second, I present novel, structured statistical models for multimodal language processing, which enable learning about gesture in its linguistic context, rather than in the abstract.These ideas find successful application in a variety of language processing tasks: resolving ambiguous noun phrases, segmenting speech into topics, and producing keyframe summaries of spoken language. In all three cases, the addition of gestural features -- extracted automatically from video -- yields significantly improved performance over a state-of-the-art text-only alternative. This marks the first demonstration that hand gesture improves automatic discourse processing.
URI: http://hdl.handle.net/1721.1/41526
Appears in Collections:CSAIL Technical Reports (July 1, 2003 - present)

Files in This Item:

File Description SizeFormat
MIT-CSAIL-TR-2008-027.pdf3552KbAdobe PDFView/Open
MIT-CSAIL-TR-2008-027.ps72KbPostScriptView/Open


This item is protected by original copyright

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

invent @ MIT: The HP-MIT Alliance Copyright © 2002 MIT and  Hewlett-Packard - Feedback