Login

Gesture in Automatic Discourse Processing

Show simple item record

dc.contributor.advisor Randall Davis en_US
dc.contributor.author Eisenstein, Jacob en_US
dc.contributor.other Natural Language Processing en_US
dc.date.accessioned 2008-05-08T16:30:14Z
dc.date.available 2008-05-08T16:30:14Z
dc.date.issued 2008-05-07 en_US
dc.identifier.other MIT-CSAIL-TR-2008-027 en_US
dc.identifier.uri http://hdl.handle.net/1721.1/41526
dc.description.abstract Computers cannot fully understand spoken language without access to the wide range of modalities that accompany speech. This thesis addresses the particularly expressive modality of hand gesture, and focuses on building structured statistical models at the intersection of speech, vision, and meaning.My approach is distinguished in two key respects. First, gestural patterns are leveraged to discover parallel structures in the meaning of the associated speech. This differs from prior work that attempted to interpret individual gestures directly, an approach that was prone to a lack of generality across speakers. Second, I present novel, structured statistical models for multimodal language processing, which enable learning about gesture in its linguistic context, rather than in the abstract.These ideas find successful application in a variety of language processing tasks: resolving ambiguous noun phrases, segmenting speech into topics, and producing keyframe summaries of spoken language. In all three cases, the addition of gestural features -- extracted automatically from video -- yields significantly improved performance over a state-of-the-art text-only alternative. This marks the first demonstration that hand gesture improves automatic discourse processing. en_US
dc.description.provenance Submitted by CSAIL Importer (publications-dspace@csail.mit.edu) on 2008-05-08T16:30:13Z No. of bitstreams: 2 MIT-CSAIL-TR-2008-027.pdf: 3637566 bytes, checksum: 534fe91b05ade0bf5d39815cada44fda (MD5) MIT-CSAIL-TR-2008-027.ps: 73870 bytes, checksum: 6a6a02fd166fa97eae8adb784f0b5ee5 (MD5) en
dc.description.provenance Made available in DSpace on 2008-05-08T16:30:14Z (GMT). No. of bitstreams: 2 MIT-CSAIL-TR-2008-027.pdf: 3637566 bytes, checksum: 534fe91b05ade0bf5d39815cada44fda (MD5) MIT-CSAIL-TR-2008-027.ps: 73870 bytes, checksum: 6a6a02fd166fa97eae8adb784f0b5ee5 (MD5) Previous issue date: 2008-05-07 en
dc.format.extent 153 p. en_US
dc.relation Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory en_US
dc.relation en_US
dc.title Gesture in Automatic Discourse Processing en_US

Files in this item

Files Size Format
MIT-CSAIL-TR-2008-027.pdf 3.637Mb application/pdf
MIT-CSAIL-TR-2008-027.ps 73.87Kb application/postscript

This item appears in the following Collection(s)

Show simple item record

Search DSpace@MIT


Advanced Search

Browse

My Account

Links