Latent-Dynamic Discriminative Models for Continuous Gesture Recognition

Morency, Louis-Philippe; Quattoni, Ariadna; Darrell, Trevor

Author(s)

Morency, Louis-Philippe; Quattoni, Ariadna; Darrell, Trevor

DownloadMIT-CSAIL-TR-2007-002.pdf (357.7Kb)

Additional downloads

Other Contributors

Vision

Advisor

Trevor Darrell

Metadata

Show full item record

Abstract

Many problems in vision involve the prediction of a class label for each frame in an unsegmented sequence. In this paper we develop a discriminative framework for simultaneous sequence segmentation and labeling which can capture both intrinsic and extrinsic class dynamics. Our approach incorporates hidden state variables which model the sub-structure of a class sequence and learn the dynamics between class labels. Each class label has a disjoint set of associated hidden states, which enables efficient training and inference in our model. We evaluated our method on the task of recognizing human gestures from unsegmented video streams and performed experiments on three different datasets of head and eye gestures. Our results demonstrate that our model for visual gesture recognition outperform models based on Support Vector Machines, Hidden Markov Models, and Conditional Random Fields.

Date issued

2007-01-07

URI

http://hdl.handle.net/1721.1/35276

Other identifiers

MIT-CSAIL-TR-2007-002

Series/Report no.

Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory

Collections

CSAIL Technical Reports (July 1, 2003 - present)