Multi-view latent variable discriminative models for action recognition

Song, Y.; Morency, L.; Davis, R.

dc.contributor.author	Song, Yale
dc.contributor.author	Davis, Randall
dc.contributor.author	Morency, Louis-Philippe
dc.date.accessioned	2014-04-11T14:32:59Z
dc.date.available	2014-04-11T14:32:59Z
dc.date.issued	2012-06
dc.identifier.isbn	978-1-4673-1228-8
dc.identifier.isbn	978-1-4673-1226-4
dc.identifier.isbn	978-1-4673-1227-1
dc.identifier.uri	http://hdl.handle.net/1721.1/86101
dc.description.abstract	Many human action recognition tasks involve data that can be factorized into multiple views such as body postures and hand shapes. These views often interact with each other over time, providing important cues to understanding the action. We present multi-view latent variable discriminative models that jointly learn both view-shared and view-specific sub-structures to capture the interaction between views. Knowledge about the underlying structure of the data is formulated as a multi-chain structured latent conditional model, explicitly learning the interaction between multiple views using disjoint sets of hidden variables in a discriminative manner. The chains are tied using a predetermined topology that repeats over time. We present three topologies - linked, coupled, and linked-coupled - that differ in the type of interaction between views that they model. We evaluate our approach on both segmented and unsegmented human action recognition tasks, using the ArmGesture, the NATOPS, and the ArmGesture-Continuous data. Experimental results show that our approach outperforms previous state-of-the-art action recognition models.	en_US
dc.description.sponsorship	United States. Office of Naval Research (Science of Autonomy Program Contract N000140910625)	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (IIS-1018055)	en_US
dc.description.sponsorship	United States. Army Research, Development, and Engineering Command	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/CVPR.2012.6247918	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	MIT web domain	en_US
dc.title	Multi-view latent variable discriminative models for action recognition	en_US
dc.type	Article	en_US
dc.identifier.citation	Y. Song, L. Morency, and R. Davis. “Multi-View Latent Variable Discriminative Models for Action Recognition.” 2012 IEEE Conference on Computer Vision and Pattern Recognition (n.d.). doi:10.1109/cvpr.2012.6247918.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Song, Yale	en_US
dc.contributor.mitauthor	Davis, Randall	en_US
dc.relation.journal	Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dspace.orderedauthors	Song, Y.; Morency, L.; Davis, R.	en_US
dc.identifier.orcid	https://orcid.org/0000-0001-5232-7281
mit.license	OPEN_ACCESS_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Davis_Multi-view.pdf
Size:: 2.616Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record