Show simple item record

dc.contributor.authorSong, Yale
dc.contributor.authorDavis, Randall
dc.contributor.authorMorency, Louis-Philippe
dc.date.accessioned2014-04-11T14:32:59Z
dc.date.available2014-04-11T14:32:59Z
dc.date.issued2012-06
dc.identifier.isbn978-1-4673-1228-8
dc.identifier.isbn978-1-4673-1226-4
dc.identifier.isbn978-1-4673-1227-1
dc.identifier.urihttp://hdl.handle.net/1721.1/86101
dc.description.abstractMany human action recognition tasks involve data that can be factorized into multiple views such as body postures and hand shapes. These views often interact with each other over time, providing important cues to understanding the action. We present multi-view latent variable discriminative models that jointly learn both view-shared and view-specific sub-structures to capture the interaction between views. Knowledge about the underlying structure of the data is formulated as a multi-chain structured latent conditional model, explicitly learning the interaction between multiple views using disjoint sets of hidden variables in a discriminative manner. The chains are tied using a predetermined topology that repeats over time. We present three topologies - linked, coupled, and linked-coupled - that differ in the type of interaction between views that they model. We evaluate our approach on both segmented and unsegmented human action recognition tasks, using the ArmGesture, the NATOPS, and the ArmGesture-Continuous data. Experimental results show that our approach outperforms previous state-of-the-art action recognition models.en_US
dc.description.sponsorshipUnited States. Office of Naval Research (Science of Autonomy Program Contract N000140910625)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (IIS-1018055)en_US
dc.description.sponsorshipUnited States. Army Research, Development, and Engineering Commanden_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/CVPR.2012.6247918en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleMulti-view latent variable discriminative models for action recognitionen_US
dc.typeArticleen_US
dc.identifier.citationY. Song, L. Morency, and R. Davis. “Multi-View Latent Variable Discriminative Models for Action Recognition.” 2012 IEEE Conference on Computer Vision and Pattern Recognition (n.d.). doi:10.1109/cvpr.2012.6247918.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorSong, Yaleen_US
dc.contributor.mitauthorDavis, Randallen_US
dc.relation.journalProceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognitionen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsSong, Y.; Morency, L.; Davis, R.en_US
dc.identifier.orcidhttps://orcid.org/0000-0001-5232-7281
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record