Show simple item record

dc.contributor.authorSong, Yale
dc.contributor.authorMorency, Louis-Philippe
dc.contributor.authorDavis, Randall
dc.date.accessioned2014-04-11T18:42:47Z
dc.date.available2014-04-11T18:42:47Z
dc.date.issued2013-06
dc.identifier.isbn978-0-7695-4989-7
dc.identifier.urihttp://hdl.handle.net/1721.1/86123
dc.description.abstractRecent progress has shown that learning from hierarchical feature representations leads to improvements in various computer vision tasks. Motivated by the observation that human activity data contains information at various temporal resolutions, we present a hierarchical sequence summarization approach for action recognition that learns multiple layers of discriminative feature representations at different temporal granularities. We build up a hierarchy dynamically and recursively by alternating sequence learning and sequence summarization. For sequence learning we use CRFs with latent variables to learn hidden spatio-temporal dynamics, for sequence summarization we group observations that have similar semantic meaning in the latent space. For each layer we learn an abstract feature representation through non-linear gate functions. This procedure is repeated to obtain a hierarchical sequence summary representation. We develop an efficient learning method to train our model and show that its complexity grows sub linearly with the size of the hierarchy. Experimental results show the effectiveness of our approach, achieving the best published results on the Arm Gesture and Canal9 datasets.en_US
dc.description.sponsorshipUnited States. Office of Naval Research (N000140910625)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (IIS-1018055)en_US
dc.description.sponsorshipUnited States. Army Research, Development, and Engineering Commanden_US
dc.language.isoen_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/CVPR.2013.457en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleAction Recognition by Hierarchical Sequence Summarizationen_US
dc.typeArticleen_US
dc.identifier.citationSong, Yale, Louis-Philippe Morency, and Randall Davis. “Action Recognition by Hierarchical Sequence Summarization.” 2013 IEEE Conference on Computer Vision and Pattern Recognition (n.d.).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorSong, Yaleen_US
dc.contributor.mitauthorDavis, Randallen_US
dc.relation.journalProceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognitionen_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsSong, Yale; Morency, Louis-Philippe; Davis, Randallen_US
dc.identifier.orcidhttps://orcid.org/0000-0001-5232-7281
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record