Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models

Xiaogang Wang; Xiaoxu Ma; Grimson, W.E.L.

dc.contributor.author	Wang, Xiaogang
dc.contributor.author	Ma, Xiaoxu
dc.contributor.author	Grimson, Eric
dc.date.accessioned	2012-07-11T18:02:56Z
dc.date.available	2012-07-11T18:02:56Z
dc.date.issued	2008-04
dc.date.submitted	2008-01
dc.identifier.issn	0162-8828
dc.identifier.issn	2160-9292
dc.identifier.uri	http://hdl.handle.net/1721.1/71587
dc.description.abstract	We propose a novel unsupervised learning framework to model activities and interactions in crowded and complicated scenes. Hierarchical Bayesian models are used to connect three elements in visual surveillance: low-level visual features, simple "atomic" activities, and interactions. Atomic activities are modeled as distributions over low-level visual features, and multi-agent interactions are modeled as distributions over atomic activities. These models are learnt in an unsupervised way. Given a long video sequence, moving pixels are clustered into different atomic activities and short video clips are clustered into different interactions. In this paper, we propose three hierarchical Bayesian models, Latent Dirichlet Allocation (LDA) mixture model, Hierarchical Dirichlet Process (HDP) mixture model, and Dual Hierarchical Dirichlet Processes (Dual-HDP) model. They advance existing language models, such as LDA [1] and HDP [2]. Our data sets are challenging video sequences from crowded traffic scenes and train station scenes with many kinds of activities co-occurring. Without tracking and human labeling effort, our framework completes many challenging visual surveillance tasks of board interest such as: (1) discovering typical atomic activities and interactions; (2) segmenting long video sequences into different interactions; (3) segmenting motions into different activities; (4) detecting abnormality; and (5) supporting high-level queries on activities and interactions.	en_US
dc.description.sponsorship	United States. Defense Advanced Research Projects Agency	en_US
dc.description.sponsorship	Singapore. DSO National Laboratories	en_US
dc.language.iso	en_US
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)	en_US
dc.relation.isversionof	http://dx.doi.org/10.1109/tpami.2008.87	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	IEEE	en_US
dc.title	Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models	en_US
dc.type	Article	en_US
dc.identifier.citation	Xiaogang Wang, Xiaoxu Ma, and W.E.L. Grimson. “Unsupervised Activity Perception in Crowded and Complicated Scenes Using Hierarchical Bayesian Models.” IEEE Transactions on Pattern Analysis and Machine Intelligence 31.3 (2009): 539–555. © Copyright 2009 IEEE	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.approver	Grimson, Eric
dc.contributor.mitauthor	Wang, Xiaogang
dc.contributor.mitauthor	Ma, Xiaoxu
dc.contributor.mitauthor	Grimson, Eric
dc.relation.journal	IEEE Transactions on Pattern Analysis and Machine Intelligence	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dspace.orderedauthors	Xiaogang Wang; Xiaoxu Ma; Grimson, W.E.L.	en
dc.identifier.orcid	https://orcid.org/0000-0002-6192-2207
dspace.mitauthor.error	true
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Wang-2009-Unsupervised Activity ...
Size:: 4.602Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record