Login

Automated Audio-visual Activity Analysis

Show simple item record

dc.contributor.author Stauffer, Chris
dc.date.accessioned 2005-12-22T02:36:45Z
dc.date.available 2005-12-22T02:36:45Z
dc.date.issued 2005-09-20
dc.identifier.other MIT-CSAIL-TR-2005-057
dc.identifier.other AIM-2005-026
dc.identifier.uri http://hdl.handle.net/1721.1/30568
dc.description.abstract Current computer vision techniques can effectively monitor gross activities in sparse environments. Unfortunately, visual stimulus is often not sufficient for reliably discriminating between many types of activity. In many cases where the visual information required for a particular task is extremely subtle or non-existent, there is often audio stimulus that is extremely salient for a particular classification or anomaly detection task. Unfortunately unlike visual events, independent sounds are often very ambiguous and not sufficient to define useful events themselves. Without an effective method of learning causally-linked temporal sequences of sound events that are coupled to the visual events, these sound events are generally only useful for independent anomalous sounds detection, e.g., detecting a gunshot or breaking glass. This paper outlines a method for automatically detecting a set of audio events and visual events in a particular environment, for determining statistical anomalies, for automatically clustering these detected events into meaningful clusters, and for learning salient temporal relationships between the audio and visual events. This results in a compact description of the different types of compound audio-visual events in an environment.
dc.description.provenance Made available in DSpace on 2005-12-22T02:36:45Z (GMT). No. of bitstreams: 2 MIT-CSAIL-TR-2005-057.ps: 32903979 bytes, checksum: 2e5e824abfd5be899471d6de3086a80b (MD5) MIT-CSAIL-TR-2005-057.pdf: 1153580 bytes, checksum: 7792d4caf2124209c3a4d8ee1d1fc46e (MD5) en
dc.format.extent 9 p.
dc.format.extent 32903979 bytes
dc.format.extent 1153580 bytes
dc.format.mimetype application/postscript
dc.format.mimetype application/pdf
dc.language.iso en_US
dc.relation.ispartofseries Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory
dc.subject AI
dc.subject Unsupervised
dc.subject activity analysis
dc.subject scene modeling
dc.subject tracking
dc.subject event detection
dc.title Automated Audio-visual Activity Analysis

Files in this item

Files Size Format
MIT-CSAIL-TR-2005-057.ps 32.90Mb application/postscript

Files in this item

Files Size Format
MIT-CSAIL-TR-2005-057.pdf 1.153Mb application/pdf

This item appears in the following Collection(s)

Show simple item record

Search DSpace@MIT


Advanced Search

Browse

My Account

Links