Show simple item record

dc.contributor.authorYan, Yizhou
dc.contributor.authorCao, Lei
dc.contributor.authorMadden, Samuel
dc.contributor.authorRundensteiner, Elke A.
dc.date.accessioned2021-11-08T19:52:39Z
dc.date.available2021-11-08T19:52:39Z
dc.date.issued2018-11
dc.identifier.issn2150-8097
dc.identifier.urihttps://hdl.handle.net/1721.1/137804
dc.description.abstract© 2018 VLDB Endowment 21508097/18/07. Event streams generated by smart devices common in modern Internet of Things applications must be continuously mined to monitor the behavior of the underlying system. In this work, we propose a stream pattern mining system for supporting online IoT applications. First, to solve the pattern explosion problem of existing stream pattern mining strategies, we now design pattern semantics that continuously produce a compact set of patterns that maximumly compresses the dynamic data streams, called MDL-based Representative Patterns (MRP). We then design a one-pass SWIFT approach that continuously mines the up-to-date MRP pattern set for each stream window upon the arrival or expiration of individual events. We show that SWIFT is guaranteed to select the update operation for each individual incoming event that leads to the most compact encoding of the sequence in the current window. We further enhance SWIFT to support batch updates, called B-SWIFT. BSWIFT adopts a lazy update strategy that guarantees that only the minimal number of operations are conducted to process an incoming event batch for MRP pattern mining. Evaluation by our industry lighting lab collaborator demonstrates that SWIFT successfully solves their use cases and finds more representative patterns than the alternative approaches adapting the state-of-the-art static representative pattern mining methods. Our experimental study confirms that SWIFT outperforms the best existing method up to 50% in the compactness of produced pattern encodings, while providing a 4 orders of magnitude speedup.en_US
dc.language.isoen
dc.publisherVLDB Endowmenten_US
dc.relation.isversionof10.14778/3291264.3291271en_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.sourceThe Proceedings of the VLDB Endowmenten_US
dc.titleSWIFT: Mining Representative Patterns from Large Event Streamsen_US
dc.typeArticleen_US
dc.identifier.citationYan, Yizhou, Cao, Lei, Madden, Samuel and Rundensteiner, Elke A. 2018. "SWIFT: Mining Representative Patterns from Large Event Streams." Proceedings of the VLDB Endowment, 12 (3).
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.relation.journalProceedings of the VLDB Endowmenten_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2019-06-18T17:19:41Z
dspace.date.submission2019-06-18T17:19:42Z
mit.journal.volume12en_US
mit.journal.issue3en_US
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record