Show simple item record

dc.contributor.authorSyed, Zeeshan
dc.contributor.authorStultz, Collin M.
dc.contributor.authorKellis, Manolis
dc.contributor.authorIndyk, Piotr
dc.contributor.authorGuttag, John V.
dc.date.accessioned2012-09-18T14:18:27Z
dc.date.available2012-09-18T14:18:27Z
dc.date.issued2010-01
dc.date.submitted2008-09
dc.identifier.issn1556-4681
dc.identifier.issn1556-472X
dc.identifier.urihttp://hdl.handle.net/1721.1/73032
dc.description.abstractIn this article, we propose a methodology for identifying predictive physiological patterns in the absence of prior knowledge. We use the principle of conservation to identify activity that consistently precedes an outcome in patients, and describe a two-stage process that allows us to efficiently search for such patterns in large datasets. This involves first transforming continuous physiological signals from patients into symbolic sequences, and then searching for patterns in these reduced representations that are strongly associated with an outcome. Our strategy of identifying conserved activity that is unlikely to have occurred purely by chance in symbolic data is analogous to the discovery of regulatory motifs in genomic datasets. We build upon existing work in this area, generalizing the notion of a regulatory motif and enhancing current techniques to operate robustly on non-genomic data. We also address two significant considerations associated with motif discovery in general: computational efficiency and robustness in the presence of degeneracy and noise. To deal with these issues, we introduce the concept of active regions and new subset-based techniques such as a two-layer Gibbs sampling algorithm. These extensions allow for a framework for information inference, where precursors are identified as approximately conserved activity of arbitrary complexity preceding multiple occurrences of an event. We evaluated our solution on a population of patients who experienced sudden cardiac death and attempted to discover electrocardiographic activity that may be associated with the endpoint of death. To assess the predictive patterns discovered, we compared likelihood scores for motifs in the sudden death population against control populations of normal individuals and those with non-fatal supraventricular arrhythmias. Our results suggest that predictive motif discovery may be able to identify clinically relevant information even in the absence of significant prior knowledge.en_US
dc.description.sponsorshipCIMIT: Center for Integration of Medicine and Innovative Technologyen_US
dc.description.sponsorshipHarvard University--MIT Division of Health Sciences and Technologyen_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/1644873.1644875en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alike 3.0en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourcePubMed Centralen_US
dc.titleMotif Discovery in Physiological Datasets: A Methodology for Inferring Predictive Elementsen_US
dc.typeArticleen_US
dc.identifier.citationZeeshan Syed, Collin Stultz, Manolis Kellis, Piotr Indyk, and John Guttag. 2010. Motif discovery in physiological datasets: A methodology for inferring predictive elements. ACM Trans. Knowl. Discov. Data 4, 1, Article 2 (January 2010), 23 pages.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverGuttag, John V.
dc.contributor.mitauthorStultz, Collin M.
dc.contributor.mitauthorKellis, Manolis
dc.contributor.mitauthorIndyk, Piotr
dc.contributor.mitauthorGuttag, John V.
dc.relation.journalACM Transactions on Knowledge Discovery from Data (TKDD)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsSyed, Zeeshan; Stultz, Collin; Kellis, Manolis; Indyk, Piotr; Guttag, Johnen
dc.identifier.orcidhttps://orcid.org/0000-0002-3415-242X
dc.identifier.orcidhttps://orcid.org/0000-0003-0992-0906
dc.identifier.orcidhttps://orcid.org/0000-0002-7983-9524
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record