Show simple item record

dc.contributor.authorGhassemi, Marzyeh
dc.contributor.authorDoshi-Velez, Finale
dc.contributor.authorJoshi, Rohit
dc.contributor.authorRumshisky, Anna
dc.contributor.authorSzolovits, Peter
dc.contributor.authorNaumann, Tristan
dc.contributor.authorBrimmer, Nicole J.
dc.date.accessioned2016-02-02T01:25:38Z
dc.date.available2016-02-02T01:25:38Z
dc.date.issued2014-08
dc.identifier.isbn9781450329569
dc.identifier.urihttp://hdl.handle.net/1721.1/101049
dc.description.abstractAccurate knowledge of a patient's disease state and trajectory is critical in a clinical setting. Modern electronic healthcare records contain an increasingly large amount of data, and the ability to automatically identify the factors that influence patient outcomes stand to greatly improve the efficiency and quality of care. We examined the use of latent variable models (viz. Latent Dirichlet Allocation) to decompose free-text hospital notes into meaningful features, and the predictive power of these features for patient mortality. We considered three prediction regimes: (1) baseline prediction, (2) dynamic (time-varying) outcome prediction, and (3) retrospective outcome prediction. In each, our prediction task differs from the familiar time-varying situation whereby data accumulates; since fewer patients have long ICU stays, as we move forward in time fewer patients are available and the prediction task becomes increasingly difficult. We found that latent topic-derived features were effective in determining patient mortality under three timelines: in-hospital, 30 day post-discharge, and 1 year post-discharge mortality. Our results demonstrated that the latent topic features important in predicting hospital mortality are very different from those that are important in post-discharge mortality. In general, latent topic features were more predictive than structured features, and a combination of the two performed best. The time-varying models that combined latent topic features and baseline features had AUCs that reached 0.85, 0.80, and 0.77 for in-hospital, 30 day post-discharge and 1 year post-discharge mortality respectively. Our results agreed with other work suggesting that the first 24 hours of patient information are often the most predictive of hospital mortality. Retrospective models that used a combination of latent topic features and structured features achieved AUCs of 0.96, 0.82, and 0.81 for in-hospital, 30 day, and 1-year mortality prediction. Our work focuses on the dynamic (time-varying) setting because models from this regime could facilitate an on-going severity stratification system that helps direct care-staff resources and inform treatment strategies.en_US
dc.description.sponsorshipIntel Corporation. Science and Technology Center for Big Dataen_US
dc.description.sponsorshipNational Library of Medicine (U.S.) (Biomedical Informatics Research Training Grant NIH/NLM 2T15 LM007092-22)en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/2623330.2623742en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcePMCen_US
dc.titleUnfolding physiological state: mortality modelling in intensive care unitsen_US
dc.typeArticleen_US
dc.identifier.citationMarzyeh Ghassemi, Tristan Naumann, Finale Doshi-Velez, Nicole Brimmer, Rohit Joshi, Anna Rumshisky, and Peter Szolovits. 2014. Unfolding physiological state: mortality modelling in intensive care units. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '14). ACM, New York, NY, USA, 75-84.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorGhassemi, Marzyehen_US
dc.contributor.mitauthorNaumann, Tristanen_US
dc.contributor.mitauthorBrimmer, Nicole J.en_US
dc.contributor.mitauthorJoshi, Rohiten_US
dc.contributor.mitauthorSzolovits, Peteren_US
dc.relation.journalProceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '14)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsGhassemi, Marzyeh; Naumann, Tristan; Doshi-Velez, Finale; Brimmer, Nicole; Joshi, Rohit; Rumshisky, Anna; Szolovits, Peteren_US
dc.identifier.orcidhttps://orcid.org/0000-0001-6349-7251
dc.identifier.orcidhttps://orcid.org/0000-0003-2150-1747
dc.identifier.orcidhttps://orcid.org/0000-0001-8411-6403
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record