Show simple item record

dc.contributor.authorWiens, Jenna
dc.contributor.authorCampbell, Wayne N.
dc.contributor.authorFranklin, Ella S.
dc.contributor.authorGuttag, John V.
dc.contributor.authorHorvitz, Eric
dc.date.accessioned2016-01-05T18:54:24Z
dc.date.available2016-01-05T18:54:24Z
dc.date.issued2014-06
dc.date.submitted2014-04
dc.identifier.issn2328-8957
dc.identifier.urihttp://hdl.handle.net/1721.1/100700
dc.description.abstractBackground. Although many risk factors are well known, Clostridium difficile infection (CDI) continues to be a significant problem throughout the world. The purpose of this study was to develop and validate a data-driven, hospital-specific risk stratification procedure for estimating the probability that an inpatient will test positive for C difficile. Methods. We consider electronic medical record (EMR) data from patients admitted for ≥24 hours to a large urban hospital in the U.S. between April 2011 and April 2013. Predictive models were constructed using L2-regularized logistic regression and data from the first year. The number of observational variables considered varied from a small set of well known risk factors readily available to a physician to over 10 000 variables automatically extracted from the EMR. Each model was evaluated on holdout admission data from the following year. A total of 34 846 admissions with 372 cases of CDI was used to train the model. Results. Applied to the separate validation set of 34 722 admissions with 355 cases of CDI, the model that made use of the additional EMR data yielded an area under the receiver operating characteristic curve (AUROC) of 0.81 (95% confidence interval [CI], .79–.83), and it significantly outperformed the model that considered only the small set of known clinical risk factors, AUROC of 0.71 (95% CI, .69–.75). Conclusions. Automated risk stratification of patients based on the contents of their EMRs can be used to accurately identify a high-risk population of patients. The proposed method holds promise for enabling the selective allocation of interventions aimed at reducing the rate of CDI.en_US
dc.description.sponsorshipNational Science Foundation (U.S.)en_US
dc.description.sponsorshipQuanta Computer (Firm)en_US
dc.description.sponsorshipNatural Sciences and Engineering Research Council of Canadaen_US
dc.language.isoen_US
dc.publisherOxford University Pressen_US
dc.relation.isversionofhttp://dx.doi.org/10.1093/ofid/ofu045en_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.sourceOxford University Pressen_US
dc.titleLearning Data-Driven Patient Risk Stratification Models for Clostridium difficileen_US
dc.typeArticleen_US
dc.identifier.citationWiens, J., W. N. Campbell, E. S. Franklin, J. V. Guttag, and E. Horvitz. “Learning Data-Driven Patient Risk Stratification Models for Clostridium Difficile.” Open Forum Infectious Diseases 1, no. 2 (June 18, 2014): ofu045–ofu045.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorWiens, Jennaen_US
dc.contributor.mitauthorGuttag, John V.en_US
dc.relation.journalOpen Forum Infectious Diseasesen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsWiens, J.; Campbell, W. N.; Franklin, E. S.; Guttag, J. V.; Horvitz, E.en_US
dc.identifier.orcidhttps://orcid.org/0000-0003-0992-0906
mit.licensePUBLISHER_CCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record