Show simple item record

dc.contributor.authorCastro, V M
dc.contributor.authorMcCoy, T H
dc.contributor.authorPerlis, R H
dc.contributor.authorNaumann, Tristan
dc.contributor.authorSzolovits, Peter
dc.contributor.authorRumshisky, Anna A.
dc.contributor.authorGhassemi, Marzyeh
dc.date.accessioned2017-04-18T19:08:43Z
dc.date.available2017-04-18T19:08:43Z
dc.date.issued2016-10
dc.date.submitted2015-08
dc.identifier.issn2158-3188
dc.identifier.urihttp://hdl.handle.net/1721.1/108225
dc.description.abstractThe ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, but may be present in narrative electronic health record (EHR) discharge summaries. We identified a cohort of individuals admitted to a psychiatric inpatient unit between 1994 and 2012 with a principal diagnosis of major depressive disorder, and extracted inpatient psychiatric discharge narrative notes. Using these data, we trained a 75-topic Latent Dirichlet Allocation (LDA) model, a form of natural language processing, which identifies groups of words associated with topics discussed in a document collection. The cohort was randomly split to derive a training (70%) and testing (30%) data set, and we trained separate support vector machine models for baseline clinical features alone, baseline features plus common individual words and the above plus topics identified from the 75-topic LDA model. Of 4687 patients with inpatient discharge summaries, 470 were readmitted within 30 days. The 75-topic LDA model included topics linked to psychiatric symptoms (suicide, severe depression, anxiety, trauma, eating/weight and panic) and major depressive disorder comorbidities (infection, postpartum, brain tumor, diarrhea and pulmonary disease). By including LDA topics, prediction of readmission, as measured by area under receiver-operating characteristic curves in the testing data set, was improved from baseline (area under the curve 0.618) to baseline+1000 words (0.682) to baseline+75 topics (0.784). Inclusion of topics derived from narrative notes allows more accurate discrimination of individuals at high risk for psychiatric readmission in this cohort. Topic modeling and related approaches offer the potential to improve prediction using EHRs, if generalizability can be established in other clinical cohorts.en_US
dc.language.isoen_US
dc.publisherNature Publishing Groupen_US
dc.relation.isversionofhttp://dx.doi.org/10.1038/tp.2015.182en_US
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs Licenseen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/en_US
dc.sourceNatureen_US
dc.titlePredicting early psychiatric readmission with natural language processing of narrative discharge summariesen_US
dc.typeArticleen_US
dc.identifier.citationRumshisky, A et al. “Predicting Early Psychiatric Readmission with Natural Language Processing of Narrative Discharge Summaries.” Translational Psychiatry 6.10 (2016): e921.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorNaumann, Tristan
dc.contributor.mitauthorSzolovits, Peter
dc.contributor.mitauthorRumshisky, Anna A.
dc.contributor.mitauthorGhassemi, Marzyeh
dc.relation.journalTranslational Psychiatryen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsRumshisky, A; Ghassemi, M; Naumann, T; Szolovits, P; Castro, V M; McCoy, T H; Perlis, R Hen_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0003-2150-1747
dc.identifier.orcidhttps://orcid.org/0000-0001-8411-6403
dc.identifier.orcidhttps://orcid.org/0000-0002-8029-0823
dc.identifier.orcidhttps://orcid.org/0000-0001-6349-7251
mit.licensePUBLISHER_CCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record