Show simple item record

dc.contributor.authorOrfanoudaki, Agni
dc.contributor.authorChesley, Emma
dc.contributor.authorCadisch, Christian
dc.contributor.authorBertsimas, Dimitris J
dc.date.accessioned2021-03-08T18:27:49Z
dc.date.available2021-03-08T18:27:49Z
dc.date.issued2020-05
dc.date.submitted2019-12
dc.identifier.issn1932-6203
dc.identifier.urihttps://hdl.handle.net/1721.1/130097
dc.description.abstractCurrent stroke risk assessment tools presume the impact of risk factors is linear and cumulative. However, both novel risk factors and their interplay influencing stroke incidence are difficult to reveal using traditional additive models. The goal of this study was to improve upon the established Revised Framingham Stroke Risk Score and design an interactive Non-Linear Stroke Risk Score. Leveraging machine learning algorithms, our work aimed at increasing the accuracy of event prediction and uncovering new relationships in an interpretable fashion. A two-phase approach was used to create our stroke risk prediction score. First, clinical examinations of the Framingham offspring cohort were utilized as the training dataset for the predictive model. Optimal Classification Trees were used to develop a tree-based model to predict 10-year risk of stroke. Unlike classical methods, this algorithm adaptively changes the splits on the independent variables, introducing non-linear interactions among them. Second, the model was validated with a multi-ethnicity cohort from the Boston Medical Center. Our stroke risk score suggests a key dichotomy between patients with history of cardiovascular disease and the rest of the population. While it agrees with known findings, it also identified 23 unique stroke risk profiles and highlighted new non-linear relationships; such as the role of T-wave abnormality on electrocardiography and hematocrit levels in a patient’s risk profile. Our results suggested that the non-linear approach significantly improves upon the baseline in the c-statistic (training 87.43% (CI 0.85–0.90) vs. 73.74% (CI 0.70–0.76); validation 75.29% (CI 0.74–0.76) vs 65.93% (CI 0.64–0.67), even in multi-ethnicity populations. The clinical implications of the new risk score include prioritization of risk factor modification and personalized care at the patient level with improved targeting of interventions for stroke prevention.en_US
dc.language.isoen
dc.publisherPublic Library of Science (PLoS)en_US
dc.relation.isversionof10.1371/JOURNAL.PONE.0232414en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourcePLoSen_US
dc.titleMachine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk scoreen_US
dc.typeArticleen_US
dc.identifier.citationOrfanoudaki, Agni et al. “Machine learning provides evidence that stroke risk is not linear: The non-linear Framingham stroke risk score.” PLoS ONE, 15, 5 (May 2020): e0232414 © 2020 The Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Operations Research Centeren_US
dc.contributor.departmentSloan School of Managementen_US
dc.relation.journalPLoS ONEen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2021-02-05T16:53:42Z
dspace.orderedauthorsOrfanoudaki, A; Chesley, E; Cadisch, C; Stein, B; Nouh, A; Alberts, MJ; Bertsimas, Den_US
dspace.date.submission2021-02-05T16:53:46Z
mit.journal.volume15en_US
mit.journal.issue5en_US
mit.licensePUBLISHER_CC
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record