Show simple item record

dc.contributor.authorFridgeirsson, Egill A.
dc.contributor.authorSontag, David
dc.contributor.authorRijnbeek, Peter
dc.date.accessioned2023-12-14T20:31:12Z
dc.date.available2023-12-14T20:31:12Z
dc.date.issued2023-12-07
dc.identifier.urihttps://hdl.handle.net/1721.1/153169
dc.description.abstractBackground Deep learning models have had a lot of success in various fields. However, on structured data they have struggled. Here we apply four state-of-the-art supervised deep learning models using the attention mechanism and compare against logistic regression and XGBoost using discrimination, calibration and clinical utility. Methods We develop the models using a general practitioners database. We implement a recurrent neural network, a transformer with and without reverse distillation and a graph neural network. We measure discrimination using the area under the receiver operating characteristic curve (AUC) and the area under the precision recall curve (AUPRC). We assess smooth calibration using restricted cubic splines and clinical utility with decision curve analysis. Results Our results show that deep learning approaches can improve discrimination up to 2.5% points AUC and 7.4% points AUPRC. However, on average the baselines are competitive. Most models are similarly calibrated as the baselines except for the graph neural network. The transformer using reverse distillation shows the best performance in clinical utility on two out of three prediction problems over most of the prediction thresholds. Conclusion In this study, we evaluated various approaches in supervised learning using neural networks and attention. Here we do a rigorous comparison, not only looking at discrimination but also calibration and clinical utility. There is value in using deep learning models on electronic health record data since it can improve discrimination and clinical utility while providing good calibration. However, good baseline methods are still competitive.en_US
dc.publisherBioMed Centralen_US
dc.relation.isversionofhttps://doi.org/10.1186/s12874-023-02112-2en_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceBioMed Centralen_US
dc.titleAttention-based neural networks for clinical prediction modelling on electronic health recordsen_US
dc.typeArticleen_US
dc.identifier.citationBMC Medical Research Methodology. 2023 Dec 07;23(1):285en_US
dc.contributor.departmentMassachusetts Institute of Technology. Institute for Medical Engineering & Science
dc.identifier.mitlicensePUBLISHER_CC
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2023-12-10T04:07:47Z
dc.language.rfc3066en
dc.rights.holderThe Author(s)
dspace.date.submission2023-12-10T04:07:47Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record