Attention-based neural networks for clinical prediction modelling on electronic health records

Fridgeirsson, Egill A.; Sontag, David; Rijnbeek, Peter

dc.contributor.author	Fridgeirsson, Egill A.
dc.contributor.author	Sontag, David
dc.contributor.author	Rijnbeek, Peter
dc.date.accessioned	2023-12-14T20:31:12Z
dc.date.available	2023-12-14T20:31:12Z
dc.date.issued	2023-12-07
dc.identifier.uri	https://hdl.handle.net/1721.1/153169
dc.description.abstract	Background Deep learning models have had a lot of success in various fields. However, on structured data they have struggled. Here we apply four state-of-the-art supervised deep learning models using the attention mechanism and compare against logistic regression and XGBoost using discrimination, calibration and clinical utility. Methods We develop the models using a general practitioners database. We implement a recurrent neural network, a transformer with and without reverse distillation and a graph neural network. We measure discrimination using the area under the receiver operating characteristic curve (AUC) and the area under the precision recall curve (AUPRC). We assess smooth calibration using restricted cubic splines and clinical utility with decision curve analysis. Results Our results show that deep learning approaches can improve discrimination up to 2.5% points AUC and 7.4% points AUPRC. However, on average the baselines are competitive. Most models are similarly calibrated as the baselines except for the graph neural network. The transformer using reverse distillation shows the best performance in clinical utility on two out of three prediction problems over most of the prediction thresholds. Conclusion In this study, we evaluated various approaches in supervised learning using neural networks and attention. Here we do a rigorous comparison, not only looking at discrimination but also calibration and clinical utility. There is value in using deep learning models on electronic health record data since it can improve discrimination and clinical utility while providing good calibration. However, good baseline methods are still competitive.	en_US
dc.publisher	BioMed Central	en_US
dc.relation.isversionof	https://doi.org/10.1186/s12874-023-02112-2	en_US
dc.rights	Creative Commons Attribution	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	BioMed Central	en_US
dc.title	Attention-based neural networks for clinical prediction modelling on electronic health records	en_US
dc.type	Article	en_US
dc.identifier.citation	BMC Medical Research Methodology. 2023 Dec 07;23(1):285	en_US
dc.contributor.department	Massachusetts Institute of Technology. Institute for Medical Engineering & Science
dc.identifier.mitlicense	PUBLISHER_CC
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2023-12-10T04:07:47Z
dc.language.rfc3066	en
dc.rights.holder	The Author(s)
dspace.date.submission	2023-12-10T04:07:47Z
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 12874_2023_Article_2112.pdf
Size:: 1.243Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record