Show simple item record

dc.contributor.advisorGupta, Amar
dc.contributor.advisorSzolovits, Peter
dc.contributor.advisorHu, Antonio
dc.contributor.authorZhou, Tianqi
dc.date.accessioned2022-01-14T15:21:37Z
dc.date.available2022-01-14T15:21:37Z
dc.date.issued2021-06
dc.date.submitted2021-06-25T20:21:22.607Z
dc.identifier.urihttps://hdl.handle.net/1721.1/139587
dc.description.abstractElectronic health records (EHRs) contain a wealth of data that can be used to improve patient-centered outcomes. In particular, EHRs have been used for disease prediction, data-driven clinical decision support, patient trajectory modeling, etc. However, it is common that EHRs data contain substantial missing information that could make the clinical prediction tasks more challenging if left unaddressed. This thesis focuses on the problem of multivariate data imputation in patients’ ICU EHRs, where the source of missing information comes mostly from the time-series data. The key challenge is to design flexible models that model the time-series and non-time-series data jointly such that accurate time-series imputation can be achieved. To this purpose, a multi-modal neural network for sequential regression, to accurately estimate the missing values in the time series from EHRs is proposed. Specifically, the model is trained by using a self-supervised regression objective, namely, the masked value regression task, which mimics missing data situations at test time and optimizes a supervised regression loss in each learning episode. Additionally, an adversarial training procedure is employed to further improve the proposed system, similarly as the conditional generative adversarial networks. Empirical results on the MIMIC-III dataset show that the proposed system achieves superior performance against several strong baseline methods. The proposed imputation system can also address the deficiencies from the missing data of EHR as it enables robust clinical prediction over a variety of missing rates, on two large-scale clinical prediction tasks.
dc.publisherMassachusetts Institute of Technology
dc.rightsIn Copyright - Educational Use Permitted
dc.rightsCopyright retained by author(s)
dc.rights.urihttps://rightsstatements.org/page/InC-EDU/1.0/
dc.titleAddressing Deficiencies from Missing Data in Electronic Health Records
dc.typeThesis
dc.description.degreeS.M.
dc.description.degreeS.M.
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.contributor.departmentSystem Design and Management Program.
dc.identifier.orcid0000-0002-4836-4319
mit.thesis.degreeMaster
thesis.degree.nameMaster of Science in Engineering and Management
thesis.degree.nameMaster of Science in Electrical Engineering and Computer Science


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record