Integration of survival data from multiple studies
Author(s)
Ventz, Steffen; Mazumder, Rahul; Trippa, Lorenzo
DownloadSubmitted version (511.0Kb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
We introduce a statistical procedure that integrates datasets from multiple biomedical studies to predict patients' survival, based on individual clinical and genomic profiles. The proposed procedure accounts for potential differences in the relation between predictors and outcomes across studies, due to distinct patient populations, treatments and technologies to measure outcomes and biomarkers. These differences are modeled explicitly with study-specific parameters. We use hierarchical regularization to shrink the study-specific parameters towards each other and to borrow information across studies. The estimation of the study-specific parameters utilizes a similarity matrix, which summarizes differences and similarities of the relations between covariates and outcomes across studies. We illustrate the method in a simulation study and using a collection of gene expression datasets in ovarian cancer. We show that the proposed model increases the accuracy of survival predictions compared to alternative meta-analytic methods.
Date issued
2021Department
Sloan School of Management; Massachusetts Institute of Technology. Operations Research Center; Statistics and Data Science Center (Massachusetts Institute of Technology)Journal
Biometrics
Publisher
Wiley
Citation
Ventz, Steffen, Mazumder, Rahul and Trippa, Lorenzo. 2021. "Integration of survival data from multiple studies." Biometrics.
Version: Original manuscript