Significant Sparse Polygenic Risk Scores across 813 traits in UK Biobank
Author(s)
Tanigawa, Yosuke; Qian, Junyang; Venkataraman, Guhan; Justesen, Johanne Marie; Li, Ruilin; Tibshirani, Robert; Hastie, Trevor; Rivas, Manuel A.; ... Show more Show less
Downloadjournal.pgen.1010105.pdf (3.257Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
We present a systematic assessment of polygenic risk score (PRS) prediction across more than 1,500 traits using genetic and phenotype data in the UK Biobank. We report 813 sparse PRS models with significant (p < 2.5 x 10−5) incremental predictive performance when compared against the covariate-only model that considers age, sex, types of genotyping arrays, and the principal component loadings of genotypes. We report a significant correlation between the number of genetic variants selected in the sparse PRS model and the incremental predictive performance (Spearman’s ⍴ = 0.61, p = 2.2 x 10−59 for quantitative traits, ⍴ = 0.21, p = 9.6 x 10−4 for binary traits). The sparse PRS model trained on European individuals showed limited transferability when evaluated on non-European individuals in the UK Biobank. We provide the PRS model weights on the Global Biobank Engine (https://biobankengine.stanford.edu/prs).
Date issued
2021-09-06Department
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence LaboratoryPublisher
Cold Spring Harbor Laboratory
Citation
Tanigawa, Yosuke, Qian, Junyang, Venkataraman, Guhan, Justesen, Johanne Marie, Li, Ruilin et al. 2021. "Significant Sparse Polygenic Risk Scores across 813 traits in UK Biobank."
Version: Final published version