Machine learning used to study risk factors for chronic diseases: A scoping review
Author(s)
Shergill, Mahek; Durant, Steve; Birdi, Sharon; Rabet, Roxana; Ziegler, Carolyn; Ali, Shehzad; Buckeridge, David; Ghassemi, Marzyeh; Gibson, Jennifer; John-Baptiste, Ava; Macklin, Jillian; McCradden, Melissa; McKenzie, Kwame; Naraei, Parisa; ... Show more Show less
Download41997_2025_Article_1059.pdf (1.047Mb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Objectives Machine learning (ML) has received significant attention for its potential to process and learn from vast amounts of data. Our aim was to perform a scoping review to identify studies that used ML to study risk factors for chronic diseases at a population level, notably those that incorporated methods to mitigate algorithmic bias. We focused on ML applications for the most common risk factors for chronic disease: tobacco use, alcohol use, unhealthy eating, physical activity, and psychological stress. Methods We searched the peer-reviewed, indexed literature using Medline (Ovid), Embase (Ovid), Cochrane Central Register of Controlled Trials and Cochrane Database of Systematic Reviews (Ovid), Scopus, ACM Digital Library, INSPEC, and Web of Science’s Science Citation Index, Social Sciences Citation Index, and Emerging Sources Citation Index. Among the included studies, we examined whether bias was considered and identified strategies employed to mitigate bias. Synthesis The search identified 10,329 studies, and 20 met our inclusion criteria. The studies we identified used ML for a wide range of goals, from prediction of chronic disease development to automating the classification of data to identifying new associations between risk factors and disease. Nine studies (45%) included some discussion of algorithmic bias. Studies that incorporated a broad array of sociodemographic variables did so primarily to improve the performance of a ML model rather than to mitigate potential harms to populations made vulnerable by social and economic policies. Conclusion This work contributes to our understanding of how ML can be used to advance population and public health.
Date issued
2025-06-11Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science; Institute for Medical Engineering and ScienceJournal
Canadian Journal of Public Health
Publisher
Springer International Publishing
Citation
Shergill, M., Durant, S., Birdi, S. et al. Machine learning used to study risk factors for chronic diseases: A scoping review. Can J Public Health (2025).
Version: Final published version