A Multitask Deep Learning Framework for Clinical Decision-Making in Assisted Reproductive Technology

Zheng, Ruiying

Author(s)

Zheng, Ruiying

DownloadThesis PDF (6.712Mb)

Advisor

Barnett, Arnold I.

Terms of use

In Copyright - Educational Use Permitted Copyright retained by author(s) https://rightsstatements.org/page/InC-EDU/1.0/

Metadata

Show full item record

Abstract

Fertility treatment requires navigating a series of complex and connected decisions, each with tradeoffs that impact both patient outcomes and clinical workload. In in vitro fertilization (IVF), choices about stimulation protocols, embryo transfer count, and treatment timing are interdependent, and each cycle carries uncertainty. Despite increasing data availability, most clinical tools still address these decisions in isolation. As a result, care remains variable across providers, and patients often face unclear guidance at critical moments in the process. To address this gap, we present an integrative deep learning framework that simultaneously optimizes three critical IVF decision points using data from 33,000 treatment cycles (2021- 2023): (1) embryo transfer count recommendation (MAE = 0.716), (2) ovarian stimulation protocol selection (AUC = 0.85), and (3) biochemical pregnancy prediction (AUC = 0.75). Our comprehensive benchmarking pipeline evaluates classical statistical models, ensemble methods (XGBoost), and novel architectures, including TabPFN, an attention-based probabilistic model that achieved comparable performance to top-performing baselines. To enhance clinical trust, we apply SHapley Additive exPlanations (SHAP) and Local Interpretable Model-Agnostic Explanations (LIME), which consistently uncover biologically meaningful variables, including biomarkers such as Anti-Müllerian Hormone (AMH), patient age, and the day-3 high-quality embryo rate. The final architecture combines: (i) a shared representation learning backbone, (ii) task-specific prediction heads, and (iii) homoscedastic uncertainty weighting for dynamic task balancing. This multitask approach achieves near-state-of-the-art performance (protocol AUC = 0.79, pregnancy AUC = 0.74) while offering practical deployment advantages such as faster inference and fewer parameters than equivalent single-task ensembles. This system’s efficiency, interpretability, and scalability will reshape current IVF practice if adopted in clinical settings, offering a more consistent and data-driven foundation for treatment decisions.

Date issued

2025-05

URI

https://hdl.handle.net/1721.1/162969

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Publisher

Massachusetts Institute of Technology

Collections

Graduate Theses