Sparse regression over clusters: SparClur
Author(s)
Bertsimas, Dimitris; Dunn, Jack; Kapelevich, Lea; Zhang, Rebecca
Download11590_2021_1770_ReferencePDF.pdf (332.5Kb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
Abstract
Prediction tasks in personalized medicine require models that combine accuracy and interpretability. We propose an integer optimization approach for building sparse regression models with enforced coordination, using data partitioned among leaves in a prediction tree. We show that the method recovers the true underlying relationship between observations and target variables in large-scale synthetic data in seconds. We apply our method to several real-world medical prediction problems and observe that the additional structure imposed provides a substantial gain in interpretability, at a low cost to accuracy.
Date issued
2021-07-08Department
Sloan School of Management; Massachusetts Institute of Technology. Operations Research CenterPublisher
Springer Berlin Heidelberg
Citation
Bertsimas, Dimitris, Dunn, Jack, Kapelevich, Lea and Zhang, Rebecca. 2021. "Sparse regression over clusters: SparClur."
Version: Author's final manuscript