Show simple item record

dc.contributor.authorBertsimas, Dimitris
dc.contributor.authorDigalakis, Vassilis
dc.date.accessioned2022-06-01T19:06:41Z
dc.date.available2022-06-01T19:06:41Z
dc.date.issued2022-01-22
dc.identifier.urihttps://hdl.handle.net/1721.1/142857
dc.description.abstractAbstract We present the backbone method, a general framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems. We solve sparse regression problems with $$10^7$$ 10 7 features in minutes and $$10^8$$ 10 8 features in hours, as well as decision tree problems with $$10^5$$ 10 5 features in minutes. The proposed method operates in two phases: we first determine the backbone set, consisting of potentially relevant features, by solving a number of tractable subproblems; then, we solve a reduced problem, considering only the backbone features. For the sparse regression problem, our theoretical analysis shows that, under certain assumptions and with high probability, the backbone set consists of the truly relevant features. Numerical experiments on both synthetic and real-world datasets demonstrate that our method outperforms or competes with state-of-the-art methods in ultra-high dimensional problems, and competes with optimal solutions in problems where exact methods scale, both in terms of recovering the truly relevant features and in its out-of-sample predictive performance.en_US
dc.publisherSpringer USen_US
dc.relation.isversionofhttps://doi.org/10.1007/s10994-021-06123-2en_US
dc.rightsCreative Commons Attribution-NonCommercial-ShareAlike 4.0 Internationalen_US
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceSpringer USen_US
dc.titleThe backbone method for ultra-high dimensional sparse machine learningen_US
dc.typeArticleen_US
dc.identifier.citationBertsimas, Dimitris and Digalakis, Vassilis. 2022. "The backbone method for ultra-high dimensional sparse machine learning."
dc.contributor.departmentMassachusetts Institute of Technology. Operations Research Center
dc.contributor.departmentSloan School of Management
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dc.date.updated2022-06-01T04:08:02Z
dc.language.rfc3066en
dc.rights.holderThe Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature
dspace.embargo.termsY
dspace.date.submission2022-06-01T04:08:01Z
mit.licenseOPEN_ACCESS_POLICY
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record