The backbone method for ultra-high dimensional sparse machine learning

Bertsimas, Dimitris; Digalakis, Vassilis

dc.contributor.author	Bertsimas, Dimitris
dc.contributor.author	Digalakis, Vassilis
dc.date.accessioned	2022-06-01T19:06:41Z
dc.date.available	2022-06-01T19:06:41Z
dc.date.issued	2022-01-22
dc.identifier.uri	https://hdl.handle.net/1721.1/142857
dc.description.abstract	Abstract We present the backbone method, a general framework that enables sparse and interpretable supervised machine learning methods to scale to ultra-high dimensional problems. We solve sparse regression problems with $$10^7$$ 10 7 features in minutes and $$10^8$$ 10 8 features in hours, as well as decision tree problems with $$10^5$$ 10 5 features in minutes. The proposed method operates in two phases: we first determine the backbone set, consisting of potentially relevant features, by solving a number of tractable subproblems; then, we solve a reduced problem, considering only the backbone features. For the sparse regression problem, our theoretical analysis shows that, under certain assumptions and with high probability, the backbone set consists of the truly relevant features. Numerical experiments on both synthetic and real-world datasets demonstrate that our method outperforms or competes with state-of-the-art methods in ultra-high dimensional problems, and competes with optimal solutions in problems where exact methods scale, both in terms of recovering the truly relevant features and in its out-of-sample predictive performance.	en_US
dc.publisher	Springer US	en_US
dc.relation.isversionof	https://doi.org/10.1007/s10994-021-06123-2	en_US
dc.rights	Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International	en_US
dc.rights.uri	https://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	Springer US	en_US
dc.title	The backbone method for ultra-high dimensional sparse machine learning	en_US
dc.type	Article	en_US
dc.identifier.citation	Bertsimas, Dimitris and Digalakis, Vassilis. 2022. "The backbone method for ultra-high dimensional sparse machine learning."
dc.contributor.department	Massachusetts Institute of Technology. Operations Research Center
dc.contributor.department	Sloan School of Management
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2022-06-01T04:08:02Z
dc.language.rfc3066	en
dc.rights.holder	The Author(s), under exclusive licence to Springer Science+Business Media LLC, part of Springer Nature
dspace.embargo.terms	Y
dspace.date.submission	2022-06-01T04:08:01Z
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 10994_2021_6123_ReferencePDF.pdf
Size:: 1.713Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record