Fast Best Subset Selection: Coordinate Descent and Local Combinatorial Optimization Algorithms

Hazimeh, Hussein; Mazumder, Rahul

dc.contributor.author	Hazimeh, Hussein
dc.contributor.author	Mazumder, Rahul
dc.date.accessioned	2021-04-08T15:34:56Z
dc.date.available	2021-04-08T15:34:56Z
dc.date.issued	2020-08
dc.identifier.issn	0030-364X
dc.identifier.issn	1526-5463
dc.identifier.uri	https://hdl.handle.net/1721.1/130416
dc.description.abstract	The L₀-regularized least squares problem (a.k.a. best subsets) is central to sparse statistical learning and has attracted significant attention across the wider statistics, machine learning, and optimization communities. Recent work has shown that modern mixed integer optimization (MIO) solvers can be used to address small to moderate instances of this problem. In spite of the usefulness of L₀-based estimators and generic MIO solvers, there is a steep computational price to pay when compared with popular sparse learning algorithms (e.g., based on L₀ regularization). In this paper, we aim to push the frontiers of computation for a family of L₀-regularized problems with additional convex penalties. We propose a new hierarchy of necessary optimality conditions for these problems. We develop fast algorithms, based on coordinate descent and local combinatorial optimization, that are guaranteed to converge to solutions satisfying these optimality conditions. From a statistical viewpoint, an interesting story emerges. When the signal strength is high, our combinatorial optimization algorithms have an edge in challenging statistical settings. When the signal is lower, pure L₀ benefits from additional convex regularization. We empirically demonstrate that our family of L₀-based estimators can outperform the state-of-the-art sparse learning algorithms in terms of a combination of prediction, estimation, and variable selection metrics under various regimes (e.g., different signal strengths, feature correlations, number of samples and features). Our new open-source sparse learning toolkit L0Learn (available on CRAN and GitHub) reaches up to a threefold speedup (with p up to 10 ) when compared with competing toolkits such as glmnet and ncvreg. 0 0 1 0 0 0 6	en_US
dc.description.sponsorship	United States. Office of Naval Research (ONR-N000141512342, ONR-N000141812298 (Young Investigator Award))	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (Grant NSF-IIS-1718258)	en_US
dc.language.iso	en
dc.publisher	Institute for Operations Research and the Management Sciences (INFORMS)	en_US
dc.relation.isversionof	10.1287/OPRE.2019.1919	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	Fast Best Subset Selection: Coordinate Descent and Local Combinatorial Optimization Algorithms	en_US
dc.type	Article	en_US
dc.identifier.citation	Hazimeh, Hussein and Rahul Mazumder. “Fast Best Subset Selection: Coordinate Descent and Local Combinatorial Optimization Algorithms.” Operations Research, 68, 5 (August 2020): iii-vi, 1285-1624, C2-C3 © 2020 The Author(s)	en_US
dc.contributor.department	Massachusetts Institute of Technology. Operations Research Center	en_US
dc.contributor.department	Sloan School of Management	en_US
dc.relation.journal	Operations Research	en_US
dc.eprint.version	Author's final manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2021-04-08T14:28:24Z
dspace.orderedauthors	Hazimeh, H; Mazumder, R	en_US
dspace.date.submission	2021-04-08T14:28:25Z
mit.journal.volume	68	en_US
mit.journal.issue	5	en_US
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Complete

Files in this item

Name:: 1803.01454.pdf
Size:: 1.419Mb
Format:: PDF
Description:: Accepted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record