Show simple item record

dc.contributor.authorBertsimas, Dimitris J
dc.contributor.authorKing, Angela
dc.contributor.authorMazumder, Rahul
dc.date.accessioned2017-05-03T19:24:27Z
dc.date.available2017-05-03T19:24:27Z
dc.date.issued2016-04
dc.date.submitted2015-08
dc.identifier.issn0090-5364
dc.identifier.urihttp://hdl.handle.net/1721.1/108645
dc.description.abstractn the period 1991–2015, algorithmic advances in Mixed Integer Optimization (MIO) coupled with hardware improvements have resulted in an astonishing 450 billion factor speedup in solving MIO problems. We present a MIO approach for solving the classical best subset selection problem of choosing k out of p features in linear regression given n observations. We develop a discrete extension of modern first-order continuous optimization methods to find high quality feasible solutions that we use as warm starts to a MIO solver that finds provably optimal solutions. The resulting algorithm (a) provides a solution with a guarantee on its suboptimality even if we terminate the algorithm early, (b) can accommodate side constraints on the coefficients of the linear regression and (c) extends to finding best subset solutions for the least absolute deviation loss function. Using a wide variety of synthetic and real datasets, we demonstrate that our approach solves problems with n in the 1000s and p in the 100s in minutes to provable optimality, and finds near optimal solutions for n in the 100s and p in the 1000s in minutes. We also establish via numerical experiments that the MIO approach performs better than Lasso and other popularly used sparse learning procedures, in terms of achieving sparse solutions with good predictive power.en_US
dc.language.isoen_US
dc.publisherInstitute of Mathematical Statisticsen_US
dc.relation.isversionofhttp://dx.doi.org/10.1214/15-aos1388en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourcearXiven_US
dc.titleBest subset selection via a modern optimization lensen_US
dc.typeArticleen_US
dc.identifier.citationBertsimas, Dimitris; King, Angela and Mazumder, Rahul. “Best Subset Selection via a Modern Optimization Lens.” The Annals of Statistics 44, no. 2 (April 2016): 813–852.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Operations Research Centeren_US
dc.contributor.departmentSloan School of Managementen_US
dc.contributor.mitauthorBertsimas, Dimitris J
dc.contributor.mitauthorKing, Angela
dc.contributor.mitauthorMazumder, Rahul
dc.relation.journalAnnals of Statisticsen_US
dc.eprint.versionOriginal manuscripten_US
dc.type.urihttp://purl.org/eprint/type/JournalArticleen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsBertsimas, Dimitris; King, Angela; Mazumder, Rahulen_US
dspace.embargo.termsNen_US
dc.identifier.orcidhttps://orcid.org/0000-0002-1985-1003
dc.identifier.orcidhttps://orcid.org/0000-0003-1384-9743
mit.licenseOPEN_ACCESS_POLICYen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record