Beyond the low-degree algorithm: mixtures of subcubes and their applications

Chen, Sitan; Moitra, Ankur

dc.contributor.author	Chen, Sitan
dc.contributor.author	Moitra, Ankur
dc.date.accessioned	2022-07-06T21:03:19Z
dc.date.available	2021-11-09T19:19:08Z
dc.date.available	2022-07-06T21:03:19Z
dc.date.issued	2019-06-23
dc.identifier.uri	https://hdl.handle.net/1721.1/138050.2
dc.description.abstract	© 2019 Association for Computing Machinery. We introduce the problem of learning mixtures of k subcubes over (0,1)n, which contains many classic learning theory problems as a special case (and is itself a special case of others). We give a surprising nO(log k)-time learning algorithm based on higher-order multilinear moments. It is not possible to learn the parameters because the same distribution can be represented by quite different models. Instead, we develop a framework for reasoning about how multilinear moments can pinpoint essential features of the mixture, like the number of components. We also give applications of our algorithm to learning decision trees with stochastic transitions (which also capture interesting scenarios where the transitions are deterministic but there are latent variables). Using our algorithm for learning mixtures of subcubes, we can approximate the Bayes optimal classifier within additive error ϵ on k-leaf decision trees with at most s stochastic transitions on any root-to-leaf path in nO(s+log k) · poly(1/ϵ) time. In this stochastic setting, the classic nO(log k) · poly(1/ϵ)-time algorithms of Rivest, Blum, and Ehrenfreucht-Haussler for learning decision trees with zero stochastic transitions break down because they are fundamentally Occam algorithms. The low-degree algorithm of Linial-Mansour-Nisan is able to get a constant factor approximation to the optimal error (again within an additive ϵ) and runs in time nO(s+log(k/ϵ)). The quasipolynomial dependence on 1/ϵ is inherent to the low-degree approach because the degree needs to grow as the target accuracy decreases, which is undesirable when ϵ is small. In contrast, as we will show, mixtures of k subcubes are uniquely determined by their 2 logk order moments and hence provide a useful abstraction for simultaneously achieving the polynomial dependence on 1/ϵ of the classic Occam algorithms for decision trees and the flexibility of the low-degree algorithm in being able to accommodate stochastic transitions. Using our multilinear moment techniques, we also give the first improved upper and lower bounds since the work of Feldman-O’Donnell-Servedio for the related but harder problem of learning mixtures of binary product distributions.	en_US
dc.language.iso	en
dc.publisher	ACM	en_US
dc.relation.isversionof	10.1145/3313276.3316375	en_US
dc.rights	Creative Commons Attribution-Noncommercial-Share Alike	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/	en_US
dc.source	arXiv	en_US
dc.title	Beyond the low-degree algorithm: mixtures of subcubes and their applications	en_US
dc.type	Article	en_US
dc.identifier.citation	Chen, Sitan and Moitra, Ankur. 2019. "Beyond the low-degree algorithm: mixtures of subcubes and their applications."	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Mathematics	en_US
dc.eprint.version	Original manuscript	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2019-11-15T17:58:30Z
dspace.date.submission	2019-11-15T17:58:35Z
mit.license	OPEN_ACCESS_POLICY
mit.metadata.status	Publication Information Needed	en_US

Files in this item

Name:: 1803.06521.pdf
Size:: 821.4Kb
Format:: Unknown
Description:: Submitted version

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record

Version	Item	Date	Summary
2	1721.1/138050.2*	2022-07-06T20:58:07Z	Metadata changed: Verified or entered author name and department authority metadata.
1	1721.1/138050	2021-11-09T19:19:08Z

*Selected version

DSpace@MIT

Beyond the low-degree algorithm: mixtures of subcubes and their applications

Files in this item

This item appears in the following Collection(s)

Version History