dc.contributor.author | Daskalakis, Constantinos | |
dc.contributor.author | Diakonikolas, Ilias | |
dc.contributor.author | Servedio, Rocco A. | |
dc.date.accessioned | 2012-10-19T14:37:31Z | |
dc.date.available | 2012-10-19T14:37:31Z | |
dc.date.issued | 2012 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/74148 | |
dc.description.abstract | A k-modal probability distribution over the domain {1,..., n} is one whose histogram has at most k "peaks" and "valleys." Such distributions are natural generalizations of monotone (k = 0) and unimodal (k = 1) probability distributions, which have been intensively studied in probability theory and statistics.
In this paper we consider the problem of learning an unknown k-modal distribution. The learning algorithm is given access to independent samples drawn from the k-modal distribution p, and must output a hypothesis distribution p such that with high probability the total variation distance between p and p is at most ε.
We give an efficient algorithm for this problem that runs in time poly(k, log(n), 1/ε). For k ≤ Õ(√ log n), the number of samples used by our algorithm is very close (within an Õ(log(1/ε)) factor) to being information-theoretically optimal. Prior to this work computationally efficient algorithms were known only for the cases k = 0, 1 [Bir87b, Bir97].
A novel feature of our approach is that our learning algorithm crucially uses a new property testing algorithm as a key subroutine. The learning algorithm uses the property tester to efficiently decompose the k-modal distribution into k (near)-monotone distributions, which are easier to learn. | en_US |
dc.description.sponsorship | National Science Foundation (U.S.) (CAREER Award CCF-0953960) | en_US |
dc.description.sponsorship | Alfred P. Sloan Foundation (Fellowship) | en_US |
dc.language.iso | en_US | |
dc.publisher | Society for Industrial and Applied Mathematics | en_US |
dc.relation.isversionof | http://dl.acm.org/citation.cfm?id=2095224 | en_US |
dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
dc.source | SIAM | en_US |
dc.title | Learning k-modal distributions via testing | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Constantinos Daskalakis, Ilias Diakonikolas, and Rocco A. Servedio. 2012. "Learning k-modal distributions via testing." In Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '12). SIAM 1371-1385. SIAM ©2012 | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.mitauthor | Daskalakis, Constantinos | |
dc.relation.journal | Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '12) | en_US |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
dc.identifier.orcid | https://orcid.org/0000-0002-5451-0490 | |
mit.license | PUBLISHER_POLICY | en_US |
mit.metadata.status | Complete | |