Learning k-modal distributions via testing

Daskalakis, Constantinos; Diakonikolas, Ilias; Servedio, Rocco A.

dc.contributor.author	Daskalakis, Constantinos
dc.contributor.author	Diakonikolas, Ilias
dc.contributor.author	Servedio, Rocco A.
dc.date.accessioned	2012-10-19T14:37:31Z
dc.date.available	2012-10-19T14:37:31Z
dc.date.issued	2012
dc.identifier.uri	http://hdl.handle.net/1721.1/74148
dc.description.abstract	A k-modal probability distribution over the domain {1,..., n} is one whose histogram has at most k "peaks" and "valleys." Such distributions are natural generalizations of monotone (k = 0) and unimodal (k = 1) probability distributions, which have been intensively studied in probability theory and statistics. In this paper we consider the problem of learning an unknown k-modal distribution. The learning algorithm is given access to independent samples drawn from the k-modal distribution p, and must output a hypothesis distribution p such that with high probability the total variation distance between p and p is at most ε. We give an efficient algorithm for this problem that runs in time poly(k, log(n), 1/ε). For k ≤ Õ(√ log n), the number of samples used by our algorithm is very close (within an Õ(log(1/ε)) factor) to being information-theoretically optimal. Prior to this work computationally efficient algorithms were known only for the cases k = 0, 1 [Bir87b, Bir97]. A novel feature of our approach is that our learning algorithm crucially uses a new property testing algorithm as a key subroutine. The learning algorithm uses the property tester to efficiently decompose the k-modal distribution into k (near)-monotone distributions, which are easier to learn.	en_US
dc.description.sponsorship	National Science Foundation (U.S.) (CAREER Award CCF-0953960)	en_US
dc.description.sponsorship	Alfred P. Sloan Foundation (Fellowship)	en_US
dc.language.iso	en_US
dc.publisher	Society for Industrial and Applied Mathematics	en_US
dc.relation.isversionof	http://dl.acm.org/citation.cfm?id=2095224	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	SIAM	en_US
dc.title	Learning k-modal distributions via testing	en_US
dc.type	Article	en_US
dc.identifier.citation	Constantinos Daskalakis, Ilias Diakonikolas, and Rocco A. Servedio. 2012. "Learning k-modal distributions via testing." In Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '12). SIAM 1371-1385. SIAM ©2012	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.contributor.mitauthor	Daskalakis, Constantinos
dc.relation.journal	Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms (SODA '12)	en_US
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
dc.identifier.orcid	https://orcid.org/0000-0002-5451-0490
mit.license	PUBLISHER_POLICY	en_US
mit.metadata.status	Complete

Files in this item

Name:: Daskalakis-Learning k-modal.pdf
Size:: 294.5Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record