| dc.contributor.author | Indyk, Piotr | |
| dc.contributor.author | Levi, Reut | |
| dc.contributor.author | Rubinfeld, Ronitt | |
| dc.date.accessioned | 2014-05-15T18:13:10Z | |
| dc.date.available | 2014-05-15T18:13:10Z | |
| dc.date.issued | 2012-05 | |
| dc.identifier.isbn | 9781450312486 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/87005 | |
| dc.description.abstract | A discrete distribution p, over [n], is a k histogram if its probability distribution function can be represented as a piece-wise constant function with k pieces. Such a function is represented by a list of k intervals and k corresponding values. We consider the following problem: given a collection of samples from a distribution p, find a k-histogram that (approximately) minimizes the l [subscript 2] distance to the distribution p. We give time and sample efficient algorithms for this problem.
We further provide algorithms that distinguish distributions that have the property of being a k-histogram from distributions that are ε-far from any k-histogram in the l [subscript 1] distance and l [subscript 2] distance respectively. | en_US |
| dc.description.sponsorship | David & Lucile Packard Foundation (Fellowship) | en_US |
| dc.description.sponsorship | National Science Foundation (U.S.) (Grant CCF-0728645) | en_US |
| dc.description.sponsorship | National Science Foundation (U.S.) (Grant 0732334) | en_US |
| dc.description.sponsorship | National Science Foundation (U.S.) (Grant 0728645) | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Association for Computing Machinery (ACM) | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1145/2213556.2213561 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | MIT web domain | en_US |
| dc.title | Approximating and testing k-histogram distributions in sub-linear time | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Piotr Indyk, Reut Levi, and Ronitt Rubinfeld. 2012. Approximating and testing k-histogram distributions in sub-linear time. In Proceedings of the 31st symposium on Principles of Database Systems (PODS '12), Markus Krötzsch (Ed.). ACM, New York, NY, USA, 15-22. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Indyk, Piotr | en_US |
| dc.contributor.mitauthor | Rubinfeld, Ronitt | en_US |
| dc.relation.journal | Proceedings of the 31st symposium on Principles of Database Systems (PODS '12) | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Indyk, Piotr; Levi, Reut; Rubinfeld, Ronitt | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0002-4353-7639 | |
| dc.identifier.orcid | https://orcid.org/0000-0002-7983-9524 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |
| mit.metadata.status | Complete | |