Show simple item record

dc.contributor.authorIndyk, Piotr
dc.contributor.authorLevi, Reut
dc.contributor.authorRubinfeld, Ronitt
dc.date.accessioned2014-05-15T18:13:10Z
dc.date.available2014-05-15T18:13:10Z
dc.date.issued2012-05
dc.identifier.isbn9781450312486
dc.identifier.urihttp://hdl.handle.net/1721.1/87005
dc.description.abstractA discrete distribution p, over [n], is a k histogram if its probability distribution function can be represented as a piece-wise constant function with k pieces. Such a function is represented by a list of k intervals and k corresponding values. We consider the following problem: given a collection of samples from a distribution p, find a k-histogram that (approximately) minimizes the l [subscript 2] distance to the distribution p. We give time and sample efficient algorithms for this problem. We further provide algorithms that distinguish distributions that have the property of being a k-histogram from distributions that are ε-far from any k-histogram in the l [subscript 1] distance and l [subscript 2] distance respectively.en_US
dc.description.sponsorshipDavid & Lucile Packard Foundation (Fellowship)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant CCF-0728645)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant 0732334)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (Grant 0728645)en_US
dc.language.isoen_US
dc.publisherAssociation for Computing Machinery (ACM)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1145/2213556.2213561en_US
dc.rightsCreative Commons Attribution-Noncommercial-Share Alikeen_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/4.0/en_US
dc.sourceMIT web domainen_US
dc.titleApproximating and testing k-histogram distributions in sub-linear timeen_US
dc.typeArticleen_US
dc.identifier.citationPiotr Indyk, Reut Levi, and Ronitt Rubinfeld. 2012. Approximating and testing k-histogram distributions in sub-linear time. In Proceedings of the 31st symposium on Principles of Database Systems (PODS '12), Markus Krötzsch (Ed.). ACM, New York, NY, USA, 15-22.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorIndyk, Piotren_US
dc.contributor.mitauthorRubinfeld, Ronitten_US
dc.relation.journalProceedings of the 31st symposium on Principles of Database Systems (PODS '12)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsIndyk, Piotr; Levi, Reut; Rubinfeld, Ronitten_US
dc.identifier.orcidhttps://orcid.org/0000-0002-4353-7639
dc.identifier.orcidhttps://orcid.org/0000-0002-7983-9524
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record