Login

Block Heavy Hitters

Show full item record




Title: Block Heavy Hitters
Author: Andoni, Alexandr; Ba, Khanh Do; Indyk, Piotr
Other Contributors: Theory of Computation
Advisor: Piotr Indyk
Issue Date: 2008-05-02
Abstract: e study a natural generalization of the heavy hitters problem in thestreaming context. We term this generalization *block heavy hitters* and define it as follows. We are to stream over a matrix$A$, and report all *rows* that are heavy, where a row is heavy ifits ell_1-norm is at least phi fraction of the ell_1 norm ofthe entire matrix $A$. In comparison, in the standard heavy hittersproblem, we are required to report the matrix *entries* that areheavy. As is common in streaming, we solve the problem approximately:we return all rows with weight at least phi, but also possibly someother rows that have weight no less than (1-eps)phi. To solve theblock heavy hitters problem, we show how to construct a linear sketchof A from which we can recover the heavy rows of A.The block heavy hitters problem has already found applications forother streaming problems. In particular, it is a crucial buildingblock in a streaming algorithm that constructs asmall-size sketch for the Ulam metric, a metric on non-repetitivestrings under the edit (Levenshtein) distance.
URI: http://hdl.handle.net/1721.1/41514
Other Identifiers: MIT-CSAIL-TR-2008-024
Related To Massachusetts Institute of Technology Computer Science and Artificial Intelligence Laboratory

Files in this item

Files Size Format
MIT-CSAIL-TR-2008-024.pdf 196.4Kb application/pdf
MIT-CSAIL-TR-2008-024.ps 73.87Kb application/postscript

This item appears in the following Collection(s)

Show full item record

Search DSpace@MIT


Advanced Search

Browse

My Account

Links