| dc.contributor.author | Woodruff, David P. | |
| dc.contributor.author | Backurs, Arturs | |
| dc.contributor.author | Indyk, Piotr | |
| dc.contributor.author | Razenshteyn, Ilya | |
| dc.date.accessioned | 2018-02-20T21:06:25Z | |
| dc.date.available | 2018-02-20T21:06:25Z | |
| dc.date.issued | 2016-01 | |
| dc.identifier.isbn | 978-1-611974-33-1 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/113845 | |
| dc.description.abstract | We initiate the study of trade-offs between sparsity and the number of measurements in sparse recovery schemes for generic norms. Specifically for a norm ||·||, sparsity parameter k, approximation factor K > 0, and probability of failure P > 0, we ask: what is the minimal value of m so that there is a distribution over m × n matrices A with the property that for any x, given Ax, we can recover a k-sparse approximation to x in the given norm with probability at least 1 -- P? We give a partial answer to this problem, by showing that for norms that admit efficient linear sketches, the optimal number of measurements m is closely related to the doubling dimension of the metric induced by the norm ||·|| on the set of all k-sparse vectors. By applying our result to specific norms, we cast known measurement bounds in our general framework (for the [subscript ℓ]p norms, p ∈ [1, 2]) as well as provide new, measurement-efficient schemes (for the Earth-Mover Distance norm). The latter result directly implies more succinct linear sketches for the well-studied planar k-median clustering problem. Finally our lower bound for the doubling dimension of the EMD norm enables us to resolve the open question of [Frahling-Sohler, STOC'05] about the space complexity of clustering problems in the dynamic streaming model. | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Association for Computing Machinery | en_US |
| dc.relation.isversionof | http://dl.acm.org/citation.cfm?id=2884459 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | arXiv | en_US |
| dc.title | Nearly-optimal bounds for sparse recovery in generic norms, with applications to k-median sketching | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Backurs, Arturs et al. "Nearly-optimal bounds for sparse recovery in generic norms, with applications to k-median sketching." SODA '16 Proceedings of the twenty-seventh annual ACM-SIAM symposium on Discrete algorithms, 20-12 January, 2016, Philadelphia, Pennsylvania, Association of Computing Machinery, 2016, pp. 318-337. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Backurs, Arturs | |
| dc.contributor.mitauthor | Indyk, Piotr | |
| dc.contributor.mitauthor | Razenshteyn, Ilya | |
| dc.relation.journal | SODA '16 Proceedings of the twenty-seventh annual ACM-SIAM symposium on Discrete algorithms | en_US |
| dc.eprint.version | Original manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Backurs, Arturs; Indyk, Piotr; Razenshteyn, Ilya; Woodruff, David P. | en_US |
| dspace.embargo.terms | N | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0001-7546-6313 | |
| dc.identifier.orcid | https://orcid.org/0000-0002-7983-9524 | |
| dc.identifier.orcid | https://orcid.org/0000-0002-3962-721X | |
| mit.license | OPEN_ACCESS_POLICY | en_US |