| dc.contributor.author | Indyk, Piotr | |
| dc.contributor.author | Mahabadi, Sepideh | |
| dc.contributor.author | Mahdian, Mohammad | |
| dc.contributor.author | Mirrokni, Vahab S. | |
| dc.date.accessioned | 2018-02-26T21:46:40Z | |
| dc.date.available | 2018-02-26T21:46:40Z | |
| dc.date.issued | 2014-06 | |
| dc.identifier.issn | 978-1-4503-2375-8 | |
| dc.identifier.uri | http://hdl.handle.net/1721.1/113896 | |
| dc.description.abstract | In this paper we consider efficient construction of "composable core-sets" for basic diversity and coverage maximization problems. A core-set for a point-set in a metric space is a subset of the point-set with the property that an approximate solution to the whole point-set can be obtained given the core-set alone. A composable core-set has the property that for a collection of sets, the approximate solution to the union of the sets in the collection can be obtained given the union of the composable core-sets for the point sets in the collection. Using composable core-sets one can obtain efficient solutions to a wide variety of massive data processing applications, including nearest neighbor search, streaming algorithms and map-reduce computation.
Our main results are algorithms for constructing composable core-sets for several notions of "diversity objective functions", a topic that attracted a significant amount of research over the last few years. The composable core-sets we construct are small and accurate: their approximation factor almost matches that of the best "off-line" algorithms for the relevant optimization problems (up to a constant factor). Moreover, we also show applications of our results to diverse nearest neighbor search, streaming algorithms and map-reduce computation. Finally, we show that for an alternative notion of diversity maximization based on the maximum coverage problem small composable core-sets do not exist. | en_US |
| dc.language.iso | en_US | |
| dc.publisher | Association for Computing Machinery | en_US |
| dc.relation.isversionof | http://dx.doi.org/10.1145/2594538.2594560 | en_US |
| dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
| dc.source | MIT Web Domain | en_US |
| dc.title | Composable core-sets for diversity and coverage maximization | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Indyk, Piotr, et al. "Composable Core-Sets for Diversity and Coverage Maximization." Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems - PODS '14, 22-27 June, 2014, New York, New York, ACM Press, 2014, pp. 100–08. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
| dc.contributor.mitauthor | Indyk, Piotr | |
| dc.contributor.mitauthor | Mahabadi, Sepideh | |
| dc.relation.journal | Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems - PODS '14 | en_US |
| dc.eprint.version | Author's final manuscript | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dspace.orderedauthors | Indyk, Piotr; Mahabadi, Sepideh; Mahdian, Mohammad; Mirrokni, Vahab S. | en_US |
| dspace.embargo.terms | N | en_US |
| dc.identifier.orcid | https://orcid.org/0000-0002-7983-9524 | |
| dc.identifier.orcid | https://orcid.org/0000-0001-5004-8991 | |
| mit.license | OPEN_ACCESS_POLICY | en_US |