dc.contributor.author | Agarwal, Anish | |
dc.contributor.author | Dahleh, Munther A | |
dc.contributor.author | Sarkar, Tuhin | |
dc.date.accessioned | 2020-12-09T19:45:43Z | |
dc.date.available | 2020-12-09T19:45:43Z | |
dc.date.issued | 2019-06 | |
dc.identifier.isbn | 9781450367929 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/128759 | |
dc.description.abstract | In this work, we aim to design a data marketplace; a robust real-time matching mechanism to efficiently buy and sell training data for Machine Learning tasks. While the monetization of data and pre-trained models is an essential focus of industry today, there does not exist a market mechanism to price training data and match buyers to sellers while still addressing the associated (computational and other) complexity. The challenge in creating such a market stems from the very nature of data as an asset: (i) it is freely replicable; (ii) its value is inherently combinatorial due to correlation with signal in other data; (iii) prediction tasks and the value of accuracy vary widely; (iv) usefulness of training data is difficult to verify a priori without first applying it to a prediction task. As our main contributions we: (i) propose a mathematical model for a two-sided data market and formally define the key associated challenges; (ii) construct algorithms for such a market to function and analyze how they meet the challenges defined. We highlight two technical contributions: (i) a new notion of "fairness" required for cooperative games with freely replicable goods; (ii) a truthful, zero regret mechanism to auction a class of combinatorial goods based on utilizing Myerson's payment function and the Multiplicative Weights algorithm. These might be of independent interest. | en_US |
dc.language.iso | en | |
dc.publisher | Association for Computing Machinery (ACM) | en_US |
dc.relation.isversionof | http://dx.doi.org/10.1145/3328526.3329589 | en_US |
dc.rights | Creative Commons Attribution-Noncommercial-Share Alike | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ | en_US |
dc.source | arXiv | en_US |
dc.title | A Marketplace for Data: An Algorithmic Solution | en_US |
dc.title.alternative | An Algorithmic Solution | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Agarwal, Anish et al. "A Marketplace for Data: An Algorithmic Solution." 2019 ACM Conference on Economics and Computation, June 2019, Phoenix, Arizona, Association for Computing Machinery, June 2019. © 2019 Association for Computing Machinery. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.relation.journal | 2019 ACM Conference on Economics and Computation | en_US |
dc.eprint.version | Author's final manuscript | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2020-12-07T15:09:54Z | |
dspace.orderedauthors | Agarwal, A; Dahleh, M; Sarkar, T | en_US |
dspace.date.submission | 2020-12-07T15:09:58Z | |
mit.license | OPEN_ACCESS_POLICY | |
mit.metadata.status | Complete | |