Show simple item record

dc.contributor.authorBhardwaj, Anant P.
dc.contributor.authorBhattacherjee, Souvik
dc.contributor.authorChavan, Amit
dc.contributor.authorDeshpande, Amol
dc.contributor.authorElmore, Aaron J.
dc.contributor.authorMadden, Samuel R.
dc.contributor.authorParameswaran, Aditya
dc.date.accessioned2016-01-19T03:17:20Z
dc.date.available2016-01-19T03:17:20Z
dc.date.issued2015-01
dc.identifier.urihttp://hdl.handle.net/1721.1/100919
dc.description.abstractRelational databases have limited support for data collaboration, where teams collaboratively curate and analyze large datasets. Inspired by software version control systems like git, we propose (a) a dataset version control system, giving users the ability to create, branch, merge, difference and search large, divergent collections of datasets, and (b) a platform, DATA HUB, that gives users the ability to perform collaborative data analysis building on this version control system. We outline the challenges in providing dataset version control at scale.en_US
dc.language.isoen_US
dc.relation.isversionofhttp://cidrdb.org/cidr2015/program.htmlen_US
dc.rightsCreative Commons Attributionen_US
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleDataHub: Collaborative Data Science & Dataset Version Management at Scaleen_US
dc.typeArticleen_US
dc.identifier.citationBhardwaj, Anant, Souvik Bhattacherjee, Amit Chavan, Amol Deshpande, Aaron J. Elmore, Samuel Madden, Aditya Parameswaran. "DataHub: Collaborative Data Science & Dataset Version Management at Scale." 7th Biennial Conference on Innovative Data Systems Research (CIDR ’15) (January 2015).en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.mitauthorBhardwaj, Anant P.en_US
dc.contributor.mitauthorElmore, Aaron J.en_US
dc.contributor.mitauthorMadden, Samuel R.en_US
dc.contributor.mitauthorParameswaran, Adityaen_US
dc.relation.journalProceeings of the 7th Biennial Conference on Innovative Data Systems Research (CIDR ’15)en_US
dc.eprint.versionAuthor's final manuscripten_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dspace.orderedauthorsBhardwaj, Anant; Bhattacherjee, Souvik; Chavan, Amit; Deshpande, Amol; Elmore, Aaron J.; Madden, Samuel; Parameswaran, Adityaen_US
dc.identifier.orcidhttps://orcid.org/0000-0002-7470-3265
dc.identifier.orcidhttps://orcid.org/0000-0002-4642-1869
mit.licensePUBLISHER_CCen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record