Login

Evaluating and Aggregating Data Believability across Quality Sub-Dimensions and Data Lineage

Show simple item record

dc.contributor.author Prat, Nicolas
dc.contributor.author Madnick, Stuart E.
dc.date.accessioned 2008-01-11T18:15:00Z
dc.date.available 2008-01-11T18:15:00Z
dc.date.issued 2008-01-11T18:15:00Z
dc.identifier.uri http://hdl.handle.net/1721.1/40085
dc.description.abstract Data quality is crucial for operational efficiency and sound decision making. This paper focuses on believability, a major aspect of data quality. The issue of believability is particularly relevant in the context of Web 2.0, where mashups facilitate the combination of data from different sources. Our approach for assessing data believability is based on provenance and lineage, i.e. the origin and subsequent processing history of data. We present the main concepts of our model for representing and storing data provenance, and an ontology of the sub-dimensions of data believability. We then use aggregation operators to compute believability across the sub-dimensions of data believability and the provenance of data. We illustrate our approach with a scenario based on Internet data. Our contribution lies in three main design artifacts (1) the provenance model (2) the ontology of believability subdimensions and (3) the method for computing and aggregating data believability. To our knowledge, this is the first work to operationalize provenance-based assessment of data believability. en
dc.description.provenance Submitted by Peter Maher (pmaher@mit.edu) on 2008-01-11T18:13:45Z No. of bitstreams: 1 4670-07.pdf: 298643 bytes, checksum: 48ca249adefd4a62b80bd1b25c835d34 (MD5) en
dc.description.provenance Approved for entry into archive by Peter Maher(pmaher@mit.edu) on 2008-01-11T18:15:00Z (GMT) No. of bitstreams: 1 4670-07.pdf: 298643 bytes, checksum: 48ca249adefd4a62b80bd1b25c835d34 (MD5) en
dc.description.provenance Made available in DSpace on 2008-01-11T18:15:00Z (GMT). No. of bitstreams: 1 4670-07.pdf: 298643 bytes, checksum: 48ca249adefd4a62b80bd1b25c835d34 (MD5) en
dc.language.iso en_US en
dc.relation.ispartofseries MIT Sloan School of Management Working Paper en
dc.relation.ispartofseries 4670-07 en
dc.subject Data Lineage en
dc.subject Web 2.0 en
dc.title Evaluating and Aggregating Data Believability across Quality Sub-Dimensions and Data Lineage en
dc.type Working Paper en

Files in this item

Files Size Format
4670-07.pdf 298.6Kb application/pdf

This item appears in the following Collection(s)

Show simple item record

Search DSpace@MIT


Advanced Search

Browse

My Account

Links