Advanced Search

Attribution principles for data integration : technology and policy perspectives

Research and Teaching Output of the MIT Community

Show simple item record

dc.contributor.advisor Stuart E. Madnick. en_US Lee, Thomas Y. (Thomas Yupoo) en_US
dc.contributor.other Massachusetts Institute of Technology. Technology, Management, and Policy Program. en_US 2005-10-14T19:27:15Z 2005-10-14T19:27:15Z 2002 en_US 2002 en_US
dc.description Thesis (Ph. D.)--Massachusetts Institute of Technology, Engineering Systems Division, Technology, Management, and Policy Program, 2002. en_US
dc.description Includes bibliographical references (p. [229]-250). en_US
dc.description.abstract This thesis addresses problems of attribution that arise from the data integration that is exemplified by data re-use and re-distribution on the Web. We present two different perspectives. We begin with a simple definition of attribution, asking what data are we interested in and where does it come from? A formal model and its properties are defined, implementation in an extended relational algebra is described, and application to semistructured data on the Web is discussed. However, because the problem is more than simply what and where, we then expand the scope of our analysis. From the perspective of intellectual property policies, we adopt a broader view of the attribution problem space. A policy analysis that surveys the status quo policy landscape and stakeholder interests is followed by specific policy recommendations. Informed by our technology perspective, we offer two new arguments to support misappropriation as a policy approach to the attribution problem space. Our formal model of attribution is developed in the established foundation of the Domain Relational Calculus (DRC). Three distinct types of attribution are identified: comprehensive, source, and relevant. For each type, we consider the attribution of equivalent DRC expressions, attribution for composed queries, and granularity. An algebra is presented to implement the model. The extended algebra is closed, reduces to the standard relational algebra, and is a consistent extension of the standard algebra. en_US
dc.description.abstract (cont.) The policy perspective encompasses not only what and where but also integration architectures and the relationships between data providers and users. Information technologies separate the processes and products of data gathering from data selection and presentation. Where the latter is addressed by copyright, the former is not addressed at all. Based upon two traditional, legal-economic frameworks, the asymmetric Prisoner's Dilemma and Entitlement Theory, we argue for a policy of misappropriation to support integration and attribution for data. en_US
dc.description.statementofresponsibility by Thomas Y. Lee. en_US
dc.format.extent 250 p. en_US
dc.format.extent 14265560 bytes
dc.format.extent 14265360 bytes
dc.format.mimetype application/pdf
dc.format.mimetype application/pdf
dc.language.iso eng en_US
dc.publisher Massachusetts Institute of Technology en_US
dc.rights M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. en_US
dc.subject Technology, Management, and Policy Program. en_US
dc.title Attribution principles for data integration : technology and policy perspectives en_US
dc.title.alternative Principles of attribution for data integration : technology and policy perspectives en_US
dc.type Thesis en_US Ph.D. en_US
dc.contributor.department Massachusetts Institute of Technology. Technology, Management, and Policy Program. en_US
dc.identifier.oclc 51738898 en_US

Files in this item

Name Size Format Description
51738898-MIT.pdf 13.60Mb PDF Full printable version

This item appears in the following Collection(s)

Show simple item record