<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" version="2.0">
<channel>
<title>Sloan Working Papers</title>
<link>http://hdl.handle.net/1721.1/1792</link>
<description/>
<image>
<title>The Channel Image</title>
<url xmlns="http://apache.org/cocoon/i18n/2.1">http://dspace.mit.edu:80/retrieve/5829</url>
<link>http://hdl.handle.net/1721.1/1792</link>
</image>
<item>
<title>Semantic Integration Approach to Efficient Business Data Supply Chain: Integration Approach to Interoperable XBRL</title>
<link>http://hdl.handle.net/1721.1/40087</link>
<description>Semantic Integration Approach to Efficient Business Data Supply Chain: Integration Approach to Interoperable XBRL

Zhu, Hongwei

Madnick, Stuart E.

As an open standard for electronic communication of business and financial data, XBRL has the&#13;
potential of improving the efficiency of the business data supply chain. A number of jurisdictions&#13;
have developed different XBRL taxonomies as their data standards. Semantic heterogeneity&#13;
exists in these taxonomies, the corresponding instances, and the internal systems that store the&#13;
original data. Consequently, there are still substantial difficulties in creating and using XBRL&#13;
instances that involve multiple taxonomies. To fully realize the potential benefits of XBRL, we&#13;
have to develop technologies to reconcile semantic heterogeneity and enable interoperability of&#13;
various parts of the supply chain. In this paper, we analyze the XBRL standard and use examples&#13;
of different taxonomies to illustrate the interoperability challenge. We also propose a technical&#13;
solution that incorporates schema matching and context mediation techniques to improve the&#13;
efficiency of the production and consumption of XBRL data.

</description>
<pubDate>Fri, 11 Jan 2008 18:30:18 GMT</pubDate>
</item>
<item>
<title>Measuring Data Believability: A Provenance Approach</title>
<link>http://hdl.handle.net/1721.1/40086</link>
<description>Measuring Data Believability: A Provenance Approach

Prat, Nicolas

Madnick, Stuart E.

Data quality is crucial for operational efficiency&#13;
and sound decision making. This paper focuses on&#13;
believability, a major aspect of quality, measured&#13;
along three dimensions: trustworthiness,&#13;
reasonableness, and temporality. We ground our&#13;
approach on provenance, i.e. the origin and&#13;
subsequent processing history of data. We present our&#13;
provenance model and our approach for computing&#13;
believability based on provenance metadata. The&#13;
approach is structured into three increasingly complex&#13;
building blocks: (1) definition of metrics for assessing&#13;
the believability of data sources, (2) definition of&#13;
metrics for assessing the believability of data resulting&#13;
from one process run and (3) assessment of&#13;
believability based on all the sources and processing&#13;
history of data. We illustrate our approach with a&#13;
scenario based on Internet data. To our knowledge,&#13;
this is the first work to develop a precise approach to&#13;
measuring data believability and making explicit use of&#13;
provenance-based measurements.

</description>
<pubDate>Fri, 11 Jan 2008 18:24:10 GMT</pubDate>
</item>
<item>
<title>Evaluating and Aggregating Data Believability across Quality Sub-Dimensions and Data Lineage</title>
<link>http://hdl.handle.net/1721.1/40085</link>
<description>Evaluating and Aggregating Data Believability across Quality Sub-Dimensions and Data Lineage

Prat, Nicolas

Madnick, Stuart E.

Data quality is crucial for operational efficiency and sound decision making. This paper focuses on believability,&#13;
a major aspect of data quality. The issue of believability is particularly relevant in the context of Web 2.0, where&#13;
mashups facilitate the combination of data from different sources. Our approach for assessing data believability is&#13;
based on provenance and lineage, i.e. the origin and subsequent processing history of data. We present the main&#13;
concepts of our model for representing and storing data provenance, and an ontology of the sub-dimensions of data&#13;
believability. We then use aggregation operators to compute believability across the sub-dimensions of data&#13;
believability and the provenance of data. We illustrate our approach with a scenario based on Internet data. Our&#13;
contribution lies in three main design artifacts (1) the provenance model (2) the ontology of believability subdimensions&#13;
and (3) the method for computing and aggregating data believability. To our knowledge, this is the first&#13;
work to operationalize provenance-based assessment of data believability.

</description>
<pubDate>Fri, 11 Jan 2008 18:15:00 GMT</pubDate>
</item>
<item>
<title>Enabling Global Price Comparison through Semantic Integration of Web Data</title>
<link>http://hdl.handle.net/1721.1/40084</link>
<description>Enabling Global Price Comparison through Semantic Integration of Web Data

Zhu, Hongwei

Siegel, Michael

Madnick, Stuart

“Sell Globally” and “Shop Globally” have been seen as a potential&#13;
benefit of web-enabled electronic business. One important step toward realizing&#13;
this benefit is to know how things are selling in various parts of the world. A&#13;
global price comparison service would address this need. But there have not&#13;
been many such services. In this paper, we use a case study of global price&#13;
dispersion to illustrate the need and the value of a global price comparison&#13;
service. Then we identify and discuss several technology challenges, including&#13;
semantic heterogeneity, in providing a global price comparison service. We&#13;
propose a mediation architecture to address the semantic heterogeneity&#13;
problem, and demonstrate the feasibility of the proposed architecture by&#13;
implementing a prototype that enables global price comparison using data from&#13;
web sources in several countries.

</description>
<pubDate>Fri, 11 Jan 2008 18:04:18 GMT</pubDate>
</item>
</channel>
</rss>
