General Strategy for Querying Web Sources in a Data Federation Environment
Author(s)
Firat, Aykut; Wu, Lynn; Madnick, Stuart E.
DownloadMadnick-General Strategy.pdf (2.623Mb)
MIT_AMENDMENT
MIT Amendment
Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.
Terms of use
Metadata
Show full item recordAbstract
Modern database management systems are supporting the inclusion and querying of non-relational sources within a data federation environment via wrappers. Wrapper development for Web sources, however, is a convolution of code with extraction and query planning knowledge and becomes a daunting task. We use IBM DB2 federation engine to demonstrate the challenges of incorporating Web sources into a data federation. We, then, present a practical and general strategy for the inclusion and querying of Web sources without requiring any changes in the underlying data federation technology. This strategy separates the code and knowledge in wrapper development by introducing a general-purpose capabilities-aware mini query-planner and a data extraction engine. As a result, Web sources can be included in a data federation system faster, and maintained easier.
Date issued
2009-01Department
Sloan School of ManagementJournal
Journal of Database Management
Publisher
IGI Global
Citation
Firat, Aykut, Lynn Wu, and Stuart Madnick. “General Strategy for Querying Web Sources in a Data Federation Environment.” Journal of Database Management 20 (2009): 1-18. Web. 1 Dec. 2011. © 2009 IGI Global
Version: Final published version
ISSN
1533-8010
1063-8016