MIT Libraries logoDSpace@MIT

MIT
View Item 
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
  • DSpace@MIT Home
  • MIT Libraries
  • MIT Theses
  • Graduate Theses
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Learning the process of World Wide Web data retrieval

Author(s)
Manuel, Ryan A
Thumbnail
DownloadFull printable version (4.984Mb)
Alternative title
Learning the process of WWW data retrieval
Other Contributors
Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.
Advisor
David R. Karger.
Terms of use
M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582
Metadata
Show full item record
Abstract
We develop a method for extracting and internalizing web site form submissions which we refer to as web operations. To begin the process, a user performs a sample submission of the form. From that submission, our system determines all of the necessary information to store the web operation. Through a simple user interface the user can view and modify the web operation to the extent that he wants or needs to. With the operation now stored, the user can invoke the operation without browsing to the web site on which the operation was originally contained. By utilizing the web site information extraction techniques contained in the Haystack information management system, we give the user the option to extract information off of web operation results pages. Thus, when using our system to the fullest extent, a user can invoke web operations and view and make use of the results without viewing any web pages.
Description
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005.
 
Includes bibliographical references (leaf 65).
 
Date issued
2005
URI
http://hdl.handle.net/1721.1/33288
Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Publisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.

Collections
  • Graduate Theses

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

My Account

Login

Statistics

OA StatisticsStatistics by CountryStatistics by Department
MIT Libraries
PrivacyPermissionsAccessibilityContact us
MIT
Content created by the MIT Libraries, CC BY-NC unless otherwise noted. Notify us about copyright concerns.