Learning the process of World Wide Web data retrieval
Author(s)
Manuel, Ryan A
DownloadFull printable version (4.984Mb)
Alternative title
Learning the process of WWW data retrieval
Other Contributors
Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.
Advisor
David R. Karger.
Terms of use
Metadata
Show full item recordAbstract
We develop a method for extracting and internalizing web site form submissions which we refer to as web operations. To begin the process, a user performs a sample submission of the form. From that submission, our system determines all of the necessary information to store the web operation. Through a simple user interface the user can view and modify the web operation to the extent that he wants or needs to. With the operation now stored, the user can invoke the operation without browsing to the web site on which the operation was originally contained. By utilizing the web site information extraction techniques contained in the Haystack information management system, we give the user the option to extract information off of web operation results pages. Thus, when using our system to the fullest extent, a user can invoke web operations and view and make use of the results without viewing any web pages.
Description
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2005. Includes bibliographical references (leaf 65).
Date issued
2005Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.