Automatic classification of documents with an in-depth analysis of information extraction and automatic summarization
Author(s)
Hohm, Joseph Brandon, 1982-
DownloadFull printable version (8.039Mb)
Other Contributors
Massachusetts Institute of Technology. Dept. of Civil and Environmental Engineering.
Advisor
John R. Williams.
Terms of use
Metadata
Show full item recordAbstract
Today, annual information fabrication per capita exceeds two hundred and fifty megabytes. As the amount of data increases, classification and retrieval methods become more necessary to find relevant information. This thesis describes a .Net application (named I-Document) that establishes an automatic classification scheme in a peer-to-peer environment that allows free sharing of academic, business, and personal documents. A Web service architecture for metadata extraction, Information Extraction, Information Retrieval, and text summarization is depicted. Specific details regarding the coding process, competition, business model, and technology employed in the project are also discussed.
Description
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Civil and Environmental Engineering, 2004. Includes bibliographical references (leaves 78-80).
Date issued
2004Department
Massachusetts Institute of Technology. Department of Civil and Environmental EngineeringPublisher
Massachusetts Institute of Technology
Keywords
Civil and Environmental Engineering.