dc.contributor.advisor | Walter Bender. | en_US |
dc.contributor.author | Marlow, Cameron Alexander, 1977- | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Dept. of Architecture. Program In Media Arts and Sciences. | en_US |
dc.date.accessioned | 2011-03-24T20:16:16Z | |
dc.date.available | 2011-03-24T20:16:16Z | |
dc.date.copyright | 2001 | en_US |
dc.date.issued | 2001 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/61850 | |
dc.description | Thesis (S.M.)--Massachusetts Institute of Technology, Program in Media Arts & Sciences, 2001. | en_US |
dc.description | Includes bibliographical references (p. 79-81). | en_US |
dc.description.abstract | With the digitization of media, computers can be employed to help us with the process of classification, both by learning from our behavior to perform the task for us and by exposing new ways for us to think about our information. Given that most of our media comes in the form of electronic text, research in this area focuses on building automatic text classification systems. The standard representation employed by these systems, known as the bag-of-words approach to information retrieval, represents documents as collections of words. As a byproduct of this model, automatic classifiers have difficulty distinguishing between different meanings of a single word. This research presents a new computational model of electronic text, called a synchronic imprint, which uses structural information to contextualize the meaning of words. Every concept in the body of a text is described by its relationships with other concepts in the same text, allowing classification systems to distinguish between alternative meanings of the same word. This representation is applied to both the standard problem of text classification and also to the task of enabling people to better identify large bodies of text. The latter is achieved through the development of a visualization tool named flux that models synchronic imprints as a spring network. | en_US |
dc.description.statementofresponsibility | by Cameron Alexander Marlow. | en_US |
dc.format.extent | 81 leaves | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by
copyright. They may be viewed from this source for any purpose, but
reproduction or distribution in any format is prohibited without written
permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Architecture. Program In Media Arts and Sciences. | en_US |
dc.title | A language-based approach to categorical analysis | en_US |
dc.type | Thesis | en_US |
dc.description.degree | S.M. | en_US |
dc.contributor.department | Program in Media Arts and Sciences (Massachusetts Institute of Technology) | |
dc.identifier.oclc | 49676138 | en_US |