dc.contributor.advisor | Patrick H. Winston. | en_US |
dc.contributor.author | Couturier, Martin Marcel | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2011-10-17T21:23:17Z | |
dc.date.available | 2011-10-17T21:23:17Z | |
dc.date.copyright | 2011 | en_US |
dc.date.issued | 2011 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/66413 | |
dc.description | Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011. | en_US |
dc.description | Cataloged from PDF version of thesis. | en_US |
dc.description | Includes bibliographical references (p. 77). | en_US |
dc.description.abstract | Today, powerful programs readily parse English text; understanding, however, is another matter. In this thesis, I take a step toward understanding by introducing CLARIFY, a program that disambiguates words. CLARIFY identifies patterns in observed word contexts, and uses these patterns to select the optimal word sense for any specific situation. CLARIFY learns successful patterns by manipulating an accelerated Self-Organizing Map to save these example contexts and then references them to perform further context based disambiguation within the language. Through this process and after training on 125 examples, CLARIFY can now decipher that shrimp in the sentence "The shrimp goes to the store. " is a small-person, not relying on a literal definition of each word as a separate element but looking at the sentence as a fluid solution of many elements, thereby making the inference crustacean absurd. CLARIFY is implemented in 1500 lines of Java. | en_US |
dc.description.statementofresponsibility | by Martin Marcel Couturier. | en_US |
dc.format.extent | 77 p. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by
copyright. They may be viewed from this source for any purpose, but
reproduction or distribution in any format is prohibited without written
permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Disambiguating words with self-organizing maps | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M.Eng. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 755091036 | en_US |