Show simple item record

dc.contributor.advisorPatrick H. Winston.en_US
dc.contributor.authorCouturier, Martin Marcelen_US
dc.contributor.otherMassachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2011-10-17T21:23:17Z
dc.date.available2011-10-17T21:23:17Z
dc.date.copyright2011en_US
dc.date.issued2011en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/66413
dc.descriptionThesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011.en_US
dc.descriptionCataloged from PDF version of thesis.en_US
dc.descriptionIncludes bibliographical references (p. 77).en_US
dc.description.abstractToday, powerful programs readily parse English text; understanding, however, is another matter. In this thesis, I take a step toward understanding by introducing CLARIFY, a program that disambiguates words. CLARIFY identifies patterns in observed word contexts, and uses these patterns to select the optimal word sense for any specific situation. CLARIFY learns successful patterns by manipulating an accelerated Self-Organizing Map to save these example contexts and then references them to perform further context based disambiguation within the language. Through this process and after training on 125 examples, CLARIFY can now decipher that shrimp in the sentence "The shrimp goes to the store. " is a small-person, not relying on a literal definition of each word as a separate element but looking at the sentence as a fluid solution of many elements, thereby making the inference crustacean absurd. CLARIFY is implemented in 1500 lines of Java.en_US
dc.description.statementofresponsibilityby Martin Marcel Couturier.en_US
dc.format.extent77 p.en_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titleDisambiguating words with self-organizing mapsen_US
dc.typeThesisen_US
dc.description.degreeM.Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.oclc755091036en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record