dc.contributor.advisor | Tommi Jaakkola. | en_US |
dc.contributor.author | Dhandhania, Keshav | en_US |
dc.contributor.other | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science. | en_US |
dc.date.accessioned | 2014-11-04T21:37:04Z | |
dc.date.available | 2014-11-04T21:37:04Z | |
dc.date.issued | 2014 | en_US |
dc.identifier.uri | http://hdl.handle.net/1721.1/91443 | |
dc.description | Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Computer Science, June 2014. | en_US |
dc.description | 24 | en_US |
dc.description | "May 23, 2014." Cataloged from PDF version of thesis. | en_US |
dc.description | Includes bibliographical references (pages 35-38). | en_US |
dc.description.abstract | In this paper, we aim to learn a semantic database given a text corpus. Specifically, we focus on predicting whether or not a pair of entities are related by the hypernym relation, also known as the 'is-a' or 'type-of' relation. We learn a neural network model for this task. The model is given as input a description of the words and the context from the text corpus in which a pair of nouns (entities) occur. In particular, among other things the description includes pre-trained embeddings of the words. We show that the model is able to predict hypernym noun pairs even though the dataset includes many incorrectly labeled noun pairs. Finally, we suggest ways to improve the dataset and the method. | en_US |
dc.description.statementofresponsibility | by Keshav Dhandhania. | en_US |
dc.format.extent | 38 pages | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Massachusetts Institute of Technology | en_US |
dc.rights | M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. | en_US |
dc.rights.uri | http://dspace.mit.edu/handle/1721.1/7582 | en_US |
dc.subject | Electrical Engineering and Computer Science. | en_US |
dc.title | Learning a semantic database from unstructured text | en_US |
dc.type | Thesis | en_US |
dc.description.degree | M. Eng. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | |
dc.identifier.oclc | 893679084 | en_US |