Sepia : semantic parsing for named entities

Marton, Gregory A. (Gregory Adam), 1977-

dc.contributor.advisor	Boris Katz.	en_US
dc.contributor.author	Marton, Gregory A. (Gregory Adam), 1977-	en_US
dc.contributor.other	Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.	en_US
dc.date.accessioned	2005-09-26T19:51:39Z
dc.date.available	2005-09-26T19:51:39Z
dc.date.copyright	2003	en_US
dc.date.issued	2004	en_US
dc.identifier.uri	http://hdl.handle.net/1721.1/28336
dc.description	Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, February 2004.	en_US
dc.description	Includes bibliographical references (p. 123-129).	en_US
dc.description.abstract	People's names, dates, locations, organizations, and various numeric expressions, collectively called Named Entities, are used to convey specific meanings to humans in the same way that identifiers and constants convey meaning to a computer language interpreter. Natural Language Question Answering can benefit from understanding the meaning of these expressions because answers in a text are often phrased differently from questions and from each other. For example, "9/11" might mean the same as "September 11th" and "Mayor Rudy Giuliani" might be the same person as "Rudolph Giuliani". Sepia, the system presented here, uses a lexicon of lambda expressions and a mildly context-sensitive parser to create a data structure for each named entity. The parser and grammar design are inspired by Combinatory Categorial Grammar. The data structures are designed to capture semantic dependencies using common syntactic forms. Sepia differs from other natural language parsers in that it does not use a pipeline architecture. As yet there is no statistical component in the architecture. To evaluate Sepia, I use examples tp illustrate its qualitative differences from other named entity systems, I measure component performance on Automatic Content Extraction (ACE) competition held-out training data. and I assess end-to-end performance in the Infolab's TREC-12 Question Answering competition entry. Sepia will compete in the ACE Entity Detection and Tracking track at the end of September.	en_US
dc.description.statementofresponsibility	by Gregory A. Marton.	en_US
dc.format.extent	129 p.	en_US
dc.format.extent	7400780 bytes
dc.format.extent	7417300 bytes
dc.format.mimetype	application/pdf
dc.format.mimetype	application/pdf
dc.language.iso	en_US
dc.publisher	Massachusetts Institute of Technology	en_US
dc.rights	M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.	en_US
dc.rights.uri	http://dspace.mit.edu/handle/1721.1/7582
dc.subject	Electrical Engineering and Computer Science.	en_US
dc.title	Sepia : semantic parsing for named entities	en_US
dc.title.alternative	Semantic parsing for named entities	en_US
dc.type	Thesis	en_US
dc.description.degree	S.M.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.oclc	55675517	en_US

Files in this item

Name:: 55675517-MIT.pdf
Size:: 7.073Mb
Format:: PDF
Description:: Full printable version

View/Open

This item appears in the following Collection(s)

Graduate Theses

Show simple item record