Show simple item record

dc.contributor.authorBerwick, Robert C.
dc.contributor.authorMalioutov, Igor Mikhailovich
dc.date.accessioned2012-06-15T13:38:38Z
dc.date.available2012-06-15T13:38:38Z
dc.date.issued2011-01
dc.date.submitted2010-12
dc.identifier.isbn978-1-4244-8134-7
dc.identifier.urihttp://hdl.handle.net/1721.1/71163
dc.description.abstractStatistically-based parsers for large corpora, in particular the Penn Tree Bank (PTB), typically have not used all the linguistic information encoded in the annotated trees on which they are trained. In particular, they have not in general used information that records the effects of derivations, such as empty categories and the representation of displaced phrases, as is the case with passive, topicalization, and wh-constructions. Here we explore ways to use this information to “unwind” derivations, yielding a regularized underlying syntactic structure that can be used as an additional source of information for more accurate parsing. In effect, we make use of two joint sets of tree structures for parsing: the surface structure and its corresponding underlying structure where arguments have been restored to their canonical positions. We present a pilot experiment on passives in the PTB indicating that through the use of these two syntactic representations we can improve overall parsing performance by exploiting transformational regularities, in this way paring down the search space of possible syntactic analyses.en_US
dc.language.isoen_US
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)en_US
dc.relation.isversionofhttp://dx.doi.org/10.1109/ISDA.2010.5687043en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceIEEEen_US
dc.titleImproving statistical parsing by linguistic regularizationen_US
dc.typeArticleen_US
dc.identifier.citationMalioutov, Igor, and Robert C. Berwick. “Improving Statistical Parsing by Linguistic Regularization.” IEEE, 2010. 1071–1076. © Copyright 2010 IEEEen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverBerwick, Robert C.
dc.contributor.mitauthorBerwick, Robert C.
dc.contributor.mitauthorMalioutov, Igor Mikhailovich
dc.relation.journal10th International Conference on Intelligent Systems Design and Applications, 2010en_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
dspace.orderedauthorsMalioutov, Igor; Berwick, Robert C.en
dc.identifier.orcidhttps://orcid.org/0000-0002-1061-1871
dc.identifier.orcidhttps://orcid.org/0000-0002-9207-4888
mit.licensePUBLISHER_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record