Show simple item record

dc.contributor.advisorMichael Collins.en_US
dc.contributor.authorWheeler, Dan Loween_US
dc.contributor.otherMassachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.en_US
dc.date.accessioned2009-08-26T16:44:27Z
dc.date.available2009-08-26T16:44:27Z
dc.date.copyright2008en_US
dc.date.issued2008en_US
dc.identifier.urihttp://hdl.handle.net/1721.1/46535
dc.descriptionThesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2008.en_US
dc.descriptionIncludes bibliographical references (p. 80-82).en_US
dc.description.abstractLanguage pairs such as Chinese and English with largely differing word order have proved to be one of the greatest challenges in statistical machine translation. One reason is that such techniques usually work with sentences as flat strings of words, rather than explicitly attempting to parse any sort of hierarchical structural representation. Because even simple syntactic differences between languages can quickly lead to a universe of idiosyncratic surface level word reordering rules, many believe the near future of machine translation will lie heavily in syntactic modeling. The time to start may be now: advances in statistical parsing over the last decade have already started opening the door. Following the work of Cowan et al., I present a statistical tree-to-tree translation system for Chinese to English that formulates the translation step as a prediction of English clause structure from Chinese clause structure. Chinese sentences are segmented and parsed, split into clauses, and independently translated into English clauses using a discriminative feature based model. Clausal arguments, such as subject and object, are translated separately using an off-the-shelf phrase-based translator. By explicitly modeling syntax at a clausal level, but using a phrase-based (flat-sentence) method on local, reduced expressions, such as clausal arguments, I aim to address the current weakness in long-distance word reordering while still leveraging the excellent local translations that today's state of the art has to offer.en_US
dc.description.statementofresponsibilityby Dan Lowe Wheeler.en_US
dc.format.extent293 p.en_US
dc.language.isoengen_US
dc.publisherMassachusetts Institute of Technologyen_US
dc.rightsM.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission.en_US
dc.rights.urihttp://dspace.mit.edu/handle/1721.1/7582en_US
dc.subjectElectrical Engineering and Computer Science.en_US
dc.titleMachine translation through clausal syntax : a statistical approach for Chinese to Englishen_US
dc.title.alternativeIncorporating syntax into machine translation : a statistical approach for Chinese to Englishen_US
dc.typeThesisen_US
dc.description.degreeM.Eng.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
dc.identifier.oclc416602511en_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record