Combining phrase-based and tree-to-tree translation
Author(s)
Lieberman, Michael (Michael R.)
DownloadFull printable version (143.6Kb)
Other Contributors
Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.
Advisor
Michael Collins.
Terms of use
Metadata
Show full item recordAbstract
We present a novel approach to multi-engine machine translation, using a feature-based classification algorithm. Instead of just using language models, translation models, or internal confidence scores, we sought out other features that could be used to determine which of two translations to select. We combined the outputs from a phrase-based system, Moses [Koehn et al., 2007] and a tree-to-tree system [Cowan et al., 2006]. Our main result is a 0.3 to 0.4 improvement in BLEU score over the best single system used, while also improving fluency and adequacy judgments. In addition, we used the same setup to directly predict which sentences would be judged by humans to be more fluent and more adequate. In those domains, we predicted the better sentence 6% to 7% more often than a baseline of always choosing the single best system.
Description
Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2008. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Includes bibliographical references (p. 39-40).
Date issued
2008Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology
Keywords
Electrical Engineering and Computer Science.