Selective Sharing for Multilingual Dependency Parsing

Naseem, Tahira; Barzilay, Regina; Globerson, Amir

Author(s)

Naseem, Tahira; Barzilay, Regina; Globerson, Amir

DownloadBarzilay_Selective sharing.pdf (320.2Kb)

OPEN_ACCESS_POLICY

Terms of use

Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/

Metadata

Show full item record

Abstract

We present a novel algorithm for multilingual dependency parsing that uses annotations from a diverse set of source languages to parse a new unannotated language. Our motivation is to broaden the advantages of multilingual learning to languages that exhibit significant differences from existing resource-rich languages. The algorithm learns which aspects of the source languages are relevant for the target language and ties model parameters accordingly. The model factorizes the process of generating a dependency tree into two steps: selection of syntactic dependents and their ordering. Being largely language-universal, the selection component is learned in a supervised fashion from all the training languages. In contrast, the ordering decisions are only influenced by languages with similar properties. We systematically model this cross-lingual sharing using typological features. In our experiments, the model consistently outperforms a state-of-the-art multilingual parser. The largest improvement is achieved on the non Indo-European languages yielding a gain of 14.4%.

Date issued

2012-07

URI

http://hdl.handle.net/1721.1/85954

Department

Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory; Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Journal

Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics

Publisher

The Association for Computational Linguistics

Citation

Naseem, Tahira, Regina Barzilay, and Amir Globerson. 2012. Selective Sharing for Multilingual Dependency Parsing. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, Republic of Korea, 8-14 July 2012, 629-637.

Version: Author's final manuscript

ISBN

978-1-937284-24-4

Collections

MIT Open Access Articles

DSpace@MIT