dc.contributor.author | Snyder, Benjamin | |
dc.contributor.author | Naseem, Tahira | |
dc.contributor.author | Eisenstein, Jacob | |
dc.contributor.author | Barzilay, Regina | |
dc.date.accessioned | 2010-10-07T13:12:43Z | |
dc.date.available | 2010-10-07T13:12:43Z | |
dc.date.issued | 2009-06 | |
dc.date.submitted | 2009-06 | |
dc.identifier.isbn | 978-1-932432-41-1 | |
dc.identifier.uri | http://hdl.handle.net/1721.1/58926 | |
dc.description.abstract | We investigate the problem of unsupervised part-of-speech tagging when raw parallel data is available in a large number of languages. Patterns of ambiguity vary greatly across languages and therefore even unannotated multilingual data can serve as a learning signal. We propose a non-parametric Bayesian model that connects related tagging decisions across languages through the use of multilingual latent variables. Our experiments show that performance improves steadily as the number of languages increases. | en_US |
dc.description.sponsorship | National Science Foundation (U.S.) (CAREER grant IIS-0448168) | en_US |
dc.description.sponsorship | National Science Foundation (U.S.) (CAREER grant IIS- 0835445) | en_US |
dc.language.iso | en_US | |
dc.publisher | Association for Computational Linguistics | en_US |
dc.relation.isversionof | http://portal.acm.org/citation.cfm?id=1620754.1620767 | en_US |
dc.rights | Attribution-Noncommercial-Share Alike 3.0 Unported | en_US |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ | en_US |
dc.source | MIT web domain | en_US |
dc.title | Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: A Bayesian Non-Parametric Approach | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Snyder, Benjamin. et al. "Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: A Bayesian Non-Parametric Approach." Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the ACL, pages 83–91,
Boulder, Colorado, June 2009. | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
dc.contributor.department | Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science | en_US |
dc.contributor.approver | Barzilay, Regina | |
dc.contributor.mitauthor | Snyder, Benjamin | |
dc.contributor.mitauthor | Naseem, Tahira | |
dc.contributor.mitauthor | Eisenstein, Jacob | |
dc.contributor.mitauthor | Barzilay, Regina | |
dc.relation.journal | Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics | en_US |
dc.eprint.version | Author's final manuscript | |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/PeerReviewed | en_US |
dspace.orderedauthors | Snyder, Benjamin; Naseem, Tahira; Eisenstein, Jacob; Barzilay, Regina | |
dc.identifier.orcid | https://orcid.org/0000-0002-2921-8201 | |
mit.license | OPEN_ACCESS_POLICY | en_US |
mit.metadata.status | Complete | |