Show simple item record

dc.contributor.authorSnyder, Benjamin
dc.contributor.authorNaseem, Tahira
dc.contributor.authorEisenstein, Jacob
dc.contributor.authorBarzilay, Regina
dc.date.accessioned2010-10-07T13:12:43Z
dc.date.available2010-10-07T13:12:43Z
dc.date.issued2009-06
dc.date.submitted2009-06
dc.identifier.isbn978-1-932432-41-1
dc.identifier.urihttp://hdl.handle.net/1721.1/58926
dc.description.abstractWe investigate the problem of unsupervised part-of-speech tagging when raw parallel data is available in a large number of languages. Patterns of ambiguity vary greatly across languages and therefore even unannotated multilingual data can serve as a learning signal. We propose a non-parametric Bayesian model that connects related tagging decisions across languages through the use of multilingual latent variables. Our experiments show that performance improves steadily as the number of languages increases.en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (CAREER grant IIS-0448168)en_US
dc.description.sponsorshipNational Science Foundation (U.S.) (CAREER grant IIS- 0835445)en_US
dc.language.isoen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.relation.isversionofhttp://portal.acm.org/citation.cfm?id=1620754.1620767en_US
dc.rightsAttribution-Noncommercial-Share Alike 3.0 Unporteden_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/en_US
dc.sourceMIT web domainen_US
dc.titleAdding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: A Bayesian Non-Parametric Approachen_US
dc.typeArticleen_US
dc.identifier.citationSnyder, Benjamin. et al. "Adding More Languages Improves Unsupervised Multilingual Part-of-Speech Tagging: A Bayesian Non-Parametric Approach." Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the ACL, pages 83–91, Boulder, Colorado, June 2009.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratoryen_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Electrical Engineering and Computer Scienceen_US
dc.contributor.approverBarzilay, Regina
dc.contributor.mitauthorSnyder, Benjamin
dc.contributor.mitauthorNaseem, Tahira
dc.contributor.mitauthorEisenstein, Jacob
dc.contributor.mitauthorBarzilay, Regina
dc.relation.journalProceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguisticsen_US
dc.eprint.versionAuthor's final manuscript
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/PeerRevieweden_US
dspace.orderedauthorsSnyder, Benjamin; Naseem, Tahira; Eisenstein, Jacob; Barzilay, Regina
dc.identifier.orcidhttps://orcid.org/0000-0002-2921-8201
mit.licenseOPEN_ACCESS_POLICYen_US
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record