Show simple item record

dc.contributor.authorHu, Jennifer
dc.contributor.authorGauthier, Jon
dc.contributor.authorQian, Peng
dc.contributor.authorLevy, Roger P
dc.date.accessioned2021-04-07T16:02:37Z
dc.date.available2021-04-07T16:02:37Z
dc.date.issued2020-07
dc.identifier.urihttps://hdl.handle.net/1721.1/130402
dc.description.abstractWhile state-of-the-art neural network models continue to achieve lower perplexity scores on language modeling benchmarks, it remains unknown whether optimizing for broad-coverage predictive performance leads to human-like syntactic knowledge. Furthermore, existing work has not provided a clear picture about the model properties required to produce proper syntactic generalizations. We present a systematic evaluation of the syntactic knowledge of neural language models, testing 20 combinations of model types and data sizes on a set of 34 English-language syntactic test suites. We find substantial differences in syntactic generalization performance by model architecture, with sequential models underperforming other architectures. Factorially manipulating model architecture and training dataset size (1M-40M words), we find that variability in syntactic generalization performance is substantially greater by architecture than by dataset size for the corpora tested in our experiments. Our results also reveal a dissociation between perplexity and syntactic generalization performance.en_US
dc.description.sponsorshipNational Institutes of Health (U.S.) (Award T32NS105587)en_US
dc.language.isoen
dc.publisherAssociation for Computational Linguistics (ACL)en_US
dc.relation.isversionof10.18653/V1/2020.ACL-MAIN.158en_US
dc.rightsArticle is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.en_US
dc.sourceAssociation for Computational Linguisticsen_US
dc.titleA Systematic Assessment of Syntactic Generalization in Neural Language Modelsen_US
dc.typeArticleen_US
dc.identifier.citationHu, Jennifer et al. “A Systematic Assessment of Syntactic Generalization in Neural Language Models.” Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (July 2020): 1725–1744 © 2020 The Author(s)en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.relation.journalProceedings of the 58th Annual Meeting of the Association for Computational Linguisticsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-04-07T14:41:25Z
dspace.orderedauthorsHu, J; Gauthier, J; Qian, P; Wilcox, E; Levy, Ren_US
dspace.date.submission2021-04-07T14:41:33Z
mit.licensePUBLISHER_POLICY
mit.metadata.statusComplete


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record