Show simple item record

dc.contributor.authorGauthier, Jon
dc.contributor.authorHu, Jennifer
dc.contributor.authorWilcox, Ethan
dc.contributor.authorQian, Peng
dc.contributor.authorLevy, Roger P
dc.date.accessioned2022-01-06T15:29:54Z
dc.date.available2021-12-01T17:49:30Z
dc.date.available2022-01-06T15:29:54Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/138281.2
dc.description.abstractTargeted syntactic evaluations have yielded insights into the generalizations learned by neural network language models. However, this line of research requires an uncommon confluence of skills: both the theoretical knowledge needed to design controlled psycholinguistic experiments, and the technical proficiency needed to train and deploy large-scale language models. We present SyntaxGym, an online platform designed to make targeted evaluations accessible to both experts in NLP and linguistics, reproducible across computing environments, and standardized following the norms of psycholinguistic experimental design. This paper releases two tools of independent value for the computational linguistics community: 1. A website, syntaxgym.org, which centralizes the process of targeted syntactic evaluation and provides easy tools for analysis and visualization; 2. Two command-line tools, syntaxgym and lm-zoo, which allow any user to reproduce targeted syntactic evaluations and general language model inference on their own machine.en_US
dc.description.sponsorshipNIH (Award T32NS105587)en_US
dc.language.isoen
dc.publisherAssociation for Computational Linguistics (ACL)en_US
dc.relation.isversionof10.18653/V1/2020.ACL-DEMOS.10en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computational Linguisticsen_US
dc.titleSyntaxGym: An Online Platform for Targeted Evaluation of Language Modelsen_US
dc.typeArticleen_US
dc.identifier.citationGauthier, Jon, Hu, Jennifer, Wilcox, Ethan, Qian, Peng and Levy, Roger. 2020. "SyntaxGym: An Online Platform for Targeted Evaluation of Language Models." Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations.en_US
dc.contributor.departmentMassachusetts Institute of Technology. Department of Brain and Cognitive Sciencesen_US
dc.relation.journalProceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrationsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-12-01T17:47:34Z
dspace.orderedauthorsGauthier, J; Hu, J; Wilcox, E; Qian, P; Levy, Ren_US
dspace.date.submission2021-12-01T17:47:35Z
mit.licensePUBLISHER_CC
mit.metadata.statusPublication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version