Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/138281.2

Show simple item record

dc.contributor.authorGauthier, Jon
dc.contributor.authorHu, Jennifer
dc.contributor.authorWilcox, Ethan
dc.contributor.authorQian, Peng
dc.contributor.authorLevy, Roger
dc.date.accessioned2021-12-01T17:49:30Z
dc.date.available2021-12-01T17:49:30Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/138281
dc.description.abstractTargeted syntactic evaluations have yielded insights into the generalizations learned by neural network language models. However, this line of research requires an uncommon confluence of skills: both the theoretical knowledge needed to design controlled psycholinguistic experiments, and the technical proficiency needed to train and deploy large-scale language models. We present SyntaxGym, an online platform designed to make targeted evaluations accessible to both experts in NLP and linguistics, reproducible across computing environments, and standardized following the norms of psycholinguistic experimental design. This paper releases two tools of independent value for the computational linguistics community: 1. A website, syntaxgym.org, which centralizes the process of targeted syntactic evaluation and provides easy tools for analysis and visualization; 2. Two command-line tools, syntaxgym and lm-zoo, which allow any user to reproduce targeted syntactic evaluations and general language model inference on their own machine.en_US
dc.language.isoen
dc.publisherAssociation for Computational Linguistics (ACL)en_US
dc.relation.isversionof10.18653/V1/2020.ACL-DEMOS.10en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computational Linguisticsen_US
dc.titleSyntaxGym: An Online Platform for Targeted Evaluation of Language Modelsen_US
dc.typeArticleen_US
dc.identifier.citationGauthier, Jon, Hu, Jennifer, Wilcox, Ethan, Qian, Peng and Levy, Roger. 2020. "SyntaxGym: An Online Platform for Targeted Evaluation of Language Models." Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations.
dc.relation.journalProceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrationsen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-12-01T17:47:34Z
dspace.orderedauthorsGauthier, J; Hu, J; Wilcox, E; Qian, P; Levy, Ren_US
dspace.date.submission2021-12-01T17:47:35Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version