Notice

This is not the latest version of this item. The latest version can be found at:https://dspace.mit.edu/handle/1721.1/138277.2

Show simple item record

dc.contributor.authorEisape, Tiwalayo
dc.contributor.authorZaslavsky, Noga
dc.contributor.authorLevy, Roger
dc.date.accessioned2021-12-01T17:36:44Z
dc.date.available2021-12-01T17:36:44Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/138277
dc.description.abstractContemporary autoregressive language models (LMs) trained purely on corpus data have been shown to capture numerous features of human incremental processing. However, past work has also suggested dissociations between corpus probabilities and human next-word predictions. Here we evaluate several state-of-theart language models for their match to human next-word predictions and to reading time behavior from eye movements. We then propose a novel method for distilling the linguistic information implicit in human linguistic predictions into pre-trained LMs: Cloze Distillation. We apply this method to a baseline neural LM and show potential improvement in reading time prediction and generalization to held-out human cloze data.en_US
dc.language.isoen
dc.publisherAssociation for Computational Linguistics (ACL)en_US
dc.relation.isversionof10.18653/V1/2020.CONLL-1.49en_US
dc.rightsCreative Commons Attribution 4.0 International licenseen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/en_US
dc.sourceAssociation for Computational Linguisticsen_US
dc.titleCloze Distillation: Improving Neural Language Models with Human Next-Word Predictionen_US
dc.typeArticleen_US
dc.identifier.citationEisape, Tiwalayo, Zaslavsky, Noga and Levy, Roger. 2020. "Cloze Distillation: Improving Neural Language Models with Human Next-Word Prediction." Proceedings of the 24th Conference on Computational Natural Language Learning.
dc.relation.journalProceedings of the 24th Conference on Computational Natural Language Learningen_US
dc.eprint.versionFinal published versionen_US
dc.type.urihttp://purl.org/eprint/type/ConferencePaperen_US
eprint.statushttp://purl.org/eprint/status/NonPeerRevieweden_US
dc.date.updated2021-12-01T17:32:40Z
dspace.orderedauthorsEisape, T; Zaslavsky, N; Levy, Ren_US
dspace.date.submission2021-12-01T17:32:41Z
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Neededen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

VersionItemDateSummary

*Selected version