Show simple item record

dc.contributor.authorSchuster, Tal
dc.contributor.authorSchuster, Roei
dc.contributor.authorShah, Darsh J
dc.contributor.authorBarzilay, Regina
dc.date.accessioned2021-10-27T20:23:24Z
dc.date.available2021-10-27T20:23:24Z
dc.date.issued2020
dc.identifier.urihttps://hdl.handle.net/1721.1/135419
dc.description.abstract© 2020 Association for Computational Linguistics. Recent developments in neural language models (LMs) have raised concerns about their potential misuse for automatically spreading misinformation. In light of these concerns, several studies have proposed to detect machine-generated fake news by capturing their stylistic differences from human-written text. These approaches, broadly termed stylometry, have found success in source attribution and misinformation detection in human-written texts. However, in this work, we show that stylometry is limited against machine-generated misinformation. Whereas humans speak differently when trying to deceive, LMs generate stylistically consistent text, regardless of underlying motive. Thus, though stylometry can successfully prevent impersonation by identifying text provenance, it fails to distinguish legitimate LM applications from those that introduce false information. We create two benchmarks demonstrating the stylistic similarity between malicious and legitimate uses of LMs, utilized in auto-completion and editing-assistance settings.1 Our findings highlight the need for non-stylometry approaches in detecting machine-generated misinformation, and open up the discussion on the desired evaluation benchmarks.
dc.language.isoen
dc.publisherMIT Press - Journals
dc.relation.isversionof10.1162/COLI_A_00380
dc.rightsCreative Commons Attribution-NonCommercial-NoDerivs License
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.sourceMIT Press
dc.titleThe Limitations of Stylometry for Detecting Machine-Generated Fake News
dc.typeArticle
dc.contributor.departmentMassachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
dc.relation.journalComputational Linguistics
dc.eprint.versionFinal published version
dc.type.urihttp://purl.org/eprint/type/JournalArticle
eprint.statushttp://purl.org/eprint/status/PeerReviewed
dc.date.updated2020-12-01T17:58:51Z
dspace.orderedauthorsSchuster, T; Schuster, R; Shah, DJ; Barzilay, R
dspace.date.submission2020-12-01T17:58:54Z
mit.journal.volume46
mit.journal.issue2
mit.licensePUBLISHER_CC
mit.metadata.statusAuthority Work and Publication Information Needed


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record