dc.contributor.author | Leekha, Rohan | |
dc.contributor.author | Vandam, Courtland | |
dc.date.accessioned | 2024-04-04T15:20:50Z | |
dc.date.available | 2024-04-04T15:20:50Z | |
dc.date.issued | 2023-11-06 | |
dc.identifier.isbn | 979-8-4007-0409-3 | |
dc.identifier.uri | https://hdl.handle.net/1721.1/154062 | |
dc.description | ASONAM '23, November 6–9, 2023, Kusadasi, Turkiye | en_US |
dc.description.abstract | Identifying changes in style can be used to detect multi-authored social media accounts, plagiarism, compromised accounts, and author contributions in long documents. We propose an approach to recognize changes in authorship using large language models. Our approach leverages sentence-level contextual embeddings and semantic relationships. First we expand the training set by adding adversarial examples to the minority class [5], [13], [17]. Then we fine-tune a sequence classification transformer model to detect style change. Our approach outperforms all baselines of PAN21 with macro F1-scores of 0.80, 0.74, and 0.70 for detecting style changepoint between paragraphs, closed-set author ID per paragraph, and style changepoint between sentences, respectively. Our approach also performs better than the leading competitors in PAN22. Also, we achieved a five percent improvement in macro F1-score (0.78) on the newly introduced DarkReddit+ dataset for authorship verification. | en_US |
dc.publisher | ACM | en_US |
dc.relation.isversionof | 10.1145/3625007.3627589 | en_US |
dc.rights | Creative Commons Attribution | en_US |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | en_US |
dc.source | Association for Computing Machinery | en_US |
dc.title | A generalized solution to verify authorship and detect style change in multi-authored documents | en_US |
dc.type | Article | en_US |
dc.identifier.citation | Leekha, Rohan and Vandam, Courtland. 2023. "A generalized solution to verify authorship and detect style change in multi-authored documents." | |
dc.contributor.department | Lincoln Laboratory | |
dc.identifier.mitlicense | PUBLISHER_CC | |
dc.eprint.version | Final published version | en_US |
dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
dc.date.updated | 2024-04-01T07:47:47Z | |
dc.language.rfc3066 | en | |
dc.rights.holder | The author(s) | |
dspace.date.submission | 2024-04-01T07:47:48Z | |
mit.license | PUBLISHER_CC | |
mit.metadata.status | Authority Work and Publication Information Needed | en_US |