LLMs in Citation Intent Classification: Progress, Precision, and Reproducibility Challenges

Fogelson, Alex; Thompson, Neil; Trišović, Ana

dc.contributor.author	Fogelson, Alex
dc.contributor.author	Thompson, Neil
dc.contributor.author	Trišović, Ana
dc.date.accessioned	2025-11-26T15:32:02Z
dc.date.available	2025-11-26T15:32:02Z
dc.date.issued	2025-10-21
dc.identifier.isbn	979-8-4007-1958-5
dc.identifier.uri	https://hdl.handle.net/1721.1/164071
dc.description	ACM REP ’25, Vancouver, BC, Canada	en_US
dc.description.abstract	Understanding the intent behind scientific citations is critical for advancing scholarly search and knowledge mapping. This paper reflects on the methodological use of large language models (LLMs) for multi-class citation intent classification. Our experiments evaluating a diverse range of models and approaches reveal striking disagreement among state-of-the-art (SotA) systems. This inconsistency suggests that citation intent classification remains a challenging task for LLMs raising questions about the robustness, reliability and replicability of current methods. Moreover, our findings highlight a concerning dependency on proprietary LLMs that, even with access to compute resources, were necessary to achieve sufficient accuracy. This introduces new challenges, as silent updates, lack of versioning, and opaque training pipelines pose threats to methodological transparency and long-term reproducibility in LLMenabled research.	en_US
dc.publisher	ACM\|ACM Conference on Reproducibility and Replicability	en_US
dc.relation.isversionof	https://doi.org/10.1145/3736731.3746137	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Association for Computing Machinery	en_US
dc.title	LLMs in Citation Intent Classification: Progress, Precision, and Reproducibility Challenges	en_US
dc.type	Article	en_US
dc.identifier.citation	Alex Fogelson, Ana Trišović, and Neil Thompson. 2025. LLMs in Citation Intent Classification: Progress, Precision, and Reproducibility Challenges. In Proceedings of the 3rd ACM Conference on Reproducibility and Replicability (ACM REP '25). Association for Computing Machinery, New York, NY, USA, 250–253.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.identifier.mitlicense	PUBLISHER_POLICY
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2025-11-01T07:50:20Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2025-11-01T07:50:20Z
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 3736731.3746137.pdf
Size:: 872.4Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record