| dc.contributor.author | Fogelson, Alex | |
| dc.contributor.author | Thompson, Neil | |
| dc.contributor.author | Trišović, Ana | |
| dc.date.accessioned | 2025-11-26T15:32:02Z | |
| dc.date.available | 2025-11-26T15:32:02Z | |
| dc.date.issued | 2025-10-21 | |
| dc.identifier.isbn | 979-8-4007-1958-5 | |
| dc.identifier.uri | https://hdl.handle.net/1721.1/164071 | |
| dc.description | ACM REP ’25, Vancouver, BC, Canada | en_US |
| dc.description.abstract | Understanding the intent behind scientific citations is critical for
advancing scholarly search and knowledge mapping. This paper
reflects on the methodological use of large language models (LLMs)
for multi-class citation intent classification. Our experiments evaluating a diverse range of models and approaches reveal striking
disagreement among state-of-the-art (SotA) systems. This inconsistency suggests that citation intent classification remains a challenging task for LLMs raising questions about the robustness, reliability
and replicability of current methods. Moreover, our findings highlight a concerning dependency on proprietary LLMs that, even
with access to compute resources, were necessary to achieve sufficient accuracy. This introduces new challenges, as silent updates,
lack of versioning, and opaque training pipelines pose threats to
methodological transparency and long-term reproducibility in LLMenabled research. | en_US |
| dc.publisher | ACM|ACM Conference on Reproducibility and Replicability | en_US |
| dc.relation.isversionof | https://doi.org/10.1145/3736731.3746137 | en_US |
| dc.rights | Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. | en_US |
| dc.source | Association for Computing Machinery | en_US |
| dc.title | LLMs in Citation Intent Classification: Progress, Precision, and Reproducibility Challenges | en_US |
| dc.type | Article | en_US |
| dc.identifier.citation | Alex Fogelson, Ana Trišović, and Neil Thompson. 2025. LLMs in Citation Intent Classification: Progress, Precision, and Reproducibility Challenges. In Proceedings of the 3rd ACM Conference on Reproducibility and Replicability (ACM REP '25). Association for Computing Machinery, New York, NY, USA, 250–253. | en_US |
| dc.contributor.department | Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory | en_US |
| dc.identifier.mitlicense | PUBLISHER_POLICY | |
| dc.eprint.version | Final published version | en_US |
| dc.type.uri | http://purl.org/eprint/type/ConferencePaper | en_US |
| eprint.status | http://purl.org/eprint/status/NonPeerReviewed | en_US |
| dc.date.updated | 2025-11-01T07:50:20Z | |
| dc.language.rfc3066 | en | |
| dc.rights.holder | The author(s) | |
| dspace.date.submission | 2025-11-01T07:50:20Z | |
| mit.license | PUBLISHER_POLICY | |
| mit.metadata.status | Authority Work and Publication Information Needed | en_US |