The Medium is the Message: How Non-Clinical Information Shapes Clinical Decisions in LLMs

Gourabathina, Abinitha; Gerych, Walter; Pan, Eileen; Ghassemi, Marzyeh

dc.contributor.author	Gourabathina, Abinitha
dc.contributor.author	Gerych, Walter
dc.contributor.author	Pan, Eileen
dc.contributor.author	Ghassemi, Marzyeh
dc.date.accessioned	2025-12-22T20:55:09Z
dc.date.available	2025-12-22T20:55:09Z
dc.date.issued	2025-06-23
dc.identifier.isbn	979-8-4007-1482-5
dc.identifier.uri	https://hdl.handle.net/1721.1/164428
dc.description	FAccT ’25, Athens, Greece	en_US
dc.description.abstract	The integration of large language models (LLMs) into clinical diagnostics necessitates a careful understanding of how clinically irrelevant aspects of user inputs directly influence generated treatment recommendations and, consequently, clinical outcomes for end-users. Building on prior research that examines the impact of demographic attributes on clinical LLM reasoning, this study explores how non-clinically relevant attributes shape clinical decision-making by LLMs. Through the perturbation of patient messages, we evaluate whether LLM behavior remains consistent, accurate, and unbiased when non-clinical information is altered. These perturbations assess the brittleness of clinical LLM reasoning by replicating structural errors that may occur during electronic data processing patient questions and simulating interactions between patient-AI systems in diverse, vulnerable patient groups. Our findings reveal notable inconsistencies in LLM treatment recommendations and significant degradation of clinical accuracy in ways that reduce care allocation to patients. Additionally, there are significant disparities in treatment recommendations between gender subgroups as well as between model-inferred gender subgroups. We also apply our perturbation framework to a conversational clinical dataset to find that even in conversation, LLM clinical accuracy decreases post-perturbation, and disparities exist in how perturbations impact gender subgroups. By analyzing LLM outputs in response to realistic yet modified clinical contexts, our work deepens understanding of the sensitivity, inaccuracy, and biases inherent in medical LLMs, offering critical insights for the deployment of patient-AI systems.	en_US
dc.publisher	ACM\|The 2025 ACM Conference on Fairness, Accountability, and Transparency	en_US
dc.relation.isversionof	https://doi.org/10.1145/3715275.3732121	en_US
dc.rights	Creative Commons Attribution	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.source	Association for Computing Machinery	en_US
dc.title	The Medium is the Message: How Non-Clinical Information Shapes Clinical Decisions in LLMs	en_US
dc.type	Article	en_US
dc.identifier.citation	Abinitha Gourabathina, Walter Gerych, Eileen Pan, and Marzyeh Ghassemi. 2025. The Medium is the Message: How Non-Clinical Information Shapes Clinical Decisions in LLMs. In Proceedings of the 2025 ACM Conference on Fairness, Accountability, and Transparency (FAccT '25). Association for Computing Machinery, New York, NY, USA, 1805–1828.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science	en_US
dc.identifier.mitlicense	PUBLISHER_POLICY
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/ConferencePaper	en_US
eprint.status	http://purl.org/eprint/status/NonPeerReviewed	en_US
dc.date.updated	2025-08-01T08:34:53Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2025-08-01T08:34:53Z
mit.license	PUBLISHER_CC
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 3715275.3732121.pdf
Size:: 2.366Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record