AI-Augmented Predictions: LLM Assistants Improve  Human Forecasting Accuracy

Schoenegger, Philipp; Park, Peter; Karger, Ezra; Trott, Sean; Tetlock, Philip

dc.contributor.author	Schoenegger, Philipp
dc.contributor.author	Park, Peter
dc.contributor.author	Karger, Ezra
dc.contributor.author	Trott, Sean
dc.contributor.author	Tetlock, Philip
dc.date.accessioned	2025-01-24T20:49:23Z
dc.date.available	2025-01-24T20:49:23Z
dc.date.submitted	2024-12-13
dc.identifier.issn	2160-6455
dc.identifier.uri	https://hdl.handle.net/1721.1/158063
dc.description.abstract	Large language models (LLMs) match and sometimes exceed human performance in many domains. This study explores the potential of LLMs to augment human judgment in a forecasting task. We evaluate the effect on human forecasters of two LLM assistants: one designed to provide high-quality ("superforecasting") advice, and the other designed to be overconfident and base-rate neglecting, thus providing noisy forecasting advice. We compare participants using these assistants to a control group that received a less advanced model that did not provide numerical predictions or engage in explicit discussion of predictions. Participants (N = 991) answered a set of six forecasting questions and had the option to consult their assigned LLM assistant throughout. Our preregistered analyses show that interacting with each of our frontier LLM assistants significantly enhances prediction accuracy by between 24% and 28% compared to the control group. Exploratory analyses showed a pronounced outlier effect in one forecasting item, without which we find that the superforecasting assistant increased accuracy by 41%, compared with 29% for the noisy assistant. We further examine whether LLM forecasting augmentation disproportionately benefits less skilled forecasters, degrades the wisdom-of-the-crowd by reducing prediction diversity, or varies in effectiveness with question difficulty. Our data do not consistently support these hypotheses. Our results suggest that access to a frontier LLM assistant, even a noisy one, can be a helpful decision aid in cognitively demanding tasks compared to a less powerful model that does not provide specific forecasting advice. However, the effects of outliers suggest that further research into the robustness of this pattern is needed.	en_US
dc.publisher	ACM	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/3707649	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Association for Computing Machinery	en_US
dc.title	AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy	en_US
dc.type	Article	en_US
dc.identifier.citation	Schoenegger, Philipp, Park, Peter, Karger, Ezra, Trott, Sean and Tetlock, Philip. "AI-Augmented Predictions: LLM Assistants Improve Human Forecasting Accuracy." ACM Transactions on Interactive Intelligent Systems.
dc.relation.journal	ACM Transactions on Interactive Intelligent Systems	en_US
dc.identifier.mitlicense	PUBLISHER_CC
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2025-01-01T08:46:22Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2025-01-01T08:46:22Z
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: license_rdf
Size:: 40bytes
Format:: application/rdf+xml

View/Open

Name:: 3707649.pdf
Size:: 3.401Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record