Policy Search through Genetic Programming and LLM-assisted Curriculum Learning

Jorgensen, Steven; Nadizar, Giorgia; Pietropolli, Gloria; Manzoni, Luca; Medvet, Eric; O'Reilly, Una-May; Hemberg, Erik

dc.contributor.author	Jorgensen, Steven
dc.contributor.author	Nadizar, Giorgia
dc.contributor.author	Pietropolli, Gloria
dc.contributor.author	Manzoni, Luca
dc.contributor.author	Medvet, Eric
dc.contributor.author	O'Reilly, Una-May
dc.contributor.author	Hemberg, Erik
dc.date.accessioned	2025-12-04T23:07:23Z
dc.date.available	2025-12-04T23:07:23Z
dc.date.issued	2025-10-31
dc.identifier.issn	2688-3007
dc.identifier.uri	https://hdl.handle.net/1721.1/164207
dc.description.abstract	Curriculum learning (CL) consists in using a diverse set of user-provided test cases, with varying levels of difficulty and organized in a suitable progression, for learning a policy. The quality of test cases is important to allow optimization techniques as genetic programming (GP) to solve policy search problems. In this work, we evaluate large language models (LLMs) as providers of test cases for GP-based policy search. We consider two policy search tasks, a single-player and a multi-player game, and four LLMs differing in complexity and specialization, which we prompt in order to generate suitable test cases for the two games. We experimentally assess the intrinsic quality of LLM-generated test cases and their utility when inserted in a curriculum consumed by a GP optimization. We evaluate the robustness of the approach with respect to the way cases are scheduled in curricula and with respect to the policy representation, for which we use both graphs and linear programs evolved by GP. We observe that the effectiveness of LLM-assisted CL depends on both the choice of LLM and the design of the prompting and scheduling strategies. These findings highlight important considerations for leveraging LLMs in automated curriculum design for GP-based optimization.	en_US
dc.publisher	ACM	en_US
dc.relation.isversionof	http://dx.doi.org/10.1145/3772718	en_US
dc.rights	Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use.	en_US
dc.source	Association for Computing Machinery	en_US
dc.title	Policy Search through Genetic Programming and LLM-assisted Curriculum Learning	en_US
dc.type	Article	en_US
dc.identifier.citation	Steven Jorgensen, Giorgia Nadizar, Gloria Pietropolli, Luca Manzoni, Eric Medvet, Una-May O'Reilly, and Erik Hemberg. 2025. Policy Search through Genetic Programming and LLM-assisted Curriculum Learning. ACM Trans. Evol. Learn. Optim.	en_US
dc.contributor.department	Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory	en_US
dc.relation.journal	ACM Transactions on Evolutionary Learning and Optimization	en_US
dc.identifier.mitlicense	PUBLISHER_POLICY
dc.eprint.version	Final published version	en_US
dc.type.uri	http://purl.org/eprint/type/JournalArticle	en_US
eprint.status	http://purl.org/eprint/status/PeerReviewed	en_US
dc.date.updated	2025-11-01T07:58:52Z
dc.language.rfc3066	en
dc.rights.holder	The author(s)
dspace.date.submission	2025-11-01T07:58:53Z
mit.license	PUBLISHER_POLICY
mit.metadata.status	Authority Work and Publication Information Needed	en_US

Files in this item

Name:: 3772718.pdf
Size:: 1011.Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

MIT Open Access Articles

Show simple item record