RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

Berto, Federico; Hua, Chuanbo; Park, Junyoung; Luttmann, Laurin; Ma, Yining; Bu, Fanchen; Wang, Jiarui; Ye, Haoran; Kim, Minsu; Choi, Sanghyeok; Zepeda, Nayeli; Hottung, Andr?; Zhou, Jianan; Bi, Jieyi; Hu, Yu; Liu, Fei; Kim, Hyeonah; Son, Jiwoo; Kim, Haeyeon; Angioni, Davide; Kool, Wouter

Author(s)

Berto, Federico; Hua, Chuanbo; Park, Junyoung; Luttmann, Laurin; Ma, Yining; ... Show more

Download3711896.3737433.pdf (2.845Mb)

Publisher with Creative Commons License

Terms of use

Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/

Metadata

Show full item record

Abstract

Combinatorial optimization (CO) is fundamental to several real-world applications, from logistics and scheduling to hardware design and resource allocation. Deep reinforcement learning (RL) has recently shown significant benefits in solving CO problems, reducing reliance on domain expertise and improving computational efficiency. However, the absence of a unified benchmarking framework leads to inconsistent evaluations, limits reproducibility, and increases engineering overhead, raising barriers to adoption for new researchers. To address these challenges, we introduce RL4CO, a unified and extensive benchmark with in-depth library coverage of 27 CO problem environments and 23 state-of-the-art baselines. Built on efficient software libraries and best practices in implementation, RL4CO features modularized implementation and flexible configurations of diverse environments, policy architectures, RL algorithms, and utilities with extensive documentation. RL4CO helps researchers build on existing successes while exploring and developing their own designs, facilitating the entire research process by decoupling science from heavy engineering. We finally provide extensive benchmark studies to inspire new insights and future work. RL4CO has already attracted numerous researchers in the community and is open-sourced at https://github.com/ai4co/rl4co.

Description

KDD ’25, Toronto, ON, Canada

Date issued

2025-08-03

URI

https://hdl.handle.net/1721.1/162622

Department

Massachusetts Institute of Technology. Department of Civil and Environmental Engineering; Massachusetts Institute of Technology. Laboratory for Information and Decision Systems

Publisher

ACM|Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2

Citation

Federico Berto, Chuanbo Hua, Junyoung Park, Laurin Luttmann, Yining Ma, Fanchen Bu, Jiarui Wang, Haoran Ye, Minsu Kim, Sanghyeok Choi, Nayeli Gast Zepeda, André Hottung, Jianan Zhou, Jieyi Bi, Yu Hu, Fei Liu, Hyeonah Kim, Jiwoo Son, Haeyeon Kim, Davide Angioni, Wouter Kool, Zhiguang Cao, Qingfu Zhang, Joungho Kim, Jie Zhang, Kijung Shin, Cathy Wu, Sungsoo Ahn, Guojie Song, Changhyun Kwon, Kevin Tierney, Lin Xie, and Jinkyoo Park. 2025. RL4CO: An Extensive Reinforcement Learning for Combinatorial Optimization Benchmark. In Proceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2 (KDD '25). Association for Computing Machinery, New York, NY, USA, 5278–5289.

Version: Final published version

ISBN

979-8-4007-1454-2

Collections

MIT Open Access Articles