Dynamic Distribution of High-Rate Data Processing from CERN to Remote HPC Data Centers
Author(s)
Boccali, T.; Cameron, D.; Cardo, N.; Conciatore, D.; Di Girolamo, A.; Dissertori, G.; Fernandez, P.; Filipcic, A.; Gila, M.; Grab, C.; Elmsheuser, J.; Jankauskas, V.; ... Show more Show less
Download41781_2020_Article_52.pdf (1008.Kb)
Publisher with Creative Commons License
Publisher with Creative Commons License
Creative Commons Attribution
Terms of use
Metadata
Show full item recordAbstract
Abstract
The prompt reconstruction of the data recorded from the Large Hadron Collider (LHC) detectors has always been addressed by dedicated resources at the CERN Tier-0. Such workloads come in spikes due to the nature of the operation of the accelerator and in special high load occasions experiments have commissioned methods to distribute (spill-over) a fraction of the load to sites outside CERN. The present work demonstrates a new way of supporting the Tier-0 environment by provisioning resources elastically for such spilled-over workflows onto the Piz Daint Supercomputer at CSCS. This is implemented using containers, tuning the existing batch scheduler and reinforcing the scratch file system, while still using standard Grid middleware. ATLAS, CMS and CSCS have jointly run selected prompt data reconstruction on up to several thousand cores on Piz Daint into a shared environment, thereby probing the viability of the CSCS high performance computer site as on demand extension of the CERN Tier-0, which could play a role in addressing the future LHC computing challenges for the high luminosity LHC.
Date issued
2021-02-08Department
Massachusetts Institute of Technology. Department of PhysicsPublisher
Springer International Publishing
Citation
Computing and Software for Big Science. 2021 Feb 08;5(1):7
Version: Final published version