Nothing Special   »   [go: up one dir, main page]

CERN Accelerating science

If you experience any problem watching the video, click the download button below
Download Embed
CMS Note
Report number CMS-CR-2023-033
Title Stability of the CMS Submission Infrastructure for the LHC Run 3
Author(s) Perez-Calero Yzquierdo, Antonio Maria (Madrid, CIEMAT) ; Kizinevic, Edita (CERN) ; Khan, Farrukh Aftab (Fermilab) ; Kim, Hyunwoo (Fermilab) ; Mascheroni, Marco (UC, San Diego) ; Acosta Flechas, Maria (Fermilab) ; Tsipinakis, Nikos (CERN) ; Haleem, Saqib (Quaid-i-Azam U.)
Publication 2023
Collaboration CMS Collaboration
Imprint 11 Mar 2023
Number of pages 6
Presented at 21st International Workshop on Advanced Computing and Analysis Techniques in Physics Research, Bari, It, 24 - 28 Oct 2022
Subject category Detectors and Experimental Techniques
Accelerator/Facility, Experiment CERN LHC ; CMS
Keywords Computing
Abstract The CMS Submission Infrastructure is the main computing resource provisioning system for CMS workflows, including data processing, simulation and analysis. It currently aggregates nearly 400k CPU cores distributed worldwide from Grid, HPC and cloud providers. CMS Tier-0 tasks, such as data repacking and prompt reconstruction, critical for data-taking operations, are executed on a collection of computing resources at CERN, also managed by the CMS Submission Infrastructure. All this computing power is harnessed via a number of federated resource pools, supervised by HTCondor and GlideinWMS services. Elements such as pilot factories, job schedulers and connection brokers are deployed in high-availability mode across several ``availability zones'', providing stability to our services via hardware redundancy and numerous failover mechanisms. Right before the start of the LHC Run 3, the Submission Infrastructure stability was tested in a series of controlled exercises, performed without interruption of our services. These tests demonstrated the resilience of our systems, and additionally provided useful information in order to further refine our monitoring and alarming system. This report will describe the main elements in the CMS Submission Infrastructure design and deployment, along with the performed failover exercises, proving that our systems are ready to serve their critical role in support of CMS activities.

 


 Element opprettet 2023-04-03, sist endret 2023-04-03


Fulltekst:
Last ned fulltekst
PDF