Author(s)
|
Villaplana Perez, Miguel (INFN Milano and Universita' di Milano, Dipartimento di Fisica) ; Alexandrov, Evgeny (Joint Institute for Nuclear Research) ; Aleksandrov, Igor (Joint Institute for Nuclear Research) ; Baranowski, Zbigniew (CERN) ; Barberis, Dario (INFN Genova and Universita' di Genova, Dipartimento di Fisica) ; Dimitrov, Gancho (European Laboratory for Particle Physics, CERN) ; Fernandez Casani, Alvaro (Instituto de Fisica Corpuscular (IFIC), Centro Mixto Universidad de Valencia - CSIC) ; Gallas, Elizabeth (University of Oxford, Particle Physics) ; Garcia Montoro, Carlos (Instituto de Fisica Corpuscular (IFIC), Centro Mixto Universidad de Valencia - CSIC) ; Gonzalez de la Hoz, Santiago (Instituto de Fisica Corpuscular (IFIC), Centro Mixto Universidad de Valencia - CSIC) ; Hrivnac, Julius (LAL, Univ. Paris-Sud, IN2P3/CNRS, Universite Paris-Saclay) ; Iakovlev, Alexander (Joint Institute for Nuclear Research) ; Kazymov, Andrei (Joint Institute for Nuclear Research) ; Mineev, Mikhail (Joint Institute for Nuclear Research) ; Prokoshin, Fedor (Joint Institute for Nuclear Research) ; Rybkin, Grigori (LAL, Univ. Paris-Sud, IN2P3/CNRS, Universite Paris-Saclay) ; Sánchez, Javier (Instituto de Fisica Corpuscular (IFIC), Centro Mixto Universidad de Valencia - CSIC) ; Salt, José (Instituto de Fisica Corpuscular (IFIC), Centro Mixto Universidad de Valencia - CSIC) ; Vasileva, Petya Tsvetanova (The University of Texas at Arlington) |
Abstract
| The ATLAS experiment produced so far hundreds of petabytes of data and expects to have one order of magnitude more in the future. This data are spread among hundreds of computing Grid sites around the world. The EventIndex is the complete catalogue of all ATLAS events, real and simulated, keeping the references to all permanent files that contain a given event in any processing stage. It provides the means to select and access event data in the ATLAS distributed storage system, and provides support for completeness and consistency checks and trigger and offline selection overlap studies. The EventIndex employs various data handling technologies like Hadoop and Oracle databases, and is integrated with other systems of the ATLAS distributed computing infrastructure, including those for data, metadata, and production management. The project is in operation since the start of LHC Run 2 in 2015, and is in permanent development in order to fit the production and analysis demands and follow technology evolutions. The main data store in Hadoop, based on MapFiles and HBase, has worked well during Run 2 but new solutions are explored for the future. This paper reports on the current system performance and on the studies of a new data storage prototype that can carry the EventIndex through Run 3. |