Particle physics has an ambitious and broad experimental programme for the
coming decades. This programme requires large investments in detector hardware,
either to build new facilities and experiments, or to upgrade existing ones.
Similarly, it requires commensurate investment in the R&D of software to
acquire, manage, process, and analyse the shear amounts of data to be recorded.
In planning for the HL-LHC in particular, it is critical that all of the
collaborating stakeholders agree on the software goals and priorities, and that
the efforts complement each other. In this spirit, this white paper describes
the R&D activities required to prepare for this software upgrade.
The LHC experiments are designed to detect large amount of physics events
produced with a very high rate. Considering the future upgrades, the data
acquisition rate will become even higher and new computing paradigms must be
adopted for fast data-processing: General Purpose Graphics Processing Units
(GPGPU) is a novel approach based on massive parallel computing. The intense
computation power provided by Graphics Processing Units (GPU) is expected to
reduce the computation time and to speed-up the low-latency applications used
for fast decision taking. In particular, this approach could be hence used for
high-level triggering in very complex environments, like the typical inner
tracking systems of the multi-purpose experiments at LHC, where a large number
of charged particle tracks will be produced with the luminosity upgrade. In
this article we discuss a track pattern recognition algorithm based on the
Hough Transform, where a parallel approach is expected to reduce dramatically
the execution time.
The automation of operations is essential to reduce manpower costs and
improve the reliability of the system. The Site Status Board (SSB) is a
framework which allows Virtual Organizations to monitor their computing
activities at distributed sites and to evaluate site performance. The ATLAS
experiment intensively uses the SSB for the distributed computing shifts, for
estimating data processing and data transfer efficiencies at a particular site,
and for implementing automatic exclusion of sites from computing activities, in
case of potential problems. The ATLAS SSB provides a real-time aggregated
monitoring view and keeps the history of the monitoring metrics. Based on this
history, usability of a site from the perspective of ATLAS is calculated. The
paper will describe how the SSB is integrated in the ATLAS operations and
computing infrastructure and will cover implementation details of the ATLAS SSB
sensors and alarm system, based on the information in the SSB. It will
demonstrate the positive impact of the use of the SSB on the overall
performance of ATLAS computing activities and will overview future plans.