Particle physics has an ambitious and broad experimental programme for the
coming decades. This programme requires large investments in detector hardware,
either to build new facilities and experiments, or to upgrade existing ones.
Similarly, it requires commensurate investment in the R&D of software to
acquire, manage, process, and analyse the shear amounts of data to be recorded.
In planning for the HL-LHC in particular, it is critical that all of the
collaborating stakeholders agree on the software goals and priorities, and that
the efforts complement each other. In this spirit, this white paper describes
the R&D activities required to prepare for this software upgrade.
To study the performance of multi-threaded Geant4 for high-energy physics
experiments, an application has been developed which generalizes and extends
previous work. A highly-complex detector geometry is used for benchmarking on
an Intel Xeon Phi coprocessor. In addition, an implementation of parallel I/O
based on Intel SCIF and ROOT technologies is incorporated and studied.
With Geant4 Version 10.0, released in December 2013, one of the most widely
used Monte-Carlo codes has been ported to take full advantage of multi- and
many-core CPUs thanks to the introduction of event-level parallelism via
multithreading. In this paper we review recent developments to allow for a
better integration of parallel Geant4 jobs with external libraries. We have
chosen to develop examples using the popular Intel Threading Building Block
(for short TBB) as an alternative parallelization approach to the native Geant4
POSIX. To simplify the scaling of a Geant4 application across nodes on a
cluster we are improving the support of MPI in Geant4. In particular it is now
possible to run an hybrid MPI/MT application that uses MPI to scale across
nodes and MT to scale across cores. %The recent developments allow users to
easily implement parallel application resources that scale on a very large
number of nodes and cores typical of HPC resources.
Process checkpoint-restart is a technology with great potential for use in
HEP workflows. Use cases include debugging, reducing the startup time of
applications both in offline batch jobs and the High Level Trigger, permitting
job preemption in environments where spare CPU cycles are being used
opportunistically and efficient scheduling of a mix of multicore and
single-threaded jobs. We report on tests of checkpoint-restart technology using
CMS software, Geant4-MT (multi-threaded Geant4), and the DMTCP (Distributed
Multithreaded Checkpointing) package. We analyze both single- and
multi-threaded applications and test on both standard Intel x86 architectures
and on Intel MIC. The tests with multi-threaded applications on Intel MIC are
used to consider scalability and performance. These are considered an indicator
of what the future may hold for many-core computing.
We report on our investigations into the viability of the ARM processor and
the Intel Xeon Phi co-processor for scientific computing. We describe our
experience porting software to these processors and running benchmarks using
real physics applications to explore the potential of these processors for
production physics processing.
During 2003 test beam session for ATLAS Tile Calorimeter a monitoring program
has been developed to ease the setup of correct running condition and the
assessment of data quality. The program has been built using the Online
Software services provided by the ATLAS Online Software group. The first part
of this note contains a brief overview of these services followed by the full
description of Tile Calorimeter monitoring program architecture and features.
Performances and future upgrades are discussed in the final part of this note.