When processing large amounts of data, the rate at which reading and writing
can take place is a critical factor. High energy physics data processing
relying on ROOT is no exception. The recent parallelisation of LHC experiments'
software frameworks and the analysis of the ever increasing amount of collision
data collected by experiments further emphasized this issue underlying the need
of increasing the implicit parallelism expressed within the ROOT I/O. In this
contribution we highlight the improvements of the ROOT I/O subsystem which
targeted a satisfactory scaling behaviour in a multithreaded context. The
effect of parallelism on the individual steps which are chained by ROOT to read
and write data, namely (de)compression, (de)serialisation, access to storage
backend, are discussed. Performance measurements are discussed through real
life examples coming from CMS production workflows on traditional server
platforms and highly parallel architectures such as Intel Xeon Phi.
Particle physics has an ambitious and broad experimental programme for the
coming decades. This programme requires large investments in detector hardware,
either to build new facilities and experiments, or to upgrade existing ones.
Similarly, it requires commensurate investment in the R&D of software to
acquire, manage, process, and analyse the shear amounts of data to be recorded.
In planning for the HL-LHC in particular, it is critical that all of the
collaborating stakeholders agree on the software goals and priorities, and that
the efforts complement each other. In this spirit, this white paper describes
the R&D activities required to prepare for this software upgrade.
RooStats is a project to create advanced statistical tools required for the
analysis of LHC data, with emphasis on discoveries, confidence intervals, and
combined measurements. The idea is to provide the major statistical techniques
as a set of C++ classes with coherent interfaces, so that can be used on
arbitrary model and datasets in a common way. The classes are built on top of
the RooFit package, which provides functionality for easily creating
probability models, for analysis combinations and for digital publications of
the results. We will present in detail the design and the implementation of the
different statistical methods of RooStats. We will describe the various classes
for interval estimation and for hypothesis test depending on different
statistical techniques such as those based on the likelihood function, or on
frequentists or bayesian statistics. These methods can be applied in complex
problems, including cases with multiple parameters of interest and various
RooStatsCms is an object oriented statistical framework based on the RooFit
technology. Its scope is to allow the modelling, statistical analysis and
combination of multiple search channels for new phenomena in High Energy
Physics. It provides a variety of methods described in literature implemented
as classes, whose design is oriented to the execution of multiple CPU intensive
jobs on batch systems or on the Grid.