-
Particle physics has an ambitious and broad experimental programme for the
coming decades. This programme requires large investments in detector hardware,
either to build new facilities and experiments, or to upgrade existing ones.
Similarly, it requires commensurate investment in the R&D of software to
acquire, manage, process, and analyse the shear amounts of data to be recorded.
In planning for the HL-LHC in particular, it is critical that all of the
collaborating stakeholders agree on the software goals and priorities, and that
the efforts complement each other. In this spirit, this white paper describes
the R&D activities required to prepare for this software upgrade.
-
ROOT is an object-oriented C++ framework conceived in the high-energy physics
(HEP) community, designed for storing and analyzing petabytes of data in an
efficient way. Any instance of a C++ class can be stored into a ROOT file in a
machine-independent compressed binary format. In ROOT the TTree object
container is optimized for statistical data analysis over very large data sets
by using vertical data storage techniques. These containers can span a large
number of files on local disks, the web, or a number of different shared file
systems. In order to analyze this data, the user can chose out of a wide set of
mathematical and statistical functions, including linear algebra classes,
numerical algorithms such as integration and minimization, and various methods
for performing regression analysis (fitting). In particular, ROOT offers
packages for complex data modeling and fitting, as well as multivariate
classification based on machine learning techniques. A central piece in these
analysis tools are the histogram classes which provide binning of one- and
multi-dimensional data. Results can be saved in high-quality graphical formats
like Postscript and PDF or in bitmap formats like JPG or GIF. The result can
also be stored into ROOT macros that allow a full recreation and rework of the
graphics. Users typically create their analysis macros step by step, making use
of the interactive C++ interpreter CINT, while running over small data samples.
Once the development is finished, they can run these macros at full compiled
speed over large data sets, using on-the-fly compilation, or by creating a
stand-alone batch program. Finally, if processing farms are available, the user
can reduce the execution time of intrinsically parallel tasks - e.g. data
mining in HEP - by using PROOF, which will take care of optimally distributing
the work over the available resources in a transparent way.
-
The Riemann Xi-function Xi(t)=xi(1/2+it) is a particularly interesting member
of a broad family of entire functions which can be expanded in terms of
symmetrized Pochhammer polynomials depending on a certain scaling parameter
beta. An entire function in this family can be expressed as a specific integral
transform of a function A(x) to which can be associated a unique minimal
beta-sequence beta(min,n)-> infinity as n-> infinity, having the property that
the Pochhammer polynomial approximant Xi(n,t,beta(n)) of order n to the
function Xi(t) has real roots only in t for all n and for all beta(n)>=
beta(min,n).
The importance of the minimal beta-sequence is related to the fact that its
asymptotic properties may, by virtue of the Hurwitz theorem of complex
analysis, allow for making inferences about the zeros of the limit function
Xi(t) in case the approximants Xi(n,t,beta(n)) converge. The objective of the
paper is to investigate numerically the properties, in particular the very
large n properties, of the minimal beta-sequences for different choices of the
function A(x) of compact support and of exponential decrease, including the
Riemann case.
-
RooStats is a project to create advanced statistical tools required for the
analysis of LHC data, with emphasis on discoveries, confidence intervals, and
combined measurements. The idea is to provide the major statistical techniques
as a set of C++ classes with coherent interfaces, so that can be used on
arbitrary model and datasets in a common way. The classes are built on top of
the RooFit package, which provides functionality for easily creating
probability models, for analysis combinations and for digital publications of
the results. We will present in detail the design and the implementation of the
different statistical methods of RooStats. We will describe the various classes
for interval estimation and for hypothesis test depending on different
statistical techniques such as those based on the likelihood function, or on
frequentists or bayesian statistics. These methods can be applied in complex
problems, including cases with multiple parameters of interest and various
nuisance parameters.