
In many settings it is critical to accurately model the extreme tail
behaviour of a random process. Nonparametric density estimation methods are
commonly implemented as exploratory data analysis techniques for this purpose
as they possess excellent visualisation properties, and can naturally avoid the
model specification biases implied by using parametric estimators. In
particular, kernelbased estimators place minimal assumptions on the data, and
provide improved visualisation over scatterplots and histograms. However kernel
density estimators are known to perform poorly when estimating extreme tail
behaviour, which is important when interest is in process behaviour above some
large threshold, and they can overemphasise bumps in the density for heavy
tailed data. In this article we develop a transformation kernel density
estimator, and demonstrate that its mean integrated squared error (MISE)
efficiency is equivalent to that of standard, nontail focused kernel density
estimators. Estimator performance is illustrated in numerical studies, and in
an expanded analysis of the ability of well known global climate models to
reproduce observed temperature extremes in Sydney, Australia.

Many developments in Mathematics involve the computation of higher order
derivatives of Gaussian density functions. The analysis of univariate Gaussian
random variables is a wellestablished field whereas the analysis of their
multivariate counterparts consists of a body of results which are more
dispersed. These latter results generally fall into two main categories:
theoretical expressions which reveal the deep structure of the problem, or
computational algorithms which can mask the connections with closely related
problems. In this paper, we unify existing results and develop new results in a
framework which is both conceptually cogent and computationally efficient. We
focus on the underlying connections between higher order derivatives of
Gaussian density functions, the expected value of products of quadratic forms
in Gaussian random variables, and Vstatistics of degree two based on Gaussian
density functions. These three sets of results are combined into an analysis of
nonparametric data smoothers.

In systems biomedicine, an experimenter encounters different potential
sources of variation in data such as individual samples, multiple experimental
conditions, and multivariable networklevel responses. In multiparametric
cytometry, which is often used for analyzing patient samples, such issues are
critical. While computational methods can identify cell populations in
individual samples, without the ability to automatically match them across
samples, it is difficult to compare and characterize the populations in typical
experiments, such as those responding to various stimulations or distinctive of
particular patients or timepoints, especially when there are many samples.
Joint Clustering and Matching (JCM) is a multilevel framework for simultaneous
modeling and registration of populations across a cohort. JCM models every
population with a robust multivariate probability distribution. Simultaneously,
JCM fits a randomeffects model to construct an overall batch template  used
for registering populations across samples, and classifying new samples. By
tackling systemslevel variation, JCM supports practical biomedical
applications involving large cohorts.

Important information concerning a multivariate data set, such as clusters
and modal regions, is contained in the derivatives of the probability density
function. Despite this importance, nonparametric estimation of higher order
derivatives of the density functions have received only relatively scant
attention. Kernel estimators of density functions are widely used as they
exhibit excellent theoretical and practical properties, though their
generalization to density derivatives has progressed more slowly due to the
mathematical intractabilities encountered in the crucial problem of bandwidth
(or smoothing parameter) selection. This paper presents the first fully
automatic, databased bandwidth selectors for multivariate kernel density
derivative estimators. This is achieved by synthesizing recent advances in
matrix analytic theory which allow mathematically and computationally tractable
representations of higher order derivatives of multivariate vector valued
functions. The theoretical asymptotic properties as well as the finite sample
behaviour of the proposed selectors are studied. {In addition, we explore in
detail the applications of the new datadriven methods for two other
statistical problems: clustering and bump hunting. The introduced techniques
are combined with the mean shift algorithm to develop novel automatic,
nonparametric clustering procedures which are shown to outperform mixturemodel
cluster analysis and other recent nonparametric approaches in practice.
Furthermore, the advantage of the use of smoothing parameters designed for
density derivative estimation for feature significance analysis for bump
hunting is illustrated with a real data example.