
Sparse principal component analysis (sPCA) has become one of the most widely
used techniques for dimensionality reduction in highdimensional datasets. The
main challenge underlying sPCA is to estimate the first vector of loadings of
the population covariance matrix, provided that only a certain number of
loadings are nonzero. In this paper, we propose confidence intervals for
individual loadings and for the largest eigenvalue of the population covariance
matrix. Given an independent sample $X^i \in\mathbb R^p, i = 1,...,n,$
generated from an unknown distribution with an unknown covariance matrix
$\Sigma_0$, our aim is to estimate the first vector of loadings and the largest
eigenvalue of $\Sigma_0$ in a setting where $p\gg n$. Next to the
highdimensionality, another challenge lies in the inherent nonconvexity of
the problem. We base our methodology on a Lassopenalized Mestimator which,
despite nonconvexity, may be solved by a polynomialtime algorithm such as
coordinate or gradient descent. We show that our estimator achieves the minimax
optimal rates in $\ell_1$ and $\ell_2$norm. We identify the bias in the
Lassobased estimator and propose a debiased sparse PCA estimator for the
vector of loadings and for the largest eigenvalue of the covariance matrix
$\Sigma_0$. Our main results provide theoretical guarantees for asymptotic
normality of the debiased estimator. The major conditions we impose are
sparsity in the first eigenvector of small order $\sqrt{n}/\log p$ and sparsity
of the same order in the columns of the inverse Hessian matrix of the
population risk.

We provide a selected overview of methodology and theory for estimation and
inference on the edge weights in highdimensional directed and undirected
Gaussian graphical models. For undirected graphical models, two main explicit
constructions are provided: one based on a global method that maximizes the
joint likelihood (the graphical Lasso) and one based on a local (nodewise)
method that sequentially applies the Lasso to estimate the neighbourhood of
each node. The proposed estimators lead to confidence intervals for edge
weights and recovery of the edge structure. We evaluate their empirical
performance in an extensive simulation study. The theoretical guarantees for
the methods are achieved under a sparsity condition relative to the sample size
and regularity conditions. For directed acyclic graphs, we apply similar ideas
to construct confidence intervals for edge weights, when the directed acyclic
graph is identifiable.

Asymptotic lower bounds for estimation play a fundamental role in assessing
the quality of statistical procedures. In this paper we propose a framework for
obtaining semiparametric efficiency bounds for sparse highdimensional models,
where the dimension of the parameter is larger than the sample size. We adopt a
semiparametric point of view: we concentrate on one dimensional functions of a
highdimensional parameter. We follow two different approaches to reach the
lower bounds: asymptotic Cram\'erRao bounds and Le Cam's type of analysis.
Both these approaches allow us to define a class of asymptotically unbiased or
"regular" estimators for which a lower bound is derived. Consequently, we show
that certain estimators obtained by desparsifying (or debiasing) an
$\ell_1$penalized Mestimator are asymptotically unbiased and achieve the
lower bound on the variance: thus in this sense they are asymptotically
efficient. The paper discusses in detail the linear regression model and the
Gaussian graphical model.

We study asymptotically normal estimation and confidence regions for
lowdimensional parameters in highdimensional sparse models. Our approach is
based on the $\ell_1$penalized Mestimator which is used for construction of a
bias corrected estimator. We show that the proposed estimator is asymptotically
normal, under a sparsity assumption on the highdimensional parameter,
smoothness conditions on the expected loss and an entropy condition. This leads
to uniformly valid confidence regions and hypothesis testing for
lowdimensional parameters. The present approach is different in that it allows
for treatment of loss functions that we not sufficiently differentiable, such
as quantile loss, Huber loss or hinge loss functions. We also provide new
results for estimation of the inverse Fisher information matrix, which is
necessary for the construction of the proposed estimator. We formulate our
results for general models under highlevel conditions, but investigate these
conditions in detail for generalized linear models and provide mild sufficient
conditions. As particular examples, we investigate the case of quantile loss
and Huber loss in linear regression and demonstrate the performance of the
estimators in a simulation study and on real datasets from genomewide
association studies. We further investigate the case of logistic regression and
illustrate the performance of the estimator on simulated and real data.

We propose methodology for estimation of sparse precision matrices and
statistical inference for their lowdimensional parameters in a
highdimensional setting where the number of parameters $p$ can be much larger
than the sample size. We show that the novel estimator achieves minimax rates
in supremum norm and the lowdimensional components of the estimator have a
Gaussian limiting distribution. These results hold uniformly over the class of
precision matrices with row sparsity of small order $\sqrt{n}/\log p$ and
spectrum uniformly bounded, under a subGaussian tail assumption on the margins
of the true underlying distribution. Consequently, our results lead to
uniformly valid confidence regions for lowdimensional parameters of the
precision matrix. Thresholding the estimator leads to variable selection
without imposing irrepresentability conditions. The performance of the method
is demonstrated in a simulation study and on real data.

We propose methodology for statistical inference for lowdimensional
parameters of sparse precision matrices in a highdimensional setting. Our
method leads to a nonsparse estimator of the precision matrix whose entries
have a Gaussian limiting distribution. Asymptotic properties of the novel
estimator are analyzed for the case of subGaussian observations under a
sparsity assumption on the entries of the true precision matrix and regularity
conditions. Thresholding the desparsified estimator gives guarantees for edge
selection in the associated graphical model. Performance of the proposed method
is illustrated in a simulation study.