
In highdimensional linear models, the sparsity assumption is typically made,
stating that most of the parameters are equal to zero. Under the sparsity
assumption, estimation and, recently, inference have been well studied.
However, in practice, sparsity assumption is not checkable and more importantly
is often violated; a large number of covariates might be expected to be
associated with the response, indicating that possibly all, rather than just a
few, parameters are nonzero. A natural example is a genomewide gene
expression profiling, where all genes are believed to affect a common disease
marker. We show that existing inferential methods are sensitive to the sparsity
assumption, and may, in turn, result in the severe lack of control of TypeI
error. In this article, we propose a new inferential method, named CorrT, which
is robust to model misspecification such as heteroscedasticity and lack of
sparsity. CorrT is shown to have Type I error approaching the nominal level for
\textit{any} models and Type II error approaching zero for sparse and many
dense models.
In fact, CorrT is also shown to be optimal in a variety of frameworks:
sparse, nonsparse and hybrid models where sparse and dense signals are mixed.
Numerical experiments show a favorable performance of the CorrT test compared
to the stateoftheart methods.

We extend conformal inference to general settings that allow for time series
data. Our proposal is developed as a randomization method and accounts for
potential serial dependence by including block structures in the permutation
scheme. As a result, the proposed method retains the exact, modelfree validity
when the data are i.i.d. or more generally exchangeable, similar to usual
conformal inference methods. When exchangeability fails, as is the case for
common time series data, the proposed approach is approximately valid under
weak assumptions on the conformity score.

This paper introduces new inference methods for counterfactual and synthetic
control methods for evaluating policy effects. Our inference methods work in
conjunction with many modern and classical methods for estimating the
counterfactual mean outcome in the absence of a policy intervention.
Specifically, our methods work together with the differenceindifference,
canonical synthetic control, constrained and penalized regression methods for
synthetic control, factor/matrix completion models for panel data, interactive
fixed effects panel models, time series models, as well as fused time series
panel data models. The proposed method has a double justification. (i) If the
residuals from estimating the counterfactuals are exchangeable as implied, for
example, by i.i.d. data, our procedure achieves exact finite sample size
control without any assumption on the specific approach used to estimate the
counterfactuals. (ii) If the data exhibit dynamics and serial dependence, our
inference procedure achieves approximate uniform size control under weak and
easytoverify conditions on the method used to estimate the counterfactual. We
verify these condition for representative methods from each group listed above.
Simulation experiments demonstrate the usefulness of our approach in finite
samples. We apply our method to reevaluate the causal effect of election day
registration (EDR) laws on voter turnout in the United States.

This paper studies hypothesis testing and confidence interval construction in
highdimensional linear models with possible nonsparse structures. For a given
component of the parameter vector, we show that the difficulty of the problem
depends on the sparsity of the corresponding row of the precision matrix of the
covariates, not the sparsity of the model itself. We develop new concepts of
uniform and essentially uniform nontestability that allow the study of
limitations of tests across a broad set of alternatives. Uniform
nontestability identifies an extensive collection of alternatives such that
the power of any test, against any alternative in this group, is asymptotically
at most equal to the nominal size, whereas minimaxity shows the existence of
one particularly "bad" alternative. Implications of the new constructions
include new minimax testability results that in sharp contrast to existing
results do not depend on the sparsity of the model parameters. We identify new
tradeoffs between testability and feature correlation. In particular, we show
that in models with weak feature correlations minimax lower bound can be
attained by a confidence interval whose width has the parametric rate
regardless of the size of the model sparsity.

Models with many signals, highdimensional models, often impose structures on
the signal strengths. The common assumption is that only a few signals are
strong and most of the signals are zero or close (collectively) to zero.
However, such a requirement might not be valid in many reallife applications.
In this article, we are interested in conducting largescale inference in
models that might have signals of mixed strengths. The key challenge is that
the signals that are not under testing might be collectively nonnegligible
(although individually small) and cannot be accurately learned. This article
develops a new class of tests that arise from a moment matching formulation. A
virtue of these momentmatching statistics is their ability to borrow strength
across features, adapt to the sparsity size and exert adjustment for testing
growing number of hypothesis. GRouplevel Inference of Parameter, GRIP, test
harvests effective sparsity structures with hypothesis formulation for an
efficient multiple testing procedure. Simulated data showcase that GRIPs error
control is far better than the alternative methods. We develop a minimax
theory, demonstrating optimality of GRIP for a broad range of models, including
those where the model is a mixture of a sparse and highdimensional dense
signals.

We provide comments on the article "Highdimensional simultaneous inference
with the bootstrap" by Ruben Dezeure, Peter Buhlmann and CunHui Zhang.

This article develops a framework for testing general hypothesis in
highdimensional models where the number of variables may far exceed the number
of observations. Existing literature has considered less than a handful of
hypotheses, such as testing individual coordinates of the model parameter.
However, the problem of testing general and complex hypotheses remains widely
open. We propose a new inference method developed around the hypothesis
adaptive projection pursuit framework, which solves the testing problems in the
most general case. The proposed inference is centered around a new class of
estimators defined as $l_1$ projection of the initial guess of the unknown onto
the space defined by the null. This projection automatically takes into account
the structure of the null hypothesis and allows us to study formal inference
for a number of longstanding problems. For example, we can directly conduct
inference on the sparsity level of the model parameters and the minimum signal
strength. This is especially significant given the fact that the former is a
fundamental condition underlying most of the theoretical development in
highdimensional statistics, while the latter is a key condition used to
establish variable selection properties. Moreover, the proposed method is
asymptotically exact and has satisfactory power properties for testing very
general functionals of the highdimensional parameters. The simulation studies
lend further support to our theoretical claims and additionally show excellent
finitesample size and power properties of the proposed test.

We propose a methodology for testing linear hypothesis in highdimensional
linear models. The proposed test does not impose any restriction on the size of
the model, i.e. model sparsity or the loading vector representing the
hypothesis. Providing asymptotically valid methods for testing general linear
functions of the regression parameters in highdimensions is extremely
challenging  especially without making restrictive or unverifiable
assumptions on the number of nonzero elements. We propose to test the moment
conditions related to the newly designed restructured regression, where the
inputs are transformed and augmented features. These new features incorporate
the structure of the null hypothesis directly. The test statistics are
constructed in such a way that lack of sparsity in the original model parameter
does not present a problem for the theoretical justification of our procedures.
We establish asymptotically exact control on Type I error without imposing any
sparsity assumptions on model parameter or the vector representing the linear
hypothesis. Our method is also shown to achieve certain optimality in detecting
deviations from the null hypothesis. We demonstrate the favorable finitesample
performance of the proposed methods, via a number of numerical and a real data
example.

In analyzing highdimensional models, sparsity of the model parameter is a
common but often undesirable assumption. In this paper, we study the following
twosample testing problem: given two samples generated by two highdimensional
linear models, we aim to test whether the regression coefficients of the two
linear models are identical. We propose a framework named TIERS (short for
TestIng Equality of Regression Slopes), which solves the twosample testing
problem without making any assumptions on the sparsity of the regression
parameters. TIERS builds a new model by convolving the two samples in such a
way that the original hypothesis translates into a new moment condition. A
selfnormalization construction is then developed to form a moment test. We
provide rigorous theory for the developed framework. Under very weak conditions
of the feature covariance, we show that the accuracy of the proposed test in
controlling Type I errors is robust both to the lack of sparsity in the
features and to the heavy tails in the error distribution, even when the sample
size is much smaller than the feature dimension. Moreover, we discuss minimax
optimality and efficiency properties of the proposed test. Simulation analysis
demonstrates excellent finitesample performance of our test. In deriving the
test, we also develop tools that are of independent interest. The test is built
upon a novel estimator, called AutoaDaptive Dantzig Selector (ADDS), which not
only automatically chooses an appropriate scale of the error term but also
incorporates prior information. To effectively approximate the critical value
of the test statistic, we develop a novel highdimensional plugin approach
that complements the recent advances in Gaussian approximation theory.