
We test the distanceduality relation $\eta \equiv d_L / [ (1 + z)^2 d_A ] =
1$ between cosmological luminosity distance ($d_L$) from the JLA SNe Ia
compilation (arXiv:1401.4064) and angulardiameter distance ($d_A$) based on
Baryon Oscillation Spectroscopic Survey (BOSS; arXiv:1607.03155) and WiggleZ
baryon acoustic oscillation measurements (arXiv:1105.2862, arXiv:1204.3674).
The $d_L$ measurements are matched to $d_A$ redshift by a statistically
consistent compression procedure. With Monte Carlo methods, nontrivial and
correlated distributions of $\eta$ can be explored in a straightforward manner
without resorting to a particular evolution template $\eta(z)$. Assuming
independent constraints on cosmological parameters that are necessary to obtain
$d_L$ and $d_A$ values, we find 9% constraints consistent with $\eta = 1$ from
the analysis of SNIa + BOSS and an 18% bound results from SNIa + WiggleZ. These
results are contrary to previous claims that $\eta < 1$ has been found close to
or above the $1 \sigma$ level. We discuss the effect of different cosmological
parameter inputs and the use of the apparent deviation from distanceduality
as a proxy of systematic effects on cosmic distance measurements. The results
suggest possible systematic overestimation of SNIa luminosity distances
compared with $d_A$ data when a Planck {\Lambda}CDM cosmological parameter
inference (arXiv:1502.01589) is used to enhance the precision. If interpreted
as an extinction correction due to a gray dust component, the effect is broadly
consistent with independent observational constraints.

This paper is concerned with the problem of top$K$ ranking from pairwise
comparisons. Given a collection of $n$ items and a few pairwise comparisons
across them, one wishes to identify the set of $K$ items that receive the
highest ranks. To tackle this problem, we adopt the logistic parametric model
 the BradleyTerryLuce model, where each item is assigned a latent
preference score, and where the outcome of each pairwise comparison depends
solely on the relative scores of the two items involved. Recent works have made
significant progress towards characterizing the performance (e.g. the mean
square error for estimating the scores) of several classical methods, including
the spectral method and the maximum likelihood estimator (MLE). However, where
they stand regarding top$K$ ranking remains unsettled.
We demonstrate that under a natural random sampling model, the spectral
method alone, or the regularized MLE alone, is minimax optimal in terms of the
sample complexity  the number of paired comparisons needed to ensure exact
top$K$ identification, for the fixed dynamic range regime. This is
accomplished via optimal control of the entrywise error of the score estimates.
We complement our theoretical studies by numerical experiments, confirming that
both methods yield low entrywise errors for estimating the underlying scores.
Our theory is established via a novel leaveoneout trick, which proves
effective for analyzing both iterative and noniterative procedures. Along the
way, we derive an elementary eigenvector perturbation bound for probability
transition matrices, which parallels the DavisKahan $\sin\Theta$ theorem for
symmetric matrices. This also allows us to close the gap between the $\ell_2$
error upper bound for the spectral method and the minimax lower limit.

MultiObject Tracking (MOT) is a challenging task in the complex scene such
as surveillance and autonomous driving. In this paper, we propose a novel
tracklet processing method to cleave and reconnect tracklets on crowd or
longterm occlusion by Siamese BiGated Recurrent Unit (GRU). The tracklet
generation utilizes object features extracted by CNN and RNN to create the
highconfidence tracklet candidates in sparse scenario. Due to mistracking in
the generation process, the tracklets from different objects are split into
several subtracklets by a bidirectional GRU. After that, a Siamese GRU based
tracklet reconnection method is applied to link the subtracklets which belong
to the same object to form a whole trajectory. In addition, we extract the
tracklet images from existing MOT datasets and propose a novel dataset to train
our networks. The proposed dataset contains more than 95160 pedestrian images.
It has 793 different persons in it. On average, there are 120 images for each
person with positions and sizes. Experimental results demonstrate the
advantages of our model over the stateoftheart methods on MOT16.

This paper considers the problem of solving systems of quadratic equations,
namely, recovering an object of interest
$\mathbf{x}^{\natural}\in\mathbb{R}^{n}$ from $m$ quadratic equations/samples
$y_{i}=(\mathbf{a}_{i}^{\top}\mathbf{x}^{\natural})^{2}$, $1\leq i\leq m$. This
problem, also dubbed as phase retrieval, spans multiple domains including
physical sciences and machine learning.
We investigate the efficiency of gradient descent (or Wirtinger flow)
designed for the nonconvex least squares problem. We prove that under Gaussian
designs, gradient descent  when randomly initialized  yields an
$\epsilon$accurate solution in $O\big(\log n+\log(1/\epsilon)\big)$ iterations
given nearly minimal samples, thus achieving nearoptimal computational and
sample complexities at once. This provides the first global convergence
guarantee concerning vanilla gradient descent for phase retrieval, without the
need of (i) carefullydesigned initialization, (ii) sample splitting, or (iii)
sophisticated saddlepoint escaping schemes. All of these are achieved by
exploiting the statistical models in analyzing optimization algorithms, via a
leaveoneout approach that enables the decoupling of certain statistical
dependency between the gradient descent iterates and the data.

We consider the problem of recovering lowrank matrices from random rankone
measurements, which spans numerous applications including covariance sketching,
phase retrieval, quantum state tomography, and learning shallow polynomial
neural networks, among others. Our approach is to directly estimate the
lowrank factor by minimizing a nonconvex quadratic loss function via vanilla
gradient descent, following a tailored spectral initialization. When the true
rank is small, this algorithm is guaranteed to converge to the ground truth (up
to global ambiguity) with nearoptimal sample complexity and computational
complexity. To the best of our knowledge, this is the first guarantee that
achieves nearoptimality in both metrics. In particular, the key enabler of
nearoptimal computational guarantees is an implicit regularization phenomenon:
without explicit regularization, both spectral initialization and the gradient
descent iterates automatically stay within a region incoherent with the
measurement vectors. This feature allows one to employ much more aggressive
step sizes compared with the ones suggested in prior literature, without the
need of sample splitting.

We develop a new modeling framework for InterSubject Analysis (ISA). The
goal of ISA is to explore the dependency structure between different subjects
with the intrasubject dependency as nuisance. It has important applications in
neuroscience to explore the functional connectivity between brain regions under
natural stimuli. Our framework is based on the Gaussian graphical models, under
which ISA can be converted to the problem of estimation and inference of the
intersubject precision matrix. The main statistical challenge is that we do
not impose sparsity constraint on the whole precision matrix and we only assume
the intersubject part is sparse. For estimation, we propose to estimate an
alternative parameter to get around the nonsparse issue and it can achieve
asymptotic consistency even if the intrasubject dependency is dense. For
inference, we propose an "untangle and chord" procedure to debias our
estimator. It is valid without the sparsity assumption on the inverse Hessian
of the loglikelihood function. This inferential method is general and can be
applied to many other statistical problems, thus it is of independent
theoretical interest. Numerical experiments on both simulated and brain imaging
data validate our methods and theory.

Bayesian graphical models are an efficient tool for modelling complex data
and derive selfconsistent expressions of the posterior distribution of model
parameters. We apply Bayesian graphs to perform statistical analyses of Type Ia
supernova (SN Ia) luminosity distance measurements from the joint lightcurve
analysis (JLA) data set. In contrast to the $\chi^2$ approach used in previous
studies, the Bayesian inference allows us to fully account for the
standardcandle parameter dependence of the data covariance matrix. Comparing
with $\chi^2$ analysis results, we find a systematic offset of the marginal
model parameter bounds. We demonstrate that the bias is statistically
significant in the case of the SN Ia standardization parameters with a maximal
6 $\sigma$ shift of the SN lightcurve colour correction. In addition, we find
that the evidence for a host galaxy correction is now only 2.4 $\sigma$.
Systematic offsets on the cosmological parameters remain small, but may
increase by combining constraints from complementary cosmological probes. The
bias of the $\chi^2$ analysis is due to neglecting the parameterdependent
logdeterminant of the data covariance, which gives more statistical weight to
larger values of the standardization parameters. We find a similar effect on
compressed distance modulus data. To this end, we implement a fully consistent
compression method of the JLA data set that uses a Gaussian approximation of
the posterior distribution for fast generation of compressed data. Overall, the
results of our analysis emphasize the need for a fully consistent Bayesian
statistical approach in the analysis of future large SN Ia data sets.

In the readout electronics of the Water Cerenkov Detector Array (WCDA) in the
Large High Altitude Air Shower Observatory (LHAASO) experiment, both
highresolution charge and time measurement are required over a dynamic range
from 1 photoelectron (P.E.) to 4000 P.E. The Analog Frontend (AFE) circuit is
one of the crucial parts in the whole readout electronics. We designed and
optimized a prototype of the AFE through parameter calculation and circuit
simulation, and conducted initial electronics tests on this prototype to
evaluate its performance. Test results indicate that the charge resolution is
better than 1% @ 4000 P.E. and remains better than 10% @ 1 P.E., and the time
resolution is better than 0.5 ns RMS, which is better than application
requirement.

Estimating similarity between vertices is a fundamental issue in network
analysis across various domains, such as social networks and biological
networks. Methods based on common neighbors and structural contexts have
received much attention. However, both categories of methods are difficult to
scale up to handle large networks (with billions of nodes). In this paper, we
propose a sampling method that provably and accurately estimates the similarity
between vertices. The algorithm is based on a novel idea of random path, and an
extended method is also presented, to enhance the structural similarity when
two vertices are completely disconnected. We provide theoretical proofs for the
errorbound and confidence of the proposed algorithm. We perform extensive
empirical study and show that our algorithm can obtain topk similar vertices
for any vertex in a network approximately 300x faster than stateoftheart
methods. We also use identity resolution and structural hole spanner finding,
two important applications in social networks, to evaluate the accuracy of the
estimated similarities. Our experimental results demonstrate that the proposed
algorithm achieves clearly better performance than several alternative methods.

We study the holographic and agegraphic dark energy models without
interaction using the latest observational Hubble parameter data (OHD), the
Union2.1 compilation of type Ia supernovae (SNIa), and the energy conditions.
Scenarios of dark energy are distinguished by the cutoff of cosmic age,
conformal time, and event horizon. The bestfit value of matter density for the
three scenarios almost steadily located at $\Omega_{m0}=0.26$ by the joint
constraint. For the agegraphic models, they can be recovered to the standard
cosmological model when the constant $c$ which presents the fraction of dark
energy approaches to infinity. Absence of upper limit of $c$ by the joint
constraint demonstrates the recovery possibility. Using the fitted result, we
also reconstruct the current equation of state of dark energy at different
scenarios, respectively. Employing the model criteria
$\chi^2_{\textrm{min}}/dof$, we find that conformal time model is the worst,
but they can not be distinguished clearly. Comparing with the observational
constraints, we find that SEC is fulfilled at redshift $0.2 \lesssim z \lesssim
0.3$ with $1\sigma$ confidence level. We also find that NEC gives a meaningful
constraint for the event horizon cutoff model, especially compared with OHD
only. We note that the energy condition maybe could play an important role in
the interacting models because of different degeneracy between $\Omega_m$ and
constant $c$.