• ### Statistical Test of Distance--Duality Relation with Type Ia Supernovae and Baryon Acoustic Oscillations(1604.04631)

July 13, 2018 astro-ph.CO
We test the distance--duality relation $\eta \equiv d_L / [ (1 + z)^2 d_A ] = 1$ between cosmological luminosity distance ($d_L$) from the JLA SNe Ia compilation (arXiv:1401.4064) and angular-diameter distance ($d_A$) based on Baryon Oscillation Spectroscopic Survey (BOSS; arXiv:1607.03155) and WiggleZ baryon acoustic oscillation measurements (arXiv:1105.2862, arXiv:1204.3674). The $d_L$ measurements are matched to $d_A$ redshift by a statistically consistent compression procedure. With Monte Carlo methods, nontrivial and correlated distributions of $\eta$ can be explored in a straightforward manner without resorting to a particular evolution template $\eta(z)$. Assuming independent constraints on cosmological parameters that are necessary to obtain $d_L$ and $d_A$ values, we find 9% constraints consistent with $\eta = 1$ from the analysis of SNIa + BOSS and an 18% bound results from SNIa + WiggleZ. These results are contrary to previous claims that $\eta < 1$ has been found close to or above the $1 \sigma$ level. We discuss the effect of different cosmological parameter inputs and the use of the apparent deviation from distance--duality as a proxy of systematic effects on cosmic distance measurements. The results suggest possible systematic overestimation of SNIa luminosity distances compared with $d_A$ data when a Planck {\Lambda}CDM cosmological parameter inference (arXiv:1502.01589) is used to enhance the precision. If interpreted as an extinction correction due to a gray dust component, the effect is broadly consistent with independent observational constraints.
• ### Spectral Method and Regularized MLE Are Both Optimal for Top-$K$ Ranking(1707.09971)

This paper is concerned with the problem of top-$K$ ranking from pairwise comparisons. Given a collection of $n$ items and a few pairwise comparisons across them, one wishes to identify the set of $K$ items that receive the highest ranks. To tackle this problem, we adopt the logistic parametric model --- the Bradley-Terry-Luce model, where each item is assigned a latent preference score, and where the outcome of each pairwise comparison depends solely on the relative scores of the two items involved. Recent works have made significant progress towards characterizing the performance (e.g. the mean square error for estimating the scores) of several classical methods, including the spectral method and the maximum likelihood estimator (MLE). However, where they stand regarding top-$K$ ranking remains unsettled. We demonstrate that under a natural random sampling model, the spectral method alone, or the regularized MLE alone, is minimax optimal in terms of the sample complexity --- the number of paired comparisons needed to ensure exact top-$K$ identification, for the fixed dynamic range regime. This is accomplished via optimal control of the entrywise error of the score estimates. We complement our theoretical studies by numerical experiments, confirming that both methods yield low entrywise errors for estimating the underlying scores. Our theory is established via a novel leave-one-out trick, which proves effective for analyzing both iterative and non-iterative procedures. Along the way, we derive an elementary eigenvector perturbation bound for probability transition matrices, which parallels the Davis-Kahan $\sin\Theta$ theorem for symmetric matrices. This also allows us to close the gap between the $\ell_2$ error upper bound for the spectral method and the minimax lower limit.
• ### Trajectory Factory: Tracklet Cleaving and Re-connection by Deep Siamese Bi-GRU for Multiple Object Tracking(1804.04555)

April 12, 2018 cs.CV
Multi-Object Tracking (MOT) is a challenging task in the complex scene such as surveillance and autonomous driving. In this paper, we propose a novel tracklet processing method to cleave and re-connect tracklets on crowd or long-term occlusion by Siamese Bi-Gated Recurrent Unit (GRU). The tracklet generation utilizes object features extracted by CNN and RNN to create the high-confidence tracklet candidates in sparse scenario. Due to mis-tracking in the generation process, the tracklets from different objects are split into several sub-tracklets by a bidirectional GRU. After that, a Siamese GRU based tracklet re-connection method is applied to link the sub-tracklets which belong to the same object to form a whole trajectory. In addition, we extract the tracklet images from existing MOT datasets and propose a novel dataset to train our networks. The proposed dataset contains more than 95160 pedestrian images. It has 793 different persons in it. On average, there are 120 images for each person with positions and sizes. Experimental results demonstrate the advantages of our model over the state-of-the-art methods on MOT16.
• ### Gradient Descent with Random Initialization: Fast Global Convergence for Nonconvex Phase Retrieval(1803.07726)

March 21, 2018 cs.IT, math.IT, cs.NA, math.OC, cs.LG, stat.ML
This paper considers the problem of solving systems of quadratic equations, namely, recovering an object of interest $\mathbf{x}^{\natural}\in\mathbb{R}^{n}$ from $m$ quadratic equations/samples $y_{i}=(\mathbf{a}_{i}^{\top}\mathbf{x}^{\natural})^{2}$, $1\leq i\leq m$. This problem, also dubbed as phase retrieval, spans multiple domains including physical sciences and machine learning. We investigate the efficiency of gradient descent (or Wirtinger flow) designed for the nonconvex least squares problem. We prove that under Gaussian designs, gradient descent --- when randomly initialized --- yields an $\epsilon$-accurate solution in $O\big(\log n+\log(1/\epsilon)\big)$ iterations given nearly minimal samples, thus achieving near-optimal computational and sample complexities at once. This provides the first global convergence guarantee concerning vanilla gradient descent for phase retrieval, without the need of (i) carefully-designed initialization, (ii) sample splitting, or (iii) sophisticated saddle-point escaping schemes. All of these are achieved by exploiting the statistical models in analyzing optimization algorithms, via a leave-one-out approach that enables the decoupling of certain statistical dependency between the gradient descent iterates and the data.
• ### Nonconvex Matrix Factorization from Rank-One Measurements(1802.06286)

Feb. 17, 2018 cs.IT, math.IT, cs.LG, stat.ML
We consider the problem of recovering low-rank matrices from random rank-one measurements, which spans numerous applications including covariance sketching, phase retrieval, quantum state tomography, and learning shallow polynomial neural networks, among others. Our approach is to directly estimate the low-rank factor by minimizing a nonconvex quadratic loss function via vanilla gradient descent, following a tailored spectral initialization. When the true rank is small, this algorithm is guaranteed to converge to the ground truth (up to global ambiguity) with near-optimal sample complexity and computational complexity. To the best of our knowledge, this is the first guarantee that achieves near-optimality in both metrics. In particular, the key enabler of near-optimal computational guarantees is an implicit regularization phenomenon: without explicit regularization, both spectral initialization and the gradient descent iterates automatically stay within a region incoherent with the measurement vectors. This feature allows one to employ much more aggressive step sizes compared with the ones suggested in prior literature, without the need of sample splitting.
• ### Inter-Subject Analysis: Inferring Sparse Interactions with Dense Intra-Graphs(1709.07036)

Sept. 20, 2017 math.ST, stat.TH, stat.ME, stat.ML
We develop a new modeling framework for Inter-Subject Analysis (ISA). The goal of ISA is to explore the dependency structure between different subjects with the intra-subject dependency as nuisance. It has important applications in neuroscience to explore the functional connectivity between brain regions under natural stimuli. Our framework is based on the Gaussian graphical models, under which ISA can be converted to the problem of estimation and inference of the inter-subject precision matrix. The main statistical challenge is that we do not impose sparsity constraint on the whole precision matrix and we only assume the inter-subject part is sparse. For estimation, we propose to estimate an alternative parameter to get around the non-sparse issue and it can achieve asymptotic consistency even if the intra-subject dependency is dense. For inference, we propose an "untangle and chord" procedure to de-bias our estimator. It is valid without the sparsity assumption on the inverse Hessian of the log-likelihood function. This inferential method is general and can be applied to many other statistical problems, thus it is of independent theoretical interest. Numerical experiments on both simulated and brain imaging data validate our methods and theory.
• ### Application of Bayesian graphs to SN Ia data analysis and compression(1603.08519)

Sept. 8, 2016 astro-ph.CO
Bayesian graphical models are an efficient tool for modelling complex data and derive self-consistent expressions of the posterior distribution of model parameters. We apply Bayesian graphs to perform statistical analyses of Type Ia supernova (SN Ia) luminosity distance measurements from the joint light-curve analysis (JLA) data set. In contrast to the $\chi^2$ approach used in previous studies, the Bayesian inference allows us to fully account for the standard-candle parameter dependence of the data covariance matrix. Comparing with $\chi^2$ analysis results, we find a systematic offset of the marginal model parameter bounds. We demonstrate that the bias is statistically significant in the case of the SN Ia standardization parameters with a maximal 6 $\sigma$ shift of the SN light-curve colour correction. In addition, we find that the evidence for a host galaxy correction is now only 2.4 $\sigma$. Systematic offsets on the cosmological parameters remain small, but may increase by combining constraints from complementary cosmological probes. The bias of the $\chi^2$ analysis is due to neglecting the parameter-dependent log-determinant of the data covariance, which gives more statistical weight to larger values of the standardization parameters. We find a similar effect on compressed distance modulus data. To this end, we implement a fully consistent compression method of the JLA data set that uses a Gaussian approximation of the posterior distribution for fast generation of compressed data. Overall, the results of our analysis emphasize the need for a fully consistent Bayesian statistical approach in the analysis of future large SN Ia data sets.
• ### The Analog Front-end Prototype Electronics Designed for LHAASO WCDA(1504.05649)

April 22, 2015 nucl-ex, physics.ins-det
In the readout electronics of the Water Cerenkov Detector Array (WCDA) in the Large High Altitude Air Shower Observatory (LHAASO) experiment, both high-resolution charge and time measurement are required over a dynamic range from 1 photoelectron (P.E.) to 4000 P.E. The Analog Front-end (AFE) circuit is one of the crucial parts in the whole readout electronics. We designed and optimized a prototype of the AFE through parameter calculation and circuit simulation, and conducted initial electronics tests on this prototype to evaluate its performance. Test results indicate that the charge resolution is better than 1% @ 4000 P.E. and remains better than 10% @ 1 P.E., and the time resolution is better than 0.5 ns RMS, which is better than application requirement.
• ### Panther: Fast Top-k Similarity Search in Large Networks(1504.02577)

April 13, 2015 cs.SI
Estimating similarity between vertices is a fundamental issue in network analysis across various domains, such as social networks and biological networks. Methods based on common neighbors and structural contexts have received much attention. However, both categories of methods are difficult to scale up to handle large networks (with billions of nodes). In this paper, we propose a sampling method that provably and accurately estimates the similarity between vertices. The algorithm is based on a novel idea of random path, and an extended method is also presented, to enhance the structural similarity when two vertices are completely disconnected. We provide theoretical proofs for the error-bound and confidence of the proposed algorithm. We perform extensive empirical study and show that our algorithm can obtain top-k similar vertices for any vertex in a network approximately 300x faster than state-of-the-art methods. We also use identity resolution and structural hole spanner finding, two important applications in social networks, to evaluate the accuracy of the estimated similarities. Our experimental results demonstrate that the proposed algorithm achieves clearly better performance than several alternative methods.
• ### Cosmological constraints on holographic dark energy models under the energy conditions(1303.0384)

Sept. 6, 2013 astro-ph.CO
We study the holographic and agegraphic dark energy models without interaction using the latest observational Hubble parameter data (OHD), the Union2.1 compilation of type Ia supernovae (SNIa), and the energy conditions. Scenarios of dark energy are distinguished by the cut-off of cosmic age, conformal time, and event horizon. The best-fit value of matter density for the three scenarios almost steadily located at $\Omega_{m0}=0.26$ by the joint constraint. For the agegraphic models, they can be recovered to the standard cosmological model when the constant $c$ which presents the fraction of dark energy approaches to infinity. Absence of upper limit of $c$ by the joint constraint demonstrates the recovery possibility. Using the fitted result, we also reconstruct the current equation of state of dark energy at different scenarios, respectively. Employing the model criteria $\chi^2_{\textrm{min}}/dof$, we find that conformal time model is the worst, but they can not be distinguished clearly. Comparing with the observational constraints, we find that SEC is fulfilled at redshift $0.2 \lesssim z \lesssim 0.3$ with $1\sigma$ confidence level. We also find that NEC gives a meaningful constraint for the event horizon cut-off model, especially compared with OHD only. We note that the energy condition maybe could play an important role in the interacting models because of different degeneracy between $\Omega_m$ and constant $c$.