• The thermodynamics of the Universe in the Eddington-Born-Infeld (EBI) theory is restudied by utilizing the holographic-style gravitational equations that dominate the dynamics of the cosmical apparent horizon $\Upsilon_{A}$ and the evolution of the Universe. We started from applying the Bigravity method to rewrite the EBI action of the Palatini approach into the Bigravity-type action with an extra metric $q_{\mu\nu}$. Then we derived the holographic-style dynamical equations and discussed the properties of the cosmical apparent horizon $\Upsilon_{A}$ including timelike, spacelike and null characters in the EBI Universe. Applying the Misner-Sharp energy, the Cai-Kim temperature $\hat{T}_{A}$ and the Hawking-Bekenstein entropy $S_{A}$, we obtained the unified first law for the gravitational thermodynamics of the EBI Universe and the total energy differential for the open system enveloped by $\Upsilon_{A}$. Finally, we used the Gibbs equation in the positive-heat-out sign convention to derive the generalized second laws of the nondecreasing entropy $S_{tot}^{(A)}$ enclosed by $\Upsilon_{A}$ in the EBI universe.
  • The iron-based superconductors are characterized by multiple-orbital physics where all the five Fe 3$d$ orbitals get involved. The multiple-orbital nature gives rise to various novel phenomena like orbital-selective Mott transition, nematicity and orbital fluctuation that provide a new route for realizing superconductivity. The complexity of multiple-orbital also asks to disentangle the relationship between orbital, spin and nematicity, and to identify dominant orbital ingredients that dictate superconductivity. The bulk FeSe superconductor provides an ideal platform to address these issues because of its simple crystal structure and unique coexistence of superconductivity and nematicity. However, the orbital nature of the low energy electronic excitations and its relation to the superconducting gap remain controversial. Here we report direct observation of highly anisotropic Fermi surface and extremely anisotropic superconducting gap in the nematic state of FeSe superconductor by high resolution laser-based angle-resolved photoemission measurements. We find that the low energy excitations of the entire hole pocket at the Brillouin zone center are dominated by the single $d_{xz}$ orbital. The superconducting gap exhibits an anti-correlation relation with the $d_{xz}$ spectral weight near the Fermi level, i.e., the gap size minimum (maximum) corresponds to the maximum (minimum) of the $d_{xz}$ spectral weight along the Fermi surface. These observations provide new insights in understanding the orbital origin of the extremely anisotropic superconducting gap in FeSe superconductor and the relation between nematicity and superconductivity in the iron-based superconductors.
  • In this paper, novel methods are presented to measure the optical properties of muon detectors (MDs) in the Large High-Altitude Air Shower Observatory (LHAASO), which can also be used by other experiments. Each MD consists of a cylindrical water Cherenkov detector, with a Tyvek liner containing pure water, and a photomultiplier tube (PMT) mounted on the top. The time distribution of the photons collected by the PMT in the water Cherenkov detector have an approximately exponential distribution, and their decay factor is determined by the photon absorption length in the water, the reflectivity of the inner Tyvek surface, the mean reflecting step length of photons reflected by the inner surface, and the ratio of reflectional Tyvek area to the total inner surface area. By considering the principles of photon propagation in the water Cherenkov detector, we have developed novel methods to measure the water absorption length, the Tyvek reflectivity, and the mean step length of photons. The step length of photons can be determined by measuring the time distribution of the reflected photons hitting the PMT after multiple photons with single wavelength are generated in the tank, with slightly different ratios of the inner Tyvek to the total inner area. The water absorption length and Tyvek reflectivity can be measured simultaneously by the PMT by changing the height of the water, while the step length and Tyvek reflectivity in air can also be measured simultaneously. The proposed novel methods are supported by deduction of formulae and verified by GEANT4 simulations and the prototype experiment.
  • This paper introduces a new and effective algorithm for learning kernels in a Multi-Task Learning (MTL) setting. Although, we consider a MTL scenario here, our approach can be easily applied to standard single task learning, as well. As shown by our empirical results, our algorithm consistently outperforms the traditional kernel learning algorithms such as uniform combination solution, convex combinations of base kernels as well as some kernel alignment-based models, which have been proven to give promising results in the past. We present a Rademacher complexity bound based on which a new Multi-Task Multiple Kernel Learning (MT-MKL) model is derived. In particular, we propose a Support Vector Machine-regularized model in which, for each task, an optimal kernel is learned based on a neighborhood-defining kernel that is not restricted to be positive semi-definite. Comparative experimental results are showcased that underline the merits of our neighborhood-defining framework in both classification and regression problems.
  • We report comprehensive angle-resolved photoemission investigations on the electronic structure of single crystal multiple-layer FeSe films grown on CaF2 substrate by pulsed laser deposition (PLD) method. Measurements on FeSe/CaF2 samples with different superconducting transition temperatures Tc of 4 K, 9 K and 14 K reveal electronic difference in their Fermi surface and band structure. Indication of the nematic phase transition is observed from temperature-dependent measurements of these samples; the nematic transition temperature is 140-160 K, much higher than 90 K for the bulk FeSe. Potassium deposition is applied onto the surface of these samples; the nematic phase is suppressed by potassium deposition which introduces electrons to these FeSe films and causes a pronounced electronic structure change. We compared and discussed the electronic structure and superconductivity of the FeSe/CaF2 films by PLD method with the FeSe/SrTiO3 films by molecular beam epitaxy (MBE) method and bulk FeSe. The PLD-grown multilayer FeSe/CaF2 is more hole-doped than that in MBE-grown multiple-layer FeSe films. Our results on FeSe/CaF2 films by PLD method establish a link between bulk FeSe single crystal and FeSe/SrTiO3 films by MBE method, and provide important information to understand superconductivity in FeSe-related systems.
  • Nano-thick metallic transition metal dichalcogenides such as VS$_{2}$ are essential building blocks for constructing next-generation electronic and energy-storage applications, as well as for exploring unique physical issues associated with the dimensionality effect. However, such 2D layered materials have yet to be achieved through either mechanical exfoliation or bottom-up synthesis. Herein, we report a facile chemical vapor deposition route for direct production of crystalline VS$_{2}$ nanosheets with sub-10 nm thicknesses and domain sizes of tens of micrometers. The obtained nanosheets feature spontaneous superlattice periodicities and excellent electrical conductivities (~3$\times$10$^{3}$ S cm$^{-1}$), which has enabled a variety of applications such as contact electrodes for monolayer MoS$_{2}$ with contact resistances of ~1/4 to that of Ni/Au metals, and as supercapacitor electrodes in aqueous electrolytes showing specific capacitances as high as 8.6$\times$10$^{2}$ F g$^{-1}$. This work provides fresh insights into the delicate structure-property relationship and the broad application prospects of such metallic 2D materials.
  • Topological quantum materials, including topological insulators and superconductors, Dirac semimetals and Weyl semimetals, have attracted much attention recently for their unique electronic structure, spin texture and physical properties. Very lately, a new type of Weyl semimetals has been proposed where the Weyl Fermions emerge at the boundary between electron and hole pockets in a new phase of matter, which is distinct from the standard type I Weyl semimetals with a point-like Fermi surface. The Weyl cone in this type II semimetals is strongly tilted and the related Fermi surface undergos a Lifshitz transition, giving rise to a new kind of chiral anomaly and other new physics. MoTe2 is proposed to be a candidate of a type II Weyl semimetal; the sensitivity of its topological state to lattice constants and correlation also makes it an ideal platform to explore possible topological phase transitions. By performing laser-based angle-resolved photoemission (ARPES) measurements with unprecedentedly high resolution, we have uncovered electronic evidence of type II semimetal state in MoTe2. We have established a full picture of the bulk electronic states and surface state for MoTe2 that are consistent with the band structure calculations. A single branch of surface state is identified that connects bulk hole pockets and bulk electron pockets. Detailed temperature-dependent ARPES measurements show high intensity spot-like features that is ~40 meV above the Fermi level and is close to the momentum space consistent with the theoretical expectation of the type II Weyl points. Our results constitute electronic evidence on the nature of the Weyl semimetal state that favors the presence of two sets of type II Weyl points in MoTe2.
  • In recent decades, a number of centrality metrics describing network properties of nodes have been proposed to rank the importance of nodes. In order to understand the correlations between centrality metrics and to approximate a high-complexity centrality metric by a strongly correlated low-complexity metric, we first study the correlation between centrality metrics in terms of their Pearson correlation coefficient and their similarity in ranking of nodes. In addition to considering the widely used centrality metrics, we introduce a new centrality measure, the degree mass. The m order degree mass of a node is the sum of the weighted degree of the node and its neighbors no further than m hops away. We find that the B_{n}, the closeness, and the components of x_{1} are strongly correlated with the degree, the 1st-order degree mass and the 2nd-order degree mass, respectively, in both network models and real-world networks. We then theoretically prove that the Pearson correlation coefficient between x_{1} and the 2nd-order degree mass is larger than that between x_{1} and a lower order degree mass. Finally, we investigate the effect of the inflexible antagonists selected based on different centrality metrics in helping one opinion to compete with another in the inflexible antagonists opinion model. Interestingly, we find that selecting the inflexible antagonists based on the leverage, the B_{n}, or the degree is more effective in opinion-competition than using other centrality metrics in all types of networks. This observation is supported by our previous observations, i.e., that there is a strong linear correlation between the degree and the B_{n}, as well as a high centrality similarity between the leverage and the degree.
  • Traditionally, Multi-task Learning (MTL) models optimize the average of task-related objective functions, which is an intuitive approach and which we will be referring to as Average MTL. However, a more general framework, referred to as Conic MTL, can be formulated by considering conic combinations of the objective functions instead; in this framework, Average MTL arises as a special case, when all combination coefficients equal 1. Although the advantage of Conic MTL over Average MTL has been shown experimentally in previous works, no theoretical justification has been provided to date. In this paper, we derive a generalization bound for the Conic MTL method, and demonstrate that the tightest bound is not necessarily achieved, when all combination coefficients equal 1; hence, Average MTL may not always be the optimal choice, and it is important to consider Conic MTL. As a byproduct of the generalization bound, it also theoretically explains the good experimental results of previous relevant works. Finally, we propose a new Conic MTL model, whose conic combination coefficients minimize the generalization bound, instead of choosing them heuristically as has been done in previous methods. The rationale and advantage of our model is demonstrated and verified via a series of experiments by comparing with several other methods.
  • Synthesis of monolayer MoS2 is essential for fulfilling the potential of MoS2 in catalysis, optoelectronics and valleytronics, etc. Herein, we report for the first time the scalable growth of high quality, domain size tunable (edge length from ~ 200 nm to 50 {\mu}m), strictly monolayer MoS2 on commercially available Au foils, via a low pressure chemical vapor deposition method. The nanosized triangular MoS2 flakes on Au foils was proved to be an excellent electrocatalyst for hydrogen evolution reaction (HER), featured by a rather low Tafel slope (61 mV/decade) and a supreme exchange current density (38.1 {\mu}A/cm2). The abundant active edge sites and the excellent electron coupling between MoS2 and Au foils account for the extraordinary HER activity. Our work presents a sound proof that strictly monolayer MoS2 assembled on a well selected electrode can manifest comparable or even superior HER property than that of nanoparticles or few-layer MoS2 electrocatalyst.
  • In this paper we present two related, kernel-based Distance Metric Learning (DML) methods. Their respective models non-linearly map data from their original space to an output space, and subsequent distance measurements are performed in the output space via a Mahalanobis metric. The dimensionality of the output space can be directly controlled to facilitate the learning of a low-rank metric. Both methods allow for simultaneous inference of the associated metric and the mapping to the output space, which can be used to visualize the data, when the output space is 2- or 3-dimensional. Experimental results for a collection of classification tasks illustrate the advantages of the proposed methods over other traditional and kernel-based DML approaches.
  • A traditional and intuitively appealing Multi-Task Multiple Kernel Learning (MT-MKL) method is to optimize the sum (thus, the average) of objective functions with (partially) shared kernel function, which allows information sharing amongst tasks. We point out that the obtained solution corresponds to a single point on the Pareto Front (PF) of a Multi-Objective Optimization (MOO) problem, which considers the concurrent optimization of all task objectives involved in the Multi-Task Learning (MTL) problem. Motivated by this last observation and arguing that the former approach is heuristic, we propose a novel Support Vector Machine (SVM) MT-MKL framework, that considers an implicitly-defined set of conic combinations of task objectives. We show that solving our framework produces solutions along a path on the aforementioned PF and that it subsumes the optimization of the average of objective functions as a special case. Using algorithms we derived, we demonstrate through a series of experimental results that the framework is capable of achieving better classification performance, when compared to other similar MTL approaches.
  • We study behavior of the restricted maximum likelihood (REML) estimator under a misspecified linear mixed model (LMM) that has received much attention in recent gnome-wide association studies. The asymptotic analysis establishes consistency of the REML estimator of the variance of the errors in the LMM, and convergence in probability of the REML estimator of the variance of the random effects in the LMM to a certain limit, which is equal to the true variance of the random effects multiplied by the limiting proportion of the nonzero random effects present in the LMM. The aymptotic results also establish convergence rate (in probability) of the REML estimators as well as a result regarding convergence of the asymptotic conditional variance of the REML estimator. The asymptotic results are fully supported by the results of empirical studies, which include extensive simulation studies that compare the performance of the REML estimator (under the misspecified LMM) with other existing methods.
  • Over the past few years, Multi-Kernel Learning (MKL) has received significant attention among data-driven feature selection techniques in the context of kernel-based learning. MKL formulations have been devised and solved for a broad spectrum of machine learning problems, including Multi-Task Learning (MTL). Solving different MKL formulations usually involves designing algorithms that are tailored to the problem at hand, which is, typically, a non-trivial accomplishment. In this paper we present a general Multi-Task Multi-Kernel Learning (Multi-Task MKL) framework that subsumes well-known Multi-Task MKL formulations, as well as several important MKL approaches on single-task problems. We then derive a simple algorithm that can solve the unifying framework. To demonstrate the flexibility of the proposed framework, we formulate a new learning problem, namely Partially-Shared Common Space (PSCS) Multi-Task MKL, and demonstrate its merits through experimentation.
  • Genome-wide association studies (GWAS) suggests that a complex disease is typically affected by many genetic variants with small or moderate effects. Identification of these risk variants remains to be a very challenging problem. Traditional approaches focusing on a single GWAS dataset alone ignore relevant information that could potentially improve our ability to detect these variants. We proposed a novel statistical approach, named GPA, to performing integrative analysis of multiple GWAS datasets and functional annotations. Hypothesis testing procedures were developed to facilitate statistical inference of pleiotropy and enrichment of functional annotation. We applied our approach to perform systematic analysis of five psychiatric disorders. Not only did GPA identify many weak signals missed by the original single phenotype analysis, but also revealed interesting genetic architectures of these disorders. We also applied GPA to the bladder cancer GWAS data with the ENCODE DNase-seq data from 125 cell lines and showed that GPA can detect cell lines that are more biologically relevant to the phenotype of interest.
  • This paper presents a RKHS, in general, of vector-valued functions intended to be used as hypothesis space for multi-task classification. It extends similar hypothesis spaces that have previously considered in the literature. Assuming this space, an improved Empirical Rademacher Complexity-based generalization bound is derived. The analysis is itself extended to an MKL setting. The connection between the proposed hypothesis space and a Group-Lasso type regularizer is discussed. Finally, experimental results, with some SVM-based Multi-Task Learning problems, underline the quality of the derived bounds and validate the paper's analysis.
  • Epidemics have so far been mostly studied in undirected networks. However, many real-world networks, such as the social network Twitter and the WWW networks, upon which information, emotion or malware spreads, are shown to be directed networks, composed of both unidirectional links and bidirectional links. We define the directionality as the percentage of unidirectional links. The epidemic threshold for the susceptible-infected-susceptible (SIS) epidemic has been proved to be 1/lambda_{1} in directed networks by N-intertwined Mean-field Approximation, where lambda_{1}, also called as spectral radius, is the largest eigenvalue of the adjacency matrix. Here, we propose two algorithms to generate directed networks with a given degree distribution, where the directionality can be controlled. The effect of directionality on the spectral radius lambda_{1}, principal eigenvector x_{1}, spectral gap lambda_{1}-|lambda_{2}|) and algebraic connectivity |mu_{N-1}| is studied. Important findings are that the spectral radius lambda_{1} decreases with the directionality, and the spectral gap and the algebraic connectivity increase with the directionality. The extent of the decrease of the spectral radius depends on both the degree distribution and the degree-degree correlation rho_{D}. Hence, the epidemic threshold of directed networks is larger than that of undirected networks, and a random walk converges to its steady-state faster in directed networks than in undirected networks with degree distribution.
  • An important task of human genetics studies is to accurately predict disease risks in individuals based on genetic markers, which allows for identifying individuals at high disease risks, and facilitating their disease treatment and prevention. Although hundreds of genome-wide association studies (GWAS) have been conducted on many complex human traits in recent years, there has been only limited success in translating these GWAS data into clinically useful risk prediction models. The predictive capability of GWAS data is largely bottlenecked by the available training sample size due to the presence of numerous variants carrying only small to modest effects. Recent studies have shown that different human traits may share common genetic bases. Therefore, an attractive strategy to increase the training sample size and hence improve the prediction accuracy is to integrate data of genetically correlated phenotypes. Yet the utility of genetic correlation in risk prediction has not been explored in the literature. In this paper, we analyzed GWAS data for bipolar and related disorders (BARD) and schizophrenia (SZ) with a bivariate ridge regression method, and found that jointly predicting the two phenotypes could substantially increase prediction accuracy as measured by the AUC (area under the receiver operating characteristic curve). We also found similar prediction accuracy improvements when we jointly analyzed GWAS data for Crohn's disease (CD) and ulcerative colitis (UC). The empirical observations were substantiated through our comprehensive simulation studies, suggesting that a gain in prediction accuracy can be obtained by combining phenotypes with relatively high genetic correlations. Through both real data and simulation studies, we demonstrated pleiotropy as a valuable asset that opens up a new opportunity to improve genetic risk prediction in the future.
  • In genome-wide association studies (GWAS), penalization is an important approach for identifying genetic markers associated with trait while mixed model is successful in accounting for a complicated dependence structure among samples. Therefore, penalized linear mixed model is a tool that combines the advantages of penalization approach and linear mixed model. In this study, a GWAS with multiple highly correlated traits is analyzed. For GWAS with multiple quantitative traits that are highly correlated, the analysis using traits marginally inevitably lose some essential information among multiple traits. We propose a penalized-MTMM, a penalized multivariate linear mixed model that allows both the within-trait and between-trait variance components simultaneously for multiple traits. The proposed penalized-MTMM estimates variance components using an AI-REML method and conducts variable selection and point estimation simultaneously using group MCP and sparse group MCP. Best linear unbiased predictor (BLUP) is used to find predictive values and the Pearson's correlations between predictive values and their corresponding observations are used to evaluate prediction performance. Both prediction and selection performance of the proposed approach and its comparison with the uni-trait penalized-LMM are evaluated through simulation studies. We apply the proposed approach to a GWAS data from Genetic Analysis Workshop (GAW) 18.