• A large number of recent genome-wide association studies (GWASs) for complex phenotypes confirm the early conjecture for polygenicity, suggesting the presence of large number of variants with only tiny or moderate effects. However, due to the limited sample size of a single GWAS, many associated genetic variants are too weak to achieve the genome-wide significance. These undiscovered variants further limit the prediction capability of GWAS. Restricted access to the individual-level data and the increasing availability of the published GWAS results motivate the development of methods integrating both the individual-level and summary-level data. How to build the connection between the individual-level and summary-level data determines the efficiency of using the existing abundant summary-level resources with limited individual-level data, and this issue inspires more efforts in the existing area. In this study, we propose a novel statistical approach, LEP, which provides a novel way of modeling the connection between the individual-level data and summary-level data. LEP integrates both types of data by \underline{LE}veraing \underline{P}leiotropy to increase the statistical power of risk variants identification and the accuracy of risk prediction. The algorithm for parameter estimation is developed to handle genome-wide-scale data. Through comprehensive simulation studies, we demonstrated the advantages of LEP over the existing methods. We further applied LEP to perform integrative analysis of Crohn's disease from WTCCC and summary statistics from GWAS of some other diseases, such as Type 1 diabetes, Ulcerative colitis and Primary biliary cirrhosis. LEP was able to significantly increase the statistical power of identifying risk variants and improve the risk prediction accuracy from 63.39\% ($\pm$ 0.58\%) to 68.33\% ($\pm$ 0.32\%) using about 195,000 variants.
  • In the medical domain, identifying and expanding abbreviations in clinical texts is a vital task for both better human and machine understanding. It is a challenging task because many abbreviations are ambiguous especially for intensive care medicine texts, in which phrase abbreviations are frequently used. Besides the fact that there is no universal dictionary of clinical abbreviations and no universal rules for abbreviation writing, such texts are difficult to acquire, expensive to annotate and even sometimes, confusing to domain experts. This paper proposes a novel and effective approach -- exploiting task-oriented resources to learn word embeddings for expanding abbreviations in clinical notes. We achieved 82.27\% accuracy, close to expert human performance.
  • In the present study a mathematical model of long-crested water waves propagating mainly in one direction with the effect of Earth's rotation is derived by following the formal asymptotic procedures. Such a model equation is analogous to the Camassa-Holm approximation of the two-dimensional incompressible and irrotational Euler equations and has a formal bi-Hamiltonian structure. Its solution corresponding to physically relevant initial perturbations is more accurate on a much longer time scale. It is shown that the deviation of the free surface can be determined by the horizontal velocity at a certain depth in the second-order approximation. The effects of the Coriolis force caused by the Earth rotation and nonlocal higher nonlinearities on blow-up criteria and wave-breaking phenomena are also investigated. Our refined analysis is approached by applying the method of characteristics and conserved quantities to the Riccati-type differential inequality.
  • We consider an asymptotic 1D (in space) rotation-Camassa-Holm (R-CH) model, which could be used to describe the propagation of long-crested shallow-water waves in the equatorial ocean regions with allowance for the weak Coriolis effect due to the Earth's rotation. This model equation has similar wave-breaking phenomena as the Camassa-Holm equation. It is analogous to the rotation-Green-Naghdi (R-GN) equations with the weak Earth's rotation effect, modeling the propagation of wave allowing large amplitude in shallow water. We provide here a rigorous justification showing that solutions of the R-GN equations tend to associated solution of the R-CH model equation in the Camassa-Holm regime with the small amplitude and the larger wavelength. Furthermore, we demonstrate that the R-GN model equations are locally well-posed in a Sobolev space by the refined energy estimates.
  • A matched filter technique is applied to the Planck all-sky Compton y-parameter map to measure the thermal Sunyaev-Zel'dovich (tSZ) effect produced by galaxy groups of different halo masses selected from large redshift surveys in the low-z Universe. Reliable halo mass estimates are available for all the groups, which allows us to bin groups of similar halo masses to investigate how the tSZ effect depends on halo mass over a large mass range. Filters are simultaneously matched for all groups to minimize projection effects. We find that the integrated y-parameter and the hot gas content it implies are consistent with the predictions of the universal pressure profile model only for massive groups above $10^{14}\,{\rm M}_\odot$, but much lower than the model prediction for low-mass groups. The halo mass dependence found is in good agreement with the predictions of a set of simulations that include strong AGN feedback, but simulations including only supernova feedback significantly over predict the hot gas contents in galaxy groups. Our results suggest that hot gas in galaxy groups is either effectively ejected or in phases much below the virial temperatures of the host halos.
  • Through the integration of the power spectral density, we obtain temperature profiles of both multi-segment harmonic and anharmonic systems, showing the presence of an anomalous negative temperature gradient inside the interfacial segment. Via investigating patterns of the power spectral density, we found that the counterintuitive phenomenon comes from the presence of interfacial localized phonon modes. Two out-band localized modes of the harmonic model, which make no contributions to local temperature due to the absence of phonon interactions, result in the concave temperature profile and over-cooling effect. For the anharmonic model, thanks to the phonon-phonon interactions, the localized modes are excited and make considerable contributions to interfacial temperature, which is clearly shown by examining the temperature accumulation function. When anharmonicity is considerably large, the negative temperature gradient is absent since the localized phonon modes are fully mixed. The presence of localized modes are evidently demonstrated by the inverse participation ratio and normal mode analysis for the isolated harmonic model.
  • Anisotropic charge carrier transport in black phosphorus limited by ionized impurity scattering at finite temperature is explored theoretically. The anisotropic electronic structure enters the calculation for the polarizability (screening), the momentum relaxation time, and the mobility. For finite temperature, scattering is not limited to the Fermi surface and the polarizability is temperature dependent. The impact of screening is investigated in detail with its dependence on carrier density and temperature. Competing with the thermal excitation effects, the temperature dependence of the polarizability is found to dominate for T<100K. As a result, the charge carrier mobility slowly decreases with increasing temperature. The weak temperature dependence of the mobility and its anisotropy ratio of 1.9-3.2 agree with published experimental data.
  • We propose a recursive algorithm for the numerical computation of the optimal value function $\inf_{t\le\tau\le T} E \Big[\sup_{0\le s\le T } Y_s / Y_{\tau} \big| {\cal F}_t\Big]$ over the stopping times $\tau$ with respect to the filtration of a geometric Brownian motion $Y_t$ with Markovian regime switching. This method allows us to determine the boundary functions of the optimal stopping set when no associated Volterra integral equation is available. It applies in particular when regime-switching drifts have mixed signs, in which case the boundary functions may not be monotone.
  • We consider a variation of the Kuramoto model with dynamic coupling, where the coupling strengths are allowed to evolve in response to the phase difference between the oscillators, a model first considered by Ha, Noh and Park. In particular we study the stability of fixed points for this model. We demonstrate a somewhat surprising fact: namely that the fixed points of this model, as well as their stability, can be completely expressed in terms of the fixed points and stability of the analogous classical Kuramoto problem where the coupling strengths are fixed to a constant (the same for all edges). In particular for the "all-to-all" network, where the underlying graph is the complete graph, the problem reduces to the problem of understanding the fixed points and stability of the all-to-all Kuramoto model with equal edge weights, a problem that has been completely solved.
  • An algebraic representation of the Turing machines is given, where the configurations of Turing machines are represented by 4 order tensors, and the transition functions by 8 order tensors. Two types of tensor product are defined, one is to model the evolution of the Turing machines, and the other is to model the compositions of transition functions. It is shown that the two types of tensor product are harmonic in the sense that the associate law is obeyed.
  • This paper deals with optimal prediction in a regime-switching model driven by a continuous-time Markov chain. We extend existing results for geometric Brownian motion by deriving optimal stopping strategies that depend on the current regime state, and prove a number of continuity properties relating to optimal value and boundary functions. Our approach replaces the use of closed form expressions, which are not available in our setting, with PDE arguments that also simplify the approach of [2] in the classical Brownian case.
  • We study the charged impurity limited mobility in black phosphorus, a highly anisotropic layered material. We compute the mobility within the Boltzmann transport equation under detailed balance condition, and taking into account the anisotropy in transport and electronic structure. For carrier densities accessible in experiments, we obtained an anisotropy ratio of 3 ~ 4 at zero temperature, two-folds larger than that observed in experiments on multilayers samples. We discuss also how the anisotropy depends on carrier density and impurity distribution.
  • The integrable Novikov equation can be regarded as one of the Camassa-Holm-type equations with cubic nonlinearity. In this paper, we prove the global existence and uniqueness of the H\"older continuous energy conservative solutions for the Cauchy problem of the Novikov equation.
  • Lars Onsager and Richard Feynman envisioned that the three-dimensional (3D) superfluid-to-normal $\lambda$ transition in $^{4}$He occurs through the proliferation of vortices. This process should hold for every phase transition in the same universality class. The role of topological defects in symmetry-breaking phase transitions has become a prime topic in cosmology and high-temperature superconductivity, even though direct imaging of these defects is challenging. Here we show that the U(1) continuous symmetry that emerges at the ferroelectric critical point of multiferroic hexagonal manganites leads to a similar proliferation of vortices. Moreover, the disorder field (vortices) is coupled to an emergent U(1) gauge field, which becomes massive by means of the Higgs mechanism when vortices condense (span the whole system) upon heating above the ferroelectric transition temperature. Direct imaging of the vortex network in hexagonal manganites offers unique experimental access to this dual description of the ferroelectric transition, while enabling tests of the Kibble-Zurek mechanism.
  • Recently, it has been shown that under pressure, unexpected and counterintuitive chemical compounds become stable. Laser shock experiments (A. Rode, unpublished) on alumina (Al2O3) have shown non-equilibrium decomposition of alumina with the formation of free Al and a mysterious transparent phase. Inspired by these observations, with have explored the possibility of the formation of new chemical compounds in the system Al-O. Using the variable-composition structure prediction algorithm USPEX, in addition to the well-known Al2O3, we have found two extraordinary compounds Al4O7 and AlO2 to be thermodynamically stable in the pressure range 330-443 GPa and above 332 GPa, respectively. Both of these compounds at the same time contain oxide O2- and peroxide O22- ions, and both are insulating. Peroxo-groups are responsible for gap states, which significantly reduce the electronic band gap of both Al4O7 and AlO2.
  • In a previous paper, some of us studied general relativistic homogeneous gravitational collapses for dust and radiation, in which the density profile was replaced by an effective density justified by some quantum gravity models. It was found that the effective density introduces an effective pressure that becomes negative and dominant in the strong-field regime. With this set-up, the central singularity is replaced by a bounce, after which the cloud starts expanding. Motivated by the fact that in the classical case homogeneous and inhomogeneous collapse models have different properties, here we extend our previous work to the inhomogeneous case. As in the quantum-inspired homogeneous collapse model, the classical central singularity is replaced by a bounce, but the inhomogeneities strongly affect the structure of the bounce curve and of the trapped region.
  • In Smart Grid applications, as the number of deployed electric smart meters increases, massive amounts of valuable meter data is generated and collected every day. To enable reliable data collection and make business decisions fast, high throughput storage and high-performance analysis of massive meter data become crucial for grid companies. Considering the advantage of high efficiency, fault tolerance, and price-performance of Hadoop and Hive systems, they are frequently deployed as underlying platform for big data processing. However, in real business use cases, these data analysis applications typically involve multidimensional range queries (MDRQ) as well as batch reading and statistics on the meter data. While Hive is high-performance at complex data batch reading and analysis, it lacks efficient indexing techniques for MDRQ. In this paper, we propose DGFIndex, an index structure for Hive that efficiently supports MDRQ for massive meter data. DGFIndex divides the data space into cubes using the grid file technique. Unlike the existing indexes in Hive, which stores all combinations of multiple dimensions, DGFIndex only stores the information of cubes. This leads to smaller index size and faster query processing. Furthermore, with pre-computing user-defined aggregations of each cube, DGFIndex only needs to access the boundary region for aggregation query. Our comprehensive experiments show that DGFIndex can save significant disk space in comparison with the existing indexes in Hive and the query performance with DGFIndex is 2-50 times faster than existing indexes in Hive and HadoopDB for aggregation query, 2-5 times faster than both for non-aggregation query, 2-75 times faster than scanning the whole table in different query selectivity.
  • Wireless networks are vulnerable to Sybil attacks, in which a malicious node poses as many identities in order to gain disproportionate influence. Many defenses based on spatial variability of wireless channels exist, but depend either on detailed, multi-tap channel estimation - something not exposed on commodity 802.11 devices - or valid RSSI observations from multiple trusted sources, e.g., corporate access points - something not directly available in ad hoc and delay-tolerant networks with potentially malicious neighbors. We extend these techniques to be practical for wireless ad hoc networks of commodity 802.11 devices. Specifically, we propose two efficient methods for separating the valid RSSI observations of behaving nodes from those falsified by malicious participants. Further, we note that prior signalprint methods are easily defeated by mobile attackers and develop an appropriate challenge-response defense. Finally, we present the Mason test, the first implementation of these techniques for ad hoc and delay-tolerant networks of commodity 802.11 devices. We illustrate its performance in several real-world scenarios.
  • We propose using the predictability of human motion to eliminate the overhead of distributed location services in human-carried MANETs, dubbing the technique location profile routing. This method outperforms the Geographic Hashing Location Service when nodes change locations 2x more frequently than they initiate connections (e.g., start new TCP streams), as in applications like text- and instant-messaging. Prior characterizations of human mobility are used to show that location profile routing achieves a 93% delivery ratio with a 1.75x first-packet latency increase relative to an oracle location service.
  • Most previous analysis of Twitter user behavior is focused on individual information cascades and the social followers graph. We instead study aggregate user behavior and the retweet graph with a focus on quantitative descriptions. We find that the lifetime tweet distribution is a type-II discrete Weibull stemming from a power law hazard function, the tweet rate distribution, although asymptotically power law, exhibits a lognormal cutoff over finite sample intervals, and the inter-tweet interval distribution is power law with exponential cutoff. The retweet graph is small-world and scale-free, like the social graph, but is less disassortative and has much stronger clustering. These differences are consistent with it better capturing the real-world social relationships of and trust between users. Beyond just understanding and modeling human communication patterns and social networks, applications for alternative, decentralized microblogging systems-both predicting real-word performance and detecting spam-are discussed.
  • Consideration here is a generalized $\mu$-type integrable equation, which can be regarded as a generalization to both the $\mu$-Camassa-Holm and modified $\mu$-Camassa-Holm equations. It is shown that the proposed equation is formally integrable with the Lax-pair and the bi-Hamiltonian structure and its scale limit is an integrable model of hydrodynamical systems describing short capillary-gravity waves. Local well-posedness of the Cauchy problem in the suitable Sobolev space is established by the viscosity method. Existence of peaked traveling-wave solutions and formation of singularities of solutions for the equation are investigated. It is found that the equation admits a single peaked soliton and multi-peakon solutions. The effects of varying $\mu$-Camassa-Holm and modified $\mu$-Camassa-Holm nonlocal nonlinearities on blow-up criteria and wave breaking are illustrated in detail. Our analysis relies on the method of characteristics and conserved quantities and is proceeded with a priori differential estimates.
  • Quaternionic polynomials are generated by quaternionic variables and the quaternionic product. This paper proposes the generating ideal of quaternionic polynomials in tensor algebra, finds the Groebner base of the ideal in the case of pure imaginary quaternionic variables, and describes the normal forms of such quaternionic polynomials explicitly.
  • Considered in this paper is the modified Camassa-Holm equation with cubic nonlinearity, which is integrable and admits the single peaked solitons and multi-peakon solutions. The short-wave limit of this equation is known as the short-pulse equation. The main investigation is the Cauchy problem of the modified Camassa-Holm equation with qualitative properties of its solutions. It is firstly shown that the equation is locally well-posed in a range of the Besov spaces. The blow-up scenario and the lower bound of the maximal time of existence are then determined. A blow-up mechanism for solutions with certain initial profiles is described in detail and nonexistence of the smooth traveling wave solutions is also demonstrated. In addition, the persistence properties of the strong solutions for the equation are obtained.
  • We study the Cauchy problem for one-dimensional dispersive system of Boussinesq type which models weakly nonlinear long wave surface waves. We establish the local well-posedness and ill-posedness of solutions to the system. We also provide criteria for the formation of singularities.
  • Considered herein is the initial-value problem for the generalized periodic Camassa-Holm equation which is related to the Camassa-Holm equation and the Hunter-Saxton equation. Sufficient conditions guaranteeing the development of breaking waves in finite time are demonstrated. On the other hand, the existence of strong permanent waves is established with certain initial profiles depending on the linear dispersive parameter in a range of the Sobolev spaces. Moreover, the admissible global weak solution in the energy space is obtained.