
A large number of recent genomewide association studies (GWASs) for complex
phenotypes confirm the early conjecture for polygenicity, suggesting the
presence of large number of variants with only tiny or moderate effects.
However, due to the limited sample size of a single GWAS, many associated
genetic variants are too weak to achieve the genomewide significance. These
undiscovered variants further limit the prediction capability of GWAS.
Restricted access to the individuallevel data and the increasing availability
of the published GWAS results motivate the development of methods integrating
both the individuallevel and summarylevel data. How to build the connection
between the individuallevel and summarylevel data determines the efficiency
of using the existing abundant summarylevel resources with limited
individuallevel data, and this issue inspires more efforts in the existing
area.
In this study, we propose a novel statistical approach, LEP, which provides a
novel way of modeling the connection between the individuallevel data and
summarylevel data. LEP integrates both types of data by \underline{LE}veraing
\underline{P}leiotropy to increase the statistical power of risk variants
identification and the accuracy of risk prediction. The algorithm for parameter
estimation is developed to handle genomewidescale data. Through comprehensive
simulation studies, we demonstrated the advantages of LEP over the existing
methods. We further applied LEP to perform integrative analysis of Crohn's
disease from WTCCC and summary statistics from GWAS of some other diseases,
such as Type 1 diabetes, Ulcerative colitis and Primary biliary cirrhosis. LEP
was able to significantly increase the statistical power of identifying risk
variants and improve the risk prediction accuracy from 63.39\% ($\pm$ 0.58\%)
to 68.33\% ($\pm$ 0.32\%) using about 195,000 variants.

In the medical domain, identifying and expanding abbreviations in clinical
texts is a vital task for both better human and machine understanding. It is a
challenging task because many abbreviations are ambiguous especially for
intensive care medicine texts, in which phrase abbreviations are frequently
used. Besides the fact that there is no universal dictionary of clinical
abbreviations and no universal rules for abbreviation writing, such texts are
difficult to acquire, expensive to annotate and even sometimes, confusing to
domain experts. This paper proposes a novel and effective approach 
exploiting taskoriented resources to learn word embeddings for expanding
abbreviations in clinical notes. We achieved 82.27\% accuracy, close to expert
human performance.

In the present study a mathematical model of longcrested water waves
propagating mainly in one direction with the effect of Earth's rotation is
derived by following the formal asymptotic procedures. Such a model equation is
analogous to the CamassaHolm approximation of the twodimensional
incompressible and irrotational Euler equations and has a formal biHamiltonian
structure. Its solution corresponding to physically relevant initial
perturbations is more accurate on a much longer time scale. It is shown that
the deviation of the free surface can be determined by the horizontal velocity
at a certain depth in the secondorder approximation. The effects of the
Coriolis force caused by the Earth rotation and nonlocal higher nonlinearities
on blowup criteria and wavebreaking phenomena are also investigated. Our
refined analysis is approached by applying the method of characteristics and
conserved quantities to the Riccatitype differential inequality.

We consider an asymptotic 1D (in space) rotationCamassaHolm (RCH) model,
which could be used to describe the propagation of longcrested shallowwater
waves in the equatorial ocean regions with allowance for the weak Coriolis
effect due to the Earth's rotation. This model equation has similar
wavebreaking phenomena as the CamassaHolm equation. It is analogous to the
rotationGreenNaghdi (RGN) equations with the weak Earth's rotation effect,
modeling the propagation of wave allowing large amplitude in shallow water. We
provide here a rigorous justification showing that solutions of the RGN
equations tend to associated solution of the RCH model equation in the
CamassaHolm regime with the small amplitude and the larger wavelength.
Furthermore, we demonstrate that the RGN model equations are locally
wellposed in a Sobolev space by the refined energy estimates.

A matched filter technique is applied to the Planck allsky Compton
yparameter map to measure the thermal SunyaevZel'dovich (tSZ) effect produced
by galaxy groups of different halo masses selected from large redshift surveys
in the lowz Universe. Reliable halo mass estimates are available for all the
groups, which allows us to bin groups of similar halo masses to investigate how
the tSZ effect depends on halo mass over a large mass range. Filters are
simultaneously matched for all groups to minimize projection effects. We find
that the integrated yparameter and the hot gas content it implies are
consistent with the predictions of the universal pressure profile model only
for massive groups above $10^{14}\,{\rm M}_\odot$, but much lower than the
model prediction for lowmass groups. The halo mass dependence found is in good
agreement with the predictions of a set of simulations that include strong AGN
feedback, but simulations including only supernova feedback significantly over
predict the hot gas contents in galaxy groups. Our results suggest that hot gas
in galaxy groups is either effectively ejected or in phases much below the
virial temperatures of the host halos.

Through the integration of the power spectral density, we obtain temperature
profiles of both multisegment harmonic and anharmonic systems, showing the
presence of an anomalous negative temperature gradient inside the interfacial
segment. Via investigating patterns of the power spectral density, we found
that the counterintuitive phenomenon comes from the presence of interfacial
localized phonon modes. Two outband localized modes of the harmonic model,
which make no contributions to local temperature due to the absence of phonon
interactions, result in the concave temperature profile and overcooling
effect. For the anharmonic model, thanks to the phononphonon interactions, the
localized modes are excited and make considerable contributions to interfacial
temperature, which is clearly shown by examining the temperature accumulation
function. When anharmonicity is considerably large, the negative temperature
gradient is absent since the localized phonon modes are fully mixed. The
presence of localized modes are evidently demonstrated by the inverse
participation ratio and normal mode analysis for the isolated harmonic model.

Anisotropic charge carrier transport in black phosphorus limited by ionized
impurity scattering at finite temperature is explored theoretically. The
anisotropic electronic structure enters the calculation for the polarizability
(screening), the momentum relaxation time, and the mobility. For finite
temperature, scattering is not limited to the Fermi surface and the
polarizability is temperature dependent. The impact of screening is
investigated in detail with its dependence on carrier density and temperature.
Competing with the thermal excitation effects, the temperature dependence of
the polarizability is found to dominate for T<100K. As a result, the charge
carrier mobility slowly decreases with increasing temperature. The weak
temperature dependence of the mobility and its anisotropy ratio of 1.93.2
agree with published experimental data.

We propose a recursive algorithm for the numerical computation of the optimal
value function $\inf_{t\le\tau\le T} E \Big[\sup_{0\le s\le T } Y_s / Y_{\tau}
\big {\cal F}_t\Big]$ over the stopping times $\tau$ with respect to the
filtration of a geometric Brownian motion $Y_t$ with Markovian regime
switching. This method allows us to determine the boundary functions of the
optimal stopping set when no associated Volterra integral equation is
available. It applies in particular when regimeswitching drifts have mixed
signs, in which case the boundary functions may not be monotone.

We consider a variation of the Kuramoto model with dynamic coupling, where
the coupling strengths are allowed to evolve in response to the phase
difference between the oscillators, a model first considered by Ha, Noh and
Park. In particular we study the stability of fixed points for this model. We
demonstrate a somewhat surprising fact: namely that the fixed points of this
model, as well as their stability, can be completely expressed in terms of the
fixed points and stability of the analogous classical Kuramoto problem where
the coupling strengths are fixed to a constant (the same for all edges). In
particular for the "alltoall" network, where the underlying graph is the
complete graph, the problem reduces to the problem of understanding the fixed
points and stability of the alltoall Kuramoto model with equal edge weights,
a problem that has been completely solved.

An algebraic representation of the Turing machines is given, where the
configurations of Turing machines are represented by 4 order tensors, and the
transition functions by 8 order tensors. Two types of tensor product are
defined, one is to model the evolution of the Turing machines, and the other is
to model the compositions of transition functions. It is shown that the two
types of tensor product are harmonic in the sense that the associate law is
obeyed.

This paper deals with optimal prediction in a regimeswitching model driven
by a continuoustime Markov chain. We extend existing results for geometric
Brownian motion by deriving optimal stopping strategies that depend on the
current regime state, and prove a number of continuity properties relating to
optimal value and boundary functions. Our approach replaces the use of closed
form expressions, which are not available in our setting, with PDE arguments
that also simplify the approach of [2] in the classical Brownian case.

We study the charged impurity limited mobility in black phosphorus, a highly
anisotropic layered material. We compute the mobility within the Boltzmann
transport equation under detailed balance condition, and taking into account
the anisotropy in transport and electronic structure. For carrier densities
accessible in experiments, we obtained an anisotropy ratio of 3 ~ 4 at zero
temperature, twofolds larger than that observed in experiments on multilayers
samples. We discuss also how the anisotropy depends on carrier density and
impurity distribution.

The integrable Novikov equation can be regarded as one of the
CamassaHolmtype equations with cubic nonlinearity. In this paper, we prove
the global existence and uniqueness of the H\"older continuous energy
conservative solutions for the Cauchy problem of the Novikov equation.

Lars Onsager and Richard Feynman envisioned that the threedimensional (3D)
superfluidtonormal $\lambda$ transition in $^{4}$He occurs through the
proliferation of vortices. This process should hold for every phase transition
in the same universality class. The role of topological defects in
symmetrybreaking phase transitions has become a prime topic in cosmology and
hightemperature superconductivity, even though direct imaging of these defects
is challenging. Here we show that the U(1) continuous symmetry that emerges at
the ferroelectric critical point of multiferroic hexagonal manganites leads to
a similar proliferation of vortices. Moreover, the disorder field (vortices) is
coupled to an emergent U(1) gauge field, which becomes massive by means of the
Higgs mechanism when vortices condense (span the whole system) upon heating
above the ferroelectric transition temperature. Direct imaging of the vortex
network in hexagonal manganites offers unique experimental access to this dual
description of the ferroelectric transition, while enabling tests of the
KibbleZurek mechanism.

Recently, it has been shown that under pressure, unexpected and
counterintuitive chemical compounds become stable. Laser shock experiments (A.
Rode, unpublished) on alumina (Al2O3) have shown nonequilibrium decomposition
of alumina with the formation of free Al and a mysterious transparent phase.
Inspired by these observations, with have explored the possibility of the
formation of new chemical compounds in the system AlO. Using the
variablecomposition structure prediction algorithm USPEX, in addition to the
wellknown Al2O3, we have found two extraordinary compounds Al4O7 and AlO2 to
be thermodynamically stable in the pressure range 330443 GPa and above 332
GPa, respectively. Both of these compounds at the same time contain oxide O2
and peroxide O22 ions, and both are insulating. Peroxogroups are responsible
for gap states, which significantly reduce the electronic band gap of both
Al4O7 and AlO2.

In a previous paper, some of us studied general relativistic homogeneous
gravitational collapses for dust and radiation, in which the density profile
was replaced by an effective density justified by some quantum gravity models.
It was found that the effective density introduces an effective pressure that
becomes negative and dominant in the strongfield regime. With this setup, the
central singularity is replaced by a bounce, after which the cloud starts
expanding. Motivated by the fact that in the classical case homogeneous and
inhomogeneous collapse models have different properties, here we extend our
previous work to the inhomogeneous case. As in the quantuminspired homogeneous
collapse model, the classical central singularity is replaced by a bounce, but
the inhomogeneities strongly affect the structure of the bounce curve and of
the trapped region.

In Smart Grid applications, as the number of deployed electric smart meters
increases, massive amounts of valuable meter data is generated and collected
every day. To enable reliable data collection and make business decisions fast,
high throughput storage and highperformance analysis of massive meter data
become crucial for grid companies. Considering the advantage of high
efficiency, fault tolerance, and priceperformance of Hadoop and Hive systems,
they are frequently deployed as underlying platform for big data processing.
However, in real business use cases, these data analysis applications typically
involve multidimensional range queries (MDRQ) as well as batch reading and
statistics on the meter data. While Hive is highperformance at complex data
batch reading and analysis, it lacks efficient indexing techniques for MDRQ.
In this paper, we propose DGFIndex, an index structure for Hive that
efficiently supports MDRQ for massive meter data. DGFIndex divides the data
space into cubes using the grid file technique. Unlike the existing indexes in
Hive, which stores all combinations of multiple dimensions, DGFIndex only
stores the information of cubes. This leads to smaller index size and faster
query processing. Furthermore, with precomputing userdefined aggregations of
each cube, DGFIndex only needs to access the boundary region for aggregation
query. Our comprehensive experiments show that DGFIndex can save significant
disk space in comparison with the existing indexes in Hive and the query
performance with DGFIndex is 250 times faster than existing indexes in Hive
and HadoopDB for aggregation query, 25 times faster than both for
nonaggregation query, 275 times faster than scanning the whole table in
different query selectivity.

Wireless networks are vulnerable to Sybil attacks, in which a malicious node
poses as many identities in order to gain disproportionate influence. Many
defenses based on spatial variability of wireless channels exist, but depend
either on detailed, multitap channel estimation  something not exposed on
commodity 802.11 devices  or valid RSSI observations from multiple trusted
sources, e.g., corporate access points  something not directly available in ad
hoc and delaytolerant networks with potentially malicious neighbors. We extend
these techniques to be practical for wireless ad hoc networks of commodity
802.11 devices. Specifically, we propose two efficient methods for separating
the valid RSSI observations of behaving nodes from those falsified by malicious
participants. Further, we note that prior signalprint methods are easily
defeated by mobile attackers and develop an appropriate challengeresponse
defense. Finally, we present the Mason test, the first implementation of these
techniques for ad hoc and delaytolerant networks of commodity 802.11 devices.
We illustrate its performance in several realworld scenarios.

We propose using the predictability of human motion to eliminate the overhead
of distributed location services in humancarried MANETs, dubbing the technique
location profile routing. This method outperforms the Geographic Hashing
Location Service when nodes change locations 2x more frequently than they
initiate connections (e.g., start new TCP streams), as in applications like
text and instantmessaging. Prior characterizations of human mobility are used
to show that location profile routing achieves a 93% delivery ratio with a
1.75x firstpacket latency increase relative to an oracle location service.

Most previous analysis of Twitter user behavior is focused on individual
information cascades and the social followers graph. We instead study aggregate
user behavior and the retweet graph with a focus on quantitative descriptions.
We find that the lifetime tweet distribution is a typeII discrete Weibull
stemming from a power law hazard function, the tweet rate distribution,
although asymptotically power law, exhibits a lognormal cutoff over finite
sample intervals, and the intertweet interval distribution is power law with
exponential cutoff. The retweet graph is smallworld and scalefree, like the
social graph, but is less disassortative and has much stronger clustering.
These differences are consistent with it better capturing the realworld social
relationships of and trust between users. Beyond just understanding and
modeling human communication patterns and social networks, applications for
alternative, decentralized microblogging systemsboth predicting realword
performance and detecting spamare discussed.

Consideration here is a generalized $\mu$type integrable equation, which can
be regarded as a generalization to both the $\mu$CamassaHolm and modified
$\mu$CamassaHolm equations. It is shown that the proposed equation is
formally integrable with the Laxpair and the biHamiltonian structure and its
scale limit is an integrable model of hydrodynamical systems describing short
capillarygravity waves. Local wellposedness of the Cauchy problem in the
suitable Sobolev space is established by the viscosity method. Existence of
peaked travelingwave solutions and formation of singularities of solutions for
the equation are investigated. It is found that the equation admits a single
peaked soliton and multipeakon solutions. The effects of varying
$\mu$CamassaHolm and modified $\mu$CamassaHolm nonlocal nonlinearities on
blowup criteria and wave breaking are illustrated in detail. Our analysis
relies on the method of characteristics and conserved quantities and is
proceeded with a priori differential estimates.

Quaternionic polynomials are generated by quaternionic variables and the
quaternionic product. This paper proposes the generating ideal of quaternionic
polynomials in tensor algebra, finds the Groebner base of the ideal in the case
of pure imaginary quaternionic variables, and describes the normal forms of
such quaternionic polynomials explicitly.

Considered in this paper is the modified CamassaHolm equation with cubic
nonlinearity, which is integrable and admits the single peaked solitons and
multipeakon solutions. The shortwave limit of this equation is known as the
shortpulse equation. The main investigation is the Cauchy problem of the
modified CamassaHolm equation with qualitative properties of its solutions. It
is firstly shown that the equation is locally wellposed in a range of the
Besov spaces. The blowup scenario and the lower bound of the maximal time of
existence are then determined. A blowup mechanism for solutions with certain
initial profiles is described in detail and nonexistence of the smooth
traveling wave solutions is also demonstrated. In addition, the persistence
properties of the strong solutions for the equation are obtained.

We study the Cauchy problem for onedimensional dispersive system of
Boussinesq type which models weakly nonlinear long wave surface waves. We
establish the local wellposedness and illposedness of solutions to the
system. We also provide criteria for the formation of singularities.

Considered herein is the initialvalue problem for the generalized periodic
CamassaHolm equation which is related to the CamassaHolm equation and the
HunterSaxton equation. Sufficient conditions guaranteeing the development of
breaking waves in finite time are demonstrated. On the other hand, the
existence of strong permanent waves is established with certain initial
profiles depending on the linear dispersive parameter in a range of the Sobolev
spaces. Moreover, the admissible global weak solution in the energy space is
obtained.