
Smallcell architecture is widely adopted by cellular network operators to
increase network capacity. By reducing the size of cells, operators can pack
more (lowpower) base stations in an area to better serve the growing demands,
without causing extra interference. However, this approach suffers from low
spectrum temporal efficiency. When a cell becomes smaller and covers fewer
users, its total traffic fluctuates significantly due to insufficient traffic
aggregation and exhibiting a large "peaktomean" ratio. As operators
customarily provision spectrum for peak traffic, large traffic temporal
fluctuation inevitably leads to low spectrum temporal efficiency. In this
paper, we advocate devicetodevice (D2D) loadbalancing as a useful mechanism
to address the fundamental drawback of smallcell architecture. The idea is to
shift traffic from a congested cell to its adjacent underutilized cells by
leveraging intercell D2D communication, so that the traffic can be served
without using extra spectrum, effectively improving the spectrum temporal
efficiency. We provide theoretical modeling and analysis to characterize the
benefit of D2D load balancing, in terms of total spectrum requirements of all
individual cells. We also derive the corresponding cost, in terms of incurred
D2D traffic overhead. We carry out empirical evaluations based on realworld 4G
data traces to gauge the benefit and cost of D2D load balancing under practical
settings. The results show that D2D load balancing can reduce the spectrum
requirement by 25% as compared to the standard scenario without D2D load
balancing, at the expense of negligible 0.7% D2D traffic overhead.

A conjecture appears in \cite{milsteinscheme}, in the form of a remark, where
it is stated that it is possible to construct, in a specified way, any high
order explicit numerical schemes to approximate the solutions of SDEs with
superlinear coefficients. We answer this conjecture affirmatively for the case
of order 1.5 approximations and show that the suggested methodology works.
Moreover, we explore the case of having H\"{o}lder continuous derivatives for
the diffusion coefficients.

Approximate nearest neighbor (ANN) search has achieved great success in many
tasks. However, existing popular methods for ANN search, such as hashing and
quantization methods, are designed for static databases only. They cannot
handle well the database with data distribution evolving dynamically, due to
the high computational effort for retraining the model based on the new
database. In this paper, we address the problem by developing an online product
quantization (online PQ) model and incrementally updating the quantization
codebook that accommodates to the incoming streaming data. Moreover, to further
alleviate the issue of large scale computation for the online PQ update, we
design two budget constraints for the model to update partial PQ codebook
instead of all. We derive a loss bound which guarantees the performance of our
online PQ model. Furthermore, we develop an online PQ model over a sliding
window with both data insertion and deletion supported, to reflect the
realtime behaviour of the data. The experiments demonstrate that our online PQ
model is both timeefficient and effective for ANN search in dynamic large
scale databases compared with baseline methods and the idea of partial PQ
codebook update further reduces the update cost.

We present a novel architecture, InDatabase Entity Linking (IDEL), in which
we integrate the analyticsoptimized RDBMS MonetDB with neural text mining
abilities. Our system design abstracts core tasks of most neural entity linking
systems for MonetDB. To the best of our knowledge, this is the first defacto
implemented system integrating entitylinking in a database. We leverage the
ability of MonetDB to support indatabaseanalytics with user defined functions
(UDFs) implemented in Python. These functions call machine learning libraries
for neural text mining, such as TensorFlow. The system achieves zero cost for
data shipping and transformation by utilizing MonetDB's ability to embed Python
processes in the database kernel and exchange data in NumPy arrays. IDEL
represents text and relational data in a joint vector space with neural
embeddings and can compensate errors with ambiguous entity representations. For
detecting matching entities, we propose a novel similarity function based on
joint neural embeddings which are learned via minimizing pairwise contrastive
ranking loss. This function utilizes a high dimensional index structures for
fast retrieval of matching entities. Our first implementation and experiments
using the WebNLG corpus show the effectiveness and the potentials of IDEL.

Multiphase flows frequently occur in many important engineering and
scientific applications, but modeling of such flows is a rather challenging
task due to complex interfacial dynamics between different phases, let alone if
the flow is oscillating in the porous media. Using humid air as the working
fluid in the thermoacoustic refrigerator is one of the research focus to
improve the thermoacoustic performance, but the corresponding effect is the
condensation of humid air in the thermal stack. Due to the small sized spacing
of thermal stack and the need to explore the detailed condensation process in
oscillating flow, a mesoscale numerical approach need to be developed. Over the
decades, several types of Lattice Boltzmann (LB) models for multiphase flows
have been developed under different physical pictures, for example the
colorgradient model, the ShanChen model, the nonideal pressure tensor model
and the HSD model. In the current study, a pseudopotential
MultipleRelaxationTime (MRT) LBM simulation was utilized to simulate the
incompressible oscillating flow and condensation in parallel plates. In the
initial stage of condensation, the oscillating flow benefits to accumulate the
saturated vapor at the exit regions, and the velocity vector of saturated vapor
clearly showed the flow over the droplets. It was also concluded that if the
condensate can be removed out from the parallel plates, the oscillating flow
and condensation will continuously feed the cold surface to form more water
droplets. The effect of wettability to the condensation was discussed, and it
turned out that by increasing the wettability, the saturated water vapor was
easier to condense on the cold walls, and the distance between each pair of
droplets was also strongly affected by the wettability.

At present, the vast majority of building blocks, techniques, and
architectures for deep learning are based on realvalued operations and
representations. However, recent work on recurrent neural networks and older
fundamental theoretical analysis suggests that complex numbers could have a
richer representational capacity and could also facilitate noiserobust memory
retrieval mechanisms. Despite their attractive properties and potential for
opening up entirely new neural architectures, complexvalued deep neural
networks have been marginalized due to the absence of the building blocks
required to design such models. In this work, we provide the key atomic
components for complexvalued deep neural networks and apply them to
convolutional feedforward networks and convolutional LSTMs. More precisely, we
rely on complex convolutions and present algorithms for complex
batchnormalization, complex weight initialization strategies for
complexvalued neural nets and we use them in experiments with endtoend
training schemes. We demonstrate that such complexvalued models are
competitive with their realvalued counterparts. We test deep complex models on
several computer vision tasks, on music transcription using the MusicNet
dataset and on Speech Spectrum Prediction using the TIMIT dataset. We achieve
stateoftheart performance on these audiorelated tasks.

Photovoltaic conversion efficiency (PCE) of halide perovskite solar cells has
risen spectacularly, yet the very crystalline structure of CH3NH3PbI3 remains
ambiguous after extensive researches, and its polar nature remains hotly
debated. Here we present compelling evidences that CH3NH3PbI3 crystals
selfgrown on FTO/TiO2 substrate consist of ferroic domains with alternating
polar and nonpolar orders, in contrast to previous experimental and theoretical
expectations, and polar domains possess reduced photocurrent. It is found that
polar and nonpolar orders of CH3NH3PbI3 can be distinguished from their
distinct lateral piezoresponse, energy dissipation, first and second harmonic
electromechanical couplings, and temperature variation, even though their
difference in crystalline lattice is very subtle, and they possess twoway
memory effect through cubictetragonal phase transition. These findings resolve
key questions regarding polar nature of CH3NH3PbI3 and its implication on
photovoltaics, reconcile contradictory data widely reported, and point a
direction toward engineering ferroic domains for enhanced PCE.

We analyze the Sun's shadow observed with the TibetIII air shower array and
find that the shadow's center deviates northward (southward) from the optical
solar disc center in the "Away" ("Toward") IMF sector. By comparing with
numerical simulations based on the solar magnetic field model, we find that the
average IMF strength in the "Away" ("Toward") sector is $1.54 \pm 0.21_{\rm
stat} \pm 0.20_{\rm syst}$ ($1.62 \pm 0.15_{\rm stat} \pm 0.22_{\rm syst}$)
times larger than the model prediction. These demonstrate that the observed
Sun's shadow is a useful tool for the quantitative evaluation of the average
solar magnetic field.

The properties of strangeness neutron star are studied within relativistic
meanfield (RMF) model via including a logarithmic interaction as a function of
scalar meson field. This logarithmic interaction, named as the $\sigma$cut
potential, can largely reduce the attractive contributions of scalar meson
field at high density without any influence on nuclear structure around normal
saturation density. In this work, the TM1 parameter set is chosen as the RMF
interaction, while the strengths of logarithmic interaction are constrained by
the properties of finite nuclei so that we can obtain a reasonable effective
nucleonnucleon interaction. The hyperons, $\Lambda,~\Sigma$, and $\Xi$ are
also considered in neutron stars within this framework, whose coupling
constants with mesons are determined by the latest hyperonnucleon and
$\Lambda$$\Lambda$ potentials extracted from the experimental data of
hypernuclei. The maximum mass of neutron star can be larger than two solar mass
with these hyperons. Furthermore, the nucleon mass at high density will be
saturated due to this additional $\sigma$cut potential, which is consistent
with the conclusions from the microscopic calculations such as,
BruecknerHartreeFock theory and quark meanfield model.

Nonzero leptonic CP phases in the neutrino sector are clear evidence for
physics beyond the Standard Model and have many implications in particle
physics and cosmology. Some clues to maximal Dirac CP violation
$\delta_{CP}=3\pi/2$ are reviewed. An approach to connect the Dirac CP
violation phase and the neutrino mass hierarchy in terms of the measurement of
CP asymmetry is proposed. Under appropriate requirements for baseline distance
and beam energy, the problem of neutrino mass hierarchy can be revealed at
maximal Dirac CP violation. General allowed parameter spaces are investigated,
within which the two unknown neutrino issues, mass hierarchy and Dirac CP
violation, can be solved simultaneously. The abilities of addressing the mass
hierarchy of current long baseline neutrino experiments are also discussed.

Various and spontaneous magnetic skyrmionic bubbles are experimentally
observed for the first time, at room temperature in a frustrated kagome magnet
Fe3Sn2 with unixial magnetic anisotropy. The magnetization dynamics were
investigated using insitu Lorentz transmission electron microscopy, revealing
that the transformation between different magnetic bubbles and domains are via
the motion of Bloch lines driven by applied external magnetic field. The
results demonstrate that Fe3Sn2 facilitates a unique magnetic control of
topological spin textures at room temperature, making it a promising candidate
for further skyrmionbased spintronic devices.

Model distillation is an effective and widely used technique to transfer
knowledge from a teacher to a student network. The typical application is to
transfer from a powerful large network or ensemble to a small network, that is
better suited to lowmemory or fast execution requirements. In this paper, we
present a deep mutual learning (DML) strategy where, rather than one way
transfer between a static predefined teacher and a student, an ensemble of
students learn collaboratively and teach each other throughout the training
process. Our experiments show that a variety of network architectures benefit
from mutual learning and achieve compelling results on CIFAR100 recognition
and Market1501 person reidentification benchmarks. Surprisingly, it is
revealed that no prior powerful teacher network is necessary  mutual learning
of a collection of simple student networks works, and moreover outperforms
distillation from a more powerful yet static teacher.

Consider a fullduplex (FD) bidirectional secure communication system, where
two communication nodes, named Alice and Bob, simultaneously transmit and
receive confidential information from each other, and an eavesdropper, named
Eve, overhears the transmissions. Our goal is to maximize the sum secrecy rate
(SSR) of the bidirectional transmissions by optimizing the transmit covariance
matrices at Alice and Bob. To tackle this SSR maximization (SSRM) problem, we
develop an alternating differenceofconcave (ADC) programming approach to
alternately optimize the transmit covariance matrices at Alice and Bob. We show
that the ADC iteration has a semiclosedform beamforming solution, and is
guaranteed to converge to a stationary solution of the SSRM problem. Besides
the SSRM design, this paper also deals with a robust SSRM transmit design under
a momentbased random channel state information (CSI) model, where only some
roughly estimated first and secondorder statistics of Eve's CSI are available,
but the exact distribution or other highorder statistics is not known. This
momentbased error model is new and different from the widely used
boundedsphere error model and the Gaussian random error model. Under the
consider CSI error model, the robust SSRM is formulated as an outage
probabilityconstrained SSRM problem. By leveraging the Lagrangian duality
theory and DC programming, a tractable safe solution to the robust SSRM problem
is derived. The effectiveness and the robustness of the proposed designs are
demonstrated through simulations.

This paper considers a general datafitting problem over a networked system,
in which many computing nodes are connected by an undirected graph. This kind
of problem can find many realworld applications and has been studied
extensively in the literature. However, existing solutions either need a
central controller for information sharing or requires slot synchronization
among different nodes, which increases the difficulty of practical
implementations, especially for a very large and heterogeneous system.
As a contrast, in this paper, we treat the datafitting problem over the
network as a stochastic programming problem with many constraints. By adapting
the results in a recent paper, we design a fully distributed and asynchronized
stochastic gradient descent (SGD) algorithm. We show that our algorithm can
achieve global optimality and consensus asymptotically by only local
computations and communications. Additionally, we provide a sharp lower bound
for the convergence speed in the regular graph case. This result fits the
intuition and provides guidance to design a `good' network topology to speed up
the convergence. Also, the merit of our design is validated by experiments on
both synthetic and realworld datasets.

We investigated effects of pairing correlations on the neutron skin thickness
and the symmetry energy of finite nuclei. In this calculation we used
HartreeFockBogoliubov (HFB) method with Skyrme forces and effective pairing
interactions. The results have been compared with available experimental data,
HartreeFock (HF) results as well as the predictions by droplet model (DM).
Finally, our discussion was extended to study of the pairing interaction in
nuclear matter. Roles of isospin T = 0 pairing in the nuclear matter were also
discussed.

Cohesive subgraph detection is an important graph problem that is widely
applied in many application domains, such as social community detection,
network visualization, and network topology analysis. Most of existing cohesive
subgraph metrics can guarantee good structural properties but may cause the
freerider effect. Here, by freerider effect, we mean that some irrelevant
subgraphs are combined as one subgraph if they only share a small number of
vertices and edges. In this paper, we study kvertex connected component
(kVCC) which can effectively eliminate the freerider effect but less studied
in the literature. A kVCC is a connected subgraph in which the removal of any
k1 vertices will not disconnect the subgraph. In addition to eliminating the
freerider effect, kVCC also has other advantages such as bounded diameter,
high cohesiveness, bounded graph overlapping, and bounded subgraph number. We
propose a polynomial time algorithm to enumerate all kVCCs of a graph by
recursively partitioning the graph into overlapped subgraphs. We find that the
key to improving the algorithm is reducing the number of local connectivity
testings. Therefore, we propose two effective optimization strategies, namely
neighbor sweep and group sweep, to largely reduce the number of local
connectivity testings. We conduct extensive performance studies using seven
large real datasets to demonstrate the effectiveness of this model as well as
the efficiency of our proposed algorithms.

Recurrent Neural Networks (RNNs) produce stateofart performance on many
machine learning tasks but their demand on resources in terms of memory and
computational power are often high. Therefore, there is a great interest in
optimizing the computations performed with these models especially when
considering development of specialized lowpower hardware for deep networks.
One way of reducing the computational needs is to limit the numerical precision
of the network weights and biases, and this will be addressed for the case of
RNNs. We present results from the use of different stochastic and deterministic
reduced precision training methods applied to two major RNN types, which are
then tested on three datasets. The results show that the stochastic and
deterministic ternarization, pow2 ternarization, and exponential quantization
methods gave rise to lowprecision RNNs that produce similar and even higher
accuracy on certain datasets, therefore providing a path towards training more
efficient implementations of RNNs in specialized hardware.

Recurrent Neural Networks (RNNs) produce stateofart performance on many
machine learning tasks but their demand on resources in terms of memory and
computational power are often high. Therefore, there is a great interest in
optimizing the computations performed with these models especially when
considering development of specialized lowpower hardware for deep networks.
One way of reducing the computational needs is to limit the numerical precision
of the network weights and biases. This has led to different proposed rounding
methods which have been applied so far to only Convolutional Neural Networks
and FullyConnected Networks. This paper addresses the question of how to best
reduce weight precision during training in the case of RNNs. We present results
from the use of different stochastic and deterministic reduced precision
training methods applied to three major RNN types which are then tested on
several datasets. The results show that the weight binarization methods do not
work with the RNNs. However, the stochastic and deterministic ternarization,
and pow2ternarization methods gave rise to lowprecision RNNs that produce
similar and even higher accuracy on certain datasets therefore providing a path
towards training more efficient implementations of RNNs in specialized
hardware.

The influence of pairing correlations on the neutron root mean square (rms)
radius of nuclei is investigated in the framework of selfconsistent Skyrme
HartreeFockBogoliubov calculations. The continuum is treated appropriately by
the Green's function techniques. As an example the nucleus $^{124}$Zr is
treated for a varying strength of pairing correlations. We find that, as the
pairing strength increases, the neutron rms radius first shrinks, reaches a
minimum and beyond this point it expands again. The shrinkage is due to the the
socalled `pairing antihalo effect', i. e. due to the decreasing of the
asymptotic density distribution with increasing pairing. However, in some
cases, increasing pairing correlations can also lead to an expansion of the
nucleus due to a growing occupation of socalled `halo' orbits, i.e. weakly
bound states and resonances in the continuum with low$\ell $ values. In this
case, the neutron radii are extended just by the influence of pairing
correlations, since these `halo' orbits cannot be occupied without pairing. The
term `antihalo effect' is not justified in such cases. For a full
understanding of this complicated interplay selfconsistent calculations are
necessary.

Convolutional Neural Networks (CNNs) are effective models for reducing
spectral variations and modeling spectral correlations in acoustic features for
automatic speech recognition (ASR). Hybrid speech recognition systems
incorporating CNNs with Hidden Markov Models/Gaussian Mixture Models
(HMMs/GMMs) have achieved the stateoftheart in various benchmarks.
Meanwhile, Connectionist Temporal Classification (CTC) with Recurrent Neural
Networks (RNNs), which is proposed for labeling unsegmented sequences, makes it
feasible to train an endtoend speech recognition system instead of hybrid
settings. However, RNNs are computationally expensive and sometimes difficult
to train. In this paper, inspired by the advantages of both CNNs and the CTC
approach, we propose an endtoend speech framework for sequence labeling, by
combining hierarchical CNNs with CTC directly without recurrent connections. By
evaluating the approach on the TIMIT phoneme recognition task, we show that the
proposed model is not only computationally efficient, but also competitive with
the existing baseline systems. Moreover, we argue that CNNs have the capability
to model temporal correlations with appropriate context information.

This paper presents a deep nonlinear metric learning framework for data
visualization on an image dataset. We propose the Triangular Similarity and
prove its equivalence to the Cosine Similarity in measuring a data pair. Based
on this novel similarity, a geometrically motivated loss function  the
triangular loss  is then developed for optimizing a metric learning system
comprising two identical CNNs. It is shown that this deep nonlinear system can
be efficiently trained by a hybrid algorithm based on the conventional
backpropagation algorithm. More interestingly, benefiting from classical
manifold learning theories, the proposed system offers two different views to
visualize the outputs, the second of which provides better classification
results than the stateoftheart methods in the visualizable spaces.

The properties of nuclear matter are studied using stateoftheart
nucleonnucleon forces up to fifth order in chiral effective field theory. The
equations of state of symmetric nuclear matter and pure neutron matter are
calculated in the framework of the BruecknerHartreeFock theory. We discuss in
detail the convergence pattern of the chiral expansion and the regulator
dependence of the calculated equations of state and provide an estimation of
the truncation uncertainty. For all employed values of the regulator, the
fifthorder chiral twonucleon potential is found to generate nuclear
saturation properties similar to the available phenomenological high precision
potentials. We also extract the symmetry energy of nuclear matter, which is
shown to be quite robust with respect to the chiral order and the value of the
regulator.

The use of reparameterization in the maximization of the likelihood function
of the MA(q) model is discussed. A general method for testing for the presence
of a parameter estimate on the boundary of an MA(q) model is presented. This
test is illustrated with a brief simulation experiment for the MA(q) for
q=1,2,3,4 in which it is shown that the probability of an estimate being on the
boundary increases with q.

We introduce a general and simple structural design called Multiplicative
Integration (MI) to improve recurrent neural networks (RNNs). MI changes the
way in which information from difference sources flows and is integrated in the
computational building block of an RNN, while introducing almost no extra
parameters. The new structure can be easily embedded into many popular RNN
models, including LSTMs and GRUs. We empirically analyze its learning behaviour
and conduct evaluations on several tasks using different RNN models. Our
experimental results demonstrate that Multiplicative Integration can provide a
substantial performance boost over many of the existing RNN models.

With the prevalence of social media and GPSenabled devices, a massive amount
of geotextual data has been generated in a stream fashion, leading to a
variety of applications such as locationbased recommendation and information
dissemination. In this paper, we investigate a novel realtime topk monitoring
problem over sliding window of streaming data; that is, we continuously
maintain the topk most relevant geotextual messages (e.g., geotagged tweets)
for a large number of spatialkeyword subscriptions (e.g., registered users
interested in local events) simultaneously. To provide the most recent
information under controllable memory cost, sliding window model is employed on
the streaming geotextual data. To the best of our knowledge, this is the first
work to study topk spatialkeyword publish/subscribe over sliding window. A
novel centralized system, called Skype (Topk Spatialkeyword
Publish/Subscribe), is proposed in this paper. In Skype, to continuously
maintain topk results for massive subscriptions, we devise a novel indexing
structure upon subscriptions such that each incoming message can be immediately
delivered on its arrival. To reduce the expensive topk reevaluation cost
triggered by message expiration, we develop a novel costbased kskyband
technique to reduce the number of reevaluations in a costeffective way.
Extensive experiments verify the great efficiency and effectiveness of our
proposed techniques. Furthermore, to support better scalability and higher
throughput, we propose a distributed version of Skype, namely, DSkype, on top
of Storm, which is a popular distributed stream processing system. With the
help of finetuned subscription/message distribution mechanisms, DSkype can
achieve orders of magnitude speedup than its centralized version.