
In this paper, we study the stochastic combinatorial multiarmed bandit
(CMAB) framework that allows a general nonlinear reward function, whose
expected value may not depend only on the means of the input random variables
but possibly on the entire distributions of these variables. Our framework
enables a much larger class of reward functions such as the $\max()$ function
and nonlinear utility functions. Existing techniques relying on accurate
estimations of the means of random variables, such as the upper confidence
bound (UCB) technique, do not work directly on these functions. We propose a
new algorithm called stochastically dominant confidence bound (SDCB), which
estimates the distributions of underlying random variables and their
stochastically dominant confidence bounds. We prove that SDCB can achieve
$O(\log{T})$ distributiondependent regret and $\tilde{O}(\sqrt{T})$
distributionindependent regret, where $T$ is the time horizon. We apply our
results to the $K$MAX problem and expected utility maximization problems. In
particular, for $K$MAX, we provide the first polynomialtime approximation
scheme (PTAS) for its offline problem, and give the first $\tilde{O}(\sqrt T)$
bound on the $(1\epsilon)$approximation regret of its online problem, for any
$\epsilon>0$.

For the task of subdecimeter aerial imagery segmentation, the finegrained
semantic segmentation results are usually difficult to obtain because of
complex remote sensing contents and optical conditions. In addition, remote
sensing imagery has inherent limitations of imbalanced class distribution.
Recently, convolutional neural networks (CNNs) have shown outstanding
performance on this task. In this paper, we propose the TreeSegNet to solve the
class imbalance problem and further improve the accuracy in the metrics' point
of view. Based on the infrastructure of DeepUNet, a TreeCNN model in which
each node represents a ResNeXt unit is constructed automatically according to
confusion matrix and minimum graph cut algorithm. By transporting feature maps
by concatenating connections, the TreeCNN block fuses the multiscale features
and learning the best weights for the model. In the experiments on ISPRS 2D
semantic labeling Potsdam dataset, the results gotten by TreeSegNet are better
than the opened stateoftheart methods. The F1 measure scores of classes are
improved especially for those classes that are easily confused. Completely and
detailed comparison and analysis are performed to show that the improvement is
brought by the construction and the embedding of the TreeCNN module.

We generalize the monomorphism category from quiver (with monomial relations)
to arbitrary finite dimensional algebras by a homological definition. Given two
finite dimension algebras $A$ and $B$, we use the special monomorphism category
Mon(B, AGproj) to describe some Gorenstein projective bimodules over the
tensor product of $A$ and $B$. If one of the two algebras is Gorenstein, we
give a sufficient and necessary condition for Mon(B, AGproj) being the
category of all Gorenstein projective bimodules. In addition, If both $A$ and
$B$ are Gorenstein, we can describe the category of all Gorenstein projective
bimodules via filtration categories. Similarly, in this case, we get the same
result for infinitely generated Gorenstein projective bimodules.

We revisit the question of reducing online learning to approximate
optimization of the offline problem. In this setting, we give two algorithms
with nearoptimal performance in the full information setting: they guarantee
optimal regret and require only polylogarithmically many calls to the
approximation oracle per iteration. Furthermore, these algorithms apply to the
more general improper learning problems. In the bandit setting, our algorithm
also significantly improves the best previously known oracle complexity while
maintaining the same regret.

Deep convolutional neural networks (CNNs) have greatly improved the Face
Recognition (FR) performance in recent years. Almost all CNNs in FR are trained
on the carefully labeled datasets containing plenty of identities. However,
such highquality datasets are very expensive to collect, which restricts many
researchers to achieve stateoftheart performance. In this paper, we propose
a framework, called SeqFace, for learning discriminative face features. Besides
a traditional identity training dataset, the designed SeqFace can train CNNs by
using an additional dataset which includes a large number of face sequences
collected from videos. Moreover, the label smoothing regularization (LSR) and a
new proposed discriminative sequence agent (DSA) loss are employed to enhance
discrimination power of deep face features via making full use of the sequence
data. Our method achieves excellent performance on Labeled Faces in the Wild
(LFW), YouTube Faces (YTF), only with a single ResNet. The code and models are
publicly available online (https://github.com/huangyangyu/SeqFace).

The problem of disturbance rejection/attenuation for constantinput delayed
linear multiagent systems (MASs) with the directed communication topology is
tackled in this paper, where a classic model reduction technique is introduced
to transform the delayed MAS into the delayfree one. First, when the leader
has no control input, a novel adaptive predictive extended state observer (ESO)
using only relative state information of neighboring agents is designed to
achieve disturbancerejected consensus tracking. The stabilization analysis is
presented via the Lyapunov function and sufficient conditions are derived in
terms of linear matrix inequalities. Then the result is extended to the
disturbanceattenuated case where the leader has bounded control input which is
only known by a portion of followers. Finally, two numerical examples are
presented to illustrate the effectiveness of proposed strategies. The main
contribution focuses on the design of adaptive predictive ESO protocols with
the fully distributed property.

A new consistent, spatially adaptive, smoothed particle hydrodynamics (SPH)
method for FluidStructure Interactions (FSI) is presented. The method combines
several attributes that have not been simultaneously satisfied by other SPH
methods. Specifically, it is secondorder convergent; it allows for resolutions
spatially adapted with moving (translating and rotating) boundaries of
arbitrary geometries; and, it accelerates the FSI solution as the adaptive
approach leads to fewer degrees of freedom without sacrificing accuracy. The
key ingredients in the method are a consistent discretization of differential
operators, a \textit{posteriori} error estimator/distancebased criterion of
adaptivity, and a particleshifting technique. The method is applied in
simulating six different flows or FSI problems. The new method's convergence,
accuracy, and efficiency attributes are assessed by comparing the results it
produces with analytical, finite element, and consistent SPH uniform
highresolution solutions as well as experimental data.

A first line of attack in exploratory data analysis is data visualization,
i.e., generating a 2dimensional representation of data that makes clusters of
similar points visually identifiable. Standard JohnsonLindenstrauss
dimensionality reduction does not produce data visualizations. The tSNE
heuristic of van der Maaten and Hinton, which is based on nonconvex
optimization, has become the de facto standard for visualization in a wide
range of applications.
This work gives a formal framework for the problem of data visualization 
finding a 2dimensional embedding of clusterable data that correctly separates
individual clusters to make them visually identifiable. We then give a rigorous
analysis of the performance of tSNE under a natural, deterministic condition
on the "groundtruth" clusters (similar to conditions assumed in earlier
analyses of clustering) in the underlying data. These are the first provable
guarantees on tSNE for constructing good data visualizations.
We show that our deterministic condition is satisfied by considerably general
probabilistic generative models for clusterable data such as mixtures of
wellseparated logconcave distributions. Finally, we give theoretical evidence
that tSNE provably succeeds in partially recovering cluster structure even
when the above deterministic condition is not met.

We consider the convexconcave saddle point problem $\min_{x}\max_{y}
f(x)+y^\top A xg(y)$ where $f$ is smooth and convex and $g$ is smooth and
strongly convex. We prove that if the coupling matrix $A$ has full column rank,
the vanilla primaldual gradient method can achieve linear convergence even if
$f$ is not strongly convex. Our result generalizes previous work which either
requires $f$ and $g$ to be quadratic functions or requires proximal mappings
for both $f$ and $g$. We adopt a novel analysis technique that in each
iteration uses a "ghost" update as a reference, and show that the iterates in
the primaldual gradient method converge to this "ghost" sequence. Using the
same technique we further give an analysis for the primaldual stochastic
variance reduced gradient (SVRG) method for convexconcave saddle point
problems with a finitesum structure.

We present an efficient way to solve the BetheSalpeter equation (BSE), a
model for the computation of absorption spectra in molecules and solids that
includes electronhole excitations. Standard approaches to construct and
diagonalize the BetheSalpeter Hamiltonian require at least $\O(N_e^5)$
operations, where $N_e$ is proportional to the number of electrons in the
system, limiting its application to small systems. Our approach is based on the
interpolative separable density fitting (ISDF) technique to construct low rank
approximations to the bare and screened exchange operators associated with the
BSE Hamiltonian. This approach reduces the complexity of the Hamiltonian
construction to $\O(N_e^3)$ with a much smaller preconstant. Here, we
implement the ISDF method for the BSE calculations within the TammDancoff
approximation (TDA) in the BerkeleyGW software package. We show that ISDFbased
BSE calculations in molecules and solids reproduce accurate exciton energies
and optical absorption spectra with significantly reduced computational cost.

An perturbationiteration method is developed for the computation of the
HermiteGaussianlike solitons with arbitrary peak numbers in nonlocal
nonlinear media. This method is based on the perturbed model of the
Schr\"{o}dinger equation for the harmonic oscillator, in which the minimum
perturbation is obtained by the iteration. This method takes a few tens of
iteration loops to achieve enough high accuracy, and the initial condition is
fixed to the HermiteGaussian function. The method we developed might also be
extended to the numerical integration of the Schr\"{o}dinger equations in any
type of potentials.

We propose a rank$k$ variant of the classical FrankWolfe algorithm to solve
convex optimization over a tracenorm ball. Our algorithm replaces the top
singularvector computation ($1$SVD) in FrankWolfe with a top$k$
singularvector computation ($k$SVD), which can be done by repeatedly applying
$1$SVD $k$ times. Alternatively, our algorithm can be viewed as a rank$k$
restricted version of projected gradient descent. We show that our algorithm
has a linear convergence rate when the objective function is smooth and
strongly convex, and the optimal solution has rank at most $k$. This improves
the convergence rate and the total time complexity of the FrankWolfe method
and its variants.

Finding the electromagnetic (EM) counterpart of binary compact star merger,
especially the binary neutron star (BNS) merger, is critically important for
gravitational wave (GW) astronomy, cosmology and fundamental physics. On Aug.
17, 2017, Advanced LIGO and \textit{Fermi}/GBM independently triggered the
first BNS merger, GW170817, and its high energy EM counterpart, GRB 170817A,
respectively, resulting in a global observation campaign covering gammaray,
Xray, UV, optical, IR, radio as well as neutrinos. The High Energy Xray
telescope (HE) onboard \textit{Insight}HXMT (Hard Xray Modulation Telescope)
is the unique highenergy gammaray telescope that monitored the entire GW
localization area and especially the optical counterpart (SSS17a/AT2017gfo)
with very large collection area ($\sim$1000 cm$^2$) and microsecond time
resolution in 0.25 MeV. In addition, \textit{Insight}HXMT quickly implemented
a Target of Opportunity (ToO) observation to scan the GW localization area for
potential Xray emission from the GW source. Although it did not detect any
significant high energy (0.25 MeV) radiation from GW170817, its observation
helped to confirm the unexpected weak and soft nature of GRB 170817A.
Meanwhile, \textit{Insight}HXMT/HE provides one of the most stringent
constraints (~10$^{7}$ to 10$^{6}$ erg/cm$^2$/s) for both GRB170817A and any
other possible precursor or extended emissions in 0.25 MeV, which help us to
better understand the properties of EM radiation from this BNS merger.
Therefore the observation of \textit{Insight}HXMT constitutes an important
chapter in the full context of multiwavelength and multimessenger observation
of this historical GW event.

Entity alignment is the task of finding entities in two knowledge bases (KBs)
that represent the same realworld object. When facing KBs in different natural
languages, conventional crosslingual entity alignment methods rely on machine
translation to eliminate the language barriers. These approaches often suffer
from the uneven quality of translations between languages. While recent
embeddingbased techniques encode entities and relationships in KBs and do not
need machine translation for crosslingual entity alignment, a significant
number of attributes remain largely unexplored. In this paper, we propose a
joint attributepreserving embedding model for crosslingual entity alignment.
It jointly embeds the structures of two KBs into a unified vector space and
further refines it by leveraging attribute correlations in the KBs. Our
experimental results on realworld datasets show that this approach
significantly outperforms the stateoftheart embedding approaches for
crosslingual entity alignment and could be complemented with methods based on
machine translation.

Quantum protocols require access to largescale entangled quantum states, due
to the requirement of channel capacity. As a promising candidate, the
highdimensional orbital angular momentum (OAM) entangled states have been
implemented, but only one of four OAM Bell states in each individual subspace
can be distinguished. Here we demonstrate the first realization of complete OAM
Bellstate measurement (OAMBSM) in an individual subspace, by seeking the
suitable unitary matrix performable using only linear optics and breaking the
degeneracy of four OAM Bell states in ancillary polarization dimension. We
further realize the superdense coding via our complete OAMBSM with the average
success probability of ~82% and the channel capacity of ~1.1(4) bits. This work
opens the window for increasing the channel capacity and extending the
applications of OAM quantum states in quantum information in future.

Semantic segmentation is a fundamental research in remote sensing image
processing. Because of the complex maritime environment, the sealand
segmentation is a challenging task. Although the neural network has achieved
excellent performance in semantic segmentation in the last years, there are a
few of works using CNN for sealand segmentation and the results could be
further improved. This paper proposes a novel deep convolution neural network
named DeepUNet. Like the UNet, its structure has a contracting path and an
expansive path to get high resolution output. But differently, the DeepUNet
uses DownBlocks instead of convolution layers in the contracting path and uses
UpBlock in the expansive path. The two novel blocks bring two new connections
that are Uconnection and Plus connection. They are promoted to get more
precise segmentation results. To verify our network architecture, we made a new
challenging sealand dataset and compare the DeepUNet on it with the SegNet and
the UNet. Experimental results show that DeepUNet achieved good performance
compared with other architectures, especially in highresolution remote sensing
imagery.

The commutator direct inversion of the iterative subspace (commutator DIIS or
CDIIS) method developed by Pulay is an efficient and the most widely used
scheme in quantum chemistry to accelerate the convergence of self consistent
field (SCF) iterations in HartreeFock theory and KohnSham density functional
theory. The CDIIS method requires the explicit storage of the density matrix,
the Fock matrix and the commutator matrix. Hence the method can only be used
for systems with a relatively small basis set, such as the Gaussian basis set.
We develop a new method that enables the CDIIS method to be efficiently
employed in electronic structure calculations with a large basis set such as
planewaves for the first time. The key ingredient is the projection of both the
density matrix and the commutator matrix to an auxiliary matrix called the
gaugefixing matrix. The resulting projected commutatorDIIS method (PCDIIS)
only operates on matrices of the same dimension as the that consists of
KohnSham orbitals. The cost of the method is comparable to that of standard
charge mixing schemes used in large basis set calculations. The PCDIIS method
is gaugeinvariant, which guarantees that its performance is invariant with
respect to any unitary transformation of the KohnSham orbitals. We demonstrate
that the PCDIIS method can be viewed as an extension of an iterative
eigensolver for nonlinear problems. We use the PCDIIS method for accelerating
KohnSham density functional theory calculations with hybrid
exchangecorrelation functionals, and demonstrate its superior performance
compared to the commonly used nested twolevel SCF iteration procedure.

We present a new efficient way to perform hybrid density functional theory
(DFT) based electronic structure calculation. The new method uses an
interpolative separable density fitting (ISDF) procedure to construct a set of
numerical auxiliary basis vectors and a compact approximation of the matrix
consisting of products of occupied orbitals represented in a large basis set
such as the planewave basis. Such an approximation allows us to reduce the
number of Poisson solves from $\Or(N_{e}^2)$ to $\Or(N_{e})$ when we apply the
exchange operator to occupied orbitals in an iterative method for solving the
KohnSham equations, where $N_{e}$ is the number of electrons in the system to
be studied. We show that the ISDF procedure can be carried out in
$\Or(N_{e}^3)$ operations, with a much smaller preconstant compared to methods
used in existing approaches. When combined with the recently developed
adaptively compressed exchange (ACE) operator formalism, which reduces the
number of times the exchange operator needs to be updated, the resulting
ACEISDF method significantly reduces the computational cost \REV{associated
with the exchange operator} by nearly two orders of magnitude compared to
existing approaches for a large silicon system with $1000$ atoms. We
demonstrate that the ACEISDF method can produce accurate energies and forces
for insulating and metallic systems, and that it is possible to obtain
converged hybrid functional calculation results for a 1000atom bulk silicon
within 10 minutes on 2000 computational cores. We also show that ACEISDF can
scale to 8192 computational cores for a 4096atom bulk silicon system. We use
the ACEISDF method to geometrically optimize a 1000atom silicon system with a
vacancy defect using the HSE06 functional and computes its electronic
structure.

Let $A$ be a unital associative algebra over a field $F$ and $V$ be a unital
left $A$module. The module $V$ is called zero action determined if every
bilinear map $f: A\times V\rightarrow F$ with the property that $f(a,m)=0$
whenever $am=0$ is of the form $f(x,v)=\Phi(xv)$ for some linear map $\Phi:
V\rightarrow F$. In this paper, we classify the finite dimensional irreducible
and principal projective zero action determined modules of $A$. As an
application, two classes of zero product determined algebras are shown: some
semiperfect algebras (infinite dimensional in general); quasihereditary
cellular algebras.

Electronic medical records contain multiformat electronic medical data that
consist of an abundance of medical knowledge. Facing with patient's symptoms,
experienced caregivers make right medical decisions based on their professional
knowledge that accurately grasps relationships between symptoms, diagnosis and
corresponding treatments. In this paper, we aim to capture these relationships
by constructing a large and highquality heterogenous graph linking patients,
diseases, and drugs (PDD) in EMRs. Specifically, we propose a novel framework
to extract important medical entities from MIMICIII (Medical Information Mart
for Intensive Care III) and automatically link them with the existing
biomedical knowledge graphs, including ICD9 ontology and DrugBank. The PDD
graph presented in this paper is accessible on the Web via the SPARQL endpoint,
and provides a pathway for medical discovery and applications, such as
effective treatment recommendations.

FFT (fast Fourier transform) plays a very important role in many fields, such
as digital signal processing, digital image processing and so on. However, in
application, FFT becomes a factor of affecting the processing efficiency,
especially in remote sensing, which large amounts of data need to be processed
with FFT. So shortening the FFT computation time is particularly important. GPU
(Graphics Processing Unit) has been used in many common areas and its
acceleration effect is very obvious compared with CPU (Central Processing Unit)
platform. In this paper, we present a new parallel method to execute FFT on
GPU. Based on GPU storage system and hardware processing pipeline, we improve
the way of data storage. We divided the data into parts reasonably according
the size of data to make full use of the characteristics of the GPU. We propose
the memory optimized method based on share memory and texture memory to reduce
the number of global memory access to achieve better efficiency. The results
show that the GPUbased memory optimized FFT implementation not only can
increase over 100% than FFTW library in CPU platform, but also can improve over
30% than CUFFT library in GPU platform.

The superconducting film of (Li1xFex)OHFeSe is reported for the first time.
The thin film exhibits a small inplane crystal mosaic of 0.22 deg, in terms of
the FWHM (fullwidthathalfmaximum) of xray rocking curve, and an excellent
outofplane orientation by xray phiscan. Its bulk superconducting transition
temperature (Tc) of 42.4 K is characterized by both zero electrical resistance
and diamagnetization measurements. The upper critical field (Hc2) is estimated
to be 79.5 T and 443 T, respectively, for the magnetic field perpendicular and
parallel to the ab plane. Moreover, a large critical current density (Jc) of a
value over 0.5 MA/cm2 is achieved at ~20 K. Such a (Li1xFex)OHFeSe film is
therefore not only important to the fundamental research for understanding the
highTc mechanism, but also promising in the field of highTc superconductivity
application, especially in highperformance electronic devices and large
scientific facilities such as superconducting accelerator.

The evolution from superconducting LiTi2O4delta to insulating Li4Ti5O12 thin
films has been studied by precisely adjusting the oxygen pressure during the
sample fabrication process. In the superconducting LiTi2O4delta films, with
the increase of oxygen pressure, the oxygen vacancies are filled, and the
caxis lattice constant decreases gradually. With the increase of the oxygen
pressure to a certain critical value, the caxis lattice constant becomes
stable, which implies that the Li4Ti5O12 phase comes into being. The process of
oxygen filling is manifested by the angular brightfield images of the scanning
transmission electron microscopy techniques. The temperature of
magnetoresistance changed from positive and negative shows a nonmonotonous
behavior with the increase of oxygen pressure. The theoretical explanation of
the oxygen effects on the structure and superconductivity of LiTi2O4delta has
also been discussed in this work.

Derived equivalences for Artin algebras (and almost $\nu$stable derived
equivalences for finitedimensional algebras) are constructed from Milnor
squares of algebras. Particularly, three operations of gluing vertices,
unifying arrows and identifying socle elements on derived equivalent algebras
are presented to produce new derived equivalences of the resulting algebras
from the given ones. As a byproduct, we construct a series of derived
equivalences, showing that derived equivalences may change Frobenius type of
algebras in general, though both tilting procedure and almost $\nu$stable
derived equivalences do preserve Frobenius type of algebras.

We seek to accelerate and increase the size of simulations for
fluidstructure interactions (FSI) by using multiple resolutions in the spatial
discretization of the equations governing the time evolution of systems
displaying twoway fluidsolid coupling. To this end, we propose a
multiresolution smoothed particle hydrodynamics (SPH) approach in which
subdomains of different resolutions are directly coupled without any overlap
region. The secondorder consistent discretization of spatial differential
operators is employed to ensure the accuracy of the proposed method. As SPH
particles advect with the flow, a dynamic SPH particle refinement/coarsening is
employed via splitting/merging to maintain a predefined multiresolution
configuration. Particle regularity is enforced via a particleshifting
technique to ensure accuracy and stability of the Lagrangian particlebased
method embraced. The convergence, accuracy, and efficiency attributes of the
new method are assessed by simulating four different flows. In this process,
the numerical results are compared to the analytical, finite element, and
consistent SPH singleresolution solutions. We anticipate that the proposed
multiresolution method will enlarge the class of SPHtractable FSI
applications.