• In this paper, we study the stochastic combinatorial multi-armed bandit (CMAB) framework that allows a general nonlinear reward function, whose expected value may not depend only on the means of the input random variables but possibly on the entire distributions of these variables. Our framework enables a much larger class of reward functions such as the $\max()$ function and nonlinear utility functions. Existing techniques relying on accurate estimations of the means of random variables, such as the upper confidence bound (UCB) technique, do not work directly on these functions. We propose a new algorithm called stochastically dominant confidence bound (SDCB), which estimates the distributions of underlying random variables and their stochastically dominant confidence bounds. We prove that SDCB can achieve $O(\log{T})$ distribution-dependent regret and $\tilde{O}(\sqrt{T})$ distribution-independent regret, where $T$ is the time horizon. We apply our results to the $K$-MAX problem and expected utility maximization problems. In particular, for $K$-MAX, we provide the first polynomial-time approximation scheme (PTAS) for its offline problem, and give the first $\tilde{O}(\sqrt T)$ bound on the $(1-\epsilon)$-approximation regret of its online problem, for any $\epsilon>0$.
  • For the task of subdecimeter aerial imagery segmentation, the fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing contents and optical conditions. In addition, remote sensing imagery has inherent limitations of imbalanced class distribution. Recently, convolutional neural networks (CNNs) have shown outstanding performance on this task. In this paper, we propose the TreeSegNet to solve the class imbalance problem and further improve the accuracy in the metrics' point of view. Based on the infrastructure of DeepUNet, a Tree-CNN model in which each node represents a ResNeXt unit is constructed automatically according to confusion matrix and minimum graph cut algorithm. By transporting feature maps by concatenating connections, the Tree-CNN block fuses the multiscale features and learning the best weights for the model. In the experiments on ISPRS 2D semantic labeling Potsdam dataset, the results gotten by TreeSegNet are better than the opened state-of-the-art methods. The F1 measure scores of classes are improved especially for those classes that are easily confused. Completely and detailed comparison and analysis are performed to show that the improvement is brought by the construction and the embedding of the Tree-CNN module.
  • We generalize the monomorphism category from quiver (with monomial relations) to arbitrary finite dimensional algebras by a homological definition. Given two finite dimension algebras $A$ and $B$, we use the special monomorphism category Mon(B, A-Gproj) to describe some Gorenstein projective bimodules over the tensor product of $A$ and $B$. If one of the two algebras is Gorenstein, we give a sufficient and necessary condition for Mon(B, A-Gproj) being the category of all Gorenstein projective bimodules. In addition, If both $A$ and $B$ are Gorenstein, we can describe the category of all Gorenstein projective bimodules via filtration categories. Similarly, in this case, we get the same result for infinitely generated Gorenstein projective bimodules.
  • We revisit the question of reducing online learning to approximate optimization of the offline problem. In this setting, we give two algorithms with near-optimal performance in the full information setting: they guarantee optimal regret and require only poly-logarithmically many calls to the approximation oracle per iteration. Furthermore, these algorithms apply to the more general improper learning problems. In the bandit setting, our algorithm also significantly improves the best previously known oracle complexity while maintaining the same regret.
  • Deep convolutional neural networks (CNNs) have greatly improved the Face Recognition (FR) performance in recent years. Almost all CNNs in FR are trained on the carefully labeled datasets containing plenty of identities. However, such high-quality datasets are very expensive to collect, which restricts many researchers to achieve state-of-the-art performance. In this paper, we propose a framework, called SeqFace, for learning discriminative face features. Besides a traditional identity training dataset, the designed SeqFace can train CNNs by using an additional dataset which includes a large number of face sequences collected from videos. Moreover, the label smoothing regularization (LSR) and a new proposed discriminative sequence agent (DSA) loss are employed to enhance discrimination power of deep face features via making full use of the sequence data. Our method achieves excellent performance on Labeled Faces in the Wild (LFW), YouTube Faces (YTF), only with a single ResNet. The code and models are publicly available on-line (https://github.com/huangyangyu/SeqFace).
  • The problem of disturbance rejection/attenuation for constant-input delayed linear multi-agent systems (MASs) with the directed communication topology is tackled in this paper, where a classic model reduction technique is introduced to transform the delayed MAS into the delay-free one. First, when the leader has no control input, a novel adaptive predictive extended state observer (ESO) using only relative state information of neighboring agents is designed to achieve disturbance-rejected consensus tracking. The stabilization analysis is presented via the Lyapunov function and sufficient conditions are derived in terms of linear matrix inequalities. Then the result is extended to the disturbance-attenuated case where the leader has bounded control input which is only known by a portion of followers. Finally, two numerical examples are presented to illustrate the effectiveness of proposed strategies. The main contribution focuses on the design of adaptive predictive ESO protocols with the fully distributed property.
  • A new consistent, spatially adaptive, smoothed particle hydrodynamics (SPH) method for Fluid-Structure Interactions (FSI) is presented. The method combines several attributes that have not been simultaneously satisfied by other SPH methods. Specifically, it is second-order convergent; it allows for resolutions spatially adapted with moving (translating and rotating) boundaries of arbitrary geometries; and, it accelerates the FSI solution as the adaptive approach leads to fewer degrees of freedom without sacrificing accuracy. The key ingredients in the method are a consistent discretization of differential operators, a \textit{posteriori} error estimator/distance-based criterion of adaptivity, and a particle-shifting technique. The method is applied in simulating six different flows or FSI problems. The new method's convergence, accuracy, and efficiency attributes are assessed by comparing the results it produces with analytical, finite element, and consistent SPH uniform high-resolution solutions as well as experimental data.
  • A first line of attack in exploratory data analysis is data visualization, i.e., generating a 2-dimensional representation of data that makes clusters of similar points visually identifiable. Standard Johnson-Lindenstrauss dimensionality reduction does not produce data visualizations. The t-SNE heuristic of van der Maaten and Hinton, which is based on non-convex optimization, has become the de facto standard for visualization in a wide range of applications. This work gives a formal framework for the problem of data visualization - finding a 2-dimensional embedding of clusterable data that correctly separates individual clusters to make them visually identifiable. We then give a rigorous analysis of the performance of t-SNE under a natural, deterministic condition on the "ground-truth" clusters (similar to conditions assumed in earlier analyses of clustering) in the underlying data. These are the first provable guarantees on t-SNE for constructing good data visualizations. We show that our deterministic condition is satisfied by considerably general probabilistic generative models for clusterable data such as mixtures of well-separated log-concave distributions. Finally, we give theoretical evidence that t-SNE provably succeeds in partially recovering cluster structure even when the above deterministic condition is not met.
  • We consider the convex-concave saddle point problem $\min_{x}\max_{y} f(x)+y^\top A x-g(y)$ where $f$ is smooth and convex and $g$ is smooth and strongly convex. We prove that if the coupling matrix $A$ has full column rank, the vanilla primal-dual gradient method can achieve linear convergence even if $f$ is not strongly convex. Our result generalizes previous work which either requires $f$ and $g$ to be quadratic functions or requires proximal mappings for both $f$ and $g$. We adopt a novel analysis technique that in each iteration uses a "ghost" update as a reference, and show that the iterates in the primal-dual gradient method converge to this "ghost" sequence. Using the same technique we further give an analysis for the primal-dual stochastic variance reduced gradient (SVRG) method for convex-concave saddle point problems with a finite-sum structure.
  • We present an efficient way to solve the Bethe-Salpeter equation (BSE), a model for the computation of absorption spectra in molecules and solids that includes electron-hole excitations. Standard approaches to construct and diagonalize the Bethe-Salpeter Hamiltonian require at least $\O(N_e^5)$ operations, where $N_e$ is proportional to the number of electrons in the system, limiting its application to small systems. Our approach is based on the interpolative separable density fitting (ISDF) technique to construct low rank approximations to the bare and screened exchange operators associated with the BSE Hamiltonian. This approach reduces the complexity of the Hamiltonian construction to $\O(N_e^3)$ with a much smaller pre-constant. Here, we implement the ISDF method for the BSE calculations within the Tamm-Dancoff approximation (TDA) in the BerkeleyGW software package. We show that ISDF-based BSE calculations in molecules and solids reproduce accurate exciton energies and optical absorption spectra with significantly reduced computational cost.
  • An perturbation-iteration method is developed for the computation of the Hermite-Gaussian-like solitons with arbitrary peak numbers in nonlocal nonlinear media. This method is based on the perturbed model of the Schr\"{o}dinger equation for the harmonic oscillator, in which the minimum perturbation is obtained by the iteration. This method takes a few tens of iteration loops to achieve enough high accuracy, and the initial condition is fixed to the Hermite-Gaussian function. The method we developed might also be extended to the numerical integration of the Schr\"{o}dinger equations in any type of potentials.
  • We propose a rank-$k$ variant of the classical Frank-Wolfe algorithm to solve convex optimization over a trace-norm ball. Our algorithm replaces the top singular-vector computation ($1$-SVD) in Frank-Wolfe with a top-$k$ singular-vector computation ($k$-SVD), which can be done by repeatedly applying $1$-SVD $k$ times. Alternatively, our algorithm can be viewed as a rank-$k$ restricted version of projected gradient descent. We show that our algorithm has a linear convergence rate when the objective function is smooth and strongly convex, and the optimal solution has rank at most $k$. This improves the convergence rate and the total time complexity of the Frank-Wolfe method and its variants.
  • Finding the electromagnetic (EM) counterpart of binary compact star merger, especially the binary neutron star (BNS) merger, is critically important for gravitational wave (GW) astronomy, cosmology and fundamental physics. On Aug. 17, 2017, Advanced LIGO and \textit{Fermi}/GBM independently triggered the first BNS merger, GW170817, and its high energy EM counterpart, GRB 170817A, respectively, resulting in a global observation campaign covering gamma-ray, X-ray, UV, optical, IR, radio as well as neutrinos. The High Energy X-ray telescope (HE) onboard \textit{Insight}-HXMT (Hard X-ray Modulation Telescope) is the unique high-energy gamma-ray telescope that monitored the entire GW localization area and especially the optical counterpart (SSS17a/AT2017gfo) with very large collection area ($\sim$1000 cm$^2$) and microsecond time resolution in 0.2-5 MeV. In addition, \textit{Insight}-HXMT quickly implemented a Target of Opportunity (ToO) observation to scan the GW localization area for potential X-ray emission from the GW source. Although it did not detect any significant high energy (0.2-5 MeV) radiation from GW170817, its observation helped to confirm the unexpected weak and soft nature of GRB 170817A. Meanwhile, \textit{Insight}-HXMT/HE provides one of the most stringent constraints (~10$^{-7}$ to 10$^{-6}$ erg/cm$^2$/s) for both GRB170817A and any other possible precursor or extended emissions in 0.2-5 MeV, which help us to better understand the properties of EM radiation from this BNS merger. Therefore the observation of \textit{Insight}-HXMT constitutes an important chapter in the full context of multi-wavelength and multi-messenger observation of this historical GW event.
  • Entity alignment is the task of finding entities in two knowledge bases (KBs) that represent the same real-world object. When facing KBs in different natural languages, conventional cross-lingual entity alignment methods rely on machine translation to eliminate the language barriers. These approaches often suffer from the uneven quality of translations between languages. While recent embedding-based techniques encode entities and relationships in KBs and do not need machine translation for cross-lingual entity alignment, a significant number of attributes remain largely unexplored. In this paper, we propose a joint attribute-preserving embedding model for cross-lingual entity alignment. It jointly embeds the structures of two KBs into a unified vector space and further refines it by leveraging attribute correlations in the KBs. Our experimental results on real-world datasets show that this approach significantly outperforms the state-of-the-art embedding approaches for cross-lingual entity alignment and could be complemented with methods based on machine translation.
  • Quantum protocols require access to large-scale entangled quantum states, due to the requirement of channel capacity. As a promising candidate, the high-dimensional orbital angular momentum (OAM) entangled states have been implemented, but only one of four OAM Bell states in each individual subspace can be distinguished. Here we demonstrate the first realization of complete OAM Bell-state measurement (OAM-BSM) in an individual subspace, by seeking the suitable unitary matrix performable using only linear optics and breaking the degeneracy of four OAM Bell states in ancillary polarization dimension. We further realize the superdense coding via our complete OAMBSM with the average success probability of ~82% and the channel capacity of ~1.1(4) bits. This work opens the window for increasing the channel capacity and extending the applications of OAM quantum states in quantum information in future.
  • Semantic segmentation is a fundamental research in remote sensing image processing. Because of the complex maritime environment, the sea-land segmentation is a challenging task. Although the neural network has achieved excellent performance in semantic segmentation in the last years, there are a few of works using CNN for sea-land segmentation and the results could be further improved. This paper proposes a novel deep convolution neural network named DeepUNet. Like the U-Net, its structure has a contracting path and an expansive path to get high resolution output. But differently, the DeepUNet uses DownBlocks instead of convolution layers in the contracting path and uses UpBlock in the expansive path. The two novel blocks bring two new connections that are U-connection and Plus connection. They are promoted to get more precise segmentation results. To verify our network architecture, we made a new challenging sea-land dataset and compare the DeepUNet on it with the SegNet and the U-Net. Experimental results show that DeepUNet achieved good performance compared with other architectures, especially in high-resolution remote sensing imagery.
  • The commutator direct inversion of the iterative subspace (commutator DIIS or C-DIIS) method developed by Pulay is an efficient and the most widely used scheme in quantum chemistry to accelerate the convergence of self consistent field (SCF) iterations in Hartree-Fock theory and Kohn-Sham density functional theory. The C-DIIS method requires the explicit storage of the density matrix, the Fock matrix and the commutator matrix. Hence the method can only be used for systems with a relatively small basis set, such as the Gaussian basis set. We develop a new method that enables the C-DIIS method to be efficiently employed in electronic structure calculations with a large basis set such as planewaves for the first time. The key ingredient is the projection of both the density matrix and the commutator matrix to an auxiliary matrix called the gauge-fixing matrix. The resulting projected commutator-DIIS method (PC-DIIS) only operates on matrices of the same dimension as the that consists of Kohn-Sham orbitals. The cost of the method is comparable to that of standard charge mixing schemes used in large basis set calculations. The PC-DIIS method is gauge-invariant, which guarantees that its performance is invariant with respect to any unitary transformation of the Kohn-Sham orbitals. We demonstrate that the PC-DIIS method can be viewed as an extension of an iterative eigensolver for nonlinear problems. We use the PC-DIIS method for accelerating Kohn-Sham density functional theory calculations with hybrid exchange-correlation functionals, and demonstrate its superior performance compared to the commonly used nested two-level SCF iteration procedure.
  • We present a new efficient way to perform hybrid density functional theory (DFT) based electronic structure calculation. The new method uses an interpolative separable density fitting (ISDF) procedure to construct a set of numerical auxiliary basis vectors and a compact approximation of the matrix consisting of products of occupied orbitals represented in a large basis set such as the planewave basis. Such an approximation allows us to reduce the number of Poisson solves from $\Or(N_{e}^2)$ to $\Or(N_{e})$ when we apply the exchange operator to occupied orbitals in an iterative method for solving the Kohn-Sham equations, where $N_{e}$ is the number of electrons in the system to be studied. We show that the ISDF procedure can be carried out in $\Or(N_{e}^3)$ operations, with a much smaller pre-constant compared to methods used in existing approaches. When combined with the recently developed adaptively compressed exchange (ACE) operator formalism, which reduces the number of times the exchange operator needs to be updated, the resulting ACE-ISDF method significantly reduces the computational cost \REV{associated with the exchange operator} by nearly two orders of magnitude compared to existing approaches for a large silicon system with $1000$ atoms. We demonstrate that the ACE-ISDF method can produce accurate energies and forces for insulating and metallic systems, and that it is possible to obtain converged hybrid functional calculation results for a 1000-atom bulk silicon within 10 minutes on 2000 computational cores. We also show that ACE-ISDF can scale to 8192 computational cores for a 4096-atom bulk silicon system. We use the ACE-ISDF method to geometrically optimize a 1000-atom silicon system with a vacancy defect using the HSE06 functional and computes its electronic structure.
  • Let $A$ be a unital associative algebra over a field $F$ and $V$ be a unital left $A$-module. The module $V$ is called zero action determined if every bilinear map $f: A\times V\rightarrow F$ with the property that $f(a,m)=0$ whenever $am=0$ is of the form $f(x,v)=\Phi(xv)$ for some linear map $\Phi: V\rightarrow F$. In this paper, we classify the finite dimensional irreducible and principal projective zero action determined modules of $A$. As an application, two classes of zero product determined algebras are shown: some semiperfect algebras (infinite dimensional in general); quasi-hereditary cellular algebras.
  • Electronic medical records contain multi-format electronic medical data that consist of an abundance of medical knowledge. Facing with patient's symptoms, experienced caregivers make right medical decisions based on their professional knowledge that accurately grasps relationships between symptoms, diagnosis and corresponding treatments. In this paper, we aim to capture these relationships by constructing a large and high-quality heterogenous graph linking patients, diseases, and drugs (PDD) in EMRs. Specifically, we propose a novel framework to extract important medical entities from MIMIC-III (Medical Information Mart for Intensive Care III) and automatically link them with the existing biomedical knowledge graphs, including ICD-9 ontology and DrugBank. The PDD graph presented in this paper is accessible on the Web via the SPARQL endpoint, and provides a pathway for medical discovery and applications, such as effective treatment recommendations.
  • FFT (fast Fourier transform) plays a very important role in many fields, such as digital signal processing, digital image processing and so on. However, in application, FFT becomes a factor of affecting the processing efficiency, especially in remote sensing, which large amounts of data need to be processed with FFT. So shortening the FFT computation time is particularly important. GPU (Graphics Processing Unit) has been used in many common areas and its acceleration effect is very obvious compared with CPU (Central Processing Unit) platform. In this paper, we present a new parallel method to execute FFT on GPU. Based on GPU storage system and hardware processing pipeline, we improve the way of data storage. We divided the data into parts reasonably according the size of data to make full use of the characteristics of the GPU. We propose the memory optimized method based on share memory and texture memory to reduce the number of global memory access to achieve better efficiency. The results show that the GPU-based memory optimized FFT implementation not only can increase over 100% than FFTW library in CPU platform, but also can improve over 30% than CUFFT library in GPU platform.
  • The superconducting film of (Li1-xFex)OHFeSe is reported for the first time. The thin film exhibits a small in-plane crystal mosaic of 0.22 deg, in terms of the FWHM (full-width-at-half-maximum) of x-ray rocking curve, and an excellent out-of-plane orientation by x-ray phi-scan. Its bulk superconducting transition temperature (Tc) of 42.4 K is characterized by both zero electrical resistance and diamagnetization measurements. The upper critical field (Hc2) is estimated to be 79.5 T and 443 T, respectively, for the magnetic field perpendicular and parallel to the ab plane. Moreover, a large critical current density (Jc) of a value over 0.5 MA/cm2 is achieved at ~20 K. Such a (Li1-xFex)OHFeSe film is therefore not only important to the fundamental research for understanding the high-Tc mechanism, but also promising in the field of high-Tc superconductivity application, especially in high-performance electronic devices and large scientific facilities such as superconducting accelerator.
  • The evolution from superconducting LiTi2O4-delta to insulating Li4Ti5O12 thin films has been studied by precisely adjusting the oxygen pressure during the sample fabrication process. In the superconducting LiTi2O4-delta films, with the increase of oxygen pressure, the oxygen vacancies are filled, and the c-axis lattice constant decreases gradually. With the increase of the oxygen pressure to a certain critical value, the c-axis lattice constant becomes stable, which implies that the Li4Ti5O12 phase comes into being. The process of oxygen filling is manifested by the angular bright-field images of the scanning transmission electron microscopy techniques. The temperature of magnetoresistance changed from positive and negative shows a non-monotonous behavior with the increase of oxygen pressure. The theoretical explanation of the oxygen effects on the structure and superconductivity of LiTi2O4-delta has also been discussed in this work.
  • Derived equivalences for Artin algebras (and almost $\nu$-stable derived equivalences for finite-dimensional algebras) are constructed from Milnor squares of algebras. Particularly, three operations of gluing vertices, unifying arrows and identifying socle elements on derived equivalent algebras are presented to produce new derived equivalences of the resulting algebras from the given ones. As a byproduct, we construct a series of derived equivalences, showing that derived equivalences may change Frobenius type of algebras in general, though both tilting procedure and almost $\nu$-stable derived equivalences do preserve Frobenius type of algebras.
  • We seek to accelerate and increase the size of simulations for fluid-structure interactions (FSI) by using multiple resolutions in the spatial discretization of the equations governing the time evolution of systems displaying two-way fluid-solid coupling. To this end, we propose a multi-resolution smoothed particle hydrodynamics (SPH) approach in which subdomains of different resolutions are directly coupled without any overlap region. The second-order consistent discretization of spatial differential operators is employed to ensure the accuracy of the proposed method. As SPH particles advect with the flow, a dynamic SPH particle refinement/coarsening is employed via splitting/merging to maintain a predefined multi-resolution configuration. Particle regularity is enforced via a particle-shifting technique to ensure accuracy and stability of the Lagrangian particle-based method embraced. The convergence, accuracy, and efficiency attributes of the new method are assessed by simulating four different flows. In this process, the numerical results are compared to the analytical, finite element, and consistent SPH single-resolution solutions. We anticipate that the proposed multi-resolution method will enlarge the class of SPH-tractable FSI applications.