• This paper presents a cost-sensitive active Question-Answering (QA) framework for learning a nine-layer And-Or graph (AOG) from web images. The AOG explicitly represents object categories, poses/viewpoints, parts, and detailed structures within the parts in a compositional hierarchy. The QA framework is designed to minimize an overall risk, which trades off the loss and query costs. The loss is defined for nodes in all layers of the AOG, including the generative loss (measuring the likelihood of the images) and the discriminative loss (measuring the fitness to human answers). The cost comprises both the human labor of answering questions and the computational cost of model learning. The cost-sensitive QA framework iteratively selects different storylines of questions to update different nodes in the AOG. Experiments showed that our method required much less human supervision (e.g., labeling parts on 3--10 training objects for each category) and achieved better performance than baseline methods.
  • Conditional Generative Adversarial Networks (GANs) for cross-domain image-to-image translation have made much progress recently. Depending on the task complexity, thousands to millions of labeled image pairs are needed to train a conditional GAN. However, human labeling is expensive, even impractical, and large quantities of data may not always be available. Inspired by dual learning from natural language translation, we develop a novel dual-GAN mechanism, which enables image translators to be trained from two sets of unlabeled images from two domains. In our architecture, the primal GAN learns to translate images from domain U to those in domain V, while the dual GAN learns to invert the task. The closed loop made by the primal and dual tasks allows images from either domain to be translated and then reconstructed. Hence a loss function that accounts for the reconstruction error of images can be used to train the translators. Experiments on multiple image translation tasks with unlabeled data show considerable performance gain of DualGAN over a single GAN. For some tasks, DualGAN can even achieve comparable or slightly better results than conditional GAN trained on fully labeled data.
  • Developing a safe and efficient collision avoidance policy for multiple robots is challenging in the decentralized scenarios where each robot generate its paths without observing other robots' states and intents. While other distributed multi-robot collision avoidance systems exist, they often require extracting agent-level features to plan a local collision-free action, which can be computationally prohibitive and not robust. More importantly, in practice the performance of these methods are much lower than their centralized counterparts. We present a decentralized sensor-level collision avoidance policy for multi-robot systems, which directly maps raw sensor measurements to an agent's steering commands in terms of movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to find an optimal policy which is trained over a large number of robots on rich, complex environments simultaneously using a policy gradient based reinforcement learning algorithm. We validate the learned sensor-level collision avoidance policy in a variety of simulated scenarios with thorough performance evaluations and show that the final learned policy is able to find time efficient, collision-free paths for a large-scale robot system. We also demonstrate that the learned policy can be well generalized to new scenarios that do not appear in the entire training period, including navigating a heterogeneous group of robots and a large-scale scenario with 100 robots. Videos are available at https://sites.google.com/view/drlmaca
  • We introduce P2P-NET, a general-purpose deep neural network which learns geometric transformations between point-based shape representations from two domains, e.g., meso-skeletons and surfaces, partial and complete scans, etc. The architecture of the P2P-NET is that of a bi-directional point displacement network, which transforms a source point set to a target point set with the same cardinality, and vice versa, by applying point-wise displacement vectors learned from data. P2P-NET is trained on paired shapes from the source and target domains, but without relying on point-to-point correspondences between the source and target point sets. The training loss combines two uni-directional geometric losses, each enforcing a shape-wise similarity between the predicted and the target point sets, and a cross-regularization term to encourage consistency between displacement vectors going in opposite directions. We develop and present several different applications enabled by our general-purpose bidirectional P2P-NET to highlight the effectiveness, versatility, and potential of our network in solving a variety of point-based shape transformation problems.
  • We examine Higgs boson production and decay in heavy-ion collisions at the LHC and future colliders. Owing to the long lifetime of the Higgs boson, its hadronic decays may experience little or no screening from the hot and dense quark-gluon plasma whereas jets from hard scattering processes and from decays of the electro-weak gauge bosons and the top-quark suffer significant energy loss. This distinction can lead to enhanced sensitivity in hadronic decay channels and thus, for example, to the Yukawa coupling of the Higgs boson to the bottom quark.
  • We present a semi-supervised co-analysis method for learning 3D shape styles from projected feature lines, achieving style patch localization with only weak supervision. Given a collection of 3D shapes spanning multiple object categories and styles, we perform style co-analysis over projected feature lines of each 3D shape and then backproject the learned style features onto the 3D shapes. Our core analysis pipeline starts with mid-level patch sampling and pre-selection of candidate style patches. Projective features are then encoded via patch convolution. Multi-view feature integration and style clustering are carried out under the framework of partially shared latent factor (PSLF) learning, a multi-view feature learning scheme. PSLF achieves effective multi-view feature fusion by distilling and exploiting consistent and complementary feature information from multiple views, while also selecting style patches from the candidates. Our style analysis approach supports both unsupervised and semi-supervised analysis. For the latter, our method accepts both user-specified shape labels and style-ranked triplets as clustering constraints.We demonstrate results from 3D shape style analysis and patch localization as well as improvements over state-of-the-art methods. We also present several applications enabled by our style analysis.
  • To a torus action on a complex vector space, Gelfand, Kapranov and Zelevinsky introduce a system of differential equations, called the GKZ hypergeometric system. Its solutions are GKZ hypergeometric functions. We study the p-adic counterpart of the GKZ hypergeometric system. In the language of dagger spaces introduced by Grosse-Kl\"onne, the p-adic GKZ hypergeometric complex is a twisted relative de Rham complex of meromorphic differential forms with logarithmic poles for an affinoid toric dagger space over the dagger unit polydisc. It is a complex of ${\mathcal O}^\dagger$-modules with integrable connections and with Frobenius structures defined on the dagger unit polydisc such that traces of Frobenius on fibers at Techm\"uller points define the hypergeometric function over the finite field introduced by Gelfand and Graev.
  • A problem not well understood in video hyperlinking is what qualifies a fragment as an anchor or target. Ideally, anchors provide good starting points for navigation, and targets supplement anchors with additional details while not distracting users with irrelevant, false and redundant information. The problem is not trivial for intertwining relationship between data characteristics and user expectation. Imagine that in a large dataset, there are clusters of fragments spreading over the feature space. The nature of each cluster can be described by its size (implying popularity) and structure (implying complexity). A principle way of hyperlinking can be carried out by picking centers of clusters as anchors and from there reach out to targets within or outside of clusters with consideration of neighborhood complexity. The question is which fragments should be selected either as anchors or targets, in one way to reflect the rich content of a dataset, and meanwhile to minimize the risk of frustrating user experience. This paper provides some insights to this question from the perspective of hubness and local intrinsic dimensionality, which are two statistical properties in assessing the popularity and complexity of data space. Based these properties, two novel algorithms are proposed for low-risk automatic selection of anchors and targets.
  • Two-dimensional (2D) transition-metal dichalcogenide (TMD) MX$_2$ (M = Mo, W; X= S, Se, Te) possess unique properties and novel applications. In this work, we perform first-principles calculations on the van der Waals (vdW) stacked MX$_2$ heterostructures to investigate their electronic, optical and transport properties systematically. We perform the so-called Anderson's rule to classify the heterostructures by providing the scheme of the construction of energy band diagrams for the heterostructure consisting of two semiconductor materials. For most of the MX$_2$ heterostructures, the conduction band maximum (CBM) and valence band minimum (VBM) reside in two separate semiconductors, forming type II band structure, thus the electron-holes pairs are spatially separated. We also find strong interlayer coupling at $\Gamma$ point after forming MX$_2$ heterostructures, even leading to the indirect band gap. While the band structure near $K$ point remain as the independent monolayer. The carrier mobilities of MX$_2$ heterostructures depend on three decisive factors, elastic modulus, effective mass and deformation potential constant, which are discussed and contrasted with those of monolayer MX$_2$, respectively.
  • We introduce BranchGAN, a novel training method that enables unconditioned generative adversarial networks (GANs) to learn image manifolds at multiple scales. What is unique about BranchGAN is that it is trained in multiple branches, progressively covering both the breadth and depth of the network, as resolutions of the training images increase to reveal finer-scale features. Specifically, each noise vector, as input to the generator network, is explicitly split into several sub-vectors, each corresponding to and trained to learn image representations at a particular scale. During training, we progressively "de-freeze" the sub-vectors, one at a time, as a new set of higher-resolution images is employed for training and more network layers are added. A consequence of such an explicit sub-vector designation is that we can directly manipulate and even combine latent (sub-vector) codes that are associated with specific feature scales. Experiments demonstrate the effectiveness of our training method in multi-scale, disentangled learning of image manifolds and synthesis, without any extra labels and without compromising quality of the synthesized high-resolution images. We further demonstrate two new applications enabled by BranchGAN.
  • To train an inference network jointly with a deep generative topic model, making it both scalable to big corpora and fast in out-of-sample prediction, we develop Weibull hybrid autoencoding inference (WHAI) for deep latent Dirichlet allocation, which infers posterior samples via a hybrid of stochastic-gradient MCMC and autoencoding variational Bayes. The generative network of WHAI has a hierarchy of gamma distributions, while the inference network of WHAI is a Weibull upward-downward variational autoencoder, which integrates a deterministic-upward deep neural network, and a stochastic-downward deep generative model based on a hierarchy of Weibull distributions. The Weibull distribution can be used to well approximate a gamma distribution with an analytic Kullback-Leibler divergence, and has a simple reparameterization via the uniform noise, which help efficiently compute the gradients of the evidence lower bound with respect to the parameters of the inference network. The effectiveness and efficiency of WHAI are illustrated with experiments on big corpora.
  • We present a theoretical proposal for a physical implementation of entanglement concentration and purification protocols for two-mode squeezed microwave photons in circuit quantum electrodynamics (QED). First, we give the description of the cross-Kerr effect induced between two resonators in circuit QED. Then we use the cross-Kerr media to design the effective quantum nondemolition (QND) measurement on microwave-photon number. By using the QND measurement, the parties in quantum communication can accomplish the entanglement concentration and purification of nonlocal two-mode squeezed microwave photons. We discuss the feasibility of our schemes by giving the detailed parameters which can be realized with current experimental technology. Our work can improve some practical applications in continuous-variable microwave-based quantum information processing.
  • Strongly correlated electronic materials such as the high-$T_c$ cuprates are expected to feature unconventional transport properties, where charge, spin and heat conduction are potentially independent probes of the dynamics. However, the measurement of spin transport in such materials is - in contrast to charge transport - highly challenging. Here we observe spin diffusion in a Mott insulator of ultracold fermionic atoms with single-atom resolution. The system realizes the Fermi-Hubbard model, believed to capture the essence of the cuprate phenomenology. We find that for strong interactions, spin diffusion is driven by super-exchange and strongly violates the quantum limit of charge diffusion. The technique developed in this work can be extended to finite doping, which can shed light on the complex interplay between spin and charge in the Hubbard model.
  • One of the commonly used approaches to modeling univariate extremes is the peaks-over-threshold (POT) method. The POT method models exceedances over a (sufficiently high/low) threshold as a generalized Pareto distribution (GPD). This method requires the selection of a threshold that might affect the estimates. Here we propose an alternative method, the "Log-Histospline (LHSpline)", to explore modeling the tail behavior and the remainder of the density in one step using the full range of the data. LHSpline applies a smoothing spline model to a finely binned histogram of the log transformed data to estimate its log density. By construction, a LHSpline estimation is constrained to have polynomial tail behavior, a feature commonly observed in geophysical observations. We illustrate LHSpline method by analyzing precipitation data collected in Houston, Texas.
  • The non-adiabatic holonomic quantum computation with the advantages of fast and robustness attracts widespread attention in recent years. Here, we propose the first scheme for realizing universal single-qubit gates based on an optomechanical system working with the non-adiabatic geometric phases. Our quantum gates are robust to the control errors and the parameter fluctuations, and have unique functions to achieve the quantum state transfer and entanglement generation between cavities. We discuss the corresponding experimental parameters and give some simulations. Our scheme may have the practical applications in quantum computation and quantum information processing.
  • We for the first time combine generated adversarial network (GAN) with wide-field light microscopy to achieve deep learning super-resolution under a large field of view (FOV). By appropriately adopting prior microscopy data in an adversarial training, the network can recover a high-resolution, accurate image of new specimen from its single low-resolution measurement. This capacity has been adequately demonstrated by imaging various types of samples, such as USAF resolution target, human pathological slides and fluorescence-labelled fibroblast cells. Their gigapixel, multi-color reconstructions verify a successful GAN-based single image super-resolution procedure. Furthermore, this deep learning-based imaging approach doesn;t necessarily introduce any change to the setup of a conventional wide-filed microscope, reconstructing large FOV (about 95 mm^2), high-resolution (about 1.7 {\mu}m) image at a high speed (in 1 second). As a result, GAN-microscopy opens a new way to computationally overcome the general challenge of high-throughput, high-resolution microscopy that is originally coupled to the physical limitation of system's optics.
  • Atomistic simulations are performed to probe the anisotropic deformation in the compressions of face-centred-cubic metallic nanoparticles. In the elastic regime, the compressive load-depth behaviors can be characterized by the classical Hertzian model or flat punch model, depending on the surface configuration beneath indenter. On the onset of plasticity, atomic-scale surface steps serve as the source of heterogeneous dislocation in nanoparticle, which is distinct from indenting bulk materials. Under [111] compression, the gliding of jogged dislocation takes over the dominant plastic deformation. The plasticity is governed by nucleation and exhaustion of extended dislocation ribbons in [110] compression. Twin boundary migration mainly sustain the plastic deformation under [112] compression. This study is helpful to extract the mechanical properties of metallic nanoparticles and understand their anisotropic deformation behaviors.
  • We have analyzed multi-passband photometric observations, obtained with the {\it Hubble Space Telescope}, of the massive ($1.8 \times 10^5 M_\odot$), intermediate-age (1.8 Gyr-old) Large Magellanic Cloud star cluster NGC 1783. The morphology of the cluster's red giant branch does not exhibit a clear broadening beyond its intrinsic width; the observed width is consistent with that owing to photometric uncertainties alone and independent of our photometric selection boundaries applied to obtain our sample of red-giant stars. The color dispersion of the cluster's red-giant stars around the best-fitting ridgeline is $0.062 \pm 0.009$ mag, which is equivalent to the width of $0.080 \pm 0.001$ mag derived from artificial simple stellar population tests, that is, tests based on single-age, single-metallicity stellar populations. NGC 1783 is comparably massive as other star clusters that show clear evidence of multiple stellar populations. After incorporating mass-loss recipes from its current age of 1.8 Gyr to an age of 6 Gyr, NGC 1783 is expected to remain as massive as some other clusters that host clear multiple populations at these intermediate ages. If we were to assume that mass is an important driver of multiple population formation, then NGC 1783 should have exhibited clear evidence of chemical abundance variations. However, our results support the absence of any chemical abundance variations in NGC 1783.
  • Majorana modes are zero-energy excitations of a topological superconductor that exhibit non-Abelian statistics. Following proposals for their detection in a semiconductor nanowire coupled to an s-wave superconductor, several tunneling experiments reported characteristic Majorana signatures. Reducing disorder has been a prime challenge for these experiments because disorder can mimic the zero-energy signatures of Majoranas, and renders the topological properties inaccessible. Here, we show characteristic Majorana signatures in InSb nanowire devices exhibiting clear ballistic transport properties. Application of a magnetic field and spatial control of carrier density using local gates generates a zero bias peak that is rigid over a large region in the parameter space of chemical potential, Zeeman energy, and tunnel barrier potential. The reduction of disorder allows us to resolve separate regions in the parameter space with and without a zero bias peak, indicating topologically distinct phases. These observations are consistent with the Majorana theory in a ballistic system, and exclude for the first time the known alternative explanations that invoke disorder or a nonuniform chemical potential.
  • Majorana zero-modes hold great promise for topological quantum computing. Tunnelling spectroscopy in electrical transport is the primary tool to identify the presence of Majorana zero-modes, for instance as a zero-bias peak (ZBP) in differential-conductance. The Majorana ZBP-height is predicted to be quantized at the universal conductance value of 2e2/h at zero temperature. Interestingly, this quantization is a direct consequence of the famous Majorana symmetry, 'particle equals antiparticle'. The Majorana symmetry protects the quantization against disorder, interactions, and variations in the tunnel coupling. Previous experiments, however, have shown ZBPs much smaller than 2e2/h, with a recent observation of a peak-height close to 2e2/h. Here, we report a quantized conductance plateau at 2e2/h in the zero-bias conductance measured in InSb semiconductor nanowires covered with an Al superconducting shell. Our ZBP-height remains constant despite changing parameters such as the magnetic field and tunnel coupling, i.e. a quantized conductance plateau. We distinguish this quantized Majorana peak from possible non-Majorana origins, by investigating its robustness on electric and magnetic fields as well as its temperature dependence. The observation of a quantized conductance plateau strongly supports the existence of non-Abelian Majorana zero-modes in the system, consequently paving the way for future braiding experiments.
  • Dynamic mode decomposition (DMD) gives a practical means of extracting dynamic information from data, in the form of spatial modes and their associated frequencies and growth/decay rates. DMD can be considered as a numerical approximation to the Koopman operator, an infinite-dimensional linear operator defined for (nonlinear) dynamical systems. This work proposes a new criterion to estimate the accuracy of DMD on a mode-by-mode basis, by estimating how closely each individual DMD eigenfunction approximates the corresponding Koopman eigenfunction. This approach does not require any prior knowledge of the system dynamics or the true Koopman spectral decomposition. The method may be applied to extensions of DMD (i.e., extended/kernel DMD), which are applicable to a wider range of problems. The accuracy criterion is first validated against the true error with a synthetic system for which the true Koopman spectral decomposition is known. We next demonstrate how this proposed accuracy criterion can be used to assess the performance of various choices of kernel when using the kernel method for extended DMD. Finally, we show that our proposed method successfully identifies modes of high accuracy when applying DMD to data from experiments in fluids, in particular particle image velocimetry of a cylinder wake and a canonical separated boundary layer.
  • Our task is to generate an effective summary for a given document with specific realtime requirements. We use the softplus function to enhance keyword rankings to favor important sentences, based on which we present a number of summarization algorithms using various keyword extraction and topic clustering methods. We show that our algorithms meet the realtime requirements and yield the best ROUGE recall scores on DUC-02 over all previously-known algorithms. We show that our algorithms meet the realtime requirements and yield the best ROUGE recall scores on DUC-02 over all previously-known algorithms. To evaluate the quality of summaries without human-generated benchmarks, we define a measure called WESM based on word-embedding using Word Mover's Distance. We show that the orderings of the ROUGE and WESM scores of our algorithms are highly comparable, suggesting that WESM may serve as a viable alternative for measuring the quality of a summary.
  • The transformation from evanescent waves to propagation waves is the key mechanism for the realization of some super-resolution imaging methods. By using the recursive Green function and scattering-matrix theory, we investigated in details on the transport of evanescent waves through a random medium and analyzed quantitatively the coupling of evanescent channels to propagation channels. By numerical calculations, we found that the transmission for the incident evanescent channel is determined by both the eigenvalues of the scattering matrix and the coupling strength to the corresponding propagation channels in random medium, and the disorder strength of the random medium influences both of them.
  • Microwave photons have become very important qubits in quantum communication as the first quantum satellite has been launched successfully. Therefore, it is a necessary and meaningful task for ensuring the high security and efficiency of microwave quantum communication in practice. Here, we present an original polarization entanglement purification protocol (EPP) for nonlocal microwave photons based on the cross-Kerr effect in circuit quantum electrodynamics (QED). Our protocol can solve the problem that the purity of maximally entangled states used for constructing quantum channel will decrease due to decoherence from environment noise. This task is accomplished by means of the polarization parity-check quantum nondemolition (QND) detector, the bit-flipping operation, and the linear microwave elements. The QND detector is composed of several cross-Kerr effect systems which can be realized by coupling two superconducting transmission line resonators to a superconducting molecule with the N-type level structure. Our calculation shows that the QND detector has a high fidelity with applicable experimental parameters in circuit QED, which means this EPP can succeed with a high fidelity and has good applications in long-distance quantum communication assisted by microwave photons in the future, such as satellite quantum communication.
  • Generative Adversarial Networks (GANs) have recently achieved significant improvement on paired/unpaired image-to-image translation, such as photo$\rightarrow$ sketch and artist painting style transfer. However, existing models can only be capable of transferring the low-level information (e.g. color or texture changes), but fail to edit high-level semantic meanings (e.g., geometric structure or content) of objects. On the other hand, while some researches can synthesize compelling real-world images given a class label or caption, they cannot condition on arbitrary shapes or structures, which largely limits their application scenarios and interpretive capability of model results. In this work, we focus on a more challenging semantic manipulation task, which aims to modify the semantic meaning of an object while preserving its own characteristics (e.g. viewpoints and shapes), such as cow$\rightarrow$sheep, motor$\rightarrow$ bicycle, cat$\rightarrow$dog. To tackle such large semantic changes, we introduce a contrasting GAN (contrast-GAN) with a novel adversarial contrasting objective. Instead of directly making the synthesized samples close to target data as previous GANs did, our adversarial contrasting objective optimizes over the distance comparisons between samples, that is, enforcing the manipulated data be semantically closer to the real data with target category than the input data. Equipped with the new contrasting objective, a novel mask-conditional contrast-GAN architecture is proposed to enable disentangle image background with object semantic changes. Experiments on several semantic manipulation tasks on ImageNet and MSCOCO dataset show considerable performance gain by our contrast-GAN over other conditional GANs. Quantitative results further demonstrate the superiority of our model on generating manipulated results with high visual fidelity and reasonable object semantics.