• We consider a stochastic energy exchange model that models the 1D microscopic heat conduction in the nonequilibrium setting. In this paper, we prove the existence and uniqueness of the nonequilibrium steady state (NESS) and, furthermore, the polynomial speed of convergence to the NESS. Our result shows that the asymptotic properties of this model and its deterministic dynamical system origin are consistent. The proof uses a new technique called the induced chain method. We partition the state space and work on both the Markov chain induced by an "active set" and the tail of return time to this "active set".
  • We demonstrate a data-driven method to solve the invariant probability density function of a randomly perturbed dynamical system. The key idea is to replace the boundary condition of numerical schemes by a least square problem corresponding to a reference solution, which is generated by Monte Carlo simulation. With this method we can solve the invariant probability density function in any local area with high accuracy, regardless whether the attractor is covered by the domain.
  • This paper studies a billiards-like microscopic heat conduction model, which describes the dynamics of gas molecules in a long tube with thermalized boundary. We numerically investigate the law of energy exchange between adjacent cells. A stochastic energy exchange model that preserves these properties is then derived. We further numerically justified that the stochastic energy exchange model preserves the ergodicity and the thermal conductivity of its deterministic counterpart.
  • Good tools can bring mechanical verification to programs written in mainstream functional languages. We use hs-to-coq to translate significant portions of Haskell's containers library into Coq, and verify it against specifications that we derive from a variety of sources including type class laws, the library's test suite, and interfaces from Coq's standard library. Our work shows that it is feasible to verify mature, widely-used, highly optimized, and unmodified Haskell code. We also learn more about the theory of weight-balanced trees, extend hs-to-coq to handle partiality, and -- since we found no bugs -- attest to the superb quality of well-tested functional code.
  • Halide perovskites are promising semiconductors for optoelectronics, yet thin films show substantial microscale heterogeneity. Understanding the origins of these variations is essential for mitigating parasitic losses such as non-radiative decay. Here, we probe the structural and chemical origins of the heterogeneity by utilizing scanning X-ray diffraction beamlines at two different synchrotrons combined with high-resolution transmission electron microscopy to spatially characterize the crystallographic properties of individual micrometer-sized perovskite grains in high-quality films. We reveal new levels of heterogeneity on the ten-micrometer scale (super-grains) and even ten-nanometer scale (sub-grain domains). By directly correlating these properties with their corresponding local time-resolved photoluminescence properties, we find that regions showing the greatest luminescence losses correspond to strained regions, which arise from enhanced defect concentrations. Our work reveals remarkably complex heterogeneity across multiple length scales, shedding new light on the defect tolerance of perovskites.
  • Self-Interacting Dark Matter (SIDM) is a leading candidate to solve the puzzles of the cold dark matter paradigm on galactic scales. We present a particle-physics study on SIDM models in PandaX-II, a direct detection experiment in China JinPing underground Laboratory. We use data collected in 2016 and 2017 runs, corresponding to a total exposure of 54 ton day, the largest published data set of its kind to date. Strong combined limits are set on the mass of the dark-force mediator, its mixing with the standard model particles, and the mass of dark matter. Together with considerations from the Big-Bang Nucleosynthesis, our results put tight constraints on SIDM models.
  • Statistical inference in high dimensional settings has recently attracted enormous attention within the literature. However, most published work focuses on the parametric linear regression problem. This paper considers an important extension of this problem: statistical inference for high dimensional sparse nonparametric additive models. To be more precise, this paper develops a methodology for constructing a probability density function on the set of all candidate models. This methodology can also be applied to construct confidence intervals for various quantities of interest (such as noise variance) and confidence bands for the additive functions. This methodology is derived using a generalized fiducial inference framework. It is shown that results produced by the proposed methodology enjoy correct asymptotic frequentist properties. Empirical results obtained from numerical experimentation verify this theoretical claim. Lastly, the methodology is applied to a gene expression data set and discovers new findings for which most existing methods based on parametric linear modeling failed to observe.
  • We report a new search of weakly interacting massive particles (WIMPs) using the combined low background data sets in 2016 and 2017 from the PandaX-II experiment in China. The latest data set contains a new exposure of 77.1 live day, with the background reduced to a level of 0.8$\times10^{-3}$ evt/kg/day, improved by a factor of 2.5 in comparison to the previous run in 2016. No excess events were found above the expected background. With a total exposure of 5.4$\times10^4$ kg day, the most stringent upper limit on spin-independent WIMP-nucleon cross section was set for a WIMP with mass larger than 100 GeV/c$^2$, with the lowest exclusion at 8.6$\times10^{-47}$ cm$^2$ at 40 GeV/c$^2$.
  • Recommendation for e-commerce with a mix of durable and nondurable goods has characteristics that distinguish it from the well-studied media recommendation problem. The demand for items is a combined effect of form utility and time utility, i.e., a product must both be intrinsically appealing to a consumer and the time must be right for purchase. In particular for durable goods, time utility is a function of inter-purchase duration within product category because consumers are unlikely to purchase two items in the same category in close temporal succession. Moreover, purchase data, in contrast to ratings data, is implicit with non-purchases not necessarily indicating dislike. Together, these issues give rise to the positive-unlabeled demand-aware recommendation problem that we pose via joint low-rank tensor completion and product category inter-purchase duration vector estimation. We further relax this problem and propose a highly scalable alternating minimization approach with which we can solve problems with millions of users and items. We also show superior prediction accuracies on multiple real-world data sets.
  • Reusable model design becomes desirable with the rapid expansion of machine learning applications. In this paper, we focus on the reusability of pre-trained deep convolutional models. Specifically, different from treating pre-trained models as feature extractors, we reveal more treasures beneath convolutional layers, i.e., the convolutional activations could act as a detector for the common object in the image co-localization problem. We propose a simple but effective method, named Deep Descriptor Transforming (DDT), for evaluating the correlations of descriptors and then obtaining the category-consistent regions, which can accurately locate the common object in a set of images. Empirical studies validate the effectiveness of the proposed DDT method. On benchmark image co-localization datasets, DDT consistently outperforms existing state-of-the-art methods by a large margin. Moreover, DDT also demonstrates good generalization ability for unseen categories and robustness for dealing with noisy data.
  • According to the classical theory of elasticity, a plate subjected to a bending moment always deflects with symmetric tensile and compressive strains in its two sides, without overall deformation perpendicular to the bending moment. Here, we find by ab initio simulations that significant overall tensile strain can be induced by pure bending in a wide range of two-dimensional crystals perpendicular to the bending moment, just like an accordion being bent to open. This accordion effect is raised by asymmetric response of chemical bonds and electron density to the bending curvature, with the tensile strain being a power function of the curvature.
  • We provide a hybrid method that captures the polynomial speed of convergence and polynomial speed of mixing for Markov processes. The hybrid method that we introduce is based on the coupling technique and renewal theory. We propose to replace some estimates in classical results about the ergodicity of Markov processes by numerical simulations when the corresponding analytical proof is difficult. After that, all remaining conclusions can be derived from rigorous analysis. Then we apply our results to two 1D microscopic heat conduction models. The mixing rate of these two models are expected to be polynomial but very difficult to prove. In both examples, our numerical results match the expected polynomial mixing rate well.
  • Large-scale datasets have driven the rapid development of deep neural networks for visual recognition. However, annotating a massive dataset is expensive and time-consuming. Web images and their labels are, in comparison, much easier to obtain, but direct training on such automatically harvested images can lead to unsatisfactory performance, because the noisy labels of Web images adversely affect the learned recognition models. To address this drawback we propose an end-to-end weakly-supervised deep learning framework which is robust to the label noise in Web images. The proposed framework relies on two unified strategies -- random grouping and attention -- to effectively reduce the negative impact of noisy web image annotations. Specifically, random grouping stacks multiple images into a single training instance and thus increases the labeling accuracy at the instance level. Attention, on the other hand, suppresses the noisy signals from both incorrectly labeled images and less discriminative image regions. By conducting intensive experiments on two challenging datasets, including a newly collected fine-grained dataset with Web images of different car models, the superior performance of the proposed methods over competitive baselines is clearly demonstrated.
  • Recognizing the identities of people in everyday photos is still a very challenging problem for machine vision, due to non-frontal faces, changes in clothing, location, lighting and similar. Recent studies have shown that rich relational information between people in the same photo can help in recognizing their identities. In this work, we propose to model the relational information between people as a sequence prediction task. At the core of our work is a novel recurrent network architecture, in which relational information between instances' labels and appearance are modeled jointly. In addition to relational cues, scene context is incorporated in our sequence prediction model with no additional cost. In this sense, our approach is a unified framework for modeling both contextual cues and visual appearance of person instances. Our model is trained end-to-end with a sequence of annotated instances in a photo as inputs, and a sequence of corresponding labels as targets. We demonstrate that this simple but elegant formulation achieves state-of-the-art performance on the newly released People In Photo Albums (PIPA) dataset.
  • Since the first successful synthesis of graphene just over a decade ago, a variety of two-dimensional (2D) materials (e.g., transition metal-dichalcogenides, hexagonal boron-nitride, etc.) have been discovered. Among the many unique and attractive properties of 2D materials, mechanical properties play important roles in manufacturing, integration and performance for their potential applications. Mechanics is indispensable in the study of mechanical properties, both experimentally and theoretically. The coupling between the mechanical and other physical properties (thermal, electronic, optical) is also of great interest in exploring novel applications, where mechanics has to be combined with condensed matter physics to establish a scalable theoretical framework. Moreover, mechanical interactions between 2D materials and various substrate materials are essential for integrated device applications of 2D materials, for which the mechanics of interfaces (adhesion and friction) has to be developed for the 2D materials. Here we review recent theoretical and experimental works related to mechanics and mechanical properties of 2D materials. While graphene is the most studied 2D material to date, we expect continual growth of interest in the mechanics of other 2D materials beyond graphene.
  • Given a set of images containing objects from the same category, the task of image co-localization is to identify and localize each instance. This paper shows that this problem can be solved by a simple but intriguing idea, that is, a common object detector can be learnt by making its detection confidence scores distributed like those of a strongly supervised detector. More specifically, we observe that given a set of object proposals extracted from an image that contains the object of interest, an accurate strongly supervised object detector should give high scores to only a small minority of proposals, and low scores to most of them. Thus, we devise an entropy-based objective function to enforce the above property when learning the common object detector. Once the detector is learnt, we resort to a segmentation approach to refine the localization. We show that despite its simplicity, our approach outperforms state-of-the-art methods.
  • This paper is Part I of a two-part series devoting to the study of systematic measures in a complex biological network modeled by a system of ordinary differential equations. As the mathematical complement to our previous work [31] with collaborators, the series aims at establishing a mathematical foundation for characterizing three important systematic measures: degeneracy, complexity and robustness, in such a biological network and studying connections among them. To do so, we consider in Part I stationary measures of a Fokker-Planck equation generated from small white noise perturbations of a dissipative system of ordinary differential equations. Some estimations of concentration of stationary measures of the Fokker-Planck equation in the vicinity of the global attractor are presented. Relationship between differential entropy of stationary measures and dimension of the global attractor is also given.
  • This paper is Part II of a two-part series devoting to the study of systematic measures in a complex bio-network modeled by a system of ordinary differential equations. In this part, we quantify several systematic measures of a biological network including degeneracy, complexity and robustness. We will apply the theory of stochastic differential equations to define degeneracy and complexity for a bio-network. Robustness of the network will be defined according to the strength of attractions to the global attractor. Based on the study of stationary probability measures and entropy made in Part I of the series, we will investigate some fundamental properties of these systematic measures, in particular the connections between degeneracy, complexity and robustness.
  • The purpose of mid-level visual element discovery is to find clusters of image patches that are both representative and discriminative. Here we study this problem from the prospective of pattern mining while relying on the recently popularized Convolutional Neural Networks (CNNs). We observe that a fully-connected CNN activation extracted from an image patch typically possesses two appealing properties that enable its seamless integration with pattern mining techniques. The marriage between CNN activations and association rule mining, a well-known pattern mining technique in the literature, leads to fast and effective discovery of representative and discriminative patterns from a huge number of image patches. When we retrieve and visualize image patches with the same pattern, surprisingly, they are not only visually similar but also semantically consistent, and thus give rise to a mid-level visual element in our work. Given the patterns and retrieved mid-level visual elements, we propose two methods to generate image feature representations for each. The first method is to use the patterns as codewords in a dictionary, similar to the Bag-of-Visual-Words model, we compute a Bag-of-Patterns representation. The second one relies on the retrieved mid-level visual elements to construct a Bag-of-Elements representation. We evaluate the two encoding methods on scene and object classification tasks, and demonstrate that our approach outperforms or matches recent works using CNN activations for these tasks.
  • The rest-frame UV-optical (i.e., NUV-B) color index is sensitive to the low-level recent star formation and dust extinction, but it is insensitive to the metallicity. In this Letter, we have measured the rest-frame NUV-B color gradients in ~1400 large ($\rm r_e>0.18^{\prime\prime}$), nearly face-on (b/a>0.5) main-sequence star-forming galaxies (SFGs) between redshift 0.5 and 1.5 in the CANDELS/GOODS-S and UDS fields. With this sample, we study the origin of UV-optical color gradients in the SFGs at z~1 and discuss their link with the buildup of stellar mass. We find that the more massive, centrally compact, and more dust extinguished SFGs tend to have statistically more negative raw color gradients (redder centers) than the less massive, centrally diffuse, and less dusty SFGs. After correcting for dust reddening based on optical-SED fitting, the color gradients in the low-mass ($M_{\ast} <10^{10}M_{\odot}$) SFGs generally become quite flat, while most of the high-mass ($M_{\ast} > 10^{10.5}M_{\odot}$) SFGs still retain shallow negative color gradients. These findings imply that dust reddening is likely the principal cause of negative color gradients in the low-mass SFGs, while both increased central dust reddening and buildup of compact old bulges are likely the origins of negative color gradients in the high-mass SFGs. These findings also imply that at these redshifts the low-mass SFGs buildup their stellar masses in a self-similar way, while the high-mass SFGs grow inside out.
  • While full-duplex (FD) transmission has the potential to double the system capacity, its substantial benefit can be offset by the self-interference (SI) and non-ideality of practical transceivers. In this paper, we investigate the achievable sum rates (ASRs) of half-duplex (HD) and FD transmissions with orthogonal frequency division multiplexing (OFDM), where the non-ideality is taken into consideration. Four transmission strategies are considered, namely HD with uniform power allocation (UPA), HD with non-UPA (NUPA), FD with UPA, and FD with NUPA. For each of the four transmission strategies, an optimization problem is formulated to maximize its ASR, and a (suboptimal/optimal) solution with low complexity is accordingly derived. Performance evaluations and comparisons are conducted for three typical channels, namely symmetric frequency-flat/selective and asymmetric frequency-selective channels. Results show that the proposed solutions for both HD and FD transmissions can achieve near optimal performances. For FD transmissions, the optimal solution can be obtained under typical conditions. In addition, several observations are made on the ASR performances of HD and FD transmissions.
  • Dynamic control of conductivity and optical properties via atomic structure changes is of tremendous technological importance in information storage. Energy consumption considerations provide a driving force toward employing thin materials in devices. Monolayer transition metal dichalcogenides are nearly atomically-thin materials that can exist in multiple crystal structures, each with distinct electrical properties. Using density functional approaches, we discover that electrostatic gating device configurations have the potential to drive structural semiconductor-to-semimetal phase transitions in some monolayer transition metal dichalcogenides. For the first time, we show that the dynamical control of this phase transition can be achieved in carefully designed electronic devices. We discover that the semiconductor-to-semimetal phase transition in monolayer MoTe2 can be driven by a gate voltage of several Volts with appropriate choice of dielectric. Structural transitions in monolayer TaSe2 are predicted to occur under similar conditions. While the required field magnitudes are large for these two materials, we find that the gate voltage for the transition can be reduced arbitrarily by alloying, e.g. for MoxW1-xTe2 monolayers. We have developed a method for computing phase diagrams of monolayer materials with respect to charge and voltage, validated by comparing to direct calculations and experimental measurements. Our findings identify a new physical mechanism, not existing in bulk materials, to dynamically control structural phase transitions in two-dimensional materials, enabling potential applications in phase-change electronic devices.
  • We consider a stochastic particle system in which a finite number of particles interact with one another via a common energy tank. Interaction rate for each particle is proportional to the square root of its kinetic energy, as is consistent with analogous mechanical models. Our main result is that the rate of convergence to equilibrium for such a system is $\sim t^{-2}$, more precisely it is faster than a constant times $t^{-2+\varepsilon}$ for any $\varepsilon>0$. A discussion of exponential vs polynomial convergence for similar particle systems is included.
  • A new method of the stochastic simulation algorithm (SSA), named the Hashing-Leaping method (HLM), for exact simulations of a class of Markov jump processes, is presented in this paper. The HLM has a conditional constant computational cost per event, which is independent of the number of exponential clocks in the Markov process. The main idea of the HLM is to repeatedly implement a hash-table-like bucket sort algorithm for all times of occurrence covered by a time step with length $\tau$. This paper serves as an introduction to this new SSA method. We introduce the method, demonstrate its implementation, analyze its properties, and compare its performance with three other commonly used SSA methods in four examples. Our performance tests and CPU operation statistics show certain advantage of the HLM for large scale problems.
  • Graphene has emerged as a promising material for photonic applications fuelled by its superior electronic and optical properties. However, the photoresponsivity is limited by the low absorption cross section and ultrafast recombination rates of photoexcited carriers. Here we demonstrate a photoconductive gain of $\sim$ 10$^5$ electrons per photon in a carbon nanotube-graphene one dimensional-two dimensional hybrid due to efficient photocarriers generation and transport within the nanostructure. A broadband photodetector (covering 400 nm to 1550 nm) based on such hybrid films is fabricated with a high photoresponsivity of more than 100 AW$^{-1}$ and a fast response time of approximately 100 {\mu}s. The combination of ultra-broad bandwidth, high responsivities and fast operating speeds affords new opportunities for facile and scalable fabrication of all-carbon optoelectronic devices.