• Feature selection is a standard approach to understanding and modeling high-dimensional classification data, but the corresponding statistical methods hinge on tuning parameters that are difficult to calibrate. In particular, existing calibration schemes in the logistic regression framework lack any finite sample guarantees. In this paper, we introduce a novel calibration scheme for $\ell_1$-penalized logistic regression. It is based on simple tests along the tuning parameter path and is equipped with optimal guarantees for feature selection. It is also amenable to easy and efficient implementations, and it rivals or outmatches existing methods in simulations and real data applications.
  • Fine control of the dynamics of a quantum system is the key element to perform quantum information processing and coherent manipulations for atomic and molecular systems. In this paper we propose a control protocol using a tangent-pulse driven model and demonstrate that it indicates a desirable design, i.e., of being both fast and accurate for population transfer. As opposed to other existing strategies, a remarkable character of the present scheme is that high velocity of the nonadiabatic evolution itself not only will not lead to unwanted transitions but also can suppress the error caused by the truncation of the driving pulse.
  • The notion of Scott distance between points and subsets in a metric space, a metric analogy of the Scott topology on an ordered set, is introduced, making a metric space into an approach space. Basic properties of Scott distance are investigated, including its topological coreflection and its relation to injective $T_0$ approach spaces. It is proved that the topological coreflection of the Scott distance is sandwiched between the $d$-Scott topology and the generalized Scott topology; and that every injective $T_0$ approach space is a cocomplete and continuous metric space equipped with its Scott distance.
  • As more and more academic papers are being submitted to conferences and journals, evaluating all these papers by professionals is time-consuming and can cause inequality due to the personal factors of the reviewers. In this paper, in order to assist professionals in evaluating academic papers, we propose a novel task: automatic academic paper rating (AAPR), which automatically determine whether to accept academic papers. We build a new dataset for this task and propose a novel modularized hierarchical convolutional neural network to achieve automatic academic paper rating. Evaluation results show that the proposed model outperforms the baselines by a large margin. The dataset and code are available at \url{https://github.com/lancopku/AAPR}
  • Let $T_{n}$ be an arc-colored tournament of order $n$. The maximum monochromatic indegree $\Delta^{-mon}(T_{n})$ (resp. outdegree $\Delta^{+mon}(T_{n})$) of $T_{n}$ is the maximum number of in-arcs (resp. out-arcs) of a same color incident to a vertex of $T_{n}$. The irregularity $i(T_{n})$ of $T_{n}$ is the maximum difference between the indegree and outdegree of a vertex of $T_{n}$. A subdigraph $H$ of an arc-colored digraph $D$ is called rainbow if each pair of arcs in $H$ have distinct colors. In this paper, we show that each vertex $v$ in an arc-colored tournament $T_{n}$ with $\Delta^{-mon}(T_n)\leq\Delta^{+mon}(T_n)$ is contained in at least $\frac{\delta(v)(n-\delta(v)-i(T_n))}{2}-[\Delta^{-mon}(T_{n})(n-1)+\Delta^{+mon}(T_{n})d^+(v)]$ rainbow triangles, where $\delta(v)=\min\{d^+(v), d^-(v)\}$. We also give some maximum monochromatic degree conditions for $T_{n}$ to contain rainbow triangles, and to contain rainbow triangles passing through a given vertex. Finally, we present some examples showing that some of the conditions in our results are best possible. Keywords: arc-colored tournament, rainbow triangle, maximum monochromatic indegree (outdegree), irregularity
  • Traditional intelligent fault diagnosis of rolling bearings work well only under a common assumption that the labeled training data (source domain) and unlabeled testing data (target domain) are drawn from the same distribution. However, in many real-world applications, this assumption does not hold, especially when the working condition varies. In this paper, a new adversarial adaptive 1-D CNN called A2CNN is proposed to address this problem. A2CNN consists of four parts, namely, a source feature extractor, a target feature extractor, a label classifier and a domain discriminator. The layers between the source and target feature extractor are partially untied during the training stage to take both training efficiency and domain adaptation into consideration. Experiments show that A2CNN has strong fault-discriminative and domain-invariant capacity, and therefore can achieve high accuracy under different working conditions. We also visualize the learned features and the networks to explore the reasons behind the high performance of our proposed model.
  • Ordinary least squares provides the optimal linear approximation to the true regression function under misspecification. This paper investigates the Instrumental Variables (IV) version of this problem. The resulting population parameter is called the Optimal Linear IV Approximation (OLIVA). This paper shows that a necessary condition for regular identification of the OLIVA is also sufficient for existence of an IV estimand in a linear IV model. The necessary condition holds for the important case of a binary endogenous treatment, leading also to a LATE interpretation with positive weights. The instrument in the IV estimand is unknown and is estimated in a first step. A Two-Step IV (TSIV) estimator is proposed. We establish the asymptotic normality of a debiased TSIV estimator based on locally robust moments. The TSIV estimator does not require neither completeness nor identification of the instrument. As a by-product of our analysis, we robustify the classical Hausman test for exogeneity against misspecification of the linear model. Monte Carlo simulations suggest excellent finite sample performance for the proposed inferences.
  • Spatiotemporal feature learning in videos is a fundamental problem in computer vision. This paper presents a new architecture, termed as Appearance-and-Relation Network (ARTNet), to learn video representation in an end-to-end manner. ARTNets are constructed by stacking multiple generic building blocks, called as SMART, whose goal is to simultaneously model appearance and relation from RGB input in a separate and explicit manner. Specifically, SMART blocks decouple the spatiotemporal learning module into an appearance branch for spatial modeling and a relation branch for temporal modeling. The appearance branch is implemented based on the linear combination of pixels or filter responses in each frame, while the relation branch is designed based on the multiplicative interactions between pixels or filter responses across multiple frames. We perform experiments on three action recognition benchmarks: Kinetics, UCF101, and HMDB51, demonstrating that SMART blocks obtain an evident improvement over 3D convolutions for spatiotemporal feature learning. Under the same training setting, ARTNets achieve superior performance on these three datasets to the existing state-of-the-art methods.
  • For the task of subdecimeter aerial imagery segmentation, the fine-grained semantic segmentation results are usually difficult to obtain because of complex remote sensing contents and optical conditions. In addition, remote sensing imagery has inherent limitations of imbalanced class distribution. Recently, convolutional neural networks (CNNs) have shown outstanding performance on this task. In this paper, we propose the TreeSegNet to solve the class imbalance problem and further improve the accuracy in the metrics' point of view. Based on the infrastructure of DeepUNet, a Tree-CNN model in which each node represents a ResNeXt unit is constructed automatically according to confusion matrix and minimum graph cut algorithm. By transporting feature maps by concatenating connections, the Tree-CNN block fuses the multiscale features and learning the best weights for the model. In the experiments on ISPRS 2D semantic labeling Potsdam dataset, the results gotten by TreeSegNet are better than the opened state-of-the-art methods. The F1 measure scores of classes are improved especially for those classes that are easily confused. Completely and detailed comparison and analysis are performed to show that the improvement is brought by the construction and the embedding of the Tree-CNN module.
  • Electrically-pumped lasers directly grown on silicon are key devices interfacing silicon microelectronics and photonics. We report here, for the first time, an electrically-pumped, room-temperature, continuous-wave (CW) and single-mode distributed feedback (DFB) laser array fabricated in InAs/GaAs quantum-dot (QD) gain material epitaxially grown on silicon. CW threshold currents as low as 12 mA and single-mode side mode suppression ratios (SMSRs) as high as 50 dB have been achieved from individual devices in the array. The laser array, compatible with state-of-the-art coarse wavelength division multiplexing (CWDM) systems, has a well-aligned channel spacing of 20 0.2 nm and exhibits a record wavelength coverage range of 100 nm, the full span of the O-band. These results indicate that, for the first time, the performance of lasers epitaxially grown on silicon is elevated to a point approaching real-world CWDM applications, demonstrating the great potential of this technology.
  • Generative models (GMs) such as Generative Adversary Network (GAN) and Variational Auto-Encoder (VAE) have thrived these years and achieved high quality results in generating new samples. Especially in Computer Vision, GMs have been used in image inpainting, denoising and completion, which can be treated as the inference from observed pixels to corrupted pixels. However, images are hierarchically structured which are quite different from many real-world inference scenarios with non-hierarchical features. These inference scenarios contain heterogeneous stochastic variables and irregular mutual dependences. Traditionally they are modeled by Bayesian Network (BN). However, the learning and inference of BN model are NP-hard thus the number of stochastic variables in BN is highly constrained. In this paper, we adapt typical GMs to enable heterogeneous learning and inference in polynomial time.We also propose an extended autoregressive (EAR) model and an EAR with adversary loss (EARA) model and give theoretical results on their effectiveness. Experiments on several BN datasets show that our proposed EAR model achieves the best performance in most cases compared to other GMs. Except for black box analysis, we've also done a serial of experiments on Markov border inference of GMs for white box analysis and give theoretical results.
  • We report low temperature scanning tunneling microscopy and spectroscopy studies of Ni-Bi films grown by molecular beam epitaxy. Highly anisotropic and twofold symmetric superconducting gaps are revealed in two distinct composites, Bi-rich NiBi3 and near-equimolar NixBi, both sharing quasi-one-dimensional crystal structure. We further reveal axially elongated vortices in both phases, but Caroli-de Gennes-Matricon states solely within the vortex cores of NiBi3. Intriguingly, although the localized bound state splits energetically off at a finite distance ~10 nm away from a vortex center along the minor axis of elliptic vortex, no splitting is found along the major axis. We attribute the elongated vortices and unusual vortex behaviors to the combined effects of twofold superconducting gap and Fermi velocity. The findings provide a comprehensive understanding of the electron pairing and vortex matter in quasi-one-dimensional superconductors
  • Blockchain stores information into a chain of blocks, whose integrity is usually guaranteed by Proof of Work (PoW). In many blockchain applications (including cryptocurrencies), users compete with each other to win the ownership of the blocks, a process commonly referred as mining. Mining activities consume huge amount of power, while the outcome appears to be useless besides validating a block. Here we discuss the requirements of designing a new PoW algorithm. We also propose a PoW scheme to help solve high-dimension, non-linear optimization problems. The revised scheme enables us to address difficult scientific questions as a byproduct of mining.
  • Monocular camera systems are prevailing in intelligent transportation systems, but by far they have rarely been used for dimensional purposes such as to accurately estimate the localization information of a vehicle. In this paper, we show that this capability can be realized. By integrating a series of advanced computer vision techniques including foreground extraction, edge and line detection, etc., and by utilizing deep learning networks for fine-grained vehicle model classification, we developed an algorithm which can estimate vehicles location (position, orientation and boundaries) within the environment down to 3.79 percent position accuracy and 2.5 degrees orientation accuracy. With this enhancement, current massive surveillance camera systems can potentially play the role of e-traffic police and trigger many new intelligent transportation applications, for example, to guide vehicles for parking or even for autonomous driving.
  • We investigate a hybrid inverse problem in fluorescence ultrasound modulated optical tomography (fUMOT) in the diffusive regime. We prove that the absorption coefficient of the fluorophores at the excitation frequency and the quantum efficiency coefficient can be uniquely and stably reconstructed from boundary measurement of the photon currents, provided that some background medium parameters are known. Reconstruction algorithms are proposed and numerically implemented as well.
  • Most recent approaches use the sequence-to-sequence model for paraphrase generation. The existing sequence-to-sequence model tends to memorize the words and the patterns in the training dataset instead of learning the meaning of the words. Therefore, the generated sentences are often grammatically correct but semantically improper. In this work, we introduce a novel model based on the encoder-decoder framework, called Word Embedding Attention Network (WEAN). Our proposed model generates the words by querying distributed word representations (i.e. neural word embeddings), hoping to capturing the meaning of the according words. Following previous work, we evaluate our model on two paraphrase-oriented tasks, namely text simplification and short text abstractive summarization. Experimental results show that our model outperforms the sequence-to-sequence baseline by the BLEU score of 6.3 and 5.5 on two English text simplification datasets, and the ROUGE-2 F1 score of 5.7 on a Chinese summarization dataset. Moreover, our model achieves state-of-the-art performances on these three benchmark datasets.
  • Most existing person re-identification (re-id) methods require supervised model learning from a separate large set of pairwise labelled training data for every single camera pair. This significantly limits their scalability and usability in real-world large scale deployments with the need for performing re-id across many camera views. To address this scalability problem, we develop a novel deep learning method for transferring the labelled information of an existing dataset to a new unseen (unlabelled) target domain for person re-id without any supervised learning in the target domain. Specifically, we introduce an Transferable Joint Attribute-Identity Deep Learning (TJ-AIDL) for simultaneously learning an attribute-semantic and identitydiscriminative feature representation space transferrable to any new (unseen) target domain for re-id tasks without the need for collecting new labelled training data from the target domain (i.e. unsupervised learning in the target domain). Extensive comparative evaluations validate the superiority of this new TJ-AIDL model for unsupervised person re-id over a wide range of state-of-the-art methods on four challenging benchmarks including VIPeR, PRID, Market-1501, and DukeMTMC-ReID.
  • Deep convolutional neural networks (CNNs) have greatly improved the Face Recognition (FR) performance in recent years. Almost all CNNs in FR are trained on the carefully labeled datasets containing plenty of identities. However, such high-quality datasets are very expensive to collect, which restricts many researchers to achieve state-of-the-art performance. In this paper, we propose a framework, called SeqFace, for learning discriminative face features. Besides a traditional identity training dataset, the designed SeqFace can train CNNs by using an additional dataset which includes a large number of face sequences collected from videos. Moreover, the label smoothing regularization (LSR) and a new proposed discriminative sequence agent (DSA) loss are employed to enhance discrimination power of deep face features via making full use of the sequence data. Our method achieves excellent performance on Labeled Faces in the Wild (LFW), YouTube Faces (YTF), only with a single ResNet. The code and models are publicly available on-line (https://github.com/huangyangyu/SeqFace).
  • This paper presents a versatile robotic system for sewing 3D structured object. Leveraging on using a customized robotic sewing device and closed-loop visual servoing control, an all-in-one solution for sewing personalized stent graft is demonstrated. Stitch size planning and automatic knot tying are proposed as the two key functions of the system. By using effective stitch size planning, sub-millimetre sewing accuracy is achieved for stitch sizes ranging from 2mm to 5mm. In addition, a thread manipulator for thread management and tension control is also proposed to perform successive knot tying to secure each stitch. Detailed laboratory experiments have been performed to access the proposed instruments and allied algorithms. The proposed framework can be generalised to a wide range of applications including 3D industrial sewing, as well as transferred to other clinical areas such as surgical suturing.
  • Taking into account the interplay between the disorder and Coulomb interactions, the phase diagram of three-dimensional anisotropic-Weyl semimetal is studied by renormalization group theory. It is well established that the weak disorder is irrelevant in 3D anisotropic-Weyl semimetal, while the strong disorder makes sense which drives a quantum phase transition from semimetal to compressible diffusive metal. The long-range Coulomb interaction is irrelevant in clean anistropic Weyl semimetal. However, we find that the long-range Coulomb interaction exerts a dramatic influence on the critical disorder strength for phase transition to compressible diffusive metal. Specifically, the critical disorder strength can receive prominent changes even though an arbitrarily small value of Coulomb interaction is included. This novel behavior is closely related to the anisotropic screening effect of long-range Coulomb interaction, and essentially results from the specifical energy dispersion of the fermions in three-dimensional anisotropic Weyl semimetal.
  • Nanoscaled room-temperature ferroelectricity is ideal for developing advanced non-volatile high-density memories. However, reaching the thin film limit in conventional ferroelectrics is a long-standing challenge due to the possible critical thickness effect. Van der Waals materials, thanks to their stable layered structure, saturate interfacial chemistry and weak interlayer couplings, are promising for exploring ultra-thin two-dimensional (2D) ferroelectrics and device applications. Here, we demonstrate a switchable room-temperature ferroelectric diode built upon a 2D ferroelectric {\alpha}-In2Se3 layer as thin as 5 nm in the form of graphene/{\alpha}-In2Se3 heterojunction. The intrinsic out-of-plane ferroelectricity of the {\alpha}-In2Se3 thin layers is evidenced by the observation of reversible spontaneous electric polarization with a relative low coercive electric field of ~$2 X 10^5 V/cm$ and a typical ferroelectric domain size of around tens ${\mu}m^2$. Owing to the out-of-plane ferroelectricity of the {\alpha}-In2Se3 layer, the Schottky barrier at the graphene/{\alpha}-In2Se3 interface can be effectively tuned by switching the electric polarization with an applied voltage, leading to a pronounced switchable double diode effect with an on/off ratio of ~$10^4$. Our results offer a new way for developing novel nanoelectronic devices based on 2D ferroelectrics.
  • A filament consists of local maximizers of a smooth function $f$ when moving in a certain direction. Filamentary structures are important features of the shape of objects and are also considered as important lower dimensional characterization of multivariate data. There have been some recent theoretical studies of filaments in the nonparametric kernel density estimation context. This paper supplements the current literature in two ways. First, we provide a Bayesian approach to the filament estimation in regression context and study the posterior contraction rates using a finite random series of B-splines basis. Compared with the kernel-estimation method, this has theoretical advantage as the bias can be better controlled when the function is smoother, which allows obtaining better rates. Assuming that $f: \mathbb{R}^2 \mapsto \mathbb{R}$ belongs to an isotropic H\"{o}lder class of order $\alpha \geq 4$, with the optimal choice of smoothing parameters, the posterior contraction rates for the filament points on some appropriately defined integral curves and for the Hausdorff distance of the filament are both $(n/\log n)^{(2-\alpha)/(2(1+\alpha))}$. Secondly, we provide a way to construct a credible set with sufficient frequentist coverage for the filaments. Our valid credible region consists of posterior filaments that have frequentist interpretation. We demonstrate the success of our proposed method in simulations and application to earthquake data.
  • We report on atomic-scale visualization of the structure of infinite-layer cuprate SrCuO2 thin films grown on Nb-doped SrTiO3 substrates by molecular beam epitaxy. In-situ scanning tunneling microscopy study reveals stoichiometric copper oxide (CuO2) plane with a 2 x 2 surface reconstruction, prompted by preferential clustering of four adjacent CuO2 plaquettes. By imaging the subsurface Sr atoms, intra-unit-cell rotational symmetry breaking is observed, which, together with the adjacent CuO2 clustering, can be well accounted for by a periodic up-down buckling of oxygen ions on the CuO2 plane. Further post-annealing leads to an incommensurate stripe structure of the surface layer. Our findings provide important structural information for deeply understanding the electronic structure of superconducting CuO2 plane as well as high temperature superconductivity in cuprates.
  • During the long time of development, Chinese language has evolved a great deal. Native speakers now have difficulty in reading sentences written in ancient Chinese. In this paper, we propose an unsupervised algorithm that constructs sentence-aligned ancient-contemporary pairs out of the abundant passage-aligned corpus. With this method, we build a large parallel corpus. We propose to apply the sequence to sequence model to automatically transfer between ancient and contemporary Chinese sentences. Experiments show that both our alignment and transfer method can produce very good result except for some circumstances that even human translators can make mistakes without background knowledge.
  • We explore the frustrated spin-$1/2$ Heisenberg model on the star lattice with antiferromagnetic (AF) couplings inside each triangle and ferromagnetic (FM) inter-triangle couplings ($J_e<0$), and calculate its magnetic and thermodynamic properties. We show that the FM couplings do not sabotage the magnetic disordering of the ground state due to the frustration from the AF interactions inside each triangle, but trigger a fully gapped inversion-symmetry-breaking trimerized valence bond crystal (TVBC) with emergent spin-1 degrees of freedom. We discover that with strengthening $J_e$, the system scales exponentially, either with or without a magnetic field $h$: the order parameter, the five critical fields that separate the $J_e$-$h$ ground-state phase diagram into six phases, and the excitation gap obtained by low-temperature specific heat, all depend exponentially on $J_e$. We calculate the temperature dependence of the specific heat, which can be directly compared with future experiments.