
Feature selection is a standard approach to understanding and modeling
highdimensional classification data, but the corresponding statistical methods
hinge on tuning parameters that are difficult to calibrate. In particular,
existing calibration schemes in the logistic regression framework lack any
finite sample guarantees. In this paper, we introduce a novel calibration
scheme for $\ell_1$penalized logistic regression. It is based on simple tests
along the tuning parameter path and is equipped with optimal guarantees for
feature selection. It is also amenable to easy and efficient implementations,
and it rivals or outmatches existing methods in simulations and real data
applications.

Fine control of the dynamics of a quantum system is the key element to
perform quantum information processing and coherent manipulations for atomic
and molecular systems. In this paper we propose a control protocol using a
tangentpulse driven model and demonstrate that it indicates a desirable
design, i.e., of being both fast and accurate for population transfer. As
opposed to other existing strategies, a remarkable character of the present
scheme is that high velocity of the nonadiabatic evolution itself not only will
not lead to unwanted transitions but also can suppress the error caused by the
truncation of the driving pulse.

The notion of Scott distance between points and subsets in a metric space, a
metric analogy of the Scott topology on an ordered set, is introduced, making a
metric space into an approach space. Basic properties of Scott distance are
investigated, including its topological coreflection and its relation to
injective $T_0$ approach spaces. It is proved that the topological coreflection
of the Scott distance is sandwiched between the $d$Scott topology and the
generalized Scott topology; and that every injective $T_0$ approach space is a
cocomplete and continuous metric space equipped with its Scott distance.

As more and more academic papers are being submitted to conferences and
journals, evaluating all these papers by professionals is timeconsuming and
can cause inequality due to the personal factors of the reviewers. In this
paper, in order to assist professionals in evaluating academic papers, we
propose a novel task: automatic academic paper rating (AAPR), which
automatically determine whether to accept academic papers. We build a new
dataset for this task and propose a novel modularized hierarchical
convolutional neural network to achieve automatic academic paper rating.
Evaluation results show that the proposed model outperforms the baselines by a
large margin. The dataset and code are available at
\url{https://github.com/lancopku/AAPR}

Let $T_{n}$ be an arccolored tournament of order $n$. The maximum
monochromatic indegree $\Delta^{mon}(T_{n})$ (resp. outdegree
$\Delta^{+mon}(T_{n})$) of $T_{n}$ is the maximum number of inarcs (resp.
outarcs) of a same color incident to a vertex of $T_{n}$. The irregularity
$i(T_{n})$ of $T_{n}$ is the maximum difference between the indegree and
outdegree of a vertex of $T_{n}$. A subdigraph $H$ of an arccolored digraph
$D$ is called rainbow if each pair of arcs in $H$ have distinct colors. In this
paper, we show that each vertex $v$ in an arccolored tournament $T_{n}$ with
$\Delta^{mon}(T_n)\leq\Delta^{+mon}(T_n)$ is contained in at least
$\frac{\delta(v)(n\delta(v)i(T_n))}{2}[\Delta^{mon}(T_{n})(n1)+\Delta^{+mon}(T_{n})d^+(v)]$
rainbow triangles, where $\delta(v)=\min\{d^+(v), d^(v)\}$. We also give some
maximum monochromatic degree conditions for $T_{n}$ to contain rainbow
triangles, and to contain rainbow triangles passing through a given vertex.
Finally, we present some examples showing that some of the conditions in our
results are best possible.
Keywords: arccolored tournament, rainbow triangle, maximum monochromatic
indegree (outdegree), irregularity

Traditional intelligent fault diagnosis of rolling bearings work well only
under a common assumption that the labeled training data (source domain) and
unlabeled testing data (target domain) are drawn from the same distribution.
However, in many realworld applications, this assumption does not hold,
especially when the working condition varies. In this paper, a new adversarial
adaptive 1D CNN called A2CNN is proposed to address this problem. A2CNN
consists of four parts, namely, a source feature extractor, a target feature
extractor, a label classifier and a domain discriminator. The layers between
the source and target feature extractor are partially untied during the
training stage to take both training efficiency and domain adaptation into
consideration. Experiments show that A2CNN has strong faultdiscriminative and
domaininvariant capacity, and therefore can achieve high accuracy under
different working conditions. We also visualize the learned features and the
networks to explore the reasons behind the high performance of our proposed
model.

Ordinary least squares provides the optimal linear approximation to the true
regression function under misspecification. This paper investigates the
Instrumental Variables (IV) version of this problem. The resulting population
parameter is called the Optimal Linear IV Approximation (OLIVA). This paper
shows that a necessary condition for regular identification of the OLIVA is
also sufficient for existence of an IV estimand in a linear IV model. The
necessary condition holds for the important case of a binary endogenous
treatment, leading also to a LATE interpretation with positive weights. The
instrument in the IV estimand is unknown and is estimated in a first step. A
TwoStep IV (TSIV) estimator is proposed. We establish the asymptotic normality
of a debiased TSIV estimator based on locally robust moments. The TSIV
estimator does not require neither completeness nor identification of the
instrument. As a byproduct of our analysis, we robustify the classical Hausman
test for exogeneity against misspecification of the linear model. Monte Carlo
simulations suggest excellent finite sample performance for the proposed
inferences.

Spatiotemporal feature learning in videos is a fundamental problem in
computer vision. This paper presents a new architecture, termed as
AppearanceandRelation Network (ARTNet), to learn video representation in an
endtoend manner. ARTNets are constructed by stacking multiple generic
building blocks, called as SMART, whose goal is to simultaneously model
appearance and relation from RGB input in a separate and explicit manner.
Specifically, SMART blocks decouple the spatiotemporal learning module into an
appearance branch for spatial modeling and a relation branch for temporal
modeling. The appearance branch is implemented based on the linear combination
of pixels or filter responses in each frame, while the relation branch is
designed based on the multiplicative interactions between pixels or filter
responses across multiple frames. We perform experiments on three action
recognition benchmarks: Kinetics, UCF101, and HMDB51, demonstrating that SMART
blocks obtain an evident improvement over 3D convolutions for spatiotemporal
feature learning. Under the same training setting, ARTNets achieve superior
performance on these three datasets to the existing stateoftheart methods.

For the task of subdecimeter aerial imagery segmentation, the finegrained
semantic segmentation results are usually difficult to obtain because of
complex remote sensing contents and optical conditions. In addition, remote
sensing imagery has inherent limitations of imbalanced class distribution.
Recently, convolutional neural networks (CNNs) have shown outstanding
performance on this task. In this paper, we propose the TreeSegNet to solve the
class imbalance problem and further improve the accuracy in the metrics' point
of view. Based on the infrastructure of DeepUNet, a TreeCNN model in which
each node represents a ResNeXt unit is constructed automatically according to
confusion matrix and minimum graph cut algorithm. By transporting feature maps
by concatenating connections, the TreeCNN block fuses the multiscale features
and learning the best weights for the model. In the experiments on ISPRS 2D
semantic labeling Potsdam dataset, the results gotten by TreeSegNet are better
than the opened stateoftheart methods. The F1 measure scores of classes are
improved especially for those classes that are easily confused. Completely and
detailed comparison and analysis are performed to show that the improvement is
brought by the construction and the embedding of the TreeCNN module.

Electricallypumped lasers directly grown on silicon are key devices
interfacing silicon microelectronics and photonics. We report here, for the
first time, an electricallypumped, roomtemperature, continuouswave (CW) and
singlemode distributed feedback (DFB) laser array fabricated in InAs/GaAs
quantumdot (QD) gain material epitaxially grown on silicon. CW threshold
currents as low as 12 mA and singlemode side mode suppression ratios (SMSRs)
as high as 50 dB have been achieved from individual devices in the array. The
laser array, compatible with stateoftheart coarse wavelength division
multiplexing (CWDM) systems, has a wellaligned channel spacing of 20 0.2 nm
and exhibits a record wavelength coverage range of 100 nm, the full span of the
Oband. These results indicate that, for the first time, the performance of
lasers epitaxially grown on silicon is elevated to a point approaching
realworld CWDM applications, demonstrating the great potential of this
technology.

Generative models (GMs) such as Generative Adversary Network (GAN) and
Variational AutoEncoder (VAE) have thrived these years and achieved high
quality results in generating new samples. Especially in Computer Vision, GMs
have been used in image inpainting, denoising and completion, which can be
treated as the inference from observed pixels to corrupted pixels. However,
images are hierarchically structured which are quite different from many
realworld inference scenarios with nonhierarchical features. These inference
scenarios contain heterogeneous stochastic variables and irregular mutual
dependences. Traditionally they are modeled by Bayesian Network (BN). However,
the learning and inference of BN model are NPhard thus the number of
stochastic variables in BN is highly constrained. In this paper, we adapt
typical GMs to enable heterogeneous learning and inference in polynomial
time.We also propose an extended autoregressive (EAR) model and an EAR with
adversary loss (EARA) model and give theoretical results on their
effectiveness. Experiments on several BN datasets show that our proposed EAR
model achieves the best performance in most cases compared to other GMs. Except
for black box analysis, we've also done a serial of experiments on Markov
border inference of GMs for white box analysis and give theoretical results.

We report low temperature scanning tunneling microscopy and spectroscopy
studies of NiBi films grown by molecular beam epitaxy. Highly anisotropic and
twofold symmetric superconducting gaps are revealed in two distinct composites,
Birich NiBi3 and nearequimolar NixBi, both sharing quasionedimensional
crystal structure. We further reveal axially elongated vortices in both phases,
but Carolide GennesMatricon states solely within the vortex cores of NiBi3.
Intriguingly, although the localized bound state splits energetically off at a
finite distance ~10 nm away from a vortex center along the minor axis of
elliptic vortex, no splitting is found along the major axis. We attribute the
elongated vortices and unusual vortex behaviors to the combined effects of
twofold superconducting gap and Fermi velocity. The findings provide a
comprehensive understanding of the electron pairing and vortex matter in
quasionedimensional superconductors

Blockchain stores information into a chain of blocks, whose integrity is
usually guaranteed by Proof of Work (PoW). In many blockchain applications
(including cryptocurrencies), users compete with each other to win the
ownership of the blocks, a process commonly referred as mining. Mining
activities consume huge amount of power, while the outcome appears to be
useless besides validating a block. Here we discuss the requirements of
designing a new PoW algorithm. We also propose a PoW scheme to help solve
highdimension, nonlinear optimization problems. The revised scheme enables us
to address difficult scientific questions as a byproduct of mining.

Monocular camera systems are prevailing in intelligent transportation
systems, but by far they have rarely been used for dimensional purposes such as
to accurately estimate the localization information of a vehicle. In this
paper, we show that this capability can be realized. By integrating a series of
advanced computer vision techniques including foreground extraction, edge and
line detection, etc., and by utilizing deep learning networks for finegrained
vehicle model classification, we developed an algorithm which can estimate
vehicles location (position, orientation and boundaries) within the environment
down to 3.79 percent position accuracy and 2.5 degrees orientation accuracy.
With this enhancement, current massive surveillance camera systems can
potentially play the role of etraffic police and trigger many new intelligent
transportation applications, for example, to guide vehicles for parking or even
for autonomous driving.

We investigate a hybrid inverse problem in fluorescence ultrasound modulated
optical tomography (fUMOT) in the diffusive regime. We prove that the
absorption coefficient of the fluorophores at the excitation frequency and the
quantum efficiency coefficient can be uniquely and stably reconstructed from
boundary measurement of the photon currents, provided that some background
medium parameters are known. Reconstruction algorithms are proposed and
numerically implemented as well.

Most recent approaches use the sequencetosequence model for paraphrase
generation. The existing sequencetosequence model tends to memorize the words
and the patterns in the training dataset instead of learning the meaning of the
words. Therefore, the generated sentences are often grammatically correct but
semantically improper. In this work, we introduce a novel model based on the
encoderdecoder framework, called Word Embedding Attention Network (WEAN). Our
proposed model generates the words by querying distributed word representations
(i.e. neural word embeddings), hoping to capturing the meaning of the according
words. Following previous work, we evaluate our model on two
paraphraseoriented tasks, namely text simplification and short text
abstractive summarization. Experimental results show that our model outperforms
the sequencetosequence baseline by the BLEU score of 6.3 and 5.5 on two
English text simplification datasets, and the ROUGE2 F1 score of 5.7 on a
Chinese summarization dataset. Moreover, our model achieves stateoftheart
performances on these three benchmark datasets.

Most existing person reidentification (reid) methods require supervised
model learning from a separate large set of pairwise labelled training data for
every single camera pair. This significantly limits their scalability and
usability in realworld large scale deployments with the need for performing
reid across many camera views. To address this scalability problem, we develop
a novel deep learning method for transferring the labelled information of an
existing dataset to a new unseen (unlabelled) target domain for person reid
without any supervised learning in the target domain. Specifically, we
introduce an Transferable Joint AttributeIdentity Deep Learning (TJAIDL) for
simultaneously learning an attributesemantic and identitydiscriminative
feature representation space transferrable to any new (unseen) target domain
for reid tasks without the need for collecting new labelled training data from
the target domain (i.e. unsupervised learning in the target domain). Extensive
comparative evaluations validate the superiority of this new TJAIDL model for
unsupervised person reid over a wide range of stateoftheart methods on four
challenging benchmarks including VIPeR, PRID, Market1501, and DukeMTMCReID.

Deep convolutional neural networks (CNNs) have greatly improved the Face
Recognition (FR) performance in recent years. Almost all CNNs in FR are trained
on the carefully labeled datasets containing plenty of identities. However,
such highquality datasets are very expensive to collect, which restricts many
researchers to achieve stateoftheart performance. In this paper, we propose
a framework, called SeqFace, for learning discriminative face features. Besides
a traditional identity training dataset, the designed SeqFace can train CNNs by
using an additional dataset which includes a large number of face sequences
collected from videos. Moreover, the label smoothing regularization (LSR) and a
new proposed discriminative sequence agent (DSA) loss are employed to enhance
discrimination power of deep face features via making full use of the sequence
data. Our method achieves excellent performance on Labeled Faces in the Wild
(LFW), YouTube Faces (YTF), only with a single ResNet. The code and models are
publicly available online (https://github.com/huangyangyu/SeqFace).

This paper presents a versatile robotic system for sewing 3D structured
object. Leveraging on using a customized robotic sewing device and closedloop
visual servoing control, an allinone solution for sewing personalized stent
graft is demonstrated. Stitch size planning and automatic knot tying are
proposed as the two key functions of the system. By using effective stitch size
planning, submillimetre sewing accuracy is achieved for stitch sizes ranging
from 2mm to 5mm. In addition, a thread manipulator for thread management and
tension control is also proposed to perform successive knot tying to secure
each stitch. Detailed laboratory experiments have been performed to access the
proposed instruments and allied algorithms. The proposed framework can be
generalised to a wide range of applications including 3D industrial sewing, as
well as transferred to other clinical areas such as surgical suturing.

Taking into account the interplay between the disorder and Coulomb
interactions, the phase diagram of threedimensional anisotropicWeyl semimetal
is studied by renormalization group theory. It is well established that the
weak disorder is irrelevant in 3D anisotropicWeyl semimetal, while the strong
disorder makes sense which drives a quantum phase transition from semimetal to
compressible diffusive metal. The longrange Coulomb interaction is irrelevant
in clean anistropic Weyl semimetal. However, we find that the longrange
Coulomb interaction exerts a dramatic influence on the critical disorder
strength for phase transition to compressible diffusive metal. Specifically,
the critical disorder strength can receive prominent changes even though an
arbitrarily small value of Coulomb interaction is included. This novel behavior
is closely related to the anisotropic screening effect of longrange Coulomb
interaction, and essentially results from the specifical energy dispersion of
the fermions in threedimensional anisotropic Weyl semimetal.

Nanoscaled roomtemperature ferroelectricity is ideal for developing advanced
nonvolatile highdensity memories. However, reaching the thin film limit in
conventional ferroelectrics is a longstanding challenge due to the possible
critical thickness effect. Van der Waals materials, thanks to their stable
layered structure, saturate interfacial chemistry and weak interlayer
couplings, are promising for exploring ultrathin twodimensional (2D)
ferroelectrics and device applications. Here, we demonstrate a switchable
roomtemperature ferroelectric diode built upon a 2D ferroelectric
{\alpha}In2Se3 layer as thin as 5 nm in the form of graphene/{\alpha}In2Se3
heterojunction. The intrinsic outofplane ferroelectricity of the
{\alpha}In2Se3 thin layers is evidenced by the observation of reversible
spontaneous electric polarization with a relative low coercive electric field
of ~$2 X 10^5 V/cm$ and a typical ferroelectric domain size of around tens
${\mu}m^2$. Owing to the outofplane ferroelectricity of the {\alpha}In2Se3
layer, the Schottky barrier at the graphene/{\alpha}In2Se3 interface can be
effectively tuned by switching the electric polarization with an applied
voltage, leading to a pronounced switchable double diode effect with an on/off
ratio of ~$10^4$. Our results offer a new way for developing novel
nanoelectronic devices based on 2D ferroelectrics.

A filament consists of local maximizers of a smooth function $f$ when moving
in a certain direction. Filamentary structures are important features of the
shape of objects and are also considered as important lower dimensional
characterization of multivariate data. There have been some recent theoretical
studies of filaments in the nonparametric kernel density estimation context.
This paper supplements the current literature in two ways. First, we provide a
Bayesian approach to the filament estimation in regression context and study
the posterior contraction rates using a finite random series of Bsplines
basis. Compared with the kernelestimation method, this has theoretical
advantage as the bias can be better controlled when the function is smoother,
which allows obtaining better rates. Assuming that $f: \mathbb{R}^2 \mapsto
\mathbb{R}$ belongs to an isotropic H\"{o}lder class of order $\alpha \geq 4$,
with the optimal choice of smoothing parameters, the posterior contraction
rates for the filament points on some appropriately defined integral curves and
for the Hausdorff distance of the filament are both $(n/\log
n)^{(2\alpha)/(2(1+\alpha))}$. Secondly, we provide a way to construct a
credible set with sufficient frequentist coverage for the filaments. Our valid
credible region consists of posterior filaments that have frequentist
interpretation. We demonstrate the success of our proposed method in
simulations and application to earthquake data.

We report on atomicscale visualization of the structure of infinitelayer
cuprate SrCuO2 thin films grown on Nbdoped SrTiO3 substrates by molecular beam
epitaxy. Insitu scanning tunneling microscopy study reveals stoichiometric
copper oxide (CuO2) plane with a 2 x 2 surface reconstruction, prompted by
preferential clustering of four adjacent CuO2 plaquettes. By imaging the
subsurface Sr atoms, intraunitcell rotational symmetry breaking is observed,
which, together with the adjacent CuO2 clustering, can be well accounted for by
a periodic updown buckling of oxygen ions on the CuO2 plane. Further
postannealing leads to an incommensurate stripe structure of the surface
layer. Our findings provide important structural information for deeply
understanding the electronic structure of superconducting CuO2 plane as well as
high temperature superconductivity in cuprates.

During the long time of development, Chinese language has evolved a great
deal. Native speakers now have difficulty in reading sentences written in
ancient Chinese. In this paper, we propose an unsupervised algorithm that
constructs sentencealigned ancientcontemporary pairs out of the abundant
passagealigned corpus. With this method, we build a large parallel corpus. We
propose to apply the sequence to sequence model to automatically transfer
between ancient and contemporary Chinese sentences. Experiments show that both
our alignment and transfer method can produce very good result except for some
circumstances that even human translators can make mistakes without background
knowledge.

We explore the frustrated spin$1/2$ Heisenberg model on the star lattice
with antiferromagnetic (AF) couplings inside each triangle and ferromagnetic
(FM) intertriangle couplings ($J_e<0$), and calculate its magnetic and
thermodynamic properties. We show that the FM couplings do not sabotage the
magnetic disordering of the ground state due to the frustration from the AF
interactions inside each triangle, but trigger a fully gapped
inversionsymmetrybreaking trimerized valence bond crystal (TVBC) with
emergent spin1 degrees of freedom. We discover that with strengthening $J_e$,
the system scales exponentially, either with or without a magnetic field $h$:
the order parameter, the five critical fields that separate the $J_e$$h$
groundstate phase diagram into six phases, and the excitation gap obtained by
lowtemperature specific heat, all depend exponentially on $J_e$. We calculate
the temperature dependence of the specific heat, which can be directly compared
with future experiments.