-
It is a challenging and practical research problem to obtain effective
compression of lengthy product titles for E-commerce. This is particularly
important as more and more users browse mobile E-commerce apps and more
merchants make the original product titles redundant and lengthy for Search
Engine Optimization. Traditional text summarization approaches often require a
large amount of preprocessing costs and do not capture the important issue of
conversion rate in E-commerce. This paper proposes a novel multi-task learning
approach for improving product title compression with user search log data. In
particular, a pointer network-based sequence-to-sequence approach is utilized
for title compression with an attentive mechanism as an extractive method and
an attentive encoder-decoder approach is utilized for generating user search
queries. The encoding parameters (i.e., semantic embedding of original titles)
are shared among the two tasks and the attention distributions are jointly
optimized. An extensive set of experiments with both human annotated data and
online deployment demonstrate the advantage of the proposed research for both
compression qualities and online business values.
-
Systematical doping studies have been carried out to search for the possible
superconductivity in the transition metal doped Zr$_5$Ge$_3$ system.
Superconductivity up to 5.7K is discovered in the Ru-doped
Zr$_5$Ge$_{2.5}$Ru$_{0.5}$ sample. Interestingly, with the same Ru-doping,
superconductivity is only induced with doping at the Ge site, but remains
absent down to 1.8K with doping at the Zr site or interstitial site. Both
magnetic and transport studies have revealed the bulk superconductivity nature
for Ru-doped Zr$_5$Ge$_{2.5}$Ru$_{0.5}$ sample. The high upper critical field,
enhanced electron correlation, and extremely small electron-phonon coupling,
have indicated possible unconventional superconductivity in this system, which
warrants further detailed theoretical and experimental studies.
-
Sparse methods and the use of Winograd convolutions are two orthogonal
approaches, each of which significantly accelerates convolution computations in
modern CNNs. Sparse Winograd merges these two and thus has the potential to
offer a combined performance benefit. Nevertheless, training convolution layers
so that the resulting Winograd kernels are sparse has not hitherto been very
successful. By introducing a Winograd layer in place of a standard convolution
layer, we can learn and prune Winograd coefficients "natively" and obtain
sparsity level beyond 90% with only 0.1% accuracy loss with AlexNet on ImageNet
dataset. Furthermore, we present a sparse Winograd convolution algorithm and
implementation that exploits the sparsity, achieving up to 31.7 effective
TFLOP/s in 32-bit precision on a latest Intel Xeon CPU, which corresponds to a
5.4x speedup over a state-of-the-art dense convolution implementation.
-
In this paper, we propose a novel scheme for data hiding in the fingerprint
minutiae template, which is the most popular in fingerprint recognition
systems. Various strategies are proposed in data embedding in order to maintain
the accuracy of fingerprint recognition as well as the undetectability of data
hiding. In bits replacement based data embedding, we replace the last few bits
of each element of the original minutiae template with the data to be hidden.
This strategy can be further improved using an optimized bits replacement based
data embedding, which is able to minimize the impact of data hiding on the
performance of fingerprint recognition. The third strategy is an order
preserving mechanism which is proposed to reduce the detectability of data
hiding. By using such a mechanism, it would be difficult for the attacker to
differentiate the minutiae template with hidden data from the original minutiae
templates. The experimental results show that the proposed data hiding scheme
achieves sufficient capacity for hiding common personal data, where the
accuracy of fingerprint recognition is acceptable after the data hiding.
-
Phenomenally successful in practical inference problems, convolutional neural
networks (CNN) are widely deployed in mobile devices, data centers, and even
supercomputers. The number of parameters needed in CNNs, however, are often
large and undesirable. Consequently, various methods have been developed to
prune a CNN once it is trained. Nevertheless, the resulting CNNs offer limited
benefits. While pruning the fully connected layers reduces a CNN's size
considerably, it does not improve inference speed noticeably as the compute
heavy parts lie in convolutions. Pruning CNNs in a way that increase inference
speed often imposes specific sparsity structures, thus limiting the achievable
sparsity levels.
We present a method to realize simultaneously size economy and speed
improvement while pruning CNNs. Paramount to our success is an efficient
general sparse-with-dense matrix multiplication implementation that is
applicable to convolution of feature maps with kernels of arbitrary sparsity
patterns. Complementing this, we developed a performance model that predicts
sweet spots of sparsity levels for different layers and on different computer
architectures. Together, these two allow us to demonstrate 3.1--7.3$\times$
convolution speedups over dense convolution in AlexNet, on Intel Atom, Xeon,
and Xeon Phi processors, spanning the spectrum from mobile devices to
supercomputers. We also open source our project at
https://github.com/IntelLabs/SkimCaffe.
-
Word2vec is a widely used algorithm for extracting low-dimensional vector
representations of words. State-of-the-art algorithms including those by
Mikolov et al. have been parallelized for multi-core CPU architectures, but are
based on vector-vector operations with "Hogwild" updates that are
memory-bandwidth intensive and do not efficiently use computational resources.
In this paper, we propose "HogBatch" by improving reuse of various data
structures in the algorithm through the use of minibatching and negative sample
sharing, hence allowing us to express the problem using matrix multiply
operations. We also explore different techniques to distribute word2vec
computation across nodes in a compute cluster, and demonstrate good strong
scalability up to 32 nodes. The new algorithm is particularly suitable for
modern multi-core/many-core architectures, especially Intel's latest Knights
Landing processors, and allows us to scale up the computation near linearly
across cores and nodes, and process hundreds of millions of words per second,
which is the fastest word2vec implementation to the best of our knowledge.
-
Word2Vec is a widely used algorithm for extracting low-dimensional vector
representations of words. It generated considerable excitement in the machine
learning and natural language processing (NLP) communities recently due to its
exceptional performance in many NLP applications such as named entity
recognition, sentiment analysis, machine translation and question answering.
State-of-the-art algorithms including those by Mikolov et al. have been
parallelized for multi-core CPU architectures but are based on vector-vector
operations that are memory-bandwidth intensive and do not efficiently use
computational resources. In this paper, we improve reuse of various data
structures in the algorithm through the use of minibatching, hence allowing us
to express the problem using matrix multiply operations. We also explore
different techniques to distribute word2vec computation across nodes in a
compute cluster, and demonstrate good strong scalability up to 32 nodes. In
combination, these techniques allow us to scale up the computation near
linearly across cores and nodes, and process hundreds of millions of words per
second, which is the fastest word2vec implementation to the best of our
knowledge.
-
Low temperature specific heat has been measured in superconductor $\beta$-FeS
with T$_c$ = 4.55 K. It is found that the low temperature electronic specific
heat C$_e$/T can be fitted to a linear relation in the low temperature region,
but fails to be described by an exponential relation as expected by an s-wave
gap. We try fittings to the data with different gap structures and find that a
model with one or two nodal gaps can fit the data. Under a magnetic field, the
field induced specific heat $\Delta\gamma$=[C$_e$(H)-C$_e$(0)]/T shows the
Volovik relation $\Delta\gamma_e(H)\propto \sqrt{H}$, suggesting the presence
of nodal gap(s) in this material.
-
To explore new superconductors beyond the copper-based and iron-based systems
is very important. The Ru element locates just below the Fe in the periodic
table and behaves like the Fe in many ways. One of the common thread to induce
high temperature superconductivity is to introduce moderate correlation into
the system. In this paper, we report the significant enhancement of
superconducting transition temperature from 3.84K to 5.77K by using a pressure
only of 1.74 GPa in LaRu2P2 which has an iso-structure of the iron-based 122
superconductors. The ab-initio calculation shows that the superconductivity in
LaRu2P2 at ambient pressure can be explained by the McMillan's theory with
strong electron-phonon coupling. However, it is difficult to interpret the
significant enhancement of Tc versus pressure within this picture. Detailed
analysis of the pressure induced evolution of resistivity and upper critical
field Hc2(T) reveals that the increases of Tc with pressure may be accompanied
by the involvement of extra electronic correlation effect. This suggests that
the Ru-based system has some commonality as the Fe-based superconductors.
-
By using a hydrostatic pressure, we have successfully tuned the ground state
and superconductivity in LaO0.5F0.5BiSe2 single crystals. It is found that,
with the increase of pressure, the original superconducting phase with Tc about
3.5 K can be tuned to a state with lower Tc, and then a new superconducting
phase with Tc about 6.5 K emerges. Accompanied by this crossover, the ground
state is switched from a semiconducting state to a metallic one. Accordingly,
the normal state resistivity also shows a nonmonotonic change with the external
pressure. Furthermore, by applying a magnetic field, the new superconducting
state under pressure with Tc about 6.5 K is suppressed, and the normal state
reveals a weak semiconducting feature again. These results illustrate a
non-trivial relationship between the normal state property and
superconductivity in this newly discovered superconducting system.
-
There is a critical need for standard approaches to assess, report, and
compare the technical performance of genome-scale differential gene expression
experiments. We assess technical performance with a proposed "standard"
dashboard of metrics derived from analysis of external spike-in RNA control
ratio mixtures. These control ratio mixtures with defined abundance ratios
enable assessment of diagnostic performance of differentially expressed
transcript lists, limit of detection of ratio (LODR) estimates, and expression
ratio variability and measurement bias. The performance metrics suite is
applicable to analysis of a typical experiment, and here we also apply these
metrics to evaluate technical performance among laboratories. An
interlaboratory study using identical samples shared amongst 12 laboratories
with three different measurement processes demonstrated generally consistent
diagnostic power across 11 laboratories. Ratio measurement variability and bias
were also comparable amongst laboratories for the same measurement process.
Different biases were observed for measurement processes using different mRNA
enrichment protocols.
-
Superconducting condensation energy $U_0^{int}$ has been determined by
integrating the electronic entropy in various iron pnictide/chalcogenide
superconducting systems. It is found that $U_0^{int}\propto T_c^n$ with $n$ = 3
to 4, which is in sharp contrast to the simple BCS prediction
$U_0^{BCS}=1/2N_F\Delta_s^2$ with $N_F$ the quasiparticle density of states at
the Fermi energy, $\Delta_s$ the superconducting gap. A similar correlation
holds if we compute the condensation energy through
$U_0^{cal}=3\gamma_n^{eff}\Delta_s^2/4\pi^2k_B^2$ with $\gamma_n^{eff}$ the
effective normal state electronic specific heat coefficient. This indicates a
general relationship $\gamma_n^{eff} \propto T_c^m$ with $m$ = 1 to 2, which is
not predicted by the BCS scheme. A picture based on quantum criticality is
proposed to explain this phenomenon.
-
We present a systematic method for evaluation of perturbation observables in
non-canonical single-field inflation models within the slow-roll approximation,
which allied with field redefinitions enables predictions to be established for
a wide range of models. We use this to investigate various non-canonical
inflation models, including Tachyon inflation and DBI inflation. The Lambert
$W$ function will be used extensively in our method for the evaluation of
observables. In the Tachyon case, in the slow-roll approximation the model can
be approximated by a canonical field with a redefined potential, which yields
predictions in better agreement with observations than the canonical
equivalents. For DBI inflation models we consider contributions from both the
scalar potential and the warp geometry. In the case of a quartic potential, we
find a formula for the observables under both non-relativistic and relativistic
behaviour of the scalar DBI inflaton. For a quadratic potential we find two
branches in the non-relativistic case, determined by the competition of model
parameters, while for the relativistic case we find consistency with results
already in the literature. We present a comparison to the latest Planck
satellite observations. Most of the non-canonical models we investigate,
including the Tachyon, are better fits to data than canonical models with the
same potential, but we find that DBI models in the slow-roll regime have
difficulty in matching the data.
-
On 13 December 2012, Chang'e-2 conducted a successful flyby of the near-Earth
asteroid 4179 Toutatis at a closest distance of 770 $\pm$ 120 meters from the
asteroid's surface. The highest-resolution image, with a resolution of better
than 3 meters, reveals new discoveries on the asteroid, e.g., a giant basin at
the big end, a sharply perpendicular silhouette near the neck region, and
direct evidence of boulders and regolith, which suggests that Toutatis may bear
a rubble-pile structure. Toutatis' maximum physical length and width are (4.75
$\times$ 1.95 km) $\pm$10$\%$, respectively, and the direction of the +$z$ axis
is estimated to be (250$\pm$5$^\circ$, 63$\pm$5$^\circ$) with respect to the
J2000 ecliptic coordinate system. The bifurcated configuration is indicative of
a contact binary origin for Toutatis, which is composed of two lobes (head and
body). Chang'e-2 observations have significantly improved our understanding of
the characteristics, formation, and evolution of asteroids in general.
-
We discuss a new mechanism which can be responsible for the origin of the
primordial perturbation in inflationary models, the inhomogeneous DBI reheating
scenario. Light DBI fields fluctuate during inflation, and finally create the
density perturbations through modulation of the inflation decay rate. In this
note, we investigate the curvature perturbation and its non-Gaussianity from
this new mechanism. Presenting generalized expressions for them, we show that
the curvature perturbation not only depends on the particular process of decay
but is also dependent on the sound speed $c_s$ from the DBI action. More
interestingly we find that the non-Gaussianity parameter $f_{NL}$ is
independent of $c_s$. As an application we exemplify some decay processes which
give a viable and detectable non-Gaussianity. Finally we find a possible
connection between our model and the DBI-Curvaton mechanism.
-
Electric transport and scanning tunneling spectrum (STS) have been
investigated on polycrystalline samples of the new superconductor Bi4O4S3. A
weak insulating behavior in the resistive curve has been induced in the normal
state when the superconductivity is suppressed by applying a magnetic field.
Interestingly, a kink appears on the temperature dependence of resistivity near
4 K at all high magnetic fields above 1 T when the bulk superconductivity is
completely suppressed. This kink associated with the upper critical field as
well as the wide range of excess conductance at low field and high temperature
are explained as the possible evidence of strong superconducting fluctuation.
From the tunneling spectra, a superconducting gap of about 3 meV is frequently
observed yielding a ratio of 2\Delta/(kB*Tc) ~ 16.6. This value is much larger
than the one predicted by the BCS theory in the weak coupling regime
(2\Delta/(kB*Tc) ~ 3.53), which suggests the strong coupling superconductivity
in the present system. Furthermore, the gapped feature persists on the spectra
until 14 K in the STS measurement, which suggests a prominent fluctuation
region of superconductivity. Such superconducting fluctuation can survive at
very high magnetic fields, which are far beyond the critical fields for bulk
superconductivity as inferred both from electric transport and tunneling
measurements.
-
We report the successful growth and the impurity scattering effect of single
crystals of Na(Fe$_{0.97-x}$Co$_{0.03}$T$_x$)As (T=Cu, Mn). The temperature
dependence of DC magnetization at high magnetic fields is measured for
different concentrations of Cu and Mn. Detailed analysis based on the
Curie-Weiss law indicates that the Cu doping weakens the average magnetic
moments, while doping Mn enhances the local magnetic moments greatly,
suggesting that the former may be non- or very weak magnetic impurities, and
the latter give rise to magnetic impurities. However, it is found that both
doping Cu and Mn will enhance the residual resistivity and suppress the
superconductivity at the same rate in the low doping region, being consistent
with the prediction of the S$^{\pm}$ model. For the Cu-doped system, the
superconductivity is suppressed completely at a residual resistivity $\rho_0$ =
0.87 m$\Omega$ cm at which a strong localization effect is observed. However,
in the case of Mn doping, the behavior of suppression to \emph{T}$_{c}$ changes
from a fast speed to a slow one and keeps superconductive even up to a residual
resistivity of 2.86 m$\Omega$ cm. Clearly the magnetic Mn impurities are even
not as detrimental as the non- or very weak magnetic Cu impurities to
superconductivity in the high doping regime.
-
In this work, we propose a novel low-complexity reduced-rank scheme and
consider its application to linear interference suppression in direct-sequence
ultra-wideband (DS-UWB) systems. Firstly, we investigate a generic reduced-rank
scheme that jointly optimizes a projection vector and a reduced-rank filter by
using the minimum mean-squared error (MMSE) criterion. Then a low-complexity
scheme, denoted switched approximation of adaptive basis functions (SAABF), is
proposed. The SAABF scheme is an extension of the generic scheme, in which the
complexity reduction is achieved by using a multi-branch framework to simplify
the structure of the projection vector. Adaptive implementations for the SAABF
scheme are developed by using least-mean squares (LMS) and recursive
least-squares (RLS) algorithms. We also develop algorithms for selecting the
branch number and the model order of the SAABF scheme. Simulations show that in
the scenarios with severe inter-symbol interference (ISI) and multiple access
interference (MAI), the proposed SAABF scheme has fast convergence and
remarkable interference suppression performance with low complexity.
-
A novel linear blind adaptive receiver based on joint iterative optimization
(JIO) and the constrained constant modulus (CCM) design criterion is proposed
for interference suppression in direct-sequence ultra-wideband (DS-UWB)
systems. The proposed blind receiver consists of two parts, a transformation
matrix that performs dimensionality reduction and a reduced-rank filter that
produces the output. In the proposed receiver, the transformation matrix and
the reduced-rank filter are updated jointly and iteratively to minimize the
constant modulus (CM) cost function subject to a constraint. Adaptive
implementations for the JIO receiver are developed by using the normalized
stochastic gradient (NSG) and recursive least-squares (RLS) algorithms. In
order to obtain a low-complexity scheme, the columns of the transformation
matrix with the RLS algorithm are updated individually. Blind channel
estimation algorithms for both versions (NSG and RLS) are implemented. Assuming
the perfect timing, the JIO receiver only requires the spreading code of the
desired user and the received data. Simulation results show that both versions
of the proposed JIO receivers have excellent performance in suppressing the
inter-symbol interference (ISI) and multiple access interference (MAI) with a
low complexity.
-
In this paper, we propose two adaptive detection schemes based on
single-carrier frequency domain equalization (SC-FDE) for multiuser
direct-sequence ultra-wideband (DS-UWB) systems, which are termed structured
channel estimation (SCE) and direct adaptation (DA). Both schemes use the
minimum mean square error (MMSE) linear detection strategy and employ a cyclic
prefix. In the SCE scheme, we perform the adaptive channel estimation in the
frequency domain and implement the despreading in the time domain after the
FDE. In this scheme, the MMSE detection requires the knowledge of the number of
users and the noise variance. For this purpose, we propose simple algorithms
for estimating these parameters. In the DA scheme, the interference suppression
task is fulfilled with only one adaptive filter in the frequency domain and a
new signal expression is adopted to simplify the design of such a filter.
Least-mean squares (LMS), recursive least squares (RLS) and conjugate gradient
(CG) adaptive algorithms are then developed for both schemes. A complexity
analysis compares the computational complexity of the proposed algorithms and
schemes, and simulation results for the downlink illustrate their performance.
-
In this work, we propose low-complexity adaptive biased estimation
algorithms, called group-based shrinkage estimators (GSEs), for parameter
estimation and interference suppression scenarios with mechanisms to
automatically adjust the shrinkage factors. The proposed estimation algorithms
divide the target parameter vector into a number of groups and adaptively
calculate one shrinkage factor for each group. GSE schemes improve the
performance of the conventional least squares (LS) estimator in terms of the
mean-squared error (MSE), while requiring a very modest increase in complexity.
An MSE analysis is presented which indicates the lower bounds of the GSE
schemes with different group sizes. We prove that our proposed schemes
outperform the biased estimation with only one shrinkage factor and the best
performance of GSE can be obtained with the maximum number of groups. Then, we
consider an application of the proposed algorithms to single-carrier
frequency-domain equalization (SC-FDE) of direct-sequence ultra-wideband
(DS-UWB) systems, in which the structured channel estimation (SCE) algorithm
and the frequency domain receiver employ the GSE. The simulation results show
that the proposed algorithms significantly outperform the conventional unbiased
estimator in the analyzed scenarios.
-
We use spatially resolved scanning tunneling spectroscopy in Na(Fe{1-x}Cox)As
to investigate the impurity effect induced by Co dopants. The Co impurities are
successfully identified, and the spatial distributions of local density of
state at different energies around these impurities are investigated. It is
found that the spectrum shows negligible spatial variation at different
positions near the Co impurity, although there is a continuum of the in-gap
states which lifts the zero-bias conductance to a finite value. Our results put
constraints on the S+- and S++ models and sharpen the debate on the role of
scattering potentials induced by the Co dopants.
-
Resistive and magnetization properties have been measured in BiS$_2$-based
samples CeO$_{1-x}$F$_{x}$BiS$_{2}$ with a systematic substitution of O with F
(0 $<$ x $<$ 0.6). In contrast to the band structure calculations, it is found
that the parent phase of CeOBiS$_2$ is a bad metal, instead of an band
insulator. By doping electrons into the system, it is surprising to find that
superconductivity appears together with an insulating normal state. This
evolution is clearly different from the cuprate and the iron pnictide systems,
and is interpreted as approaching the von Hove singularity. Furthermore,
ferromagnetism which may arise from the Ce moments, has been observed in the
low temperature region in all samples, suggesting the co-existence of
superconductivity and ferromagnetism in the superconducting samples.
-
We extend the ModeCode software of Mortonson, Peiris and Easther to enable
numerical computation of perturbations in K-inflation models, where the scalar
field no longer has a canonical kinetic term. Focussing on models where the
kinetic and potential terms can be separated into a sum, we compute slow-roll
predictions for various models and use these to verify the numerical code. A
Markov chain Monte Carlo analysis is then used to impose constraints from WMAP7
data on the addition of a term quadratic in the kinetic energy to the
Lagrangian of simple chaotic inflation models. For a quadratic potential, the
data do not discriminate against addition of such a term, while for a quartic
(\lambda \phi^4) potential inclusion of such a term is actually favoured.
Overall, constraints on such a term from present data are found to be extremely
weak.
-
Resistivity, Hall effect and magnetization have been investigated on the new
superconductor Bi4O4S3. A weak insulating behavior has been induced in the
normal state when the superconductivity is suppressed. Hall effect measurements
illustrate clearly a multiband feature dominated by electron charge carriers,
which is further supported by the magnetoresistance data. Interestingly, a kink
appears on the temperature dependence of resistivity at about 4 K at all high
magnetic fields when the bulk superconductivity is completely suppressed. This
kink can be well traced back to the upper critical field Hc2(T) in the low
field region, and is explained as the possible evidence of residual Cooper
pairs on the one dimensional chains.