• It is a challenging and practical research problem to obtain effective compression of lengthy product titles for E-commerce. This is particularly important as more and more users browse mobile E-commerce apps and more merchants make the original product titles redundant and lengthy for Search Engine Optimization. Traditional text summarization approaches often require a large amount of preprocessing costs and do not capture the important issue of conversion rate in E-commerce. This paper proposes a novel multi-task learning approach for improving product title compression with user search log data. In particular, a pointer network-based sequence-to-sequence approach is utilized for title compression with an attentive mechanism as an extractive method and an attentive encoder-decoder approach is utilized for generating user search queries. The encoding parameters (i.e., semantic embedding of original titles) are shared among the two tasks and the attention distributions are jointly optimized. An extensive set of experiments with both human annotated data and online deployment demonstrate the advantage of the proposed research for both compression qualities and online business values.
  • Systematical doping studies have been carried out to search for the possible superconductivity in the transition metal doped Zr$_5$Ge$_3$ system. Superconductivity up to 5.7K is discovered in the Ru-doped Zr$_5$Ge$_{2.5}$Ru$_{0.5}$ sample. Interestingly, with the same Ru-doping, superconductivity is only induced with doping at the Ge site, but remains absent down to 1.8K with doping at the Zr site or interstitial site. Both magnetic and transport studies have revealed the bulk superconductivity nature for Ru-doped Zr$_5$Ge$_{2.5}$Ru$_{0.5}$ sample. The high upper critical field, enhanced electron correlation, and extremely small electron-phonon coupling, have indicated possible unconventional superconductivity in this system, which warrants further detailed theoretical and experimental studies.
  • Sparse methods and the use of Winograd convolutions are two orthogonal approaches, each of which significantly accelerates convolution computations in modern CNNs. Sparse Winograd merges these two and thus has the potential to offer a combined performance benefit. Nevertheless, training convolution layers so that the resulting Winograd kernels are sparse has not hitherto been very successful. By introducing a Winograd layer in place of a standard convolution layer, we can learn and prune Winograd coefficients "natively" and obtain sparsity level beyond 90% with only 0.1% accuracy loss with AlexNet on ImageNet dataset. Furthermore, we present a sparse Winograd convolution algorithm and implementation that exploits the sparsity, achieving up to 31.7 effective TFLOP/s in 32-bit precision on a latest Intel Xeon CPU, which corresponds to a 5.4x speedup over a state-of-the-art dense convolution implementation.
  • In this paper, we propose a novel scheme for data hiding in the fingerprint minutiae template, which is the most popular in fingerprint recognition systems. Various strategies are proposed in data embedding in order to maintain the accuracy of fingerprint recognition as well as the undetectability of data hiding. In bits replacement based data embedding, we replace the last few bits of each element of the original minutiae template with the data to be hidden. This strategy can be further improved using an optimized bits replacement based data embedding, which is able to minimize the impact of data hiding on the performance of fingerprint recognition. The third strategy is an order preserving mechanism which is proposed to reduce the detectability of data hiding. By using such a mechanism, it would be difficult for the attacker to differentiate the minutiae template with hidden data from the original minutiae templates. The experimental results show that the proposed data hiding scheme achieves sufficient capacity for hiding common personal data, where the accuracy of fingerprint recognition is acceptable after the data hiding.
  • Phenomenally successful in practical inference problems, convolutional neural networks (CNN) are widely deployed in mobile devices, data centers, and even supercomputers. The number of parameters needed in CNNs, however, are often large and undesirable. Consequently, various methods have been developed to prune a CNN once it is trained. Nevertheless, the resulting CNNs offer limited benefits. While pruning the fully connected layers reduces a CNN's size considerably, it does not improve inference speed noticeably as the compute heavy parts lie in convolutions. Pruning CNNs in a way that increase inference speed often imposes specific sparsity structures, thus limiting the achievable sparsity levels. We present a method to realize simultaneously size economy and speed improvement while pruning CNNs. Paramount to our success is an efficient general sparse-with-dense matrix multiplication implementation that is applicable to convolution of feature maps with kernels of arbitrary sparsity patterns. Complementing this, we developed a performance model that predicts sweet spots of sparsity levels for different layers and on different computer architectures. Together, these two allow us to demonstrate 3.1--7.3$\times$ convolution speedups over dense convolution in AlexNet, on Intel Atom, Xeon, and Xeon Phi processors, spanning the spectrum from mobile devices to supercomputers. We also open source our project at https://github.com/IntelLabs/SkimCaffe.
  • Word2vec is a widely used algorithm for extracting low-dimensional vector representations of words. State-of-the-art algorithms including those by Mikolov et al. have been parallelized for multi-core CPU architectures, but are based on vector-vector operations with "Hogwild" updates that are memory-bandwidth intensive and do not efficiently use computational resources. In this paper, we propose "HogBatch" by improving reuse of various data structures in the algorithm through the use of minibatching and negative sample sharing, hence allowing us to express the problem using matrix multiply operations. We also explore different techniques to distribute word2vec computation across nodes in a compute cluster, and demonstrate good strong scalability up to 32 nodes. The new algorithm is particularly suitable for modern multi-core/many-core architectures, especially Intel's latest Knights Landing processors, and allows us to scale up the computation near linearly across cores and nodes, and process hundreds of millions of words per second, which is the fastest word2vec implementation to the best of our knowledge.
  • Word2Vec is a widely used algorithm for extracting low-dimensional vector representations of words. It generated considerable excitement in the machine learning and natural language processing (NLP) communities recently due to its exceptional performance in many NLP applications such as named entity recognition, sentiment analysis, machine translation and question answering. State-of-the-art algorithms including those by Mikolov et al. have been parallelized for multi-core CPU architectures but are based on vector-vector operations that are memory-bandwidth intensive and do not efficiently use computational resources. In this paper, we improve reuse of various data structures in the algorithm through the use of minibatching, hence allowing us to express the problem using matrix multiply operations. We also explore different techniques to distribute word2vec computation across nodes in a compute cluster, and demonstrate good strong scalability up to 32 nodes. In combination, these techniques allow us to scale up the computation near linearly across cores and nodes, and process hundreds of millions of words per second, which is the fastest word2vec implementation to the best of our knowledge.
  • Low temperature specific heat has been measured in superconductor $\beta$-FeS with T$_c$ = 4.55 K. It is found that the low temperature electronic specific heat C$_e$/T can be fitted to a linear relation in the low temperature region, but fails to be described by an exponential relation as expected by an s-wave gap. We try fittings to the data with different gap structures and find that a model with one or two nodal gaps can fit the data. Under a magnetic field, the field induced specific heat $\Delta\gamma$=[C$_e$(H)-C$_e$(0)]/T shows the Volovik relation $\Delta\gamma_e(H)\propto \sqrt{H}$, suggesting the presence of nodal gap(s) in this material.
  • To explore new superconductors beyond the copper-based and iron-based systems is very important. The Ru element locates just below the Fe in the periodic table and behaves like the Fe in many ways. One of the common thread to induce high temperature superconductivity is to introduce moderate correlation into the system. In this paper, we report the significant enhancement of superconducting transition temperature from 3.84K to 5.77K by using a pressure only of 1.74 GPa in LaRu2P2 which has an iso-structure of the iron-based 122 superconductors. The ab-initio calculation shows that the superconductivity in LaRu2P2 at ambient pressure can be explained by the McMillan's theory with strong electron-phonon coupling. However, it is difficult to interpret the significant enhancement of Tc versus pressure within this picture. Detailed analysis of the pressure induced evolution of resistivity and upper critical field Hc2(T) reveals that the increases of Tc with pressure may be accompanied by the involvement of extra electronic correlation effect. This suggests that the Ru-based system has some commonality as the Fe-based superconductors.
  • By using a hydrostatic pressure, we have successfully tuned the ground state and superconductivity in LaO0.5F0.5BiSe2 single crystals. It is found that, with the increase of pressure, the original superconducting phase with Tc about 3.5 K can be tuned to a state with lower Tc, and then a new superconducting phase with Tc about 6.5 K emerges. Accompanied by this crossover, the ground state is switched from a semiconducting state to a metallic one. Accordingly, the normal state resistivity also shows a nonmonotonic change with the external pressure. Furthermore, by applying a magnetic field, the new superconducting state under pressure with Tc about 6.5 K is suppressed, and the normal state reveals a weak semiconducting feature again. These results illustrate a non-trivial relationship between the normal state property and superconductivity in this newly discovered superconducting system.
  • There is a critical need for standard approaches to assess, report, and compare the technical performance of genome-scale differential gene expression experiments. We assess technical performance with a proposed "standard" dashboard of metrics derived from analysis of external spike-in RNA control ratio mixtures. These control ratio mixtures with defined abundance ratios enable assessment of diagnostic performance of differentially expressed transcript lists, limit of detection of ratio (LODR) estimates, and expression ratio variability and measurement bias. The performance metrics suite is applicable to analysis of a typical experiment, and here we also apply these metrics to evaluate technical performance among laboratories. An interlaboratory study using identical samples shared amongst 12 laboratories with three different measurement processes demonstrated generally consistent diagnostic power across 11 laboratories. Ratio measurement variability and bias were also comparable amongst laboratories for the same measurement process. Different biases were observed for measurement processes using different mRNA enrichment protocols.
  • Superconducting condensation energy $U_0^{int}$ has been determined by integrating the electronic entropy in various iron pnictide/chalcogenide superconducting systems. It is found that $U_0^{int}\propto T_c^n$ with $n$ = 3 to 4, which is in sharp contrast to the simple BCS prediction $U_0^{BCS}=1/2N_F\Delta_s^2$ with $N_F$ the quasiparticle density of states at the Fermi energy, $\Delta_s$ the superconducting gap. A similar correlation holds if we compute the condensation energy through $U_0^{cal}=3\gamma_n^{eff}\Delta_s^2/4\pi^2k_B^2$ with $\gamma_n^{eff}$ the effective normal state electronic specific heat coefficient. This indicates a general relationship $\gamma_n^{eff} \propto T_c^m$ with $m$ = 1 to 2, which is not predicted by the BCS scheme. A picture based on quantum criticality is proposed to explain this phenomenon.
  • We present a systematic method for evaluation of perturbation observables in non-canonical single-field inflation models within the slow-roll approximation, which allied with field redefinitions enables predictions to be established for a wide range of models. We use this to investigate various non-canonical inflation models, including Tachyon inflation and DBI inflation. The Lambert $W$ function will be used extensively in our method for the evaluation of observables. In the Tachyon case, in the slow-roll approximation the model can be approximated by a canonical field with a redefined potential, which yields predictions in better agreement with observations than the canonical equivalents. For DBI inflation models we consider contributions from both the scalar potential and the warp geometry. In the case of a quartic potential, we find a formula for the observables under both non-relativistic and relativistic behaviour of the scalar DBI inflaton. For a quadratic potential we find two branches in the non-relativistic case, determined by the competition of model parameters, while for the relativistic case we find consistency with results already in the literature. We present a comparison to the latest Planck satellite observations. Most of the non-canonical models we investigate, including the Tachyon, are better fits to data than canonical models with the same potential, but we find that DBI models in the slow-roll regime have difficulty in matching the data.
  • On 13 December 2012, Chang'e-2 conducted a successful flyby of the near-Earth asteroid 4179 Toutatis at a closest distance of 770 $\pm$ 120 meters from the asteroid's surface. The highest-resolution image, with a resolution of better than 3 meters, reveals new discoveries on the asteroid, e.g., a giant basin at the big end, a sharply perpendicular silhouette near the neck region, and direct evidence of boulders and regolith, which suggests that Toutatis may bear a rubble-pile structure. Toutatis' maximum physical length and width are (4.75 $\times$ 1.95 km) $\pm$10$\%$, respectively, and the direction of the +$z$ axis is estimated to be (250$\pm$5$^\circ$, 63$\pm$5$^\circ$) with respect to the J2000 ecliptic coordinate system. The bifurcated configuration is indicative of a contact binary origin for Toutatis, which is composed of two lobes (head and body). Chang'e-2 observations have significantly improved our understanding of the characteristics, formation, and evolution of asteroids in general.
  • We discuss a new mechanism which can be responsible for the origin of the primordial perturbation in inflationary models, the inhomogeneous DBI reheating scenario. Light DBI fields fluctuate during inflation, and finally create the density perturbations through modulation of the inflation decay rate. In this note, we investigate the curvature perturbation and its non-Gaussianity from this new mechanism. Presenting generalized expressions for them, we show that the curvature perturbation not only depends on the particular process of decay but is also dependent on the sound speed $c_s$ from the DBI action. More interestingly we find that the non-Gaussianity parameter $f_{NL}$ is independent of $c_s$. As an application we exemplify some decay processes which give a viable and detectable non-Gaussianity. Finally we find a possible connection between our model and the DBI-Curvaton mechanism.
  • Electric transport and scanning tunneling spectrum (STS) have been investigated on polycrystalline samples of the new superconductor Bi4O4S3. A weak insulating behavior in the resistive curve has been induced in the normal state when the superconductivity is suppressed by applying a magnetic field. Interestingly, a kink appears on the temperature dependence of resistivity near 4 K at all high magnetic fields above 1 T when the bulk superconductivity is completely suppressed. This kink associated with the upper critical field as well as the wide range of excess conductance at low field and high temperature are explained as the possible evidence of strong superconducting fluctuation. From the tunneling spectra, a superconducting gap of about 3 meV is frequently observed yielding a ratio of 2\Delta/(kB*Tc) ~ 16.6. This value is much larger than the one predicted by the BCS theory in the weak coupling regime (2\Delta/(kB*Tc) ~ 3.53), which suggests the strong coupling superconductivity in the present system. Furthermore, the gapped feature persists on the spectra until 14 K in the STS measurement, which suggests a prominent fluctuation region of superconductivity. Such superconducting fluctuation can survive at very high magnetic fields, which are far beyond the critical fields for bulk superconductivity as inferred both from electric transport and tunneling measurements.
  • We report the successful growth and the impurity scattering effect of single crystals of Na(Fe$_{0.97-x}$Co$_{0.03}$T$_x$)As (T=Cu, Mn). The temperature dependence of DC magnetization at high magnetic fields is measured for different concentrations of Cu and Mn. Detailed analysis based on the Curie-Weiss law indicates that the Cu doping weakens the average magnetic moments, while doping Mn enhances the local magnetic moments greatly, suggesting that the former may be non- or very weak magnetic impurities, and the latter give rise to magnetic impurities. However, it is found that both doping Cu and Mn will enhance the residual resistivity and suppress the superconductivity at the same rate in the low doping region, being consistent with the prediction of the S$^{\pm}$ model. For the Cu-doped system, the superconductivity is suppressed completely at a residual resistivity $\rho_0$ = 0.87 m$\Omega$ cm at which a strong localization effect is observed. However, in the case of Mn doping, the behavior of suppression to \emph{T}$_{c}$ changes from a fast speed to a slow one and keeps superconductive even up to a residual resistivity of 2.86 m$\Omega$ cm. Clearly the magnetic Mn impurities are even not as detrimental as the non- or very weak magnetic Cu impurities to superconductivity in the high doping regime.
  • In this work, we propose a novel low-complexity reduced-rank scheme and consider its application to linear interference suppression in direct-sequence ultra-wideband (DS-UWB) systems. Firstly, we investigate a generic reduced-rank scheme that jointly optimizes a projection vector and a reduced-rank filter by using the minimum mean-squared error (MMSE) criterion. Then a low-complexity scheme, denoted switched approximation of adaptive basis functions (SAABF), is proposed. The SAABF scheme is an extension of the generic scheme, in which the complexity reduction is achieved by using a multi-branch framework to simplify the structure of the projection vector. Adaptive implementations for the SAABF scheme are developed by using least-mean squares (LMS) and recursive least-squares (RLS) algorithms. We also develop algorithms for selecting the branch number and the model order of the SAABF scheme. Simulations show that in the scenarios with severe inter-symbol interference (ISI) and multiple access interference (MAI), the proposed SAABF scheme has fast convergence and remarkable interference suppression performance with low complexity.
  • A novel linear blind adaptive receiver based on joint iterative optimization (JIO) and the constrained constant modulus (CCM) design criterion is proposed for interference suppression in direct-sequence ultra-wideband (DS-UWB) systems. The proposed blind receiver consists of two parts, a transformation matrix that performs dimensionality reduction and a reduced-rank filter that produces the output. In the proposed receiver, the transformation matrix and the reduced-rank filter are updated jointly and iteratively to minimize the constant modulus (CM) cost function subject to a constraint. Adaptive implementations for the JIO receiver are developed by using the normalized stochastic gradient (NSG) and recursive least-squares (RLS) algorithms. In order to obtain a low-complexity scheme, the columns of the transformation matrix with the RLS algorithm are updated individually. Blind channel estimation algorithms for both versions (NSG and RLS) are implemented. Assuming the perfect timing, the JIO receiver only requires the spreading code of the desired user and the received data. Simulation results show that both versions of the proposed JIO receivers have excellent performance in suppressing the inter-symbol interference (ISI) and multiple access interference (MAI) with a low complexity.
  • In this paper, we propose two adaptive detection schemes based on single-carrier frequency domain equalization (SC-FDE) for multiuser direct-sequence ultra-wideband (DS-UWB) systems, which are termed structured channel estimation (SCE) and direct adaptation (DA). Both schemes use the minimum mean square error (MMSE) linear detection strategy and employ a cyclic prefix. In the SCE scheme, we perform the adaptive channel estimation in the frequency domain and implement the despreading in the time domain after the FDE. In this scheme, the MMSE detection requires the knowledge of the number of users and the noise variance. For this purpose, we propose simple algorithms for estimating these parameters. In the DA scheme, the interference suppression task is fulfilled with only one adaptive filter in the frequency domain and a new signal expression is adopted to simplify the design of such a filter. Least-mean squares (LMS), recursive least squares (RLS) and conjugate gradient (CG) adaptive algorithms are then developed for both schemes. A complexity analysis compares the computational complexity of the proposed algorithms and schemes, and simulation results for the downlink illustrate their performance.
  • In this work, we propose low-complexity adaptive biased estimation algorithms, called group-based shrinkage estimators (GSEs), for parameter estimation and interference suppression scenarios with mechanisms to automatically adjust the shrinkage factors. The proposed estimation algorithms divide the target parameter vector into a number of groups and adaptively calculate one shrinkage factor for each group. GSE schemes improve the performance of the conventional least squares (LS) estimator in terms of the mean-squared error (MSE), while requiring a very modest increase in complexity. An MSE analysis is presented which indicates the lower bounds of the GSE schemes with different group sizes. We prove that our proposed schemes outperform the biased estimation with only one shrinkage factor and the best performance of GSE can be obtained with the maximum number of groups. Then, we consider an application of the proposed algorithms to single-carrier frequency-domain equalization (SC-FDE) of direct-sequence ultra-wideband (DS-UWB) systems, in which the structured channel estimation (SCE) algorithm and the frequency domain receiver employ the GSE. The simulation results show that the proposed algorithms significantly outperform the conventional unbiased estimator in the analyzed scenarios.
  • We use spatially resolved scanning tunneling spectroscopy in Na(Fe{1-x}Cox)As to investigate the impurity effect induced by Co dopants. The Co impurities are successfully identified, and the spatial distributions of local density of state at different energies around these impurities are investigated. It is found that the spectrum shows negligible spatial variation at different positions near the Co impurity, although there is a continuum of the in-gap states which lifts the zero-bias conductance to a finite value. Our results put constraints on the S+- and S++ models and sharpen the debate on the role of scattering potentials induced by the Co dopants.
  • Resistive and magnetization properties have been measured in BiS$_2$-based samples CeO$_{1-x}$F$_{x}$BiS$_{2}$ with a systematic substitution of O with F (0 $<$ x $<$ 0.6). In contrast to the band structure calculations, it is found that the parent phase of CeOBiS$_2$ is a bad metal, instead of an band insulator. By doping electrons into the system, it is surprising to find that superconductivity appears together with an insulating normal state. This evolution is clearly different from the cuprate and the iron pnictide systems, and is interpreted as approaching the von Hove singularity. Furthermore, ferromagnetism which may arise from the Ce moments, has been observed in the low temperature region in all samples, suggesting the co-existence of superconductivity and ferromagnetism in the superconducting samples.
  • We extend the ModeCode software of Mortonson, Peiris and Easther to enable numerical computation of perturbations in K-inflation models, where the scalar field no longer has a canonical kinetic term. Focussing on models where the kinetic and potential terms can be separated into a sum, we compute slow-roll predictions for various models and use these to verify the numerical code. A Markov chain Monte Carlo analysis is then used to impose constraints from WMAP7 data on the addition of a term quadratic in the kinetic energy to the Lagrangian of simple chaotic inflation models. For a quadratic potential, the data do not discriminate against addition of such a term, while for a quartic (\lambda \phi^4) potential inclusion of such a term is actually favoured. Overall, constraints on such a term from present data are found to be extremely weak.
  • Resistivity, Hall effect and magnetization have been investigated on the new superconductor Bi4O4S3. A weak insulating behavior has been induced in the normal state when the superconductivity is suppressed. Hall effect measurements illustrate clearly a multiband feature dominated by electron charge carriers, which is further supported by the magnetoresistance data. Interestingly, a kink appears on the temperature dependence of resistivity at about 4 K at all high magnetic fields when the bulk superconductivity is completely suppressed. This kink can be well traced back to the upper critical field Hc2(T) in the low field region, and is explained as the possible evidence of residual Cooper pairs on the one dimensional chains.