• We propose a new network architecture, Gated Attention Networks (GaAN), for learning on graphs. Unlike the traditional multi-head attention mechanism, which equally consumes all attention heads, GaAN uses a convolutional sub-network to control each attention head's importance. We demonstrate the effectiveness of GaAN on the inductive node classification problem. Moreover, with GaAN as a building block, we construct the Graph Gated Recurrent Unit (GGRU) to address the traffic speed forecasting problem. Extensive experiments on three real-world datasets show that our GaAN framework achieves state-of-the-art results on both tasks.
  • Since the invention of word2vec, the skip-gram model has significantly advanced the research of network embedding, such as the recent emergence of the DeepWalk, LINE, PTE, and node2vec approaches. In this work, we show that all of the aforementioned models with negative sampling can be unified into the matrix factorization framework with closed forms. Our analysis and proofs reveal that: (1) DeepWalk empirically produces a low-rank transformation of a network's normalized Laplacian matrix; (2) LINE, in theory, is a special case of DeepWalk when the size of vertices' context is set to one; (3) As an extension of LINE, PTE can be viewed as the joint factorization of multiple networks' Laplacians; (4) node2vec is factorizing a matrix related to the stationary distribution and transition probability tensor of a 2nd-order random walk. We further provide the theoretical connections between skip-gram based network embedding algorithms and the theory of graph Laplacian. Finally, we present the NetMF method as well as its approximation algorithm for computing network embedding. Our method offers significant improvements over DeepWalk and LINE for conventional network mining tasks. This work lays the theoretical foundation for skip-gram based network embedding methods, leading to a better understanding of latent network representation learning.
  • Fast digitisers and digital pulse processing have been widely used for spectral application and pulse shape discrimination (PSD) owing to their advantages in terms of compactness, higher trigger rates, offline analysis, etc. Meanwhile, the noise of readout electronics is usually trivial for organic, plastic, or liquid scintillator with PSD ability because of their poor intrinsic energy resolution. However, LaBr3(Ce) has been widely used for its excellent energy resolution and has been proven to have PSD ability for alpha/gamma particles. Therefore, designing a digital acquisition system for such scintillators as LaBr3(Ce) with both optimal energy resolution and promising PSD ability is worthwhile. Several experimental research studies about the choice of digitiser properties for liquid scintillators have already been conducted in terms of the sampling rate and vertical resolution. Quantitative analysis on the influence of waveform digitisers, that is, fast amplifier (optional), sampling rates, and vertical resolution, on both applications is still lacking. The present paper provides quantitative analysis of these factors and, hence, general rules about the optimal design of digitisers for both energy resolution and PSD application according to the noise analysis of time-variant gated charge integration.
  • This paper proposes a novel neural machine reading model for open-domain question answering at scale. Existing machine comprehension models typically assume that a short piece of relevant text containing answers is already identified and given to the models, from which the models are designed to extract answers. This assumption, however, is not realistic for building a large-scale open-domain question answering system which requires both deep text understanding and identifying relevant text from corpus simultaneously. In this paper, we introduce Neural Comprehensive Ranker (NCR) that integrates both passage ranking and answer extraction in one single framework. A Q&A system based on this framework allows users to issue an open-domain question without needing to provide a piece of text that must contain the answer. Experiments show that the unified NCR model is able to outperform the states-of-the-art in both retrieval of relevant text and answer extraction.
  • Progress in science has advanced the development of human society across history, with dramatic revolutions shaped by information theory, genetic cloning, and artificial intelligence, among the many scientific achievements produced in the 20th century. However, the way that science advances itself is much less well-understood. In this work, we study the evolution of scientific development over the past century by presenting an anatomy of 89 million digitalized papers published between 1900 and 2015. We find that science has benefited from the shift from individual work to collaborative effort, with over 90% of the world-leading innovations generated by collaborations in this century, nearly four times higher than they were in the 1900s. We discover that rather than the frequent myopic- and self-referencing that was common in the early 20th century, modern scientists instead tend to look for literature further back and farther around. Finally, we also observe the globalization of scientific development from 1900 to 2015, including 25-fold and 7-fold increases in international collaborations and citations, respectively, as well as a dramatic decline in the dominant accumulation of citations by the US, the UK, and Germany, from ~95% to ~50% over the same period. Our discoveries are meant to serve as a starter for exploring the visionary ways in which science has developed throughout the past century, generating insight into and an impact upon the current scientific innovations and funding policies.
  • Previous studies have demonstrated the empirical success of word embeddings in various applications. In this paper, we investigate the problem of learning distributed representations for text documents which many machine learning algorithms take as input for a number of NLP tasks. We propose a neural network model, KeyVec, which learns document representations with the goal of preserving key semantics of the input text. It enables the learned low-dimensional vectors to retain the topics and important information from the documents that will flow to downstream tasks. Our empirical evaluations show the superior quality of KeyVec representations in two different document understanding tasks.
  • A scintillating fiber array measurement system for gross beta is developed to achieve real-time monitoring of radioactivity in drinking water. The detector consists of 1,096 scintillating fibers, both sides of the fibers connect to a photomultiplier tube, and they are placed in a stainless steel tank. The detector parameters of working voltage, background counting rate and stability of the detector were tested, and the detection efficiency was calibrated using a standard solution of potassium chloride. Through experiment, the background counting rate of the detector is 38.31 cps and the detection efficiency for $\beta$ particles is 0.37 cps per Bq per liter; the detector can reach its detection limit of 1.0 Bq per liter for $\beta$ particles within 100 minutes without pre-concentration.
  • In neutrinoless double beta (0{\nu}{\beta}{\beta}) decay experiments, the diversity of topological signatures of different particles provides an important tool to distinguish double beta events from background events and reduce background rates. Aiming at suppressing the single-electron backgrounds which are most challenging, several groups have established Monte Carlo simulation packages to study the topological characteristics of single-electron events and 0{\nu}{\beta}{\beta} events and develop methods to differentiate them. In this paper, applying the knowledge of graph theory, a new topological signature called REF track (Refined Energy-Filtered track) is proposed and proven to be an accurate approximation of the real particle trajectory. Based on the analysis of the energy depositions along the REF track of single-electron events and 0{\nu}{\beta}{\beta} events, the REF energy deposition models for both events are proposed to indicate the significant differences between them. With these differences, this paper presents a new discrimination method, which, in the Monte Carlo simulation, achieved a single-electron rejection factor of 93.8+-0.3 (stat.)% as well as a 0{\nu}{\beta}{\beta} efficiency of 85.6+-0.4 (stat.)% with optimized parameters in CdZnTe.
  • The key to photoelectric X-ray polarimetry is the determination of the emission direction of photoelectrons. Because of the low mass of an electron, the ionisation trajectory is not straight and the useful information needed for polarimetry is stored mostly in the initial part of the track where less energy is deposited. We present a new algorithm, based on the shortest path problem in graph theory, to reconstruct the 2D electron track from the measured image that is blurred due to transversal diffusion along drift and multiplication in the gas chamber. Compared with previous methods based on moment analysis, this algorithm allows us to identify the photoelectric interaction point more accurately and precisely for complicated tracks resulting from high energy photons or low pressure chambers. This leads to a better position resolution and a higher degree of modulation toward high energy X-rays. The new algorithm is justified using simulations and measurements with the gas pixel detector (GPD), and it should also work for other polarimetric techniques such as a time projection chamber (TPC). As the improvement is restricted in the high energy band, this new algorithm shows limited improvement for the sensitivity of GPD polarimeters, but it may have a larger potential for low-pressure TPC polarimeters.
  • We report the first result on Ge-76 neutrinoless double beta decay from CDEX-1 experiment at China Jinping Underground Laboratory. A mass of 994 g p-type point-contact high purity germanium detector has been installed to search the neutrinoless double beta decay events, as well as to directly detect dark matter particles. An exposure of 304 kg*day has been analyzed. The wideband spectrum from 500 keV to 3 MeV was obtained and the average event rate at the 2.039 MeV energy range is about 0.012 count per keV per kg per day. The half-life of Ge-76 neutrinoless double beta decay has been derived based on this result as: T 1/2 > 6.4*10^22 yr (90% C.L.). An upper limit on the effective Majorana-neutrino mass of 5.0 eV has been achieved. The possible methods to further decrease the background level have been discussed and will be pursued in the next stage of CDEX experiment.
  • A new method of pulse shape discrimination (PSD) for BEGe detectors is developed to suppress Compton-continuum by digital pulse shape analysis (PSA), which helps reduce the Compton background level in gamma ray spectrometry. A decision parameter related to the rise time of a pulse shape was presented. The method was verified by experiments using 60Co and 137Cs sources. The result indicated that the 60Co Peak to Compton ratio and the Cs-Peak to Co-Compton ratio could be improved by more than two and three times, respectively.
  • The neutron background spectrum from thermal neutron to 20 MeV fast neutron was measured at the first experimental hall of China Jinping underground laboratory with a Bonner multi-sphere spectrometer. The measurement system was validated by a Cf252 source and inconformity was corrected. Due to micro charge discharge, the dataset was screened and background from the steel of the detectors was estimated by MC simulation. Based on genetic algorithm we obtained the energy distribution of the neutron and the total flux of neutron was (2.69 +/-1.02) *10^-5 cm^-2s^-1
  • An underwater in situ gamma ray spectrometer based on LaBr3 was developed and optimized to monitor marine radioactivity. The intrinsic background mainly from La138 and Ac227 of LaBr3 was well determined by low background measurement and pulse shape discrimination method. A method of self-calibration using three internal contaminant peaks was proposed to eliminate the peak shift during long term monitoring. With experiments under different temperatures, the method was proved to be helpful for maintaining long term stability. To monitor the marine radioactivity, the spectrometer efficiency was calculated via water tank experiment as well as Monte Carlo simulation.
  • The ability of background discrimination using pulse shape discrimination (PSD) in broad-energy germanium (BEGe) detectors makes them as competitive candidates for neutrinoless double beta decay (0{\nu}\b{eta}\b{eta}) experiments. The measurements of key parameters for detector modeling in a commercial p-type BEGe detector are presented in this paper. Point-like sources were used to investigate the energy resolution and linearity of the detector. A cylindrical volume source was used for the efficiency calibration. With an assembled device for source positioning, a collimated 133Ba point-like source was used to scan the detector and investigate the active volume. A point-like source of 241Am was used to measure the dead layer thicknesses, which are approximately 0.17 mm on the front and 1.18 mm on the side. The described characterization method will play an important role in the 0{\nu}\b{eta}\b{eta} experiments with BEGe detectors at China JinPing underground Laboratory (CJPL) in the future.
  • A prototype of LaBr3:Ce in situ gamma-ray spectrometer for marine environmental monitoring is developed and applied for in situ measurement. A 3-inch LaBr3:Ce scintillator is used in the detector, and a digital pulse process electronics is chosen as the pulse height analyzer. For this prototype, the energy response of the spectrometer is linear and the energy resolution of 662keV is 2.6% (much better than NaI). With the measurement of the prototype in a water tank filled with 137Cs, the detect efficiency for 137Cs is (0.288 0.01)cps/(Bq/L), which is close to the result of Monte Carlo simulation, 0.283cps/(Bq/L). With this measurement, the MDAC for 137Cs in one hour has been calculated to 0.78Bq/L, better than that of NaI(Tl) in-situ gamma spectrometer, which is ~1.0Bq/L.
  • A low background germanium gamma ray spectrometer, GeTHU, has been installed at China JinPing underground Laboratory. The integral background count rate between 40 and 2700 keV was 0.6 cpm, and the origin was studied by Monte Carlo simulation. Detection limits and efficiencies were calculated for selected gamma peaks. Boric acid and silica sand samples were measured and 137Cs contamination was found in boric acid. GeTHU will be mainly used to measure environmental samples and screen materials in dark matter experiments.
  • The China Dark Matter Experiment (CDEX) is located at the China Jinping underground laboratory (CJPL) and aims to directly detect the WIMP flux with high sensitivity in the low mass region. Here we present a study of the predicted photon and electron backgrounds including the background contribution of the structure materials of the germanium detector, the passive shielding materials, and the intrinsic radioactivity of the liquid argon that serves as an anti-Compton active shielding detector. A detailed geometry is modeled and the background contribution has been simulated based on the measured radioactivities of all possible components within the GEANT4 program. Then the photon and electron background level in the energy region of interest (<10^-2 events kg-1 day-1 keV-1 (cpkkd)) is predicted based on Monte Carlo simulations. The simulated result is consistent with the design goal of CDEX-10 experiment, 0.1 cpkkd, which shows that the active and passive shield design of CDEX-10 is effective and feasible.
  • The China Dark Matter Experiment (CDEX) Collaboration will carry out a direct search for weakly interacting massive particles with germanium detectors. Liquid argon will be utilized as an anti-Compton and cooling material for the germanium detectors. A low-background and large-area photomultiplier tube (PMT) immersed in liquid argon will be used to read out the light signal from the argon. In this paper we carry out a careful evaluation on the performance of the PMT operating at both room and cryogenic temperatures. Based on the single photoelectron response model, the absolute gain and resolution of the PMT were measured. This has laid a foundation for PMT selection, calibration and signal analysis in the forthcoming CDEX experiments.
  • Cosmogenic nuclides inside germanium detectors contribute background noise spectra quite different from ordinary external sources. We propose and discuss a nuclide decay and level transition model based on graph theory to understand the background contribution of the decay of cosmogenic nuclides inside a germanium crystal. In this work, not only was the level transition process, but the detector response time was also taken into consideration to decide whether or not to apply coincidence summing-up. We simulated the background spectrum of the internal cosmogenic nuclides in a germanium detector, and found some unique phenomena caused by the coincidence summing-up effect in the simulated spectrum. Thus, the background spectrum of each cosmogenic nuclide can be quantitatively obtained.
  • In this work we investigate medium modifications to the interference pattern between initial and final state radiation. We compute single gluon production off a highly energetic parton that undergoes a hard scattering and subsequently crosses a dense QCD medium of finite size. We extend our previous studies obtained at first order in opacity by providing general results for multiple soft scatterings and their specific formulation within the harmonic oscillator approximation. We show that there is a gradual onset of decoherence between the initial and final state radiation due to multiple scatterings, that opens the phase space for large angle emissions. By examining the multiplicity of produced gluons, we observe a potentially large double logarithmic enhancement for dense media and small opening angles. This result points to a possible modification of the evolution equations due to a QCD medium of finite size. We briefly comment on the phenomenological consequences of this setup in high-energy nuclear collisions.
  • China JinPing underground Laboratory (CJPL) is the deepest underground laboratory presently running in the world. In such a deep underground laboratory, the cosmic ray flux is a very important and necessary parameter for rare event experiments. A plastic scintillator telescope system has been set up to measure the cosmic ray flux. The performance of the telescope system has been studied using the cosmic ray on the ground laboratory near CJPL. Based on the underground experimental data taken from November 2010 to December 2011 in CJPL, which has effective live time of 171 days, the cosmic ray muon flux in CJPL is measured to be (2.0+-0.4)*10^(-10)/(cm^2)/(s). The ultra-low cosmic ray background guarantees CJPL's ideal environment for dark matter experiment.
  • The CDEX Collaboration has been established for direct detection of light dark matter particles, using ultra-low energy threshold p-type point-contact germanium detectors, in China JinPing underground Laboratory (CJPL). The first 1 kg point-contact germanium detector with a sub-keV energy threshold has been tested in a passive shielding system located in CJPL. The outputs from both the point-contact p+ electrode and the outside n+ electrode make it possible to scan the lower energy range of less than 1 keV and at the same time to detect the higher energy range up to 3 MeV. The outputs from both p+ and n+ electrode may also provide a more powerful method for signal discrimination for dark matter experiment. Some key parameters, including energy resolution, dead time, decay times of internal X-rays, and system stability, have been tested and measured. The results show that the 1 kg point-contact germanium detector, together with its shielding system and electronics, can run smoothly with good performances. This detector system will be deployed for dark matter search experiments.
  • Weakly Interacting Massive Particles (WIMPs) are the candidates of dark matter in our universe. Up to now any direct interaction of WIMP with nuclei has not been observed yet. The exclusion limits of the spin-independent cross section of WIMP-nucleon which have been experimentally obtained is about 10^{-7}pb at high mass region and only 10^{-5}pb} at low mass region. China Jin-Ping underground laboratory CJPL is the deepest underground lab in the world and provides a very promising environment for direct observation of dark matter. The China Dark Matter Experiment (CDEX) experiment is going to directly detect the WIMP flux with high sensitivity in the low mass region. Both CJPL and CDEX have achieved a remarkable progress in recent two years. The CDEX employs a point-contact germanium semi-conductor detector PCGe whose detection threshold is less than 300 eV. We report the measurement results of Muon flux, monitoring of radioactivity and Radon concentration carried out in CJPL, as well describe the structure and performance of the 1 kg PCGe detector CDEX-1 and 10kg detector array CDEX-10 including the detectors, electronics, shielding and cooling systems. Finally we discuss the physics goals of the CDEX-1, CDEX-10 and the future CDEX-1T detectors.
  • Interferences between different emitters in the multi-parton shower is the building block of QCD jet physics in vacuum. The presence of a hot medium made of quarks and gluons is expected to alter this interference pattern. To study such effects, we derive the gluon emission spectrum off an "asymptotic quark" traversing a hot and dense QCD medium at first order in the medium density. The resulting induced gluon distribution gets modified when the new interference terms between the initial and final quark are included. We comment on the possible phenomenological consequences of this new contribution for jet observables in heavy-ion collisions.
  • We investigate the color coherence pattern between initial and final state radiation in the presence of a QCD medium. We derive the medium-induced gluon spectrum of an "asymptotic" parton which suffers a hard scattering and subsequently crosses the medium. The angular distribution of the induced gluon spectrum is modified when one includes interference terms between the incoming and the outgoing parton at finite angle between them. The coherent, incoherent and soft limits of the medium-induced gluon spectrum are studied. In the soft limit, we provide a simple and intuitive probabilistic picture which could be of interest for Monte Carlo implementations. The configuration studied here may have phenomenological consequences in high energy nuclear collisions.