• ### Modulation of Solar Wind Energy Flux Input on Global Tropical Cyclone Activity(1709.05917)

Studies on Sun-climate connection have been carried out for several decades, and almost all of them focused on the effects of solar total irradiation energy. As the second major terrestrial energy source from outer space, the solar wind energy flux exhibits more significant long-term variations. However, its link to the global climate change is rarely concerned and remain a mystery. As a fundamental and important aspect of the Earth's weather and climate system, tropical cyclone activity has been causing more and more attentions. Here we investigate the possible modulation of the total energy flux input from the solar wind into the Earth's magnetosphere on the global tropical cyclone activity during 1963--2012. From a global perspective, the accumulated cyclone energy increases gradually since 1963 and start to decrease after 1994. Compare to the previously frequently used parameters, e,g., the sunspot number, the total solar irradiation, the solar F10.7 irradiation, the tropical sea surface temperature, and the south oscillation index, the total solar wind energy flux input exhibits a better correlation with the global tropical cyclone activity. Furthermore, the tropical cyclones seem to be more intense with higher geomagnetic activities. A plausible modulation mechanism is thus proposed to link the terrestrial weather phenomenon to the seemly-unrelated solar wind energy input.
• ### The DArk Matter Particle Explorer mission(1706.08453)

The DArk Matter Particle Explorer (DAMPE), one of the four scientific space science missions within the framework of the Strategic Pioneer Program on Space Science of the Chinese Academy of Sciences, is a general purpose high energy cosmic-ray and gamma-ray observatory, which was successfully launched on December 17th, 2015 from the Jiuquan Satellite Launch Center. The DAMPE scientific objectives include the study of galactic cosmic rays up to $\sim 10$ TeV and hundreds of TeV for electrons/gammas and nuclei respectively, and the search for dark matter signatures in their spectra. In this paper we illustrate the layout of the DAMPE instrument, and discuss the results of beam tests and calibrations performed on ground. Finally we present the expected performance in space and give an overview of the mission key scientific goals.
• ### Evolution of Alfv\'enic fluctuations inside an interplanetary coronal mass ejection and their contributions to local plasma heating: Joint observations from 1.0 AU to 5.4 AU(1709.03639)

Sept. 12, 2017 physics.space-ph
Directly tracking an interplanetary coronal mass ejection (ICME) by widely separated spacecrafts is a great challenge. However, such an event could provide us a good opportunity to study the evolution of embedded Alfv\'enic fluctuations (AFs) inside ICME and their contributions to local plasma heating directly. In this study, an ICME observed by Wind at 1.0 au on March 4-6 1998 is tracked to the location of Ulysess at 5.4 au. AFs are commonly found inside the ICME at 1.0 au, with an occurrence rate of 21.7% and at broadband frequencies from 4$\times 10^{-4}$ to 5$\times 10^{-2}$ Hz. When the ICME propagates to 5.4 au, the Aflv\'enicity decreases significantly, and AFs are rare and only found at few localized frequencies with the occurrence rate decreasing to 3.0%. At the same time, the magnetic field intensity at the AF-rich region has an extra magnetic dissipation except ICME expansion effect. The energetics of the ICME at different radial distance is also investigated here. Under similar magnetic field intensity situations at 1.0 au, the turbulence cascade rate at the AF-rich region is much larger than the value at the AF-lack region. Moreover, it can maintain as the decrease of magnetic field intensity if there is lack of AFs. However, when there exists many AFs, it reduces significantly as the AFs disappear. The turbulence cascade dissipation rate within the ICME is inferred to be 1622.3 $J\cdot kg^{-1}\cdot s^{-1}$, which satisfies the requirement of local ICME plasma heating rate, 1653.2 $J\cdot kg^{-1}\cdot s^{-1}$. We suggest that AF dissipation is responsible for extra magnetic dissipation and local plasma heating inside ICME.
• ### Offline software for the DAMPE experiment(1604.03219)

June 5, 2017 hep-ex, cs.SE, astro-ph.IM
A software system has been developed for the DArk Matter Particle Explorer (DAMPE) mission, a satellite-based experiment. The DAMPE software is mainly written in C++ and steered using Python script. This article presents an overview of the DAMPE offline software, including the major architecture design and specific implementation for simulation, calibration and reconstruction. The whole system has been successfully applied to DAMPE data analysis, based on which some results from simulation and beam test experiments are obtained and presented.
• ### Plasma heating inside ICMEs by Alfvenic fluctuations dissipation(1608.04823)

Aug. 17, 2016 physics.space-ph
Nonlinear cascade of low-frequency Alfvenic fluctuations (AFs) is regarded as one candidate of the energy sources to heat plasma during the non-adiabatic expansion of interplanetary coronal mass ejections (ICMEs). However, AFs inside ICMEs were seldom reported in the literature. In this study, we investigate AFs inside ICMEs using observations from Voyager 2 between 1 and 6 au. It is found that AFs with high degree of Alfvenicity frequently occurred inside ICMEs, for almost all the identified ICMEs (30 out of 33 ICMEs), and 12.6% of ICME time interval. As ICMEs expand and move outward, the percentage of AF duration decays linearly in general. The occurrence rate of AFs inside ICMEs is much less than that in ambient solar wind, especially within 4 au. AFs inside ICMEs are more frequently presented in the center and at the boundaries of ICMEs. In addition, the proton temperature inside ICME has a similar distribution. These findings suggest significant contribution of AFs on local plasma heating inside ICMEs.
• ### Properties of post-shock solar wind deduced from geomagnetic indices responses after sudden impulses(1608.04551)

Aug. 16, 2016 physics.space-ph
Interplanetary (IP) shock plays a key role in causing the global dynamic changes of the geospace environment. For the perspective of Solar-Terrestrial relationship, it will be of great importance to estimate the properties of post-shock solar wind simply and accurately. Motivated by this, we performed a statistical analysis of IP shocks during 1998-2008, focusing on the significantly different responses of two well-used geomagnetic indices (SYMH and AL) to the passive of two types of IP shocks. For the IP shocks with northward IMF (91 cases), the SYMH index keeps on the high level after the sudden impulses (SI) for a long time. Meanwhile, the change of AL index is relative small, with an mean value of only -29 nT. However, for the IP shocks with southward IMF (92 cases), the SYMH index suddenly decreases at a certain rate after SI, and the change of AL index is much significant, of -316 nT. Furthermore, the change rate of SYMH index after SI is found to be linearly correlated with the post-shock reconnection E-field (E$_{KL}$). Based on these facts, an inversion model of post-shock IMF orientation and E$_{KL}$ is developed. The model validity is also confirmed by studying 68 IP shocks in the period of 2009-2013. The inversion accuracy of IMF orientation is 88.24%, and the inversion efficiency of E$_{KL}$ is as high as 78%.
• ### Weighted SAMGSR: combining significance analysis of microarray-gene set reduction algorithm with pathway topology-based weights to select relevant genes(1605.03697)

May 12, 2016 stat.ME
Introduction It has been demonstrated that a pathway-based feature selection method which incorporates biological information within pathways into the process of feature selection usually outperform a gene-based feature selection algorithm in terms of predictive accuracy, stability, and biological interpretation. Significance analysis of microarray-gene set reduction algorithm (SAMGSR), an extension to a gene set analysis method with further reduction of the selected pathways to their respective core subsets, can be regarded as a pathway-based feature selection method. Results and Discussion In SAMGSR, whether a gene is selected is mainly determined by its expression difference between the phenotypes, and partially by the number of pathways to which this gene belongs, but ignoring the topology information among pathways. In this study, we propose a weighted version of the SAMGSR algorithm by constructing weights based on the connectivity among genes and then incorporating these weights in the test statistic. Conclusions Using both simulated and real-world data, we evaluate the performance of the proposed SAMGSR extension and demonstrate that gene connectivity is indeed informative for feature selection.
• ### Temperature Dependence Calibration and Correction of the DAMPE BGO Electromagnetic Calorimeter(1604.08060)

April 27, 2016 hep-ex, physics.ins-det
A BGO electromagnetic calorimeter (ECAL) is built for the DArk Matter Particle Explorer (DAMPE) mission. The effect of temperature on the BGO ECAL was investigated with a thermal vacuum experiment. The light output of a BGO crystal depends on temperature significantly. The temperature coefficient of each BGO crystal bar has been calibrated, and a correction method is also presented in this paper.
• ### The calibration and electron energy reconstruction of the BGO ECAL of the DAMPE detector(1602.07015)

Feb. 23, 2016 physics.ins-det
The DArk Matter Particle Explorer (DAMPE) is a space experiment designed to search for dark matter indirectly by measuring the spectra of photons, electrons, and positrons up to 10 TeV. The BGO electromagnetic calorimeter (ECAL) is its main sub-detector for energy measurement. In this paper, the instrumentation and development of the BGO ECAL is briefly described. The calibration on the ground, including the pedestal, minimum ionizing particle (MIP) peak, dynode ratio, and attenuation length with the cosmic rays and beam particles is discussed in detail. Also, the energy reconstruction results of the electrons from the beam test are presented.
• ### On Sun-to-Earth Propagation of Coronal Mass Ejections: 2. Slow Events and Comparison with Others(1512.07949)

Dec. 25, 2015 physics.space-ph, astro-ph.SR
As a follow-up study on Sun-to-Earth propagation of fast coronal mass ejections (CMEs), we examine the Sun-to-Earth characteristics of slow CMEs combining heliospheric imaging and in situ observations. Three events of particular interest, the 2010 June 16, 2011 March 25 and 2012 September 25 CMEs, are selected for this study. We compare slow CMEs with fast and intermediate-speed events, and obtain key results complementing the attempt of \citet{liu13} to create a general picture of CME Sun-to-Earth propagation: (1) the Sun-to-Earth propagation of a typical slow CME can be approximately described by two phases, a gradual acceleration out to about 20-30 solar radii, followed by a nearly invariant speed around the average solar wind level, (2) comparison between different types of CMEs indicates that faster CMEs tend to accelerate and decelerate more rapidly and have shorter cessation distances for the acceleration and deceleration, (3) both intermediate-speed and slow CMEs would have a speed comparable to the average solar wind level before reaching 1 AU, (4) slow CMEs have a high potential to interact with other solar wind structures in the Sun-Earth space due to their slow motion, providing critical ingredients to enhance space weather, and (5) the slow CMEs studied here lack strong magnetic fields at the Earth but tend to preserve a flux-rope structure with axis generally perpendicular to the radial direction from the Sun. We also suggest a "best" strategy for the application of a triangulation concept in determining CME Sun-to-Earth kinematics, which helps to clarify confusions about CME geometry assumptions in the triangulation and to improve CME analysis and observations.
• ### Feature selection for longitudinal microarray data by adapting a pathway analysis method(1511.08272)

Nov. 26, 2015 q-bio.QM, stat.ME, stat.AP
Introduction: Feature selection and gene set analysis are of increasing interest in bioinformatics. While these two approaches have been developed for different purposes, we describe how some gene set analysis methods can be used to conduct feature selection. Here we adapt the gene set analysis method, significance analysis of microarray gene set reduction (SAMGSR), for feature selection, and propose two extensions-simple SAMGSR and two-level SAMGSR to identify relevant features for longitudinal microarray data. Results and Discussion: When applied to a real-world application, both simple and two-level SAMGSR work comparably well. Using simulated data, we further demonstrate that both SAMGSR extensions have the ability to identify the true relevant genes. If the relevant genes are not highly correlated with the irrelevant ones, the final models given by the two SAMGSR extensions are parsimonious as well. Conclusions: By adapting SAMGSR for feature selection and applying the proposed algorithms on a longitudinal gene expression dataset, we demonstrate that a gene set analysis method can be used for the purpose of feature selection. We believe this work paves the way for more research to bridge feature selection and gene set analysis with the development of novel algorithms.
• ### A study of energy correction for the electron beam data in the BGO ECAL of the DAMPE(1511.02998)

Nov. 10, 2015 hep-ex, physics.ins-det
The DArk Matter Particle Explorer (DAMPE) is an orbital experiment aiming at searching for dark matter indirectly by measuring the spectra of photons, electrons and positrons originating from deep space. The BGO electromagnetic calorimeter is one of the key sub-detectors of the DAMPE, which is designed for high energy measurement with a large dynamic range from 5 GeV to 10 TeV. In this paper, some methods for energy correction are discussed and tried, in order to reconstruct the primary energy of the incident electrons. Different methods are chosen for the appropriate energy ranges. The results of Geant4 simulation and beam test data (at CERN) are presented.
• ### No evidence of histology subtype-specific prognostic signatures among lung adenocarcinoma and squamous cell carcinoma patients at early stages(1408.2616)

Aug. 6, 2015 q-bio.QM, stat.AP, q-bio.GN
Background Non-small cell lung cancer (NSCLC) is the predominant histological type of lung cancer, accounting for up to 85% of cases. Disease stage is commonly used to determine adjuvant treatment eligibility of NSCLC patients, however, it is an imprecise predictor of the prognosis of an individual patient. Currently, many researchers resort to microarray technology for identifying relevant genetic prognostic markers, with particular attention on trimming or extending a Cox regression model. Among NSCLC, adenocarcinoma (AC) and squamous cell carcinoma (SCC) are two major histology subtypes. It has been demonstrated that there exist fundamental differences in the underlying mechanisms between them, which motivated us to postulate there might exist specific genes relevant to prognosis of each histology subtype. Results In this article, we propose a simple filterer feature selection algorithm with a Cox regression model as the base. Applying this method to a real-world microarray data, no evidence has been found to support the existence of histology-specific prognostic gene signature. Nevertheless, a 31-gene prognostic gene signature for the early-stage AC and SCC samples is obtained, which provides comparable performance when compared with other relevant signatures. Conclusions Our proposal is conceptually simple and straightforward to implement. Therefore, it is expected that other researchers, especially those with less statistical knowledge and experience, can adapt this method readily to test their own research hypotheses.
• ### In situ Evidence of Breaking the Ion Frozen-in Condition via the Non-gyrotropic Pressure Effect in Magnetic Reconnection(1504.06053)

For magnetic reconnection to proceed, the frozen-in condition for both ion fluid and electron fluid in a localized diffusion region must be violated by inertial effects, thermal pressure effects, or inter-species collisions. It has been unclear which underlying effects unfreeze ion fluid in the diffusion region. By analyzing in-situ THEMIS spacecraft measurements at the dayside magnetopause, we present clear evidence that the off-diagonal components of the ion pressure tensor is mainly responsible for breaking the ion frozen-in condition in reconnection. The off-diagonal pressure tensor, which corresponds to a nongyrotropic pressure effect, is a fluid manifestation of ion demagnetization in the diffusion region. From the perspective of the ion momentum equation, the reported non-gyrotropic ion pressure tensor is a fundamental aspect in specifying the reconnection electric field that controls how quickly reconnection proceeds.
• ### Scalable Topical Phrase Mining from Text Corpora(1406.6312)

Nov. 19, 2014 cs.CL, cs.LG, cs.IR
While most topic modeling algorithms model text corpora with unigrams, human interpretation often relies on inherent grouping of terms into phrases. As such, we consider the problem of discovering topical phrases of mixed lengths. Existing work either performs post processing to the inference results of unigram-based topic models, or utilizes complex n-gram-discovery topic models. These methods generally produce low-quality topical phrases or suffer from poor scalability on even moderately-sized datasets. We propose a different approach that is both computationally efficient and effective. Our solution combines a novel phrase mining framework to segment a document into single and multi-word phrases, and a new topic model that operates on the induced document partition. Our approach discovers high quality topical phrases with negligible extra cost to the bag-of-words topic model in a variety of datasets including research publication titles, abstracts, reviews, and news articles.
• ### Propagation of the 2012 March Coronal Mass Ejections from the Sun to Heliopause(1405.6086)

In 2012 March the Sun exhibited extraordinary activities. In particular, the active region NOAA AR 11429 emitted a series of large coronal mass ejections (CMEs) which were imaged by STEREO as it rotated with the Sun from the east to west. These sustained eruptions are expected to generate a global shell of disturbed material sweeping through the heliosphere. A cluster of shocks and interplanetary CMEs (ICMEs) were observed near the Earth, and are propagated outward from 1 AU using an MHD model. The transient streams interact with each other, which erases memory of the source and results in a large merged interaction region (MIR) with a preceding shock. The MHD model predicts that the shock and MIR would reach 120 AU around 2013 April 22, which agrees well with the period of radio emissions and the time of a transient disturbance in galactic cosmic rays detected by Voyager 1. These results are important for understanding the "fate" of CMEs in the outer heliosphere and provide confidence that the heliopause is located around 120 AU from the Sun.
• ### Scalable and Robust Construction of Topical Hierarchies(1403.3460)

March 13, 2014 cs.DB, cs.CL, cs.LG, cs.IR
Automated generation of high-quality topical hierarchies for a text collection is a dream problem in knowledge engineering with many valuable applications. In this paper a scalable and robust algorithm is proposed for constructing a hierarchy of topics from a text collection. We divide and conquer the problem using a top-down recursive framework, based on a tensor orthogonal decomposition technique. We solve a critical challenge to perform scalable inference for our newly designed hierarchical topic model. Experiments with various real-world datasets illustrate its ability to generate robust, high-quality hierarchies efficiently. Our method reduces the time of construction by several orders of magnitude, and its robust feature renders it possible for users to interactively revise the hierarchy.
• ### KERT: Automatic Extraction and Ranking of Topical Keyphrases from Content-Representative Document Titles(1306.0271)

June 3, 2013 cs.LG, cs.IR
We introduce KERT (Keyphrase Extraction and Ranking by Topic), a framework for topical keyphrase generation and ranking. By shifting from the unigram-centric traditional methods of unsupervised keyphrase extraction to a phrase-centric approach, we are able to directly compare and rank phrases of different lengths. We construct a topical keyphrase ranking function which implements the four criteria that represent high quality topical keyphrases (coverage, purity, phraseness, and completeness). The effectiveness of our approach is demonstrated on two collections of content-representative titles in the domains of Computer Science and Physics.