• Equivariant cohomology Chern numbers determine equivariant unitary bordism for torus groups(1408.2134)

March 16, 2019 math.AT, math.SG
This paper shows that the integral equivariant cohomology Chern numbers completely determine the equivariant geometric unitary bordism classes of closed unitary $G$-manifolds, which gives an affirmative answer to the conjecture posed by Guillemin--Ginzburg--Karshon in [20, Remark H.5, $\S3$, Appendix H], where $G$ is a torus. As a further application, we also obtain a satisfactory solution of [20, Question (A), $\S1.1$, Appendix H] on unitary Hamiltonian $G$-manifolds. Our key ingredients in the proof are the universal toric genus defined by Buchstaber--Panov--Ray and the Kronecker pairing of bordism and cobordism. Our approach heavily exploits Quillen's geometric interpretation of homotopic unitary cobordism theory. Moreover, this method can also be applied to the study of $({\Bbb Z}_2)^k$-equivariant unoriented bordism and can still derive the classical result of tom Dieck.
• Social contagions with communication channels alternation on multiplex networks(1708.01724)

Dec. 19, 2018 physics.soc-ph, cs.SI
Internet communication channels, e.g., Facebook, Twitter, and email, are multiplex networks that facilitate interaction and information-sharing among individuals. During brief time periods users often use a single communication channel, but then communication channel alteration (CCA) occurs. This means that we must refine our understanding of the dynamics of social contagions. We propose a non-Markovian behavior spreading model in multiplex networks that takes into account the CCA mechanism, and we develop a generalized edge-based compartmental method to describe the spreading dynamics. Through extensive numerical simulations and theoretical analyses we find that the time delays induced by CCA slow the behavior spreading but do not affect the final adoption size. We also find that the CCA suppresses behavior spreading. On two coupled random regular networks, the adoption size exhibits hybrid growth, i.e., it grows first continuously and then discontinuously with the information transmission probability. CCA in ER-SF multiplex networks in which two subnetworks are Erd\H{o}s-R\'{e}nyi (ER) and scale-free (SF) introduces a crossover from continuous to hybrid growth in adoption size versus information transmission probability. Our results extend our understanding of the role of CCA in spreading dynamics, and may elicit further research.
• SSGAN: Secure Steganography Based on Generative Adversarial Networks(1707.01613)

Nov. 24, 2018 cs.CV, cs.MM
In this paper, a novel strategy of Secure Steganograpy based on Generative Adversarial Networks is proposed to generate suitable and secure covers for steganography. The proposed architecture has one generative network, and two discriminative networks. The generative network mainly evaluates the visual quality of the generated images for steganography, and the discriminative networks are utilized to assess their suitableness for information hiding. Different from the existing work which adopts Deep Convolutional Generative Adversarial Networks, we utilize another form of generative adversarial networks. By using this new form of generative adversarial networks, significant improvements are made on the convergence speed, the training stability and the image quality. Furthermore, a sophisticated steganalysis network is reconstructed for the discriminative network, and the network can better evaluate the performance of the generated images. Numerous experiments are conducted on the publicly available datasets to demonstrate the effectiveness and robustness of the proposed method.
• Antifactors of regular bipartite graphs(1511.09277)

May 31, 2020 math.CO
Let $G=(X,Y;E)$ be a bipartite graph, where $X$ and $Y$ are color classes and $E$ is the set of edges of $G$. Lov\'asz and Plummer \cite{LoPl86} asked whether one can decide in polynomial time that a given bipartite graph $G=(X,Y; E)$ admits a 1-anti-factor, that is subset $F$ of $E$ such that $d_F(v)=1$ for all $v\in X$ and $d_F(v)\neq 1$ for all $v\in Y$. Cornu\'ejols \cite{CHP} answered this question in the affirmative. Yu and Liu \cite{YL09} asked whether, for a given integer $k\geq 3$, every $k$-regular bipartite graph contains a 1-anti-factor. This paper answers this question in the affirmative.
• KV-match: A Subsequence Matching Approach Supporting Normalization and Time Warping [Extended Version](1710.00560)

Sept. 10, 2018 cs.DB
The volume of time series data has exploded due to the popularity of new applications, such as data center management and IoT. Subsequence matching is a fundamental task in mining time series data. All index-based approaches only consider raw subsequence matching (RSM) and do not support subsequence normalization. UCR Suite can deal with normalized subsequence match problem (NSM), but it needs to scan full time series. In this paper, we propose a novel problem, named constrained normalized subsequence matching problem (cNSM), which adds some constraints to NSM problem. The cNSM problem provides a knob to flexibly control the degree of offset shifting and amplitude scaling, which enables users to build the index to process the query. We propose a new index structure, KV-index, and the matching algorithm, KV-match. With a single index, our approach can support both RSM and cNSM problems under either ED or DTW distance. KV-index is a key-value structure, which can be easily implemented on local files or HBase tables. To support the query of arbitrary lengths, we extend KV-match to KV-match$_{DP}$, which utilizes multiple varied-length indexes to process the query. We conduct extensive experiments on synthetic and real-world datasets. The results verify the effectiveness and efficiency of our approach.
• On quaternionic complexes over unimodular quaternionic manifolds(1610.06445)

May 31, 2018 math.DG, math-ph, math.MP, math.CV
Penrose's two-spinor notation for $4$-dimensional Lorentzian manifolds can be extended to two-component notation for quaternionic manifolds, which is a very useful tool for calculation. We construct a family of quaternionic complexes over unimodular quaternionic manifolds by elementary calculation. On complex quaternionic manifolds, which are essentially the complexification of real analytic quaternionic K\"ahler manifolds, the existence of these complexes was established by Baston by using twistor transformations and spectral sequences. Unimodular quaternionic manifolds constitute a nice and large class of quaternionic manifolds: the conformal change of a unimodular quaternionic structure is still unimodular quaternionic and the complexes over such manifolds are conformally invariant. This class of manifolds, including quaternionic K\"ahler manifolds, are the real version of torsion-free QCFs introduced by Bailey and Eastwood. We show the ellipticity of these complexes and its Hodge-type decomposition. We also obtain a Weitzenb\"ock formula to establish vanishing of the cohomology groups of these complexes for quaternionic K\"ahler manifolds with negative scalar curvatures.
• On the global $2$-holonomy for a $2$-connection on a $2$-bundle(1512.08680)

May 30, 2018 math-ph, math.MP
A crossed module constitutes a strict $2$-groupoid $\mathcal{G}$ and a $\mathcal{G}$-valued cocycle on a manifold defines a $2$-bundle. A $2$-connection on this $2$-bundle is given by a Lie algebra $\mathfrak g$ valued $1$-form $A$ and a Lie algebra $\mathfrak h$ valued $2$-form $B$ over each coordinate chart together with $2$-gauge transformations between them, which satisfy the compatibility condition. Locally, the path-ordered integral of $A$ gives us the local $1$-holonomy, and the surface-ordered integral of $(A ,B )$ gives us the local $2$-holonomy. The transformation of local $2$-holonomies from one coordinate chart to another is provided by the transition $2$-arrow, which is constructed from a $2$-gauge transformation. We can use the transition $2$-arrows and the $2$-arrows provided by the $\mathcal{G}$-valued cocycle to glue such local $2$-holonomies together to get a global one, which is well defined. Namely we give an explicit algorithm for calculating the global $2$-holonomy.
• The Neumann problem for the $k$-Cauchy-Fueter complexes over $k$-pseudoconvex domains in $\mathbb{R}^4$ and the $L^2$ estimate(1704.02856)

May 20, 2018 math.CV
The $k$-Cauchy-Fueter operators and complexes are quaternionic counterparts of the Cauchy-Riemann operator and the Dolbeault complex in the theory of several complex variables. To develop the function theory of several quaternionic variables, we need to solve the non-homogeneous $k$-Cauchy-Fueter equation over a domain under the compatibility condition, which naturally leads to a Neumann problem. The method of solving the $\overline{\partial}$-Neumann problem in the theory of several complex variables is applied to this Neumann problem. We introduce notions of $k$-plurisubharmonic functions and $k$-pseudoconvex domains, establish the $L^2$ estimate and solve this Neumann problem over $k$-pseudoconvex domains in $\mathbb{R}^4$. Namely, we get a vanishing theorem for the first cohomology groups of the $k$-Cauchy-Fueter complex over such domains.
• Revealing Controllable Anisotropic Magnetoresistance in Spin Orbit Coupled Antiferromagnet Sr2IrO4(1805.02394)

Antiferromagnetic spintronics actively introduces new principles of magnetic memory, in which the most fundamental spin-dependent phenomena, i.e. anisotropic magnetoresistance effects, are governed by an antiferromagnet instead of a ferromagnet. A general scenario of the antiferromagnetic anisotropic magnetoresistance effects mainly stems from the magnetocrystalline anisotropy related to spin-orbit coupling. Here we demonstrate magnetic field driven contour rotation of the fourfold anisotropic magnetoresistance in bare antiferromagnetic Sr2IrO4/SrTiO3 (001) thin films hosting a strong spin-orbit coupling induced Jeff=1/2 Mott state. Concurrently, an intriguing minimal in the magnetoresistance emerges. Through first principles calculations, the band-gap engineering due to rotation of the Ir isospins is revealed to be responsible for these emergent phenomena, different from the traditional scenario where relatively more conductive state was obtained usually when magnetic field was applied along the magnetic easy axis. Our findings demonstrate a new efficient route, i.e. via the novel Jeff=1/2 state, to realize controllable anisotropic magnetoresistance in antiferromagnetic materials.
• Prospects of discovering stable double-heavy tetraquarks at a Tera-$Z$ factory(1805.02535)

May 7, 2018 hep-ph, hep-ex
Motivated by a number of theoretical considerations, predicting the deeply bound double-heavy tetraquarks $T^{\{bb\}}_{[\bar u \bar d]}$, $T^{\{bb\}}_{[\bar u \bar s]}$ and $T^{\{bb\}}_{[\bar d \bar s]}$, we explore the potential of their discovery at Tera-$Z$ factories. Using the process $Z \to b \bar b b \bar b$, we calculate, employing the Monte Carlo generators MadGraph5$\_$aMC@NLO and Pythia6, the phase space configuration in which the~$b b$ pair is likely to fragment as a diquark. In a jet-cone, defined by an invariant mass interval $m_{bb} < M_{T^{\{bb\}}_{[\bar q \bar q']}} + \Delta M$, the sought-after tetraquarks $T^{\{bb\}}_{[\bar q \bar q^\prime]}$ as well as the double-bottom baryons,~$\Xi_{bb}^{0,-}$, and $\Omega_{bb}^-$, can be produced. Using the heavy quark--diquark symmetry, we estimate $\mathcal{B} (Z \to T^{\{bb\}}_{[\bar u \bar d]} + \; \bar b \bar b) = (1.4^{+1.1}_{-0.5}) \times 10^{-6}$, and about a half of this for the $T^{\{bb\}}_{[\bar{u}\bar{s}]}$ and $T^{\{bb\}}_{[\bar d \bar s]}$. We also present an estimate of their lifetimes using the heavy quark expansion, yielding $\tau(T^{\{bb\}}_{[\bar q \bar q^\prime]}) \simeq 800$~fs. Measuring the tetraquark masses would require decays, such as $T^{\{bb\} -}_{[\bar u \bar d]} \to B^- D^- \pi^+$, $T^{\{bb\} -}_{[\bar u \bar d]} \to J/\psi \overline K^0 B^-$, $T^{\{bb\} -}_{[\bar u \bar d]} \to J/\psi K^- \overline B^0$, $T^{\{bb\} -}_{[\bar u \bar s]} \to \Xi_{bc}^0 \Sigma^-$, and $T^{\{bb\} 0}_{[\bar d \bar s]} \to \Xi_{bc}^0 \bar\Sigma^0$, with subsequent decay chains in exclusive non-leptonic final states. We estimate a couple of the decay widths and find that the product branching ratios do not exceed~$10^{-5}$. Hence, a good fraction of these modes will be required for a discovery of $T^{\{bb\}}_{[\bar q \bar q']}$ at a Tera-$Z$ factory.
• Giant planets around FGK stars form probably through core accretion(1805.02721)

May 7, 2018 astro-ph.EP
We present a statistical study of the planet-metallicity (P-M) correlation, by comparing the 744 stars with candidate planets (SWPs) in the Kepler field which have been observed with LAMOST, and a sample of distance-independent, fake "twin" stars in the Kepler field with no planet reported (CKSNPs) yet. With the well-defined and carefully-selected large samples, we find for the first time a turn-off P-M correlation of Delta [Fe/H]_(SWPs-SNPs), which in average increases from ~0.00+-0.03 dex to 0.06+-0.03 dex, and to 0.12+-0.03 for stars with Earth, Neptune, Jupiter-sized planets successively, and then declines to ~-0.01+-0.03 dex for more massive planets or brown dwarfs. Moreover, the percentage of those systems with positive Delta[Fe/H] has the same turn-off pattern. We also find FG-type stars follow this general trend, but K-type stars are different. Moderate metal enhancement (~0.1-0.2 dex) for K-type stars with planets of radii between 2 to 4 Earth radius as compared to CKSNPs is observed, which indicates much higher metallicities are required for Super-Earths, Neptune-sized planets to form around K-type stars. We point out that the P-M correlation is actually metallicity-dependent, i.e., the correlation is positive at solar and super-solar metallicities, and negative at subsolar metallicities. No steady increase of Delta[Fe/H] against planet sizes is observed for rocky planets, excluding the pollution scenario as a major mechanism for the P-M correlation. All these clues suggest that giant planets probably form differently from rocky planets or more massive planets/brown dwarfs, and the core-accretion scenario is highly favoured, and high metallicity is a prerequisite for massive planets to form.
• Skeleton-Based Action Recognition with Spatial Reasoning and Temporal Stack Learning(1805.02335)

May 7, 2018 cs.CV
Skeleton-based action recognition has made great progress recently, but many problems still remain unsolved. For example, most of the previous methods model the representations of skeleton sequences without abundant spatial structure information and detailed temporal dynamics features. In this paper, we propose a novel model with spatial reasoning and temporal stack learning (SR-TSL) for skeleton based action recognition, which consists of a spatial reasoning network (SRN) and a temporal stack learning network (TSLN). The SRN can capture the high-level spatial structural information within each frame by a residual graph neural network, while the TSLN can model the detailed temporal dynamics of skeleton sequences by a composition of multiple skip-clip LSTMs. During training, we propose a clip-based incremental loss to optimize the model. We perform extensive experiments on the SYSU 3D Human-Object Interaction dataset and NTU RGB+D dataset and verify the effectiveness of each network of our model. The comparison results illustrate that our approach achieves much better results than state-of-the-art methods.
• Double transition of information spreading in a two-layered network(1805.02270)

May 6, 2018 physics.soc-ph
A great deal of significant progress has been seen in the study of information spreading on populations of networked individuals. A common point in many of past studies is that there is only one transition in the phase diagram of the final accepted size versus the transmission probability. However, whether other factors alter this phenomenology is still under debate, especially for the case of information spreading through many channels and platforms. In the present study, we adopt a two-layered network to represent the interactions of multiple channels and propose a SAR (Susceptible-Accepted-Recovered) information spreading model. Interestingly, our model shows a novel double transition including a continuous transition and a following discontinuous transition in the phase diagram, which originates from two outbreaks between the two layers of the network. Further, we reveal that the key factors are a weak coupling condition between the two layers, a large adoption threshold and the difference of the degree distributions between the two layers. Then, an edge-based compartmental theory is developed which fully explains all numerical results. Our findings may be of significance for understanding the secondary outbreaks of the information in real life.
• A model of spreading of sudden events on social networks(1710.02274)

May 6, 2018 physics.soc-ph, cs.SI
Information spreading has been studied for decades, but its underlying mechanism is still under debate, especially for those ones spreading extremely fast through Internet. By focusing on the information spreading data of six typical events on Sina Weibo, we surprisingly find that the spreading of modern information shows some new features, i.e. either extremely fast or slow, depending on the individual events. To understand its mechanism, we present a Susceptible-Accepted-Recovered (SAR) model with both information sensitivity and social reinforcement. Numerical simulations show that the model can reproduce the main spreading patterns of the six typical events. By this model we further reveal that the spreading can be speeded up by increasing either the strength of information sensitivity or social reinforcement. Depending on the transmission probability and information sensitivity, the final accepted size can change from continuous to discontinuous transition when the strength of the social reinforcement is large. Moreover, an edge-based compartmental theory is presented to explain the numerical results. These findings may be of significance on the control of information spreading in modern society.
• Optimal community structure for social contagions(1805.00360)

May 1, 2018 physics.soc-ph
Community structure is an important factor in the behavior of real-world networks because it strongly affects the stability and thus the phase transition order of the spreading dynamics. We here propose a reversible social contagion model of community networks that includes the factor of social reinforcement. In our model an individual adopts a social contagion when the number of received units of information exceeds its adoption threshold. We use mean-field approximation to describe our proposed model, and the results agree with numerical simulations. The numerical simulations and theoretical analyses both indicate that there is a first-order phase transition in the spreading dynamics, and that a hysteresis loop emerges in the system when there is a variety of initially-adopted seeds. We find an optimal community structure that maximizes spreading dynamics. We also find a rich phase diagram with a triple point that separates the no-diffusion phase from the two diffusion phases.
• Configurations and Diagnosis for Ultra-Dense Heterogeneous Networks: From Empirical Measurements to Technical Solutions(1804.10505)

April 27, 2018 cs.NI
The intense demands for higher data rates and ubiquitous network coverage have raised the stakes on developing new network topology and architecture to meet these ever-increasing demands in a cost-effective manner. The telecommunication industry and international standardization bodies have placed considerable attention to the deployment of ultra-dense heterogeneous small-scale cells over existing cellular systems. Those small-scale cells, although provide higher data rates and better indoor coverage by reducing the distance between base stations (BSs) and end users, have raised severe configuration concerns. As the deployments are becoming irregular and flexible, inappropriate configurations occur frequently and undermine the network reliability and service quality. We envision that the fine-grained characterization of user traffic is a key pillar to diagnosing inappropriate configurations. In this article, we investigate the fine-grained traffic patterns of mobile users by analyzing the network data containing millions of subscribers and covering thousands of cells in a large metropolitan area. We characterize traffic patterns and mobility behaviors of users and geospatial properties of cells, and discuss how the heterogeneity of these characteristics affects network configurations and diagnosis in future ultra-dense small cells. Based on these observations from our measurements, we investigate possible models and corresponding challenges, and propose a heterogeneity-aware scheme that takes into account the disparity of user mobility behaviors and geospatial properties among small cells.
• PANDA: Facilitating Usable AI Development(1804.09997)

April 26, 2018 cs.AI, cs.DB
Recent advances in artificial intelligence (AI) and machine learning have created a general perception that AI could be used to solve complex problems, and in some situations over-hyped as a tool that can be so easily used. Unfortunately, the barrier to realization of mass adoption of AI on various business domains is too high because most domain experts have no background in AI. Developing AI applications involves multiple phases, namely data preparation, application modeling, and product deployment. The effort of AI research has been spent mostly on new AI models (in the model training stage) to improve the performance of benchmark tasks such as image recognition. Many other factors such as usability, efficiency and security of AI have not been well addressed, and therefore form a barrier to democratizing AI. Further, for many real world applications such as healthcare and autonomous driving, learning via huge amounts of possibility exploration is not feasible since humans are involved. In many complex applications such as healthcare, subject matter experts (e.g. Clinicians) are the ones who appreciate the importance of features that affect health, and their knowledge together with existing knowledge bases are critical to the end results. In this paper, we take a new perspective on developing AI solutions, and present a solution for making AI usable. We hope that this resolution will enable all subject matter experts (eg. Clinicians) to exploit AI like data scientists.
• Spectral characterization of the complete graph removing a path of small length(1804.08263)

April 23, 2018 math.CO
A graph $G$ is said to be \emph{determined by its spectrum} if any graph having the same spectrum as $G$ is isomorphic to $G$. Let $K_n \setminus P_{\ell}$ be the graph obtained from $K_n$ by removing edges of $P_\ell$, where $P_\ell$ is a path of length $\ell-1$ which is a subgraph of a complete graph $K_n$. C\'{a}mara and Haemers~\cite{MC} conjectured that $K_n \backslash P_{\ell}$ is determined by its adjacency spectrum for every $2\leq \ell \leq n$. In this paper we show that the conjecture is true for $7\leq \ell \leq9$.
• Adversarial Training for Community Question Answer Selection Based on Multi-scale Matching(1804.08058)

April 22, 2018 cs.CL
Community-based question answering (CQA) websites represent an important source of information. As a result, the problem of matching the most valuable answers to their corresponding questions has become an increasingly popular research topic. We frame this task as a binary (relevant/irrelevant) classification problem, and propose a Multi-scale Matching model that inspects the correlation between words and ngrams (word-to-ngrams) of different levels of granularity. This is in addition to word-to-word correlations which are used in most prior work. In this way, our model is able to capture rich context information conveyed in ngrams, therefore can better differentiate good answers from bad ones. Furthermore, we present an adversarial training framework to iteratively generate challenging negative samples to fool the proposed classification model. This is completely different from previous methods, where negative samples are uniformly sampled from the dataset during training process. The proposed method is evaluated on SemEval 2017 and Yahoo Answer dataset and achieves state-of-the-art performance.
• Two closed geodesics on compact bumpy Finsler manifolds(1804.08452)

April 20, 2018 math.DG
In this paper, we prove there are at least two closed geodesics on any compact bumpy Finsler $n$-manifold with $n\ge 2$. Thus generically there are at least two closed geodesics on compact Finsler manifolds. Furthermore, there are at least two closed geodesics on any compact Finsler $2$-manifold, and this lower bound is achieved by the Katok 2-sphere, cf. \cite{Kat}.
• Rafiki: Machine Learning as an Analytics Service System(1804.06087)

April 17, 2018 cs.AI, cs.DC, cs.DB
Big data analytics is gaining massive momentum in the last few years. Applying machine learning models to big data has become an implicit requirement or an expectation for most analysis tasks, especially on high-stakes applications.Typical applications include sentiment analysis against reviews for analyzing on-line products, image classification in food logging applications for monitoring user's daily intake and stock movement prediction. Extending traditional database systems to support the above analysis is intriguing but challenging. First, it is almost impossible to implement all machine learning models in the database engines. Second, expertise knowledge is required to optimize the training and inference procedures in terms of efficiency and effectiveness, which imposes heavy burden on the system users. In this paper, we develop and present a system, called Rafiki, to provide the training and inference service of machine learning models, and facilitate complex analytics on top of cloud platforms. Rafiki provides distributed hyper-parameter tuning for the training service, and online ensemble modeling for the inference service which trades off between latency and accuracy. Experimental results confirm the efficiency, effectiveness, scalability and usability of Rafiki.
• Closed geodesics on positively curved spheres $S^n$ with Finsler metric induced by $(\mathbb{R}P^n,F)$(1804.06193)

April 17, 2018 math.DG, math.DS
It's well known that the n-sphere $S^n$ is the universal double covering of the $n$-dimensional real projective space $\mathbb{R}P^n$ and then any Finsler metric on $\mathbb{R}P^n$ induces a Finsler metric of $S^n$. In this paper, we prove that for every Finsler $(S^n, F)$ for $n\geq3$ whose metric is induced by irreversible Finsler $(\mathbb{R}P^n,F)$ with reversibility $\lambda$ and flag curvature $K$ satisfying $(\frac{\lambda}{\lambda+1})^2<K\leq 1$, there exist at least $n-1$ prime closed geodesics on $(S^n, F)$. Furthermore, if there exist finitely many distinct closed geodesics on $(S^n, F)$, then there exist at least $2[\frac{n}{2}]-1$ of them are non-hyperbolic.
• Colorings v.s. list colorings of uniform hypergraphs(1804.02852)

April 9, 2018 math.CO
Let $r$ be an integer with $r\ge 2$ and $G$ be a connected $r$-uniform hypergraph with $m$ edges. By refining the broken cycle theorem for hypergraphs, we show that if $k>\frac{m-1}{\ln(1+\sqrt{2})}\approx 1.135 (m-1)$ then the $k$-list assignment of $G$ admitting the fewest colorings is the constant list assignment. This extends the previous results of Donner, Thomassen and the current authors for graphs.
• The Flash ADC system and PMT waveform reconstruction for the Daya Bay Experiment(1707.03699)

April 9, 2018 physics.ins-det
To better understand the energy response of the Antineutrino Detector (AD), the Daya Bay Reactor Neutrino Experiment installed a full Flash ADC readout system on one AD that allowed for simultaneous data taking with the current readout system. This paper presents the design, data acquisition, and simulation of the Flash ADC system, and focuses on the PMT waveform reconstruction algorithms. For liquid scintillator calorimetry, the most critical requirement to waveform reconstruction is linearity. Several common reconstruction methods were tested but the linearity performance was not satisfactory. A new method based on the deconvolution technique was developed with 1% residual non-linearity, which fulfills the requirement. The performance was validated with both data and Monte Carlo (MC) simulations, and 1% consistency between them has been achieved.
• Discovering Communities of Malapps on Android-based Mobile Cyber-physical Systems(1804.01641)

April 5, 2018 cs.CR
Android-based devices like smartphones have become ideal mobile cyber-physical systems (MCPS) due to their powerful processors and variety of sensors. In recent years, an explosively and continuously growing number of malicious applications (malapps) have posed a great threat to Android-based MCPS as well as users' privacy. The effective detection of malapps is an emerging yet crucial task. How to establish relationships among malapps, discover their potential communities, and explore their evolution process has become a challenging issue in effective detection of malapps. To deal with this issue, in this work, we are motivated to propose an automated community detection method for Android malapps by building a relation graph based on their static features. First, we construct a large feature set to profile the behaviors of malapps. Second, we propose an E-N algorithm by combining epsilon graph and k-nearest neighbor (k-NN) graph for graph construction. It solves the problem of an incomplete graph led by epsilon method and the problem of noise generated by k-NN graph. Finally, a community detection method, Infomap, is employed to explore the underlying structures of the relation graph, and obtain the communities of malapps. We evaluate our community detection method with 3996 malapp samples. Extensive experimental results show that our method outperforms the traditional clustering methods and achieves the best performance with rand statistic of 94.93% and accuracy of 79.53%.