• ### Billion-scale Network Embedding with Iterative Random Projection(1805.02396)

May 7, 2018 cs.SI, cs.LG, stat.ML
Network embedding has attracted considerable research attention recently. However, the existing methods are incapable of handling billion-scale networks, because they are computationally expensive and, at the same time, difficult to be accelerated by distributed computing schemes. To address these problems, we propose RandNE, a novel and simple billion-scale network embedding method. Specifically, we propose a Gaussian random projection approach to map the network into a low-dimensional embedding space while preserving the high-order proximities between nodes. To reduce the time complexity, we design an iterative projection procedure to avoid the explicit calculation of the high-order proximities. Theoretical analysis shows that our method is extremely efficient, and friendly to distributed computing schemes without any communication cost in the calculation. We demonstrate the efficacy of RandNE over state-of-the-art methods in network reconstruction and link prediction tasks on multiple datasets with different scales, ranging from thousands to billions of nodes and edges.
• ### Isogeometric Least-squares Collocation Method with Consistency and Convergence Analysis(1601.07244)

April 18, 2018 math.NA
In this paper, we present the isogeometric least-squares collocation (IGA-L) method, which determines the numerical solution by making the approximate differential operator fit the real differential operator in a least-squares sense. The number of collocation points employed in IGA-L can be larger than that of the unknowns. Theoretical analysis and numerical examples presented in this paper show the superiority of IGA-L over state-of-the-art collocation methods. First, a small increase in the number of collocation points in IGA-L leads to a large improvement in the accuracy of its numerical solution. Second, IGA-L method is more flexible and more stable, because the number of collocation points in IGA-L is variable. Third, IGA-L is convergent in some cases of singular parameterization. Moreover, the consistency and convergence analysis are also developed in this paper.
• ### Fokker-Planck equation driven by asymmetric L\'evy motion(1803.00923)

March 2, 2018 math.NA, math.DS
Non-Gaussian L\'evy noises are present in many models for understanding underlining principles of physics, finance, biology and more. In this work, we consider the Fokker-Planck equation(FPE) due to one-dimensional asymmetric L\'evy motion, which is a nonlocal partial differential equation. We present an accurate numerical quadrature for the singular integrals in the nonlocal FPE and develop a fast summation method to reduce the order of the complexity from $O(J^2)$ to $O(J\log J)$ in one time-step, where $J$ is the number of unknowns. We also provide conditions under which the numerical schemes satisfy maximum principle. Our numerical method is validated by comparing with exact solutions for special cases. We also discuss the properties of the probability density functions and the effects of various factors on the solutions, including the stability index, the skewness parameter, the drift term, the Gaussian and non-Gaussian noises and the domain size.
• ### Tracking Multiple Moving Objects Using Unscented Kalman Filtering Techniques(1802.01235)

Feb. 5, 2018 cs.CV
It is an important task to reliably detect and track multiple moving objects for video surveillance and monitoring. However, when occlusion occurs in nonlinear motion scenarios, many existing methods often fail to continuously track multiple moving objects of interest. In this paper we propose an effective approach for detection and tracking of multiple moving objects with occlusion. Moving targets are initially detected using a simple yet efficient block matching technique, providing rough location information for multiple object tracking. More accurate location information is then estimated for each moving object by a nonlinear tracking algorithm. Considering the ambiguity caused by the occlusion among multiple moving objects, we apply an unscented Kalman filtering (UKF) technique for reliable object detection and tracking. Different from conventional Kalman filtering (KF), which cannot achieve the optimal estimation in nonlinear tracking scenarios, UKF can be used to track both linear and nonlinear motions due to the unscented transform. Further, it estimates the velocity information for each object to assist to the object detection algorithm, effectively delineating multiple moving objects of occlusion. The experimental results demonstrate that the proposed method can correctly detect and track multiple moving objects with nonlinear motion patterns and occlusions.
• ### Structural Deep Embedding for Hyper-Networks(1711.10146)

Jan. 31, 2018 cs.SI
Network embedding has recently attracted lots of attentions in data mining. Existing network embedding methods mainly focus on networks with pairwise relationships. In real world, however, the relationships among data points could go beyond pairwise, i.e., three or more objects are involved in each relationship represented by a hyperedge, thus forming hyper-networks. These hyper-networks pose great challenges to existing network embedding methods when the hyperedges are indecomposable, that is to say, any subset of nodes in a hyperedge cannot form another hyperedge. These indecomposable hyperedges are especially common in heterogeneous networks. In this paper, we propose a novel Deep Hyper-Network Embedding (DHNE) model to embed hyper-networks with indecomposable hyperedges. More specifically, we theoretically prove that any linear similarity metric in embedding space commonly used in existing methods cannot maintain the indecomposibility property in hyper-networks, and thus propose a new deep model to realize a non-linear tuplewise similarity function while preserving both local and global proximities in the formed embedding space. We conduct extensive experiments on four different types of hyper-networks, including a GPS network, an online social network, a drug network and a semantic network. The empirical results demonstrate that our method can significantly and consistently outperform the state-of-the-art algorithms.
• ### Semi-Supervised Convolutional Neural Networks for Human Activity Recognition(1801.07827)

Jan. 22, 2018 cs.LG, stat.ML
Labeled data used for training activity recognition classifiers are usually limited in terms of size and diversity. Thus, the learned model may not generalize well when used in real-world use cases. Semi-supervised learning augments labeled examples with unlabeled examples, often resulting in improved performance. However, the semi-supervised methods studied in the activity recognition literatures assume that feature engineering is already done. In this paper, we lift this assumption and present two semi-supervised methods based on convolutional neural networks (CNNs) to learn discriminative hidden features. Our semi-supervised CNNs learn from both labeled and unlabeled data while also performing feature learning on raw sensor data. In experiments on three real world datasets, we show that our CNNs outperform supervised methods and traditional semi-supervised learning methods by up to 18% in mean F1-score (Fm).
• ### Uniaxial and hydrostatic pressure effects in alpha-RuCl3 single crystals via thermal-expansion measurements(1712.08511)

Dec. 22, 2017 cond-mat.str-el
We present high-resolution thermal-expansion and specific-heat measurements of single crystalline alpha-RuCl3. An extremely hysteretic structural transition expanding over 100 K is observed by thermal- expansion along both crystallographic axes, which we attribute to a change of stacking sequence of the RuCl3 layers. Three magnetic transitions are observed, which we link to the different stacking sequences. Using our data and thermodynamic relations, we derive the uniaxial and hydrostatic pressure derivatives of all three magnetic transitions. Our results demonstrate that magnetic order should be totally suppressed by very moderate pressures of 0.3 GPa to 0.9 GPa. Finally, we discuss why our results differ from recent hydrostatic pressure measurements and suggest a possible route to reaching the spin-liquid state in alpha-RuCl3.
• ### On the Statistical Efficiency of Compositional Nonparametric Prediction(1704.01896)

Oct. 20, 2017 cs.LG, stat.ML
In this paper, we propose a compositional nonparametric method in which a model is expressed as a labeled binary tree of $2k+1$ nodes, where each node is either a summation, a multiplication, or the application of one of the $q$ basis functions to one of the $p$ covariates. We show that in order to recover a labeled binary tree from a given dataset, the sufficient number of samples is $O(k\log(pq)+\log(k!))$, and the necessary number of samples is $\Omega(k\log (pq)-\log(k!))$. We further propose a greedy algorithm for regression in order to validate our theoretical findings through synthetic experiments.
• ### Optimal Global Test for Functional Regression(1710.02269)

Oct. 6, 2017 stat.ME
This paper studies the optimal testing for the nullity of the slope function in the functional linear model using smoothing splines. We propose a generalized likelihood ratio test based on an easily implementable data-driven estimate. The quality of the test is measured by the minimal distance between the null and the alternative set that still allows a possible test. The lower bound of the minimax decay rate of this distance is derived, and test with a distance that decays faster than the lower bound would be impossible. We show that the minimax optimal rate is jointly determined by the smoothing spline kernel and the covariance kernel. It is shown that our test attains this optimal rate. Simulations are carried out to confirm the finite-sample performance of our test as well as to illustrate the theoretical results. Finally, we apply our test to study the effect of the trajectories of oxides of nitrogen ($\text{NO}_{\text{x}}$) on the level of ozone ($\text{O}_3$).
• ### Finite Difference Methods for the generator of 1D asymmetric alpha-stable L\'{e}vy motions(1705.03357)

Aug. 18, 2017 math.NA
Several finite difference methods are proposed for the infinitesimal generator of 1D asymmetric $\alpha$-stable L\'{e}vy motions, based on the fact that the operator becomes a multiplier in the spectral space. These methods take the general form of a discrete convolution, and the coefficients (or the weights) in the convolution are chosen to approximate the exact multiplier after appropriate transform. The accuracy and the associated advantages/disadvantages are also discussed, providing some guidance on the choice of the right scheme for practical problems, like in the calculation of mean exit time for random processes governed by general asymmetric $\alpha$-stable motions.
• ### Deep Co-Space: Sample Mining Across Feature Transformation for Semi-Supervised Learning(1707.09119)

July 28, 2017 cs.CV
Aiming at improving performance of visual classification in a cost-effective manner, this paper proposes an incremental semi-supervised learning paradigm called Deep Co-Space (DCS). Unlike many conventional semi-supervised learning methods usually performing within a fixed feature space, our DCS gradually propagates information from labeled samples to unlabeled ones along with deep feature learning. We regard deep feature learning as a series of steps pursuing feature transformation, i.e., projecting the samples from a previous space into a new one, which tends to select the reliable unlabeled samples with respect to this setting. Specifically, for each unlabeled image instance, we measure its reliability by calculating the category variations of feature transformation from two different neighborhood variation perspectives, and merged them into an unified sample mining criterion deriving from Hellinger distance. Then, those samples keeping stable correlation to their neighboring samples (i.e., having small category variation in distribution) across the successive feature space transformation, are automatically received labels and incorporated into the model for incrementally training in terms of classification. Our extensive experiments on standard image classification benchmarks (e.g., Caltech-256 and SUN-397) demonstrate that the proposed framework is capable of effectively mining from large-scale unlabeled images, which boosts image classification performance and achieves promising results compared to other semi-supervised learning methods.
• ### Electronic structure and direct observation of ferrimagnetism in multiferroic hexagonal YbFeO3(1703.08482)

June 3, 2017 cond-mat.mtrl-sci
The magnetic interaction between rare-earth and Fe ions in hexagonal rare-earth ferrites (h-REFeO3), may amplify the weak ferromagnetic moment on Fe, making these materials more appealing as multiferroics. To elucidate the interaction strength between the rare-earth and Fe ions as well as the magnetic moment of the rare-earth ions, element specific magnetic characterization is needed. Using X-ray magnetic circular dichroism, we have studied the ferrimagnetism in h-YbFeO3 by measuring the magnetization of Fe and Yb separately. The results directly show anti-alignment of magnetization of Yb and Fe ions in h-YbFeO3 at low temperature, with an exchange field on Yb of about 17 kOe. The magnetic moment of Yb is about 1.6 \muB at low-temperature, significantly reduced compared with the 4.5 \muB moment of a free Yb3+. In addition, the saturation magnetization of Fe in h-YbFeO3 has a sizable enhancement compared with that in h-LuFeO3. These findings directly demonstrate that ferrimagnetic order exists in h-YbFeO3; they also account for the enhancement of magnetization and the reduction of coercivity in h-YbFeO3 compared with those in h-LuFeO3 at low temperature, suggesting an important role for the rare-earth ions in tuning the multiferroic properties of h-REFeO3.
• ### Stochastic Quasi-Newton Methods for Nonconvex Stochastic Optimization(1607.01231)

May 21, 2017 cs.NA, math.OC, cs.LG, stat.ML
In this paper we study stochastic quasi-Newton methods for nonconvex stochastic optimization, where we assume that noisy information about the gradients of the objective function is available via a stochastic first-order oracle (SFO). We propose a general framework for such methods, for which we prove almost sure convergence to stationary points and analyze its worst-case iteration complexity. When a randomly chosen iterate is returned as the output of such an algorithm, we prove that in the worst-case, the SFO-calls complexity is $O(\epsilon^{-2})$ to ensure that the expectation of the squared norm of the gradient is smaller than the given accuracy tolerance $\epsilon$. We also propose a specific algorithm, namely a stochastic damped L-BFGS (SdLBFGS) method, that falls under the proposed framework. {Moreover, we incorporate the SVRG variance reduction technique into the proposed SdLBFGS method, and analyze its SFO-calls complexity. Numerical results on a nonconvex binary classification problem using SVM, and a multiclass classification problem using neural networks are reported.
• ### Detecting Drivable Area for Self-driving Cars: An Unsupervised Approach(1705.00451)

May 1, 2017 cs.CV
It has been well recognized that detecting drivable area is central to self-driving cars. Most of existing methods attempt to locate road surface by using lane line, thereby restricting to drivable area on which have a clear lane mark. This paper proposes an unsupervised approach for detecting drivable area utilizing both image data from a monocular camera and point cloud data from a 3D-LIDAR scanner. Our approach locates initial drivable areas based on a "direction ray map" obtained by image-LIDAR data fusion. Besides, a fusion of the feature level is also applied for more robust performance. Once the initial drivable areas are described by different features, the feature fusion problem is formulated as a Markov network and a belief propagation algorithm is developed to perform the model inference. Our approach is unsupervised and avoids common hypothesis, yet gets state-of-the-art results on ROAD-KITTI benchmark. Experiments show that our unsupervised approach is efficient and robust for detecting drivable area for self-driving cars.
• ### Rooted trees with the same plucking polynomial(1702.02004)

Feb. 7, 2017 math.CO, math.GT
In this paper we give a sufficient and necessary condition for two rooted trees with the same plucking polynomial. Furthermore, we give a criteria for a sequence of non-negative integers to be realized as a rooted tree.
• ### Numerical algorithms for mean exit time and escape probability of stochastic systems with asymmetric L\'evy motion(1702.00600)

Feb. 2, 2017 math.DS
For non-Gaussian stochastic dynamical systems, mean exit time and escape probability are important deterministic quantities, which can be obtained from integro-differential (nonlocal) equations. We develop an efficient and convergent numerical method for the mean first exit time and escape probability for stochastic systems with an asymmetric L\'evy motion, and analyze the properties of the solutions of the nonlocal equations. We also investigate the effects of different system factors on the mean exit time and escape probability, including the skewness parameter, the size of the domain, the drift term and the intensity of Gaussian and non-Gaussian noises. We find that the behavior of the mean exit time and the escape probability has dramatic difference at the boundary of the domain when the index of stability crosses the critical value of one.
• ### Separation of inverse spin Hall effect and anomalous Nernst effect in ferromagnetic metals(1701.05320)

Jan. 19, 2017 cond-mat.mes-hall
Inverse spin Hall effect (ISHE) in ferromagnetic metals (FM) can also be used to detect the spin current generated by longitudinal spin Seebeck effect in a ferromagnetic insulator YIG. However, anomalous Nernst effect(ANE) in FM itself always mixes in the thermal voltage. In this work, the exchange bias structure (NiFe/IrMn)is employed to separate these two effects. The exchange bias structure provides a shift field to NiFe, which can separate the magnetization of NiFe from that of YIG in M-H loops. As a result, the ISHE related to magnetization of YIG and the ANE related to the magnetization of NiFe can be separated as well. By comparison with Pt, a relative spin Hall angle of NiFe (0.87) is obtained, which results from the partially filled 3d orbits and the ferromagnetic order. This work puts forward a practical method to use the ISHE in ferromagnetic metals towards future spintronic applications.
• ### Search for torsion in Khovanov homology(1701.04924)

Jan. 18, 2017 math.GT
In the Khovanov homology of links, presence of $\mathbb{Z}_2$-torsion is a very common phenomenon. Finite number of examples of knots with $\mathbb{Z}_n$-torsion for $n>2$ were also known, none for $n>8$. In this paper, we prove that there are infinite families of links whose Khovanov homology contains $\mathbb{Z}_n$-torsion for $2 < n < 9$ and $\mathbb{Z}_{2^s}$-torsion for $s < 24$. We also introduce $4$-braid links with $\mathbb{Z}_3$-torsion which are counterexamples to the PS braid conjecture. We also provide an infinite family of knots with $\mathbb{Z}_5$-torsion in reduced Khovanov homology and $\mathbb{Z}_3$-torsion in odd Khovanov homology.
• ### Equivalence of two definitions of set-theoretic Yang-Baxter homology(1611.01178)

Nov. 3, 2016 math.GT
In 2004, Carter, Elhamdadi and Saito defined a homology theory for set-theoretic Yang-Baxter operators(we will call it the "algebraic" version in this article). In 2012, Przytycki defined another homology theory for pre-Yang-Baxter operators which has a nice graphic visualization(we will call it the "graphic" version in this article). We show that they are equivalent. The "graphic" homology is also defined for pre-Yang-Baxter operators, and we give some examples of it's one-term and two-term homologies. In the two-term case, we have found torsion in homology of Yang-Baxter operator that yields the Jones polynomial.
• ### Effects of biaxial strain on the improper multiferroicity in h-LuFeO3 films(1610.00073)

Oct. 1, 2016 cond-mat.mtrl-sci
Elastic strain is potentially an important approach in tuning the properties of the improperly multiferroic hexagonal ferrites, the details of which have however been elusive due to the experimental difficulties. Employing the method of restrained thermal expansion, we have studied the effect of isothermal biaxial strain in the basal plane of h-LuFeO3 (001) films. The results indicate that a compressive biaxial strain significantly enhances the ferrodistortion, and the effect is larger at higher temperatures. The compressive biaxial strain and the enhanced ferrodistortion together, cause an increase in the electric polarization and a reduction in the canting of the weak ferromagnetic moments in h-LuFeO3, according to our first principle calculations. These findings are important for understanding the strain effect as well as the coupling between the lattice and the improper multiferroicity in h-LuFeO3. The experimental elucidation of the strain effect in h-LuFeO3 films also suggests that the restrained thermal expansion can be a viable method to unravel the strain effect in many other epitaxial thin film materials.
• ### Room Temperature Ferroelectricity in Continuous Croconic Acid Thin Films(1608.02998)

Aug. 9, 2016 cond-mat.mtrl-sci
Ferroelectricity at room temperature has been demonstrated in nanometer-thin quasi 2D croconic acid thin films, by the polarization hysteresis loop measurements in macroscopic capacitor geometry, along with observation and manipulation of the nanoscale domain structure by piezoresponse force microscopy. The fabrication of continuous thin films of the hydrogen-bonded croconic acid was achieved by the suppression of the thermal decomposition using low evaporation temperatures in high vacuum, combined with growth conditions far from thermal equilibrium. For nominal coverages >=20 nm, quasi 2D and polycrystalline films, with an average grain size of 50-100 nm and 3.5 nm roughness, can be obtained. Spontaneous ferroelectric domain structures of the thin films have been observed and appear to correlate with the grain patterns. The application of this solvent-free growth protocol may be a key to the development of flexible organic ferroelectric thin films for electronic applications.
• ### Local Region Sparse Learning for Image-on-Scalar Regression(1605.08501)

May 27, 2016 cs.LG, stat.ML
Identification of regions of interest (ROI) associated with certain disease has a great impact on public health. Imposing sparsity of pixel values and extracting active regions simultaneously greatly complicate the image analysis. We address these challenges by introducing a novel region-selection penalty in the framework of image-on-scalar regression. Our penalty combines the Smoothly Clipped Absolute Deviation (SCAD) regularization, enforcing sparsity, and the SCAD of total variation (TV) regularization, enforcing spatial contiguity, into one group, which segments contiguous spatial regions against zero-valued background. Efficient algorithm is based on the alternative direction method of multipliers (ADMM) which decomposes the non-convex problem into two iterative optimization problems with explicit solutions. Another virtue of the proposed method is that a divide and conquer learning algorithm is developed, thereby allowing scaling to large images. Several examples are presented and the experimental results are compared with other state-of-the-art approaches.
• ### Simultaneous Sparse Dictionary Learning and Pruning(1605.07870)

May 25, 2016 stat.ML
Dictionary learning is a cutting-edge area in imaging processing, that has recently led to state-of-the-art results in many signal processing tasks. The idea is to conduct a linear decomposition of a signal using a few atoms of a learned and usually over-completed dictionary instead of a pre-defined basis. Determining a proper size of the to-be-learned dictionary is crucial for both precision and efficiency of the process, while most of the existing dictionary learning algorithms choose the size quite arbitrarily. In this paper, a novel regularization method called the Grouped Smoothly Clipped Absolute Deviation (GSCAD) is employed for learning the dictionary. The proposed method can simultaneously learn a sparse dictionary and select the appropriate dictionary size. Efficient algorithm is designed based on the alternative direction method of multipliers (ADMM) which decomposes the joint non-convex problem with the non-convex penalty into two convex optimization problems. Several examples are presented for image denoising and the experimental results are compared with other state-of-the-art approaches.
• ### Penalty Methods with Stochastic Approximation for Stochastic Nonlinear Programming(1312.2690)

May 19, 2016 math.OC
In this paper, we propose a class of penalty methods with stochastic approximation for solving stochastic nonlinear programming problems. We assume that only noisy gradients or function values of the objective function are available via calls to a stochastic first-order or zeroth-order oracle. In each iteration of the proposed methods, we minimize an exact penalty function which is nonsmooth and nonconvex with only stochastic first-order or zeroth-order information available. Stochastic approximation algorithms are presented for solving this particular subproblem. The worst-case complexity of calls to the stochastic first-order (or zeroth-order) oracle for the proposed penalty methods for obtaining an $\epsilon$-stochastic critical point is analyzed.
• ### On the Structural Origin of the Single-ion Magnetic Anisotropy in LuFeO3(1605.01111)

May 3, 2016 cond-mat.mtrl-sci
Electronic structures for the conduction bands of both hexagonal and orthorhombic LuFeO3 thin films have been measured using x-ray absorption spectroscopy at oxygen K (O K) edge. Dramatic differences in both the spectra shape and the linear dichroism are observed. These differences in the spectra can be explained using the differences in crystal field splitting of the metal (Fe and Lu) electronic states and the differences in O 2p-Fe 3d and O 2p-Lu 5d hybridizations. While the oxidation states has not changed, the spectra are sensitive to the changes in the local environments of the Fe3+ and Lu3+ sites in the hexagonal and orthorhombic structures. Using the crystal-field splitting and the hybridizations that are extracted from the measured electronic structures and the structural distortion information, we derived the occupancies of the spin minority states in Fe3+, which are non-zero and uneven. The single ion anisotropy on Fe3+ sites is found to originate from these uneven occupancies of the spin minority states via spin-orbit coupling in LuFeO3.