• We study the central problem in data privacy: how to share data with an analyst while providing both privacy and utility guarantees to the user that owns the data. In this setting, we present an estimation-theoretic analysis of the privacy-utility trade-off (PUT). Here, an analyst is allowed to reconstruct (in a mean-squared error sense) certain functions of the data (utility), while other private functions should not be reconstructed with distortion below a certain threshold (privacy). We demonstrate how $\chi^2$-information captures the fundamental PUT in this case and provide bounds for the best PUT. We propose a convex program to compute privacy-assuring mappings when the functions to be disclosed and hidden are known a priori and the data distribution is known. We derive lower bounds on the minimum mean-squared error of estimating a target function from the disclosed data and evaluate the robustness of our approach when an empirical distribution is used to compute the privacy-assuring mappings instead of the true data distribution. We illustrate the proposed approach through two numerical experiments.
  • Network representation learning (NRL) aims to learn low-dimensional vectors for vertices in a network. Most existing NRL methods focus on learning representations from local context of vertices (such as their neighbors). Nevertheless, vertices in many complex networks also exhibit significant global patterns widely known as communities. It's intuitive that vertices in the same community tend to connect densely and share common attributes. These patterns are expected to improve NRL and benefit relevant evaluation tasks, such as link prediction and vertex classification. Inspired by the analogy between network representation learning and text modeling, we propose a unified NRL framework by introducing community information of vertices, named as Community-enhanced Network Representation Learning (CNRL). CNRL simultaneously detects community distribution of each vertex and learns embeddings of both vertices and communities. Moreover, the proposed community enhancement mechanism can be applied to various existing NRL models. In experiments, we evaluate our model on vertex classification, link prediction, and community detection using several real-world datasets. The results demonstrate that CNRL significantly and consistently outperforms other state-of-the-art methods while verifying our assumptions on the correlations between vertices and communities.
  • Atomistic/continuum coupling methods aim to achieve optimal balance between accuracy and efficiency. Adaptivity is the key for the efficient implementation of such methods. In this paper, we carry out a rigorous a posteriori analysis of the residual, the stability constant, and the error bound, for a consistent atomistic/continuum coupling method in 2D. We design and implement the corresponding adaptive mesh refinement algorithm, and the convergence rate with respect to degrees of freedom is optimal compare with a priori error estimates.
  • In the context of machine learning, disparate impact refers to a form of systematic discrimination whereby the output distribution of a model depends on the value of a sensitive attribute (e.g., race or gender). In this paper, we propose an information-theoretic framework to analyze the disparate impact of a binary classification model. We view the model as a fixed channel, and quantify disparate impact as the divergence in output distributions over two groups. Our aim is to find a correction function that can perturb the input distributions of each group to align their output distributions. We present an optimization problem that can be solved to obtain a correction function that will make the output distributions statistically indistinguishable. We derive closed-form expressions to efficiently compute the correction function, and demonstrate the benefits of our framework on a recidivism prediction problem based on the ProPublica COMPAS dataset.
  • Consider a data publishing setting for a data set with public and private features. The objective of the publisher is to maximize the amount of information about the public features in a revealed data set, while keeping the information leaked about the private features bounded. The goal of this paper is to analyze the performance of privacy mechanisms that are constructed to match the distribution learned from the data set. Two distinct scenarios are considered: (i) mechanisms are designed to provide a privacy guarantee for the learned distribution; and (ii) mechanisms are designed to provide a privacy guarantee for every distribution in a given neighborhood of the learned distribution. For the first scenario, given any privacy mechanism, upper bounds on the difference between the privacy-utility guarantees for the learned and true distributions are presented. In the second scenario, upper bounds on the reduction in utility incurred by providing a uniform privacy guarantee are developed.
  • A new neural circuit is proposed by considering the myelin as an inductor. This new neural circuit can explain why the lump-parameter circuit used in previous C-P theory is valid. Meanwhile, it provides a new explanation of the biological function of myelin for neural signal propagation. Furthermore, a new model for magnetic nerve stimulation is built and all phenomena in magnetic nerve stimulation can be well explained. Based on this model, the coil structure can be optimized.
  • In compressed sensing, the l0-norm minimization of sparse signal reconstruction is NP-hard. Recent work shows that compared with the best convex relaxation (l1-norm), nonconvex penalties can better approximate the l0-norm and can reconstruct the signal based on fewer observations. In this paper, the original problem is relaxed by using minimax concave penalty (MCP). Then alternating direction method of multipliers (ADMM) and modified iterative hard thresholding method are used to solve the problem. Under certain reasonable assumptions, the global convergence of the proposed method is proved. The parameter setting is also discussed. Finally, through simulations and comparisons with several state-of-the-art algorithms, the effectiveness of proposed method is confirmed.
  • A new theory, named the Circuit-Probability theory, is proposed to unveil the secret of electrical nerve stimulation, essentially explain the nonlinear and resonant phenomena observed when neural and non-neural tissues are electrically stimulated. For the explanation of frequency dependent response, an inductor is involved in the neural circuit model. Furthermore, predicted response to varied stimulation strength is calculated stochastically. Based on this theory, many empirical models, such as strength-duration relationship and LNP model, can be theoretically explained, derived, and amended. This theory can explain the complex nonlinear interactions in electrical stimulation and fit in vivo experiment data on stimulation-responses of many experiments. As such, the C-P theory should be able to guide novel experiments and more importantly, offer an in-depth physical understanding of the neural tissue. As a promising neural model, we can even further explore the more accurate circuit configuration and probability equation to better describe the electrical stimulation of neural tissues in the future.
  • The Alternating Direction Method of Multipliers (ADMM) decoding of Low Density Parity Check (LDPC) codes has received many attentions due to its excellent performance at the error floor region. In this paper, we develop a parameter-free decoder based on Linear Program (LP) decoding by replacing the binary constraint with the intersection of a box and an $\ell_p$ sphere. An efficient $\ell_2$-box ADMM is designed to handle this model in a distributed fashion. Numerical experiments demonstrate that our decoder attains better adaptability to different Signal-to-Noise Ratio and channels.
  • The gravitational wave data of GW170817 favor the equation of state (EoS) models that predict compact neutron stars (NSs), consistent with the radius constraints from X-ray observations. Motivated by such a remarkable progress, we examine the fate of the remnants formed in NS mergers and focus on the roles of the angular momentum and the mass distribution of the binary NSs. In the mass shedding limit (for which the dimensionless angular momentum equals to the Keplerian value, i.e., $j=j_{\rm Kep}$), the adopted { seven EoS models, except H4 and ALF2,} yield supramassive NSs in more than half of the mergers. However, for $j\lesssim 0.7j_{\rm Kep}$, the presence or absence of a non-negligible fraction of supramassive NSs formed in the mergers depends sensitively on both the EoS and the mass distribution of the binary systems. The NS mergers with a total gravitational mass $\leq 2.6M_\odot$ are found to be able to shed valuable light on both the EoS model and the angular momentum of the remnants if supramassive NSs are still absent. We have also discussed the uncertainty on estimating the maximum gravitational mass of non-rotating NSs ($M_{\rm max}$) due to the unknown $j$ of the pre-collapse remnants. With the data of GW170817 and the assumption of the mass loss of $0.03M_\odot$, we have $M_{\rm max}<(2.19,~2.32)M_\odot$ (90\% confidence level) for $j=(1.0,~0.8)j_{\rm Kep}$, respectively.
  • Face recognition has made extraordinary progress owing to the advancement of deep convolutional neural networks (CNNs). The central task of face recognition, including face verification and identification, involves face feature discrimination. However, the traditional softmax loss of deep CNNs usually lacks the power of discrimination. To address this problem, recently several loss functions such as center loss, large margin softmax loss, and angular softmax loss have been proposed. All these improved losses share the same idea: maximizing inter-class variance and minimizing intra-class variance. In this paper, we propose a novel loss function, namely large margin cosine loss (LMCL), to realize this idea from a different perspective. More specifically, we reformulate the softmax loss as a cosine loss by $L_2$ normalizing both features and weight vectors to remove radial variations, based on which a cosine margin term is introduced to further maximize the decision margin in the angular space. As a result, minimum intra-class variance and maximum inter-class variance are achieved by virtue of normalization and cosine decision margin maximization. We refer to our model trained with LMCL as CosFace. Extensive experimental evaluations are conducted on the most popular public-domain face recognition datasets such as MegaFace Challenge, Youtube Faces (YTF) and Labeled Face in the Wild (LFW). We achieve the state-of-the-art performance on these benchmarks, which confirms the effectiveness of our proposed approach.
  • Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of object proposals for final classification and regression. Recent classification methods demonstrate that the integration of high-order statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model whole images by discarding location information so that they cannot be directly adopted to object detection. In this paper, we make an attempt to exploit high-order statistics in object detection, aiming at generating more discriminative representations for proposals to enhance the performance of detectors. To this end, we propose a novel Multi-scale Location-aware Kernel Representation (MLKP) to capture high-order statistics of deep features in proposals. Our MLKP can be efficiently computed on a modified multi-scale feature map using a low-dimensional polynomial kernel approximation.Moreover, different from existing orderless global representations based on high-order statistics, our proposed MLKP is location retentive and sensitive so that it can be flexibly adopted to object detection. Through integrating into Faster R-CNN schema, the proposed MLKP achieves very competitive performance with state-of-the-art methods, and improves Faster R-CNN by 4.9% (mAP), 4.7% (mAP) and 5.0% (AP at IOU=[0.5:0.05:0.95]) on PASCAL VOC 2007, VOC 2012 and MS COCO benchmarks, respectively. Code is available at: https://github.com/Hwang64/MLKP.
  • This paper focuses on the design of sequential quadratic optimization (commonly known as SQP) methods for solving large-scale nonlinear optimization problems. The most computationally demanding aspect of such an approach is the computation of the search direction during each iteration, for which we consider the use of matrix-free methods. In particular, we develop a method that requires an inexact solve of a single QP subproblem to establish the convergence of the overall SQP method. It is known that SQP methods can be plagued by poor behavior of the global convergence mechanism. To confront this issue, we propose the use of an exact penalty function with a dynamic penalty parameter updating strategy to be employed within the subproblem solver in such a way that the resulting search direction predicts progress toward both feasibility and optimality. We present our parameter updating strategy and prove that, under reasonable assumptions, the strategy does not modify the penalty parameter unnecessarily. We also discuss a matrix-free subproblem solver in which our updating strategy can be incorporated. We close the paper with a discussion of the results of numerical experiments that illustrate the benefits of our proposed techniques.
  • The jet breaks in the afterglow lightcurves of short gamma-ray bursts (SGRBs), rarely detected so far, are crucial for estimating the half-opening angles of the ejecta ($\theta_{\rm j}$) and hence the neutron star merger rate. In this work we report the detection of jet decline behaviors in GRB 150424A and GRB 160821B and find $\theta_{\rm j}\sim 0.1$ rad. Together with five events reported before 2015 and other three "identified" recently (GRB 050709, GRB 060614 and GRB 140903A), we have a sample consisting of nine SGRBs and one long-short GRB with reasonably estimated $\theta_{\rm j}$. In particular, three {\it Swift} bursts in the sample have redshifts $z\leq 0.2$, with which we estimate the local neutron star merger rate density {to be $\sim 1109^{+1432}_{-657}~{\rm Gpc^{-3}~yr^{-1}}$ or $162^{+140}_{-83} {\rm Gpc^{-3}yr^{-1}}$ if the narrowly-beamed GRB 061201 is excluded}. Inspired by the typical $\theta_{\rm j}\sim 0.1$ rad found currently, we further investigate whether the off-beam GRBs (in the uniform jet model) or the off-axis events (in the structured jet model) can significantly enhance the GRB/GW association or not. For the former the enhancement is at most moderate, while for the latter the enhancement can be much greater and a high GRB/GW association probability of $\sim 10\%$ is possible. We also show that the data of GRB 160821B may contain a macronova/kilonova emission component with a temperature of $\sim 3100$ K at $\sim 3.6$ days after the burst and more data are needed to ultimately clarify.
  • The Hopf actions on vertex operator algebras are investigated. If the action is semisimple, a Schur-Weyl type decomposition is obtained. When the Hopf algebra is finite dimensional and the action is faithful, the action is a group action. Moreover if the Hopf algebra is finite dimensional and the action is semisimple and inner faithful, the action is also a group action. In this case, inner faithfulness is equivalent to faithfulness.
  • This paper considers an application of model predictive control to automotive air conditioning (A/C) system in future connected and automated vehicles (CAVs) with battery electric or hybrid electric powertrains. A control-oriented prediction model for A/C system is proposed, identified, and validated against a higher fidelity simulation model (CoolSim). Based on the developed prediction model, a nonlinear model predictive control (NMPC) problem is formulated and solved online to minimize the energy consumption of the A/C system. Simulation results illustrate the desirable characteristics of the proposed NMPC solution such as being able to enforce physical constraints of the A/C system and maintain cabin temperature within a specified range. Moreover, it is shown that by utilizing the vehicle speed preview and through coordinated adjustment of the cabin temperature constraints, energy efficiency improvements of up to 9% can be achieved.
  • Real-time bidding (RTB) is almost the most important mechanism in online display advertising, where proper bid for each page view plays a vital and essential role for good marketing results. Budget constrained bidding is a typical scenario in RTB mechanism where the advertisers hope to maximize total value of winning impressions under a pre-set budget constraint. However, the optimal strategy is hard to be derived due to complexity and volatility of the auction environment. To address the challenges, in this paper, we formulate budget constrained bidding as a Markov Decision Process. Quite different from prior model-based work, we propose a novel framework based on model-free reinforcement learning which sequentially regulates the bidding parameter rather than directly producing bid. Along this line, we further innovate a reward function which deploys a deep neural network to learn appropriate reward and thus leads the agent to deliver the optimal policy effectively; we also design an adaptive $\epsilon$-greedy strategy which adjusts the exploration behaviour dynamically and further improves the performance. Experimental results on real dataset demonstrate the effectiveness of our framework.
  • In this paper, we derive a temporal arbitrage policy for storage via reinforcement learning. Real-time price arbitrage is an important source of revenue for storage units, but designing good strategies have proven to be difficult because of the highly uncertain nature of the prices. Instead of current model predictive or dynamic programming approaches, we use reinforcement learning to design an optimal arbitrage policy. This policy is learned through repeated charge and discharge actions performed by the storage unit through updating a value matrix. We design a reward function that does not only reflect the instant profit of charge/discharge decisions but also incorporate the history information. Simulation results demonstrate that our designed reward function leads to significant performance improvement compared with existing algorithms.
  • Large-scale rumor spreading could pose severe social and economic damages. The emergence of online social networks along with the new media can even make rumor spreading more severe. Effective control of rumor spreading is of theoretical and practical significance. This paper takes the first step to understand how the blockchain technology can help limit the spread of rumors. Specifically, we develop a new paradigm for social networks embedded with the blockchain technology, which employs decentralized contracts to motivate trust networks as well as secure information exchange contract. We design a blockchain-based sequential algorithm which utilizes virtual information credits for each peer-to-peer information exchange. We validate the effectiveness of the blockchain-enabled social network on limiting the rumor spreading. Simulation results validate our algorithm design in avoiding rapid and intense rumor spreading, and motivate better mechanism design for trusted social networks.
  • An electric field can induce or modify optical birefringence in both the isotropic and nematic phases of liquid crystals (LCs). In the isotropic phase, the electric field induces birefringence with an optical axis along the field. The phenomenon is known as the Kerr effect. In the nematic, the analog of the Kerr effect is the change of existing birefringence through nanosecond electric modification of order parameters (NEMOP) that does not require realignment of the optic axis. The utility of both effects for practical applications is challenged by a relatively weak birefringence induced by the field. We address the issue by adding a non-mesogenic additive 2, 3-dicyano-4-pentyloxyphenyl 4'-pentyloxybenzoate (DPP) with a large transverse dipole moment to mesogenic materials in order to enhance their negative dielectric anisotropy. The DPP doping substantially increases the field-induced birefringence in both NEMOP and Kerr effects, up to 0.02. The doping also slows down the switching processes, but this effect can be compensated by rising working temperatures, if necessary. The enhancement of field induced birefringence by the non-mesogenic dopant paves the way for practical applications of nanosecond electro-optic effects.
  • In recent years, RTB(Real Time Bidding) becomes a popular online advertisement trading method. During the auction, each DSP(Demand Side Platform) is supposed to evaluate current opportunity and respond with an ad and corresponding bid price. It's essential for DSP to find an optimal ad selection and bid price determination strategy which maximizes revenue or performance under budget and ROI(Return On Investment) constraints in P4P(Pay For Performance) or P4U(Pay For Usage) mode. We solve this problem by 1) formalizing the DSP problem as a constrained optimization problem, 2) proposing the augmented MMKP(Multi-choice Multi-dimensional Knapsack Problem) with general solution, 3) and demonstrating the DSP problem is a special case of the augmented MMKP and deriving specialized strategy. Our strategy is verified through simulation and outperforms state-of-the-art strategies in real application. To the best of our knowledge, our solution is the first dual based DSP bidding framework that is derived from strict second price auction assumption and generally applicable to the multiple ads scenario with various objectives and constraints.
  • On 17 August 2017, a gravitational wave event (GW170817) and an associated short gamma-ray burst (GRB 170817A) from a binary neutron star merger had been detected. The followup optical/infrared observations also identified the macronova/kilonova emission (AT2017gfo). In this work we discuss some implications of the remarkable GW170817/GRB 170817A/AT2017gfo association. We show that the $\sim 1.7$s time delay between the gravitational wave (GW) and GRB signals imposes very tight constraint on the superluminal movement of gravitational waves (i.e., the relative departure of GW velocity from the speed of light is $\leq 4.3\times 10^{-16}$) or the possible violation of weak equivalence principle (i.e., the difference of the gamma-ray and GW trajectories in the gravitational field of the galaxy and the local universe should be within a factor of $\sim 3.4\times 10^{-9}$). The so-called Dark Matter Emulators and a class of contender models for cosmic acceleration ("Covariant Galileon") are ruled out, too. The successful identification of Lanthanide elements in the macronova/kilonova spectrum also excludes the possibility that the progenitors of GRB 170817A are a binary strange star system. The high neutron star merger rate (inferred from both the local sGRB data and the gravitational wave data) together with the significant ejected mass strongly suggest that such mergers are the prime sites of heavy r-process nucleosynthesis.
  • With the goal of making high-resolution forecasts of regional rainfall, precipitation nowcasting has become an important and fundamental technology underlying various public services ranging from rainstorm warnings to flight safety. Recently, the Convolutional LSTM (ConvLSTM) model has been shown to outperform traditional optical flow based methods for precipitation nowcasting, suggesting that deep learning models have a huge potential for solving the problem. However, the convolutional recurrence structure in ConvLSTM-based models is location-invariant while natural motion and transformation (e.g., rotation) are location-variant in general. Furthermore, since deep-learning-based precipitation nowcasting is a newly emerging area, clear evaluation protocols have not yet been established. To address these problems, we propose both a new model and a benchmark for precipitation nowcasting. Specifically, we go beyond ConvLSTM and propose the Trajectory GRU (TrajGRU) model that can actively learn the location-variant structure for recurrent connections. Besides, we provide a benchmark that includes a real-world large-scale dataset from the Hong Kong Observatory, a new training loss, and a comprehensive evaluation protocol to facilitate future research and gauge the state of the art.
  • We present a comprehensive set of measurements of optical, dielectric, diamagnetic, elastic and viscous properties in the nematic (N) phase formed by a liquid crystalline dimer. The studied dimer, 1,7-bis-4-(4-cyanobiphenyl) heptane (CB7CB), is composed of two rigid rod-like cyanobiphenyl segments connected by a flexible aliphatic link with seven methyl groups. CB7CB and other nematic dimers are of interest due to their tendency to adopt bent configurations and to form two states possessing a modulated nematic director structure, namely, the twist bend nematic, NTB, and the oblique helicoidal cholesteric, ChOH, which occurs when the achiral dimer is doped with a chiral additive and exposed to an external electric or magnetic field. We characterize the material parameters as functions of temperature in the entire temperature range of the N phase, including the pre-transitional regions near the N-NTB and N-to-isotropic (I) transitions. The splay constant K11 is determined by two direct and independent techniques, namely, detection of the Frederiks transition and measurement of director fluctuation amplitudes by dynamic light scattering (DLS). The bend K33 and twist K22 constants are measured by DLS. K33 being the smallest of the three constants, shows a strong non-monotonous temperature dependence with a negative slope in both N-I and N-NTB pretransitional regions. The measured ratio K11/K22 is larger than 2 in the entire nematic temperature range. The orientational viscosities associated with splay, twist and bend fluctuations in the N phase are comparable to those of nematics formed by rod-like molecules. All three show strong temperature dependence, increasing sharply near the N-NTB transition.
  • Face detection has achieved great success using the region-based methods. In this report, we propose a region-based face detector applying deep networks in a fully convolutional fashion, named Face R-FCN. Based on Region-based Fully Convolutional Networks (R-FCN), our face detector is more accurate and computational efficient compared with the previous R-CNN based face detectors. In our approach, we adopt the fully convolutional Residual Network (ResNet) as the backbone network. Particularly, We exploit several new techniques including position-sensitive average pooling, multi-scale training and testing and on-line hard example mining strategy to improve the detection accuracy. Over two most popular and challenging face detection benchmarks, FDDB and WIDER FACE, Face R-FCN achieves superior performance over state-of-the-arts.