• We desgin a novel fully convolutional network architecture for shapes, denoted by Shape Fully Convolutional Networks (SFCN). 3D shapes are represented as graph structures in the SFCN architecture, based on novel graph convolution and pooling operations, which are similar to convolution and pooling operations used on images. Meanwhile, to build our SFCN architecture in the original image segmentation fully convolutional network (FCN) architecture, we also design and implement a generating operation} with bridging function. This ensures that the convolution and pooling operation we have designed can be successfully applied in the original FCN architecture. In this paper, we also present a new shape segmentation approach based on SFCN. Furthermore, we allow more general and challenging input, such as mixed datasets of different categories of shapes} which can prove the ability of our generalisation. In our approach, SFCNs are trained triangles-to-triangles by using three low-level geometric features as input. Finally, the feature voting-based multi-label graph cuts is adopted to optimise the segmentation results obtained by SFCN prediction. The experiment results show that our method can effectively learn and predict mixed shape datasets of either similar or different characteristics, and achieve excellent segmentation results.
  • In this paper, we consider the problem of joint secure routing and transmit power optimization for a multi-hop ad-hoc network under the existence of randomly distributed eavesdroppers following a Poisson point process (PPP). Secrecy messages are delivered from a source to a destination through a multi-hop route connected by multiple legitimate relays in the network. Our goal is to minimize the end-to-end connection outage probability (COP) under the constraint of a secrecy outage probability (SOP) threshold, by optimizing the routing path and the transmit power of each hop jointly. We show that the globally optimal solution could be obtained by a two-step procedure where the optimal transmit power has a closed-form and the optimal routing path can be found by Dijkstra's algorithm. Then a friendly jammer with multiple antennas is applied to enhance the secrecy performance further, and the optimal transmit power of the jammer and each hop of the selected route is investigated. This problem can be solved optimally via an iterative outer polyblock approximation with one-dimension search algorithm. Furthermore, suboptimal transmit powers can be derived using the successive convex approximation (SCA) method with a lower complexity. Simulation results show the performance improvement of the proposed algorithms for both non-jamming and jamming scenarios, and also reveal a non-trivial trade-off between the numbers of hops and the transmit power of each hop for secure routing.
  • We propose a novel efficient tracking framework. Firstly, we incorporate DCF and SOSVM to obtain a novel formulation for training circular and structural learners (CSL). Secondly, we introduce a collaborative optimization strategy to update the learners, which significantly reduces computational complexity and improves robustness. Thirdly, we observe the fact that features extracted from only single-level are not robust to handle all challenge factors, thus we suggest to get a multi-level confidence score map with deep features in the continuous spatial domain, and we exploit an implicit interpolation model to extract multi-resolution complementary deep features based on different pre-trained CNNs, including both the deep appearance features and deep motion features of the target. Finally, in order to get an optimal confidence score map for more accurate localization, we propose a novel ensemble post-processor which based on relative entropy to combine the sing-level confidence score maps. Comprehensive evaluations are conducted on three object tracking benchmarks. Our approach obtains an absolute gain of 0.3% and 0.6% in mean AUC score compare to the top-ranked method on the OTB-2013 and OTB-2015 benchmarks, respectively, and provides a third-best performance with expected average overlap (EAO) score of 29.8% on the VOT2017 challenge, while operates at frame-rate.
  • In current visual object tracking system, the CPU or GPU-based visual object tracking systems have high computational cost and consume a prohibitive amount of power. Therefore, in this paper, to reduce the computational burden of the Camshift algorithm, we propose a novel visual object tracking algorithm by exploiting the properties of the binary classifier and Kalman predictor. Moreover, we present a low-cost FPGA-based real-time object tracking hardware architecture. Extensive evaluations on OTB benchmark demonstrate that the proposed system has extremely compelling real-time, stability and robustness. The evaluation results show that the accuracy of our algorithm is about 48%, and the average speed is about 309 frames per second.
  • The Ray-Casting algorithm is an important method for fast real-time surface display from 3D medical images. Based on the Ray-Casting algorithm, a novel parallel Ray-Casting algorithm is proposed in this paper. A novel operation is introduced and defined as a star operation, and star operations can be computed in parallel in the proposed algorithm compared with the serial chain of star operations in the Ray-Casting algorithm. The computation complexity of the proposed algorithm is reduced from $O(n)$ to $O(\log^n_2)$.
  • In ontology-based data access (OBDA), the classical database is enhanced with an ontology in the form of logical assertions generating new intensional knowledge. A powerful form of such logical assertions is the tuple-generating dependencies (TGDs), also called existential rules, where Horn rules are extended by allowing existential quantifiers to appear in the rule heads. In this paper we introduce a new language called loop restricted (LR) TGDs (existential rules), which are TGDs with certain restrictions on the loops embedded in the underlying rule set. We study the complexity of this new language. We show that the conjunctive query answering (CQA) under the LR TGDs is decid- able. In particular, we prove that this language satisfies the so-called bounded derivation-depth prop- erty (BDDP), which implies that the CQA is first-order rewritable, and its data complexity is in AC0 . We also prove that the combined complexity of the CQA is EXPTIME complete, while the language membership is PSPACE complete. Then we extend the LR TGDs language to the generalised loop restricted (GLR) TGDs language, and prove that this class of TGDs still remains to be first-order rewritable and properly contains most of other first-order rewritable TGDs classes discovered in the literature so far.
  • In this paper we introduce a new class of tuple-generating dependencies (TGDs) called triangularly-guarded TGDs, which are TGDs with certain restrictions on the atomic derivation track embedded in the underlying rule set. We show that conjunctive query answering under this new class of TGDs is decidable. We further show that this new class strictly contains some other decidable classes such as weak-acyclic, guarded, sticky and shy, which, to the best of our knowledge, provides a unified representation of all these aforementioned classes.
  • We present a semi-supervised co-analysis method for learning 3D shape styles from projected feature lines, achieving style patch localization with only weak supervision. Given a collection of 3D shapes spanning multiple object categories and styles, we perform style co-analysis over projected feature lines of each 3D shape and then backproject the learned style features onto the 3D shapes. Our core analysis pipeline starts with mid-level patch sampling and pre-selection of candidate style patches. Projective features are then encoded via patch convolution. Multi-view feature integration and style clustering are carried out under the framework of partially shared latent factor (PSLF) learning, a multi-view feature learning scheme. PSLF achieves effective multi-view feature fusion by distilling and exploiting consistent and complementary feature information from multiple views, while also selecting style patches from the candidates. Our style analysis approach supports both unsupervised and semi-supervised analysis. For the latter, our method accepts both user-specified shape labels and style-ranked triplets as clustering constraints.We demonstrate results from 3D shape style analysis and patch localization as well as improvements over state-of-the-art methods. We also present several applications enabled by our style analysis.
  • We revisit the problem of estimating depth of a scene from its single RGB image. Despite the recent success of deep learning based methods, we show that there is still room for improvement in two aspects by training a deep network consisting of two sub-networks; a base network for providing an initial depth estimate, and a refinement network for refining it. First, spatial resolution of the estimated depth maps can be improved using skip connections among the sub-networks which are trained in a sequential fashion. Second, we can improve estimation accuracy of boundaries of objects in scenes by employing the proposed loss functions using depth gradients. Experimental results show that the proposed network and methods improve depth estimation performance of baseline networks, particularly for reconstruction of small objects and refinement of distortion of edges, and outperform the state-of-the-art methods on benchmark datasets.
  • In this paper, we consider a realistic and meaningful scenario in the context of smart grids where an electricity retailer serves three different types of customers, i.e., customers with an optimal home energy management system embedded in their smart meters (C-HEMS), customers with only smart meters (C-SM), and customers without smart meters (C-NONE). The main objective of this paper is to support the retailer to make optimal day-ahead dynamic pricing decisions in such a mixed customer pool. To this end, we propose a two-level decision-making framework where the retailer acting as upper-level agent firstly announces its electricity prices of next 24 hours and customers acting as lower-level agents subsequently schedule their energy usages accordingly. For the lower level problem, we model the price responsiveness of different customers according to their unique characteristics. For the upper level problem, we optimize the dynamic prices for the retailer to maximize its profit subject to realistic market constraints. The above two-level model is tackled by genetic algorithms (GA) based distributed optimization methods while its feasibility and effectiveness are confirmed via simulation results.
  • We present an effective dynamic clustering algorithm for the task of temporal human action segmentation, which has comprehensive applications such as robotics, motion analysis, and patient monitoring. Our proposed algorithm is unsupervised, fast, generic to process various types of features, and applicable in both the online and offline settings. We perform extensive experiments of processing data streams, and show that our algorithm achieves the state-of-the-art results for both online and offline settings.
  • In scanning microscopy based imaging techniques, there is a need to develop novel data acquisition schemes that can reduce the time for data acquisition and minimize sample exposure to the probing radiation. Sparse sampling schemes are ideally suited for such applications where the images can be reconstructed from a sparse set of measurements. In particular, dynamic sparse sampling based on supervised learning has shown promising results for practical applications. However, a particular drawback of such methods is that it requires training image sets with similar information content which may not always be available. In this paper, we introduce a Supervised Learning Approach for Dynamic Sampling (SLADS) algorithm that uses a deep neural network based training approach. We call this algorithm SLADS- Net. We have performed simulated experiments for dynamic sampling using SLADS-Net in which the training images either have similar information content or completely different information content, when compared to the testing images. We compare the performance across various methods for training such as least- squares, support vector regression and deep neural networks. From these results we observe that deep neural network based training results in superior performance when the training and testing images are not similar. We also discuss the development of a pre-trained SLADS-Net that uses generic images for training. Here, the neural network parameters are pre-trained so that users can directly apply SLADS-Net for imaging experiments.
  • Visual Question Answering (VQA) models have struggled with counting objects in natural images so far. We identify a fundamental problem due to soft attention in these models as a cause. To circumvent this problem, we propose a neural network component that allows robust counting from object proposals. Experiments on a toy task show the effectiveness of this component and we obtain state-of-the-art accuracy on the number category of the VQA v2 dataset without negatively affecting other categories, even outperforming ensemble models with our single model. On a difficult balanced pair metric, the component gives a substantial improvement in counting over a strong baseline by 6.6%.
  • We study an one-dimensional non-Hermitian lattice with complex hopping rates, which can be realized by a quasi-one-dimensional sawtooth-type Hermitian lattice after adiabatic elimination with proper conditions. By means of synthetic magnetic fluxes, the imaginary parts of the complex hopping rates can be modulated by additional phase, thus a non-reciprocal structure arises. With this lattice, one can realize robust unidirectional transport for both single-site and Gaussian excitations, which is immune to defects and backscattering. Furthermore, we proposed a sandwich structure based on the non-Hermitian lattice, which can be used for realizing controllable photon storage and reversal. The storage time and range can be artificially controlled within limits, and the storage efficiency can be increased via a finite gain compensation. The proposal of controllable photon transport in this paper opens up a new path for unidirectional photon transport and provides a promising platform for optical control and manipulation.
  • This paper addresses active state estimation with a team of robotic sensors. The states to be estimated are represented by spatially distributed, uncorrelated, stationary vectors. Given a prior belief on the geographic locations of the states, we cluster the states in moderately sized groups and propose a new hierarchical Dynamic Programming (DP) framework to compute optimal sensing policies for each cluster that mitigates the computational cost of planning optimal policies in the combined belief space. Then, we develop a decentralized assignment algorithm that dynamically allocates clusters to robots based on the pre-computed optimal policies at each cluster. The integrated distributed state estimation framework is optimal at the cluster level but also scales very well to large numbers of states and robot sensors. We demonstrate efficiency of the proposed method in both simulations and real-world experiments using stereoscopic vision sensors.
  • In this paper, we address the problem of controlling a mobile stereo camera under image quantization noise. Assuming that a pair of images of a set of targets is available, the camera moves through a sequence of Next-Best-Views (NBVs), i.e., a sequence of views that minimize the trace of the targets' cumulative state covariance, constructed using a realistic model of the stereo rig that captures image quantization noise and a Kalman Filter (KF) that fuses the observation history with new information. The proposed algorithm decomposes control into two stages: first the NBV is computed in the camera relative coordinates, and then the camera moves to realize this view in the fixed global coordinate frame. This decomposition allows the camera to drive to a new pose that effectively realizes the NBV in camera coordinates while satisfying Field-of-View constraints in global coordinates, a task that is particularly challenging using complex sensing models. We provide simulations and real experiments that illustrate the ability of the proposed mobile camera system to accurately localize sets of targets. We also propose a novel data-driven technique to characterize unmodeled uncertainty, such as calibration errors, at the pixel level and show that this method ensures stability of the KF.
  • The new hemispherical photomultiplier tubes (PMTs) with 9 inch diameter from Hainan Zhanchuang Photonics Technology Co.,Ltd (HZC) have been studied. Narrow transit time spread (FWHM=2.35 ns) accompanied by small nonlinearity (750 photoelectrons at 5%) and high gain (1E7 ) with good single photoelectron (PE) resolution have been observed. 11 PMTs of this type are deployed and studied in the prototype detector for JUNO at IHEP, China.
  • We propose a feasible scheme to realize all-optical photon transmission switching in a passiveactive optomechanical system, consisting of one ordinary passive cavity, one active cavity and one common movable membrane oscillator of perfect reflection, driven by two strong control fields and two weak probe fields symmetrically. By means of the gain effect of the active cavity, many novel and valuable phenomena arise, such as frequency-independent perfect reflection (FIPR) first proposed in this paper, adjustable photon bidirectional transmission, phase-dependent non-reciprocity and so on. The relevant parameters used for controlling the all-optical switching are precisely tunable by adjusting the strengths of control fields and the relative phase of probe fields. Furthermore, tunable fast and slow light can be realized in our system by accurately adjusting relevant parameters which is readily and feasible in experiments. These novel phenomena originate from the effective optomechanical coupling and the gain effect provide a promising platform for photonic devices, quantum network node fabrication and quantum information process (QIP).
  • We present a new feasible way to flatten the axial intensity oscillations for diffraction of a finite-sized Bessel beam, through designing a cardioid-like hole. The boundary formula of the cardioid-like hole is given analytically. Numerical results by the complete Rayleigh-Sommerfeld method reveal that the Bessel beam propagates stably in a considerably long axial range, after passing through the cardioid-like hole. Compared with the gradually absorbing apodization technique in previous papers, in this paper a hard truncation of the incident Bessel beam is employed at the cardioid-like hole edges. The proposed hard apodization technique takes two advantages in suppressing the axial intensity oscillations, i.e., easier implementation and higher accuracy. It is expected to have practical applications in laser machining, light sectioning, or optical trapping.
  • Fourier ptychographic microscopy (FPM) is a computational imaging technique that overcomes the physical space-bandwidth product (SBP) limit of a conventional microscope by applying angular diversity illuminations. In the usual model of FPM, the microscopic system is approximated by being taken as space-invariant with transfer function determined by a complex pupil function of the objective. However, in real experimental conditions, several unexpected "semi-bright and semi-dark" images with strong vignetting effect can be easily observed when the sample is illuminated by the LED within the "transition zone" between bright field and dark field. These imperfect images, apparently, are not coincident with the space-invariant model and could deteriorate the reconstruction quality severely. In this Letter, we examine the impact of this space-invariant approximation on FPM image formation based on ray-based and rigorous wave optics-based analysis. Our analysis shows that for a practical FPM microscope with a low power objective and a large field of view, the space invariance is destroyed by diffraction at other stops associated with different lens elements to a large extent. A modified version of the space-variant model is derived and discussed. Two simple countermeasures are also presented and experimentally verified to bypass or partially alleviate the vignetting-induced reconstruction artifacts.
  • Almost a decade has passed since the serendipitous discovery of the iron-based high temperature superconductors (FeSCs) in 2008. The question of how much similarity the FeSCs have with the copper oxide high temperature superconductors emerged since the initial discovery of long-range antiferromagnetism in the FeSCs in proximity to superconductivity. Despite the great resemblance in their phase diagrams, there exist important disparities between FeSCs and cuprates that need to be considered in order to paint a full picture of these two families of high temperature superconductors. One of the key differences lies in the multi-orbital multi-band nature of FeSCs, in contrast to the effective single-band model for cuprates. Due to the complexity of multi-orbital band structures, the orbital degree of freedom is often neglected in formulating the theoretical models for FeSCs. On the experimental side, systematic studies of the orbital related phenomena in FeSCs have been largely lacking. In this review, we summarize angle-resolved photoemission spectroscopy (ARPES) measurements across various FeSC families in literature, focusing on the systematic trend of orbital dependent electron correlations and the role of different Fe 3d orbitals in driving the nematic transition, the spin-density-wave transition, and implications for superconductivity.
  • We introduce a large-scale 3D shape understanding benchmark using data and annotation from ShapeNet 3D object database. The benchmark consists of two tasks: part-level segmentation of 3D shapes and 3D reconstruction from single view images. Ten teams have participated in the challenge and the best performing teams have outperformed state-of-the-art approaches on both tasks. A few novel deep learning architectures have been proposed on various 3D representations on both tasks. We report the techniques used by each team and the corresponding performances. In addition, we summarize the major discoveries from the reported results and possible trends for the future work in the field.
  • Fourier ptychographic microscopy (FPM) is a computational imaging technique with both high resolution and large field-of-view. However, the effective numerical aperture (NA) achievable with a typical LED panel is ambiguous and usually relies on the repeated tests of different illumination NAs. The imaging quality of each raw image usually depends on the visual assessments, which is subjective and inaccurate especially for those dark field images. Moreover, the acquisition process is really time-consuming.In this paper, we propose a SNR-based adaptive acquisition method for quantitative evaluation and adaptive collection of each raw image according to the signal-to-noise ration (SNR) value, to improve the FPM's acquisition efficiency and automatically obtain the maximum achievable NA, reducing the time of collection, storage and subsequent calculation. The widely used EPRY-FPM algorithm is applied without adding any algorithm complexity and computational burden. The performance has been demonstrated in both USAF targets and biological samples with different imaging sensors respectively, which have either Poisson or Gaussian noises model. Further combined with the sparse LEDs strategy, the number of collection images can be shorten to around 25 frames while the former needs 361 images, the reduction ratio can reach over 90%. This method will make FPM more practical and automatic, and can also be used in different configurations of FPM.
  • Traffic flow prediction is an important research issue for solving the traffic congestion problem in an Intelligent Transportation System (ITS). Traffic congestion is one of the most serious problems in a city, which can be predicted in advance by analyzing traffic flow patterns. Such prediction is possible by analyzing the real-time transportation data from correlative roads and vehicles. This article first gives a brief introduction to the transportation data, and surveys the state-of-the-art prediction methods. Then, we verify whether or not the prediction performance is able to be improved by fitting actual data to optimize the parameters of the prediction model which is used to predict the traffic flow. Such verification is conducted by comparing the optimized time series prediction model with the normal time series prediction model. This means that in the era of big data, accurate use of the data becomes the focus of studying the traffic flow prediction to solve the congestion problem. Finally, experimental results of a case study are provided to verify the existence of such performance improvement, while the research challenges of this data-analytics-based prediction are presented and discussed.
  • Mobile edge computing (MEC) is a promising approach for enabling cloud-computing capabilities at the edge of cellular networks. Nonetheless, security is becoming an increasingly important issue in MEC-based applications. In this paper, we propose a deep-learning-based model to detect security threats. The model uses unsupervised learning to automate the detection process, and uses location information as an important feature to improve the performance of detection. Our proposed model can be used to detect malicious applications at the edge of a cellular network, which is a serious security threat. Extensive experiments are carried out with 10 different datasets, the results of which illustrate that our deep-learning-based model achieves an average gain of 6% accuracy compared with state-of-the-art machine learning algorithms.