• ### Optimal $C^{1,\alpha}$ estimates for a class of elliptic quasilinear equations(1507.06898)

Dec. 19, 2018 math.AP
In this article we establish sharp $C^{1,\alpha}$ estimates for weak solutions of singular and degenerate quasilinear elliptic equation $$-\,div\, a(x, \nabla u) = f,$$ which includes the standard $p$-laplacean equation with varying coefficients as a special case. The sharp exponent $\alpha$ is asymptotically optimal and is determined by the H\"older regularity of the coefficients, the exponent $p$ and the $q$-integrability of the source term $f$.
• ### Multi-channel Weighted Nuclear Norm Minimization for Real Color Image Denoising(1705.09912)

Dec. 18, 2018 cs.CV
Most of the existing denoising algorithms are developed for grayscale images, while it is not a trivial work to extend them for color image denoising because the noise statistics in R, G, B channels can be very different for real noisy images. In this paper, we propose a multi-channel (MC) optimization model for real color image denoising under the weighted nuclear norm minimization (WNNM) framework. We concatenate the RGB patches to make use of the channel redundancy, and introduce a weight matrix to balance the data fidelity of the three channels in consideration of their different noise statistics. The proposed MC-WNNM model does not have an analytical solution. We reformulate it into a linear equality-constrained problem and solve it with the alternating direction method of multipliers. Each alternative updating step has closed-form solution and the convergence can be guaranteed. Extensive experiments on both synthetic and real noisy image datasets demonstrate the superiority of the proposed MC-WNNM over state-of-the-art denoising methods.
• ### Abundance for 3-folds with non-trivial Albanese maps in positive characteristic(1705.00847)

Oct. 8, 2019 math.AG
In this paper, we prove abundance for 3-folds with non-trivial Albanese maps, over an algebraically closed field of characteristic $p > 5$.
• ### Subadditivity of Kodaira dimensions for fibrations of three-folds in positive characteristics(1601.06907)

July 16, 2019 math.AG
In this paper, we will prove subadditivity of Kodaira dimensions for a fibration with possibly singular geometric generic fiber, under certain nefness and relative semi-ampleness conditions. As an application, for a fibration $f: X \to Y$ of a smooth projective threefold over an algebraically closed field of characteristic $p>5$, under the assumption that $Y$ is of general type and non-uniruled, we prove subadditivity of Kodaira dimensions when general fibers are smooth or when $K_{X/Y}$ is relatively big over $Y$.
• ### External Prior Guided Internal Prior Learning for Real-World Noisy Image Denoising(1705.04505)

Oct. 15, 2018 cs.CV
Most of existing image denoising methods learn image priors from either external data or the noisy image itself to remove noise. However, priors learned from external data may not be adaptive to the image to be denoised, while priors learned from the given noisy image may not be accurate due to the interference of corrupted noise. Meanwhile, the noise in real-world noisy images is very complex, which is hard to be described by simple distributions such as Gaussian distribution, making real-world noisy image denoising a very challenging problem. We propose to exploit the information in both external data and the given noisy image, and develop an external prior guided internal prior learning method for real-world noisy image denoising. We first learn external priors from an independent set of clean natural images. With the aid of learned external priors, we then learn internal priors from the given noisy image to refine the prior model. The external and internal priors are formulated as a set of orthogonal dictionaries to efficiently reconstruct the desired image. Extensive experiments are performed on several real-world noisy image datasets. The proposed method demonstrates highly competitive denoising performance, outperforming state-of-the-art denoising methods including those designed for real-world noisy images.
• ### Abundance for non-uniruled 3-folds with non-trivial Albanese maps in positive characteristics(1610.03637)

Aug. 20, 2018 math.AG
In this paper, we prove abundance for non-uniruled 3-folds with non-trivial Albanese maps, over an algebraically closed field of characteristic $p > 5$. As an application we get a characterization of abelian 3-folds.
• ### Iitaka's $C_{n,m}$ conjecture for 3-folds in positive characteristic(1604.01856)

June 25, 2018 math.AG
In this paper, we prove that for a fibration $f:X\to Z$ from a smooth projective 3-fold to a smooth projective curve, over an algebraically closed field $k$ with $\mathrm{char} k =p >5$, if the geometric generic fiber $X_{\overline\eta}$ is smooth, then subadditivity of Kodaira dimensions holds, i.e. $$\kappa(X)\ge\kappa(X_{\overline\eta})+\kappa(Z).$$
• ### PatternNet: Visual Pattern Mining with Deep Neural Network(1703.06339)

June 13, 2018 cs.CV
Visual patterns represent the discernible regularity in the visual world. They capture the essential nature of visual objects or scenes. Understanding and modeling visual patterns is a fundamental problem in visual recognition that has wide ranging applications. In this paper, we study the problem of visual pattern mining and propose a novel deep neural network architecture called PatternNet for discovering these patterns that are both discriminative and representative. The proposed PatternNet leverages the filters in the last convolution layer of a convolutional neural network to find locally consistent visual patches, and by combining these filters we can effectively discover unique visual patterns. In addition, PatternNet can discover visual patterns efficiently without performing expensive image patch sampling, and this advantage provides an order of magnitude speedup compared to most other approaches. We evaluate the proposed PatternNet subjectively by showing randomly selected visual patterns which are discovered by our method and quantitatively by performing image classification with the identified visual patterns and comparing our performance with the current state-of-the-art. We also directly evaluate the quality of the discovered visual patterns by leveraging the identified patterns as proposed objects in an image and compare with other relevant methods. Our proposed network and procedure, PatterNet, is able to outperform competing methods for the tasks described.
• ### A Posteriori Error Estimation and Adaptive Algorithm for the Atomistic/Continuum Coupling in 2D(1702.02701)

June 13, 2018 math.NA
Atomistic/continuum coupling methods aim to achieve optimal balance between accuracy and efficiency. Adaptivity is the key for the efficient implementation of such methods. In this paper, we carry out a rigorous a posteriori analysis of the residual, the stability constant, and the error bound, for a consistent atomistic/continuum coupling method in 2D. We design and implement the corresponding adaptive mesh refinement algorithm, and the convergence rate with respect to degrees of freedom is optimal compare with a priori error estimates.
• ### MU-UFMC System Performance Analysis and Optimal Filter Length and Zero Padding Length Design(1603.09169)

June 5, 2018 cs.IT, math.IT
Universal filtered multi-carrier (UFMC) systems offer a flexibility of filtering arbitrary number of subcarriers to suppress out of band (OoB) emission, while keeping the orthogonality between subbands and subcarriers within one subband. However, subband filtering may affect system performance and capacity in a number of ways. In this paper, we first propose the conditions for interference-free one-tap equalization and corresponding signal model in the frequency domain for multi-user (MU) UFMC system. Based on this ideal interference-free case, impact of subband filtering on the system performance is analyzed in terms of average signal-to-noise ratio (SNR) per subband, capacity per subcarrier and bit error rate (BER) and compared with the orthogonal frequency division multiplexing (OFDM) system. This is followed by filter length selection strategies to provide guidelines for system design. Next, by taking carrier frequency offset (CFO), timing offset (TO), insufficient guard interval between symbols and filter tail cutting (TC) into consideration, an analytical system model is established. New channel equalization algorithms are proposed by considering the errors and imperfections based on the derived signal models. In addition, a set of optimization criteria in terms of filter length and guard interval/filter TC length subject to various constraints is formulated to maximize the system capacity. Numerical results show that the analytical and corresponding optimal approaches match the simulation results, and the proposed equalization algorithms can significantly improve the BER performance.
• ### FFDNet: Toward a Fast and Flexible Solution for CNN based Image Denoising(1710.04026)

May 22, 2018 cs.CV
Due to the fast inference and good performance, discriminative learning methods have been widely studied in image denoising. However, these methods mostly learn a specific model for each noise level, and require multiple models for denoising images with different noise levels. They also lack flexibility to deal with spatially variant noise, limiting their applications in practical denoising. To address these issues, we present a fast and flexible denoising convolutional neural network, namely FFDNet, with a tunable noise level map as the input. The proposed FFDNet works on downsampled sub-images, achieving a good trade-off between inference speed and denoising performance. In contrast to the existing discriminative denoisers, FFDNet enjoys several desirable properties, including (i) the ability to handle a wide range of noise levels (i.e., [0, 75]) effectively with a single network, (ii) the ability to remove spatially variant noise by specifying a non-uniform noise level map, and (iii) faster speed than benchmark BM3D even on CPU without sacrificing denoising performance. Extensive experiments on synthetic and real noisy images are conducted to evaluate FFDNet in comparison with state-of-the-art denoisers. The results show that FFDNet is effective and efficient, making it highly attractive for practical denoising applications.
• ### Homocentric Hypersphere Feature Embedding for Person Re-identification(1804.08866)

May 1, 2018 cs.CV
Person re-identification (Person ReID) is a challenging task due to the large variations in camera viewpoint, lighting, resolution, and human pose. Recently, with the advancement of deep learning technologies, the performance of Person ReID has been improved swiftly. Feature extraction and feature matching are two crucial components in the training and deployment stages of Person ReID. However, many existing Person ReID methods have measure inconsistency between the training stage and the deployment stage, and they couple magnitude and orientation information of feature vectors in feature representation. Meanwhile, traditional triplet loss methods focus on samples within a mini-batch and lack knowledge of global feature distribution. To address these issues, we propose a novel homocentric hypersphere embedding scheme to decouple magnitude and orientation information for both feature and weight vectors, and reformulate classification loss and triplet loss to their angular versions and combine them into an angular discriminative loss. We evaluate our proposed method extensively on the widely used Person ReID benchmarks, including Market1501, CUHK03 and DukeMTMC-ReID. Our method demonstrates leading performance on all datasets.
• ### Vanishing estimates for fully bubbling solutions of $SU(n+1)$ Toda Systems at a singular source(1804.07685)

April 20, 2018 math.AP
For Gauss curvature equation (or more general Toda systems) defined on two dimensional spaces, the vanishing rate of certain curvature functions on blowup points is a key estimate for numerous applications. However, if these equations have singular sources, very few vanishing estimates can be found. In this article we consider a Toda system with singular sources defined on a Riemann surface and we prove a very surprising vanishing estimates and a reflection phenomenon for certain functions involving the Gauss curvature.
• ### Spectrum Efficient MIMO-FBMC System using Filter Output Truncation(1711.08842)

April 18, 2018 eess.SP
Due to the use of an appropriately designed pulse shaping prototype filter, filter bank multicarrier (FBMC) system can achieve low out of band (OoB) emissions and is also robust to the channel and synchronization errors. However, it comes at a cost of long filter tails which may reduce the spectral efficiency significantly when the block size is small. Filter output truncation (FOT) can reduce the overhead by discarding the filter tails but may also significantly destroy the orthogonality of FBMC system, by introducing inter carrier interference (ICI) and inter symbol interference (ISI) terms in the received signal. As a result, the signal to interference ratio (SIR) is degraded. In addition, the presence of intrinsic interference terms in FBMC also proves to be an obstacle in combining multiple input multiple output (MIMO) with FBMC. In this paper, we present a theoretical analysis on the effect of FOT in an MIMO-FBMC system. First, we derive the matrix model of MIMO-FBMC system which is subsequently used to analyze the impact of finite filter length and FOT on the system performance. The analysis reveals that FOT can avoid the overhead in time domain but also introduces extra interference in the received symbols. To combat the interference terms, we then propose a compensation algorithm that considers odd and even overlapping factors as two separate cases, where the signals are interfered by the truncation in different ways. The general form of the compensation algorithm can compensate all the symbols in a MIMO-FBMC block and can improve the SIR values of each symbol for better detection at the receiver. It is also shown that the proposed algorithm requires no overhead and can still achieve a comparable BER performance to the case with no filter truncation.
• ### Interference Analysis of QAM based Filter Bank Multicarrier System with Index Modulation(1804.04770)

April 17, 2018 eess.SP
Index modulation (IM) has recently emerged as a promising concept for spectrum and energy-efficient next generation wireless communications systems since it strikes a good balance among error performance, complexity, and spectral efficiency. IM technique, when applied to multicarrier waveforms, yields the ability to convey the information not only by M-ary signal constellations as in conventional multicarrier systems but also by the indexes of the subcarriers, which are activated according to the incoming bit stream. Although IM is well studied for OFDM based systems, FBMC with index modulation has not been thoroughly investigated. In this paper, we shed light on the potential and implementation of IM technique for QAM based FBMC system. We start with a mathematical model of the IM based QAM-FBMC system (FBMC/QAM-IM) along with the derivation of interference terms at the receiver due to channel distortions and noise. The interference terms including the ones introduced by the multipath channel are analyzed in terms of MSE and output SINR. It is shown with analytical and simulation results that the interference power in FBMC/QAM-IM is smaller compared to that of the conventional FBMC/QAM system as some of the subcarriers are inactive. The performance of FBMC/QAM with IM is investigated by comparing the SIR and output SINR with that of the conventional FBMC/QAM system along with the BER performance which shows that the FBMC/QAM-IM is a promising transmission technique for future wireless networks.
• ### Simultaneous Fidelity and Regularization Learning for Image Restoration(1804.04522)

April 12, 2018 cs.CV
Most existing non-blind restoration methods are based on the assumption that a precise degradation model is known. As the degradation process can only partially known or inaccurately modeled, images may not be well restored. Rain streak removal and image deconvolution with inaccurate blur kernels are two representative examples of such tasks. For rain streak removal, although an input image can be decomposed into a scene layer and a rain streak layer, there exists no explicit formulation for modeling rain streaks and the composition with scene layer. For blind deconvolution, as estimation error of blur kernel is usually introduced, the subsequent non-blind deconvolution process does not restore the latent image well. In this paper, we propose a principled algorithm within the maximum a posterior framework to tackle image restoration with a partially known or inaccurate degradation model. Specifically, the residual caused by a partially known or inaccurate degradation model is spatially dependent and complexly distributed. With a training set of degraded and ground-truth image pairs, we parameterize and learn the fidelity term for a degradation model in a task-driven manner. Furthermore, the regularization term can also be learned along with the fidelity term, thereby forming a simultaneous fidelity and regularization learning model. Extensive experimental results demonstrate the effectiveness of the proposed model for image deconvolution with inaccurate blur kernels and rain streak removal. Furthermore, for image restoration with precise degradation process, e.g., Gaussian denoising, the proposed model can be applied to learn the proper fidelity term for optimal performance based on visual perception metrics.
• ### Generating Diverse and Accurate Visual Captions by Comparative Adversarial Learning(1804.00861)

April 11, 2018 cs.CV
We study how to generate captions that are not only accurate in describing an image but also discriminative across different images. The problem is both fundamental and interesting, as most machine-generated captions, despite phenomenal research progresses in the past several years, are expressed in a very monotonic and featureless format. While such captions are normally accurate, they often lack important characteristics in human languages - distinctiveness for each caption and diversity for different images. To address this problem, we propose a novel conditional generative adversarial network for generating diverse captions across images. Instead of estimating the quality of a caption solely on one image, the proposed comparative adversarial learning framework better assesses the quality of captions by comparing a set of captions within the image-caption joint space. By contrasting with human-written captions and image-mismatched captions, the caption generator effectively exploits the inherent characteristics of human languages, and generates more discriminative captions. We show that our proposed network is capable of producing accurate and diverse captions across images.
• ### Thermal rectification in a double quantum dots system with polaron effect(1804.03400)

April 10, 2018 cond-mat.mes-hall
We investigate the rectification of heat current carried by electrons through a double quantum dot (DQD) system under a temperature bias. The DQD can be realized by molecules such as suspended carbon nanotube and be described by the Anderson-Holstein model in presence of electron-phonon interaction. Strong electron-phonon interaction can lead to formation of polaronic states in which electronic states are dressed by phonon cloud. Dressed tunneling approximation (DTA), which is nonperturbative in dealing with strong electron-phonon interaction, is employed to obtain the heat current expression. In DTA, self-energies are dressed by phonon cloud operator and are temperature dependent. The temperature dependency of imaginary part of dressed retarded self-energy gives rise to the asymmetry of the system and is the necessary condition of thermal rectification. On top of this, one can either tune DQD effective energy levels such that $|\bar{\epsilon}_1|\neq |\bar{\epsilon}_2|$ or have asymmetric dot-lead couplings to achieve thermal rectification. We numerically find that increasing electron-phonon coupling and reducing inter dot coupling can both improve thermal rectification effect, while the electronic heat current is reduced.
• ### Real-world Noisy Image Denoising: A New Benchmark(1804.02603)

April 7, 2018 cs.CV
Most of previous image denoising methods focus on additive white Gaussian noise (AWGN). However,the real-world noisy image denoising problem with the advancing of the computer vision techiniques. In order to promote the study on this problem while implementing the concurrent real-world image denoising datasets, we construct a new benchmark dataset which contains comprehensive real-world noisy images of different natural scenes. These images are captured by different cameras under different camera settings. We evaluate the different denoising methods on our new dataset as well as previous datasets. Extensive experimental results demonstrate that the recently proposed methods designed specifically for realistic noise removal based on sparse or low rank theories achieve better denoising performance and are more robust than other competing methods, and the newly proposed dataset is more challenging. The constructed dataset of real photographs is publicly available at \url{https://github.com/csjunxu/PolyUDataset} for researchers to investigate new real-world image denoising methods. We will add more analysis on the noise statistics in the real photographs of our new dataset in the next version of this article.
• ### End-to-End Detection and Re-identification Integrated Net for Person Search(1804.00376)

April 2, 2018 cs.CV
This paper proposes a pedestrian detection and re-identification (re-id) integration net (I-Net) in an end-to-end learning framework. The I-Net is used in real-world video surveillance scenarios, where the target person needs to be searched in the whole scene videos, while the annotations of pedestrian bounding boxes are unavailable. By comparing to the OIM which is a work for joint detection and re-id, we have three distinct contributions. First, we introduce a Siamese architecture of I-Net instead of 1 stream, such that a verification task can be implemented. Second, we propose a novel on-line pairing loss (OLP) and hard example priority softmax loss (HEP), such that only the hard negatives are posed much attention in loss computation. Third, an on-line dictionary for negative samples storage is designed in I-Net without recording the positive samples. We show our result on person search datasets, the gap between detection and re-identification is narrowed. The superior performance can be achieved.
• ### Towards Human-Machine Cooperation: Self-supervised Sample Mining for Object Detection(1803.09867)

March 27, 2018 cs.CV
Though quite challenging, leveraging large-scale unlabeled or partially labeled images in a cost-effective way has increasingly attracted interests for its great importance to computer vision. To tackle this problem, many Active Learning (AL) methods have been developed. However, these methods mainly define their sample selection criteria within a single image context, leading to the suboptimal robustness and impractical solution for large-scale object detection. In this paper, aiming to remedy the drawbacks of existing AL methods, we present a principled Self-supervised Sample Mining (SSM) process accounting for the real challenges in object detection. Specifically, our SSM process concentrates on automatically discovering and pseudo-labeling reliable region proposals for enhancing the object detector via the introduced cross image validation, i.e., pasting these proposals into different labeled images to comprehensively measure their values under different image contexts. By resorting to the SSM process, we propose a new AL framework for gradually incorporating unlabeled or partially labeled data into the model learning while minimizing the annotating effort of users. Extensive experiments on two public benchmarks clearly demonstrate our proposed framework can achieve the comparable performance to the state-of-the-art methods with significantly fewer annotations.
• ### CleanNet: Transfer Learning for Scalable Image Classifier Training with Label Noise(1711.07131)

March 25, 2018 cs.AI, cs.CV, cs.LG
In this paper, we study the problem of learning image classification models with label noise. Existing approaches depending on human supervision are generally not scalable as manually identifying correct or incorrect labels is time-consuming, whereas approaches not relying on human supervision are scalable but less effective. To reduce the amount of human supervision for label noise cleaning, we introduce CleanNet, a joint neural embedding network, which only requires a fraction of the classes being manually verified to provide the knowledge of label noise that can be transferred to other classes. We further integrate CleanNet and conventional convolutional neural network classifier into one framework for image classification learning. We demonstrate the effectiveness of the proposed algorithm on both of the label noise detection task and the image classification on noisy data task on several large-scale datasets. Experimental results show that CleanNet can reduce label noise detection error rate on held-out classes where no human supervision available by 41.5% compared to current weakly supervised methods. It also achieves 47% of the performance gain of verifying all images with only 3.2% images verified on an image classification task. Source code and dataset will be available at kuanghuei.github.io/CleanNetProject.
• ### Learning Spatial-Temporal Regularized Correlation Filters for Visual Tracking(1803.08679)

March 23, 2018 cs.CV
Discriminative Correlation Filters (DCF) are efficient in visual tracking but suffer from unwanted boundary effects. Spatially Regularized DCF (SRDCF) has been suggested to resolve this issue by enforcing spatial penalty on DCF coefficients, which, inevitably, improves the tracking performance at the price of increasing complexity. To tackle online updating, SRDCF formulates its model on multiple training images, further adding difficulties in improving efficiency. In this work, by introducing temporal regularization to SRDCF with single sample, we present our spatial-temporal regularized correlation filters (STRCF). Motivated by online Passive-Agressive (PA) algorithm, we introduce the temporal regularization to SRDCF with single sample, thus resulting in our spatial-temporal regularized correlation filters (STRCF). The STRCF formulation can not only serve as a reasonable approximation to SRDCF with multiple training samples, but also provide a more robust appearance model than SRDCF in the case of large appearance variations. Besides, it can be efficiently solved via the alternating direction method of multipliers (ADMM). By incorporating both temporal and spatial regularization, our STRCF can handle boundary effects without much loss in efficiency and achieve superior performance over SRDCF in terms of accuracy and speed. Experiments are conducted on three benchmark datasets: OTB-2015, Temple-Color, and VOT-2016. Compared with SRDCF, STRCF with hand-crafted features provides a 5 times speedup and achieves a gain of 5.4% and 3.6% AUC score on OTB-2015 and Temple-Color, respectively. Moreover, STRCF combined with CNN features also performs favorably against state-of-the-art CNN-based trackers and achieves an AUC score of 68.3% on OTB-2015.
We present an automatic moment capture system that runs in real-time on mobile cameras. The system is designed to run in the viewfinder mode and capture a burst sequence of frames before and after the shutter is pressed. For each frame, the system predicts in real-time a "goodness" score, based on which the best moment in the burst can be selected immediately after the shutter is released, without any user interference. To solve the problem, we develop a highly efficient deep neural network ranking model, which implicitly learns a "latent relative attribute" space to capture subtle visual differences within a sequence of burst images. Then the overall goodness is computed as a linear aggregation of the goodnesses of all the latent attributes. The latent relative attributes and the aggregation function can be seamlessly integrated in one fully convolutional network and trained in an end-to-end fashion. To obtain a compact model which can run on mobile devices in real-time, we have explored and evaluated a wide range of network design choices, taking into account the constraints of model size, computational cost, and accuracy. Extensive studies show that the best frame predicted by our model hit users' top-1 (out of 11 on average) choice for $64.1\%$ cases and top-3 choices for $86.2\%$ cases. Moreover, the model(only 0.47M Bytes) can run in real time on mobile devices, e.g. only 13ms on iPhone 7 for one frame prediction.