
Communitybased question answering (CQA) websites represent an important
source of information. As a result, the problem of matching the most valuable
answers to their corresponding questions has become an increasingly popular
research topic. We frame this task as a binary (relevant/irrelevant)
classification problem, and propose a Multiscale Matching model that inspects
the correlation between words and ngrams (wordtongrams) of different levels
of granularity. This is in addition to wordtoword correlations which are used
in most prior work. In this way, our model is able to capture rich context
information conveyed in ngrams, therefore can better differentiate good answers
from bad ones. Furthermore, we present an adversarial training framework to
iteratively generate challenging negative samples to fool the proposed
classification model. This is completely different from previous methods, where
negative samples are uniformly sampled from the dataset during training
process. The proposed method is evaluated on SemEval 2017 and Yahoo Answer
dataset and achieves stateoftheart performance.

Daily engagement in life experiences is increasingly interwoven with mobile
device use. Screen capture at the scale of seconds is being used in behavioral
studies and to implement "justintime" health interventions. The increasing
psychological breadth of digital information will continue to make the actual
screens that people view a preferred if not required source of data about life
experiences. Effective and efficient Information Extraction and Retrieval from
digital screenshots is a crucial prerequisite to successful use of screen data.
In this paper, we present the experimental workflow we exploited to: (i)
preprocess a unique collection of screen captures, (ii) extract unstructured
text embedded in the images, (iii) organize image text and metadata based on a
structured schema, (iv) index the resulting document collection, and (v) allow
for Image Retrieval through a dedicated vertical search engine application. The
adopted procedure integrates different open source libraries for traditional
image processing, Optical Character Recognition (OCR), and Image Retrieval. Our
aim is to assess whether and how stateoftheart methodologies can be applied
to this novel data set. We show how combining OpenCVbased preprocessing
modules with a Long shortterm memory (LSTM) based release of Tesseract OCR,
without ad hoc training, led to a 74% characterlevel accuracy of the extracted
text. Further, we used the processed repository as baseline for a dedicated
Image Retrieval system, for the immediate use and application for behavioral
and prevention scientists. We discuss issues of Text Information Extraction and
Retrieval that are particular to the screenshot image case and suggest
important future work.

The Soil Moisture Active Passive (SMAP) mission has delivered valuable
sensing of surface soil moisture since 2015. However, it has a short time span
and irregular revisit schedule. Utilizing a stateoftheart timeseries deep
learning neural network, Long ShortTerm Memory (LSTM), we created a system
that predicts SMAP level3 soil moisture data with atmospheric forcing,
modelsimulated moisture, and static physiographic attributes as inputs. The
system removes most of the bias with model simulations and improves predicted
moisture climatology, achieving small test rootmeansquared error (<0.035) and
high correlation coefficient >0.87 for over 75\% of Continental United States,
including the forested Southeast. As the first application of LSTM in
hydrology, we show the proposed network avoids overfitting and is robust for
both temporal and spatial extrapolation tests. LSTM generalizes well across
regions with distinct climates and physiography. With high fidelity to SMAP,
LSTM shows great potential for hindcasting, data assimilation, and weather
forecasting.

Piezoelectric and ferroelectric properties in the two dimensional (2D) limit
are highly desired for nanoelectronic, electromechanical, and optoelectronic
applications. Here we report the first experimental evidence of outofplane
piezoelectricity and ferroelectricity in van der Waals layered
${\alpha}$In2Se3 nanoflakes. The noncentrosymmetric R3m symmetry of the
${\alpha}$In2Se3 samples is confirmed by scanning transmission electron
microscopy, secondharmonic generation, and Raman spectroscopy measurements.
Domains with opposite polarizations are visualized by piezoresponse force
microscopy. Singlepoint poling experiments suggest that the polarization is
potentially switchable for ${\alpha}$In2Se3 nanoflakes with thicknesses down
to ~ 10 nm. The piezotronic effect is demonstrated in twoterminal devices,
where the Schottky barrier can be modulated by the straininduced
piezopotential. Our work on polar ${\alpha}$In2Se3, one of the model 2D
piezoelectrics and ferroelectrics with simple crystal structures, shows its
great potential in electronic and photonic applications.

We propose a method for learning Markov network structures for continuous
data without invoking any assumptions about the distribution of the variables.
The method makes use of previous work on a nonparametric estimator for mutual
information which is used to create a nonparametric test for multivariate
conditional independence. This independence test is then combined with an
efficient constraintbased algorithm for learning the graph structure. The
performance of the method is evaluated on several synthetic data sets and it is
shown to learn considerably more accurate structures than competing methods
when the dependencies between the variables involve nonlinearities.

Mobile edge computing (MEC) is expected to be an effective solution to
deliver 360degree virtual reality (VR) videos over wireless networks. In
contrast to previous computationconstrained MEC framework, which reduces the
computationresource consumption at the mobile VR device by increasing the
communicationresource consumption, we develop a communicationsconstrained MEC
framework to reduce communicationresource consumption by increasing the
computationresource consumption and exploiting the caching resources at the
mobile VR device in this paper. Specifically, according to the task
modularization, the MEC server can only deliver the components which have not
been stored in the VR device, and then the VR device uses the received
components and the corresponding cached components to construct the task,
resulting in low communicationresource consumption but high delay. The MEC
server can also compute the task by itself to reduce the delay, however, it
consumes more communicationresource due to the delivery of entire task.
Therefore, we then propose a task scheduling strategy to decide which
computation model should the MEC server operates, in order to minimize the
communicationresource consumption under the delay constraint. Finally, we
discuss the tradeoffs between communications, computing, and caching in the
proposed system.

This paper introduces Quicksilver, a fast deformable image registration
method. Quicksilver registration for imagepairs works by patchwise prediction
of a deformation model based directly on image appearance. A deep
encoderdecoder network is used as the prediction model. While the prediction
strategy is general, we focus on predictions for the Large Deformation
Diffeomorphic Metric Mapping (LDDMM) model. Specifically, we predict the
momentumparameterization of LDDMM, which facilitates a patchwise prediction
strategy while maintaining the theoretical properties of LDDMM, such as
guaranteed diffeomorphic mappings for sufficiently strong regularization. We
also provide a probabilistic version of our prediction network which can be
sampled during the testing time to calculate uncertainties in the predicted
deformations. Finally, we introduce a new correction network which greatly
increases the prediction accuracy of an already existing prediction network. We
show experimental results for unimodal atlastoimage as well as uni / multi
modal imagetoimage registrations. These experiments demonstrate that our
method accurately predicts registrations obtained by numerical optimization, is
very fast, achieves stateoftheart registration results on four standard
validation datasets, and can jointly learn an image similarity measure.
Quicksilver is freely available as an opensource software.

We present an endtoend, multimodal, fully convolutional network for
extracting semantic structures from document images. We consider document
semantic structure extraction as a pixelwise segmentation task, and propose a
unified model that classifies pixels based not only on their visual appearance,
as in the traditional page segmentation task, but also on the content of
underlying text. Moreover, we propose an efficient synthetic document
generation process that we use to generate pretraining data for our network.
Once the network is trained on a large set of synthetic documents, we finetune
the network on unlabeled real documents using a semisupervised approach. We
systematically study the optimum network architecture and show that both our
multimodal approach and the synthetic data pretraining significantly boost the
performance.

We introduce a deep encoderdecoder architecture for image deformation
prediction from multimodal images. Specifically, we design an imagepatchbased
deep network that jointly (i) learns an image similarity measure and (ii) the
relationship between image patches and deformation parameters. While our method
can be applied to general image registration formulations, we focus on the
Large Deformation Diffeomorphic Metric Mapping (LDDMM) registration model. By
predicting the initial momentum of the shooting formulation of LDDMM, we
preserve its mathematical properties and drastically reduce the computation
time, compared to optimizationbased approaches. Furthermore, we create a
Bayesian probabilistic version of the network that allows evaluation of
registration uncertainty via sampling of the network at test time. We evaluate
our method on a 3D brain MRI dataset using both T1 and T2weighted images. Our
experiments show that our method generates accurate predictions and that
learning the similarity measure leads to more consistent registrations than
relying on generic multimodal image similarity measures, such as mutual
information. Our approach is an order of magnitude faster than
optimizationbased LDDMM.

Registration involving one or more images containing pathologies is
challenging, as standard image similarity measures and spatial transforms
cannot account for common changes due to pathologies. Lowrank/Sparse (LRS)
decomposition removes pathologies prior to registration; however, LRS is
memorydemanding and slow, which limits its use on larger data sets.
Additionally, LRS blurs normal tissue regions, which may degrade registration
performance. This paper proposes an efficient alternative to LRS: (1) normal
tissue appearance is captured by principal component analysis (PCA) and (2)
blurring is avoided by an integrated model for pathology removal and image
reconstruction. Results on synthetic and BRATS 2015 data demonstrate its
utility.

Word embeddings and convolutional neural networks (CNN) have attracted
extensive attention in various classification tasks for Twitter, e.g. sentiment
classification. However, the effect of the configuration used to train and
generate the word embeddings on the classification performance has not been
studied in the existing literature. In this paper, using a Twitter election
classification task that aims to detect electionrelated tweets, we investigate
the impact of the background dataset used to train the embedding models, the
context window size and the dimensionality of word embeddings on the
classification performance. By comparing the classification results of two word
embedding models, which are trained using different background corpora (e.g.
Wikipedia articles and Twitter microposts), we show that the background data
type should align with the Twitter classification dataset to achieve a better
performance. Moreover, by evaluating the results of word embeddings models
trained using various context window sizes and dimensionalities, we found that
large context window and dimension sizes are preferable to improve the
performance. Our experimental results also show that using word embeddings and
CNN leads to statistically significant improvements over various baselines such
as random, SVM with TFIDF and SVM with word embeddings.

Physical library collections are valuable and long standing resources for
knowledge and learning. However, managing books in a large bookshelf and
finding books on it often leads to tedious manual work, especially for large
book collections where books might be missing or misplaced. Recently, deep
neural models, such as Convolutional Neural Networks (CNN) and Recurrent Neural
Networks (RNN) have achieved great success for scene text detection and
recognition. Motivated by these recent successes, we aim to investigate their
viability in facilitating book management, a task that introduces further
challenges including large amounts of cluttered scene text, distortion, and
varied lighting conditions. In this paper, we present a library inventory
building and retrieval system based on scene text reading methods. We
specifically design our scene text recognition model using rich supervision to
accelerate training and achieve stateoftheart performance on several
benchmark datasets. Our proposed system has the potential to greatly reduce the
amount of human labor required in managing book inventories as well as the
space needed to store book information.

We present a method to predict image deformations based on patchwise image
appearance. Specifically, we design a patchbased deep encoderdecoder network
which learns the pixel/voxelwise mapping between image appearance and
registration parameters. Our approach can predict general deformation
parameterizations, however, we focus on the large deformation diffeomorphic
metric mapping (LDDMM) registration model. By predicting the LDDMM
momentumparameterization we retain the desirable theoretical properties of
LDDMM, while reducing computation time by orders of magnitude: combined with
patch pruning, we achieve a 1500x/66x speed up compared to GPUbased
optimization for 2D/3D image registration. Our approach has better prediction
accuracy than predicting deformation or velocity fields and results in
diffeomorphic transformations. Additionally, we create a Bayesian probabilistic
version of our network, which allows evaluation of deformation field
uncertainty through Monte Carlo sampling using dropout at test time. We show
that deformation uncertainty highlights areas of ambiguous deformations. We
test our method on the OASIS brain image dataset in 2D and 3D.

A procedure is introduced to recognise sunspots automatically in solar
fulldisk photosphere images obtained from Huairou Solar Observing Station,
National Astronomical Observatories of China. The images are first
preprocessed through Gaussian algorithm. Sunspots are then recognised by the
morphological Bothat operation and Otsu threshold. Wrong selection of sunspots
is eliminated by a criterion of sunspot properties. Besides, in order to
calculate the sunspots areas and the solar centre, the solar limb is extracted
by a procedure using morphological closing and erosion operations and setting
an adaptive threshold. Results of sunspot recognition reveal that the number of
the sunspots detected by our procedure has a quite good agreement with the
manual method. The sunspot recognition rate is 95% and error rate is 1.2%. The
sunspot areas calculated by our method have high correlation (95%) with the
area data from USAF/NOAA.

This paper introduces a convenient strategy for coding and predicting
sequences of independent, identically distributed random variables generated
from a large alphabet of size $m$. In particular, the size of the sample is
allowed to be variable. The employment of a Poisson model and tilting method
simplifies the implementation and analysis through independence. The resulting
strategy is optimal within the class of distributions satisfying a moment
condition, and is close to optimal for the class of all i.i.d distributions on
strings of a given length. Moreover, the method can be used to code and predict
strings with a condition on the tail of the ordered counts. It can also be
applied to distributions in an envelope class.

Based on several magnetic nonpotentiality parameters obtained from the vector
photospheric active region magnetograms obtained with the Solar Magnetic Field
Telescope at the Huairou Solar Observing Station over two solar cycles, a
machine learning model has been constructed to predict the occurrence of flares
in the corresponding active region within a certain time window. The Support
Vector Classifier, a widely used general classifier, is applied to build and
test the prediction models. Several classical verification measures are adopted
to assess the quality of the predictions. We investigate different flare levels
within various time windows, and thus it is possible to estimate the rough
classes and erupting times of flares for particular active regions. Several
combinations of predictors have been tested in the experiments. The True Skill
Statistics are higher than 0.36 in 97% of cases and the Heidke Skill Scores
range from 0.23 to 0.48. The predictors derived from longitudinal magnetic
fields do perform well, however they are less sensitive in predicting large
flares. Employing the nonpotentiality predictors from vector fields improves
the performance of predicting large flares of magnitude $\geq$M5.0 and
$\geq$X1.0.

A statistical study is carried out on the photospheric magnetic
nonpotentiality in solar active regions and its relationship with associated
flares. We select 2173 photospheric vector magnetograms from 1106 active
regions observed by the Solar Magnetic Field Telescope at Huairou Solar
Observing Station, National Astronomical Observatories of China, in the period
of 19882008, which covers most of the 22nd and 23rd solar cycles. We have
computed the mean planar magnetic shear angle (\bar{\Delta\phi}), mean shear
angle of the vector magnetic field (\bar{\Delta\psi}), mean absolute vertical
current density (\bar{J_{z}}), mean absolute current helicity density
(\bar{h_{c}}), absolute twist parameter (\alpha_{av}), mean free magnetic
energy density (\bar{\rho_{free}}), effective distance of the longitudinal
magnetic field (d_{E}), and modified effective distance (d_{Em}) of each
photospheric vector magnetogram. Parameters \bar{h_{c}}, \bar{\rho_{free}},
and d_{Em} show higher correlation with the evolution of the solar cycle. The
Pearson linear correlation coefficients between these three parameters and the
yearly mean sunspot number are all larger than 0.59. Parameters
\bar{\Delta\phi}, \bar{\Delta\psi}, \bar{J_{z}}, \alpha_{av}, and d_{E}
show only weak correlations with the solar cycle, though the nonpotentiality
and the complexity of active regions are greater in the activity maximum
periods than in the minimum periods. All of the eight parameters show positive
correlations with the flare productivity of active regions, and the combination
of different nonpotentiality parameters may be effective in predicting the
flaring probability of active regions.

It was realized two decades ago that the twodimensional diffusive Fermi
liquid phase is unstable against arbitrarily weak electronelectron
interactions. Recently, using the nonlinear sigma model developed by
Finkelstein, several authors have shown that the instability leads to a
ferromagnetic state. In this paper, we consider diffusing electrons interacting
through a ferromagnetic exchange interaction. Using the HartreeFock
approximation to directly calculate the electron self energy, we find that the
total energy is minimized by a finite ferromagnetic moment for arbitrarily weak
interactions in two dimensions and for interaction strengths exceeding a
critical proportional to the conductivity in three dimensions. We discuss the
relation between our results and previous ones.

We show that several wellknown onedimensional quantum systems possess a
hidden nonlocal supersymmetry. The simplest example is the open XXZ spin chain
with \Delta=1/2. We use the supersymmetry to place lower bounds on the ground
state energy with various boundary conditions. For an odd number of sites in
the periodic chain, and with a particular boundary magnetic field in the open
chain, we can derive the ground state energy exactly. The supersymmetry thus
explains why it is possible to solve the Bethe equations for the ground state
in these cases. We also show that a similar spacetime supersymmetry holds for
the tJ model at its integrable ferromagnetic point, where the spacetime
supersymmetry and the Hamiltonian it yields coexist with a global u(12) graded
Lie algebra symmetry. Possible generalizations to other algebras are discussed.

In a dirty metal, electronelectron interactions in the spintriplet channel
lead to singular corrections to a variety of physical quantities. We show that
these singularities herald the emergence of ferromagnetism. We calculate the
effective action for the magnetic moment of weaklyinteracting electrons in a
dirty metal and show that a state with finite ferromagnetic moment minimizes
this effective action. The saddlepoint approximation is exact in an
appropriate largeN limit. We discuss the physics of the ferromagnetic state
with particular regard to thermal fluctuations and localization effects.

We study the behavior of the Hall coefficient, $R_H$, in a system exhibiting
$d_{{x^2}{y^2}}$ densitywave (DDW) order in a regime in which the carrier
concentration, $x$, is tuned to approach a quantum critical point at which the
order is destroyed. At the meanfield level, we find that $n_{\rm Hall}=1/R_H$
evinces a sharp signature of the transition. There is a kink in $n_{\rm Hall}$
at the critical value of the carrier concentration, $x_c$; as the critical
point is approached from the ordered side, the slope of $n_{\rm Hall}$
diverges. Hall transport experiments in the cuprates, at high magnetic fields
sufficient to destroy superconductivity, should reveal this effect.

We compute the electrical and thermal conductivities and Hall conductivities
of the $d$density wave (DDW) state in the lowtemperature
impurityscatteringdominated regime for lowdopings, at which they are
dominated by nodal quasiparticles. We show that the longitudinal conductivity
in this limit in the DDW state is not Drudelike. However, the thermal
conductivty is Drudelike; this is a reflection of the discrepancy between
electrical and thermal transport at finite frequency in the DDW state. An
extreme example of this occurs in the $\mu=0$, $\tau\to\infty$ limit, where
there is a strong violation of the WiedemannFranz law:
${\kappa_{xx}}/{\sigma_{xx}} \propto {T^2}$ at $\omega=0$ and
${\kappa_{xx}}/{\sigma_{xx}}=0$ at finite frequency. The DDW electrical and
thermal Hall conductivities are linear in the magnetic field, $B$, for weak
fields. The formation of Landau levels at the nodes leads to the quantization
of these Hall conductivities at high fields. In all of these ways, the
quasiparticles of the DDW state differ from those of the $d_{{x^2}{y^2}}$
superconducting (DSC) state.