
We investigate whether quantum annealers with select chip layouts can
outperform classical computers in reinforcement learning tasks. We associate a
transverse field Ising spin Hamiltonian with a layout of qubits similar to that
of a deep Boltzmann machine (DBM) and use simulated quantum annealing (SQA) to
numerically simulate quantum sampling from this system. We design a
reinforcement learning algorithm in which the set of visible nodes representing
the states and actions of an optimal policy are the first and last layers of
the deep network. In absence of a transverse field, our simulations show that
DBMs are trained more effectively than restricted Boltzmann machines (RBM) with
the same number of nodes. We then develop a framework for training the network
as a quantum Boltzmann machine (QBM) in the presence of a significant
transverse field for reinforcement learning. This method also outperforms the
reinforcement learning method that uses RBMs.

In this paper we generalize the 1bit matrix completion problem to higher
order tensors. We prove that when $r=O(1)$ a bounded rank$r$, order$d$ tensor
$T$ in $\mathbb{R}^{N} \times \mathbb{R}^{N} \times \cdots \times
\mathbb{R}^{N}$ can be estimated efficiently by only $m=O(Nd)$ binary
measurements by regularizing its maxqnorm and Mnorm as surrogates for its
rank. We prove that similar to the matrix case, i.e., when $d=2$, the sample
complexity of recovering a lowrank tensor from 1bit measurements of a subset
of its entries is the same as recovering it from unquantized measurements.
Moreover, we show the advantage of using 1bit tensor completion over
matricization both theoretically and numerically. Specifically, we show how the
1bit measurement model can be used for contextaware recommender systems.

Recent theoretical and experimental results suggest the possibility of using
current and nearfuture quantum hardware in challenging sampling tasks. In this
paper, we introduce free energybased reinforcement learning (FERL) as an
application of quantum hardware. We propose a method for processing a quantum
annealer's measured qubit spin configurations in approximating the free energy
of a quantum Boltzmann machine (QBM). We then apply this method to perform
reinforcement learning on the gridworld problem using the DWave 2000Q quantum
annealer. The experimental results show that our technique is a promising
method for harnessing the power of quantum sampling in reinforcement learning
tasks.

In this paper we address the recovery conditions of weighted $\ell_p$
minimization for signal reconstruction from compressed sensing measurements
when partial support information is available. We show that weighted $\ell_p$
minimization with $0<p<1$ is stable and robust under weaker sufficient
conditions compared to weighted $\ell_1$ minimization. Moreover, the sufficient
recovery conditions of weighted $\ell_p$ are weaker than those of regular
$\ell_p$ minimization if at least $50%$ of the support estimate is accurate. We
also review some algorithms which exist to solve the nonconvex $\ell_p$
problem and illustrate our results with numerical experiments.