
Neuralnetwork models of highlevel brain functions such as memory recall and
reasoning often rely on the presence of stochasticity. The majority of these
models assumes that each neuron in the functional network is equipped with its
own private source of randomness, often in the form of uncorrelated external
noise. However, both in vivo and in silico, the number of noise sources is
limited due to space and bandwidth constraints. Hence, neurons in large
networks usually need to share noise sources. Here, we show that the resulting
sharednoise correlations can significantly impair the performance of
stochastic network models. We further demonstrate that this problem can be
overcome by using deterministic recurrent neural networks as sources of
stochasticity, exploiting the decorrelating effect of inhibitory feedback.
Consequently, even a single recurrent network of a few hundred neurons can
serve as a natural noise source for large ensembles of functional networks,
each comprising thousands of units. We test the proposed framework for a
diverse set of networks with different dimensionalities and entropies. A
network reproducing handwritten digits with distinct predefined frequencies
demonstrates the practical relevance. Finally, we show that the same design
transfers to functional networks of spiking neurons.

Spiking networks that perform probabilistic inference have been proposed both
as models of cortical computation and as candidates for solving problems in
machine learning. However, the evidence for spikebased computation being in
any way superior to nonspiking alternatives remains scarce. We propose that
shortterm plasticity can provide spiking networks with distinct computational
advantages compared to their classical counterparts. In this work, we use
networks of leaky integrateandfire neurons that are trained to perform both
discriminative and generative tasks in their forward and backward information
processing paths, respectively. During training, the energy landscape
associated with their dynamics becomes highly diverse, with deep attractor
basins separated by high barriers. Classical algorithms solve this problem by
employing various tempering techniques, which are both computationally
demanding and require global state updates. We demonstrate how similar results
can be achieved in spiking networks endowed with local shortterm synaptic
plasticity. Additionally, we discuss how these networks can even outperform
temperingbased approaches when the training data is imbalanced. We thereby
show how biologically inspired, local, spiketriggered synaptic dynamics based
simply on a limited pool of synaptic resources can allow spiking networks to
outperform their nonspiking relatives.

A steadily increasing body of evidence suggests that the brain performs
probabilistic inference to interpret and respond to sensory input and that
trialtotrial variability in neural activity plays an important role. The
neural sampling hypothesis interprets stochastic neural activity as sampling
from an underlying probability distribution and has been shown to be compatible
with biologically observed firing patterns. In many studies, uncorrelated noise
is used as a source of stochasticity, discounting the fact that cortical
neurons may share a significant portion of their presynaptic partners, which
impacts their computation. This is relevant in biology and for implementations
of neural networks where bandwidth constraints limit the amount of independent
noise. When receiving correlated noise, the resulting correlations cannot be
directly countered by changes in synaptic weights $W$. We show that this is
contingent on the chosen coding: when changing the state space from
$z\in\{0,1\}$ to $z'\in\{1,1\}$, correlated noise has the exact same effect as
changes in $W'$. The translation of the problem to the $\{1,1\}$ space allows
to find a weight configuration that compensates for the induced correlations.
For an artificial embedding of sampling networks, this allows a straightforward
transfer between platforms with different bandwidth constraints. The existence
of the mapping is important for learning. Since in the $\{1,1\}$coding the
correlated noise can be compensated by parameter changes and the probability
distribution can be kept invariant when changing the coding, the distribution
will be found in the $\{0,1\}$coding as well during learning, as demonstrated
in simulations. Conclusively, sampling spiking networks are impervious to noise
correlations when trained. If such computation happens in cortex, network
plasticity does not need to take account of shared noise inputs.

Despite being originally inspired by the central nervous system, artificial
neural networks have diverged from their biological archetypes as they have
been remodeled to fit particular tasks. In this paper, we review several
possibilites to reverse map these architectures to biologically more realistic
spiking networks with the aim of emulating them on fast, lowpower neuromorphic
hardware. Since many of these devices employ analog components, which cannot be
perfectly controlled, finding ways to compensate for the resulting effects
represents a key challenge. Here, we discuss three different strategies to
address this problem: the addition of auxiliary network components for
stabilizing activity, the utilization of inherently robust architectures and a
training method for hardwareemulated networks that functions without perfect
knowledge of the system's dynamics and parameters. For all three scenarios, we
corroborate our theoretical considerations with experimental results on
accelerated analog neuromorphic platforms.

How spiking networks are able to perform probabilistic inference is an
intriguing question, not only for understanding information processing in the
brain, but also for transferring these computational principles to neuromorphic
silicon circuits. A number of computationally powerful spiking network models
have been proposed, but most of them have only been tested, under ideal
conditions, in software simulations. Any implementation in an analog, physical
system, be it in vivo or in silico, will generally lead to distorted dynamics
due to the physical properties of the underlying substrate. In this paper, we
discuss several such distortive effects that are difficult or impossible to
remove by classical calibration routines or parameter training. We then argue
that hierarchical networks of leaky integrateandfire neurons can offer the
required robustness for physical implementation and demonstrate this with both
software simulations and emulation on an accelerated analog neuromorphic
device.

Emulating spiking neural networks on analog neuromorphic hardware offers
several advantages over simulating them on conventional computers, particularly
in terms of speed and energy consumption. However, this usually comes at the
cost of reduced control over the dynamics of the emulated networks. In this
paper, we demonstrate how iterative training of a hardwareemulated network can
compensate for anomalies induced by the analog substrate. We first convert a
deep neural network trained in software to a spiking network on the BrainScaleS
waferscale neuromorphic system, thereby enabling an acceleration factor of 10
000 compared to the biological time domain. This mapping is followed by the
intheloop training, where in each training step, the network activity is
first recorded in hardware and then used to compute the parameter updates in
software via backpropagation. An essential finding is that the parameter
updates do not have to be precise, but only need to approximately follow the
correct gradient, which simplifies the computation of updates. Using this
approach, after only several tens of iterations, the spiking network shows an
accuracy close to the ideal softwareemulated prototype. The presented
techniques show that deep spiking networks emulated on analog neuromorphic
devices can attain good computational performance despite the inherent
variations of the analog substrate.

The highly variable dynamics of neocortical circuits observed in vivo have
been hypothesized to represent a signature of ongoing stochastic inference but
stand in apparent contrast to the deterministic response of neurons measured in
vitro. Based on a propagation of the membrane autocorrelation across spike
bursts, we provide an analytical derivation of the neural activation function
that holds for a large parameter space, including the highconductance state.
On this basis, we show how an ensemble of leaky integrateandfire neurons with
conductancebased synapses embedded in a spiking environment can attain the
correct firing statistics for sampling from a welldefined target distribution.
For recurrent networks, we examine convergence toward stationarity in computer
simulations and demonstrate samplebased Bayesian inference in a mixed
graphical model. This points to a new computational role of highconductance
states and establishes a rigorous link between deterministic neuron models and
functional stochastic dynamics on the network level.

The apparent stochasticity of invivo neural circuits has long been
hypothesized to represent a signature of ongoing stochastic inference in the
brain. More recently, a theoretical framework for neural sampling has been
proposed, which explains how samplebased inference can be performed by
networks of spiking neurons. One particular requirement of this approach is
that the neural response function closely follows a logistic curve.
Analytical approaches to calculating neural response functions have been the
subject of many theoretical studies. In order to make the problem tractable,
particular assumptions regarding the neural or synaptic parameters are usually
made. However, biologically significant activity regimes exist which are not
covered by these approaches: Under strong synaptic bombardment, as is often the
case in cortex, the neuron is shifted into a highconductance state (HCS)
characterized by a small membrane time constant. In this regime, synaptic time
constants and refractory periods dominate membrane dynamics.
The core idea of our approach is to separately consider two different "modes"
of spiking dynamics: burst spiking and transient quiescence, in which the
neuron does not spike for longer periods. We treat the former by propagating
the PDF of the effective membrane potential from spike to spike within a burst,
while using a diffusion approximation for the latter. We find that our
prediction of the neural response function closely matches simulation data.
Moreover, in the HCS scenario, we show that the neural response function
becomes symmetric and can be well approximated by a logistic function, thereby
providing the correct dynamics in order to perform neural sampling. We hereby
provide not only a normative framework for Bayesian inference in cortex, but
also powerful applications of lowpower, accelerated neuromorphic systems to
relevant machine learning tasks.

The means by which cortical neural networks are able to efficiently solve
inference problems remains an open question in computational neuroscience.
Recently, abstract models of Bayesian computation in neural circuits have been
proposed, but they lack a mechanistic interpretation at the singlecell level.
In this article, we describe a complete theoretical framework for building
networks of leaky integrateandfire neurons that can sample from arbitrary
probability distributions over binary random variables. We test our framework
for a model inference task based on a psychophysical phenomenon (the
KnillKersten optical illusion) and further assess its performance when applied
to randomly generated distributions. As the local computations performed by the
network strongly depend on the interaction between neurons, we compare several
types of couplings mediated by either single synapses or interneuron chains.
Due to its robustness to substrate imperfections such as parameter noise and
background noise correlations, our model is particularly interesting for
implementation on novel, neuroinspired computing architectures, which can
thereby serve as a fast, lowpower substrate for solving realworld inference
problems.

Advancing the size and complexity of neural network models leads to an ever
increasing demand for computational resources for their simulation.
Neuromorphic devices offer a number of advantages over conventional computing
architectures, such as high emulation speed or low power consumption, but this
usually comes at the price of reduced configurability and precision. In this
article, we investigate the consequences of several such factors that are
common to neuromorphic devices, more specifically limited hardware resources,
limited parameter configurability and parameter variations. Our final aim is to
provide an array of methods for coping with such inevitable distortion
mechanisms. As a platform for testing our proposed strategies, we use an
executable system specification (ESS) of the BrainScaleS neuromorphic system,
which has been designed as a universal emulation backend for neuroscientific
modeling. We address the most essential limitations of this device in detail
and study their effects on three prototypical benchmark network models within a
welldefined, systematic workflow. For each network model, we start by defining
quantifiable functionality measures by which we then assess the effects of
typical hardwarespecific distortion mechanisms, both in idealized software
simulations and on the ESS. For those effects that cause unacceptable
deviations from the original network dynamics, we suggest generic compensation
mechanisms and demonstrate their effectiveness. Both the suggested workflow and
the investigated compensation mechanisms are largely backend independent and
do not require additional hardware configurability beyond the one required to
emulate the benchmark networks in the first place. We hereby provide a generic
methodological environment for configurable neuromorphic devices that are
targeted at emulating largescale, functional neural networks.

The seemingly stochastic transient dynamics of neocortical circuits observed
in vivo have been hypothesized to represent a signature of ongoing stochastic
inference. In vitro neurons, on the other hand, exhibit a highly deterministic
response to various types of stimulation. We show that an ensemble of
deterministic leaky integrateandfire neurons embedded in a spiking noisy
environment can attain the correct firing statistics in order to sample from a
welldefined target distribution. We provide an analytical derivation of the
activation function on the single cell level; for recurrent networks, we
examine convergence towards stationarity in computer simulations and
demonstrate samplebased Bayesian inference in a mixed graphical model. This
establishes a rigorous link between deterministic neuron models and functional
stochastic dynamics on the network level.

In this study, we present a highly configurable neuromorphic computing
substrate and use it for emulating several types of neural networks. At the
heart of this system lies a mixedsignal chip, with analog implementations of
neurons and synapses and digital transmission of action potentials. Major
advantages of this emulation device, which has been explicitly designed as a
universal neural network emulator, are its inherent parallelism and high
acceleration factor compared to conventional computers. Its configurability
allows the realization of almost arbitrary network topologies and the use of
widely varied neuronal and synaptic parameters. Fixedpattern noise inherent to
analog circuitry is reduced by calibration routines. An integrated development
environment allows neuroscientists to operate the device without any prior
knowledge of neuromorphic circuit design. As a showcase for the capabilities of
the system, we describe the successful emulation of six different neural
networks which cover a broad spectrum of both structure and functionality.

In this paper we present a methodological framework that meets novel
requirements emerging from upcoming types of accelerated and highly
configurable neuromorphic hardware systems. We describe in detail a device with
45 million programmable and dynamic synapses that is currently under
development, and we sketch the conceptual challenges that arise from taking
this platform into operation. More specifically, we aim at the establishment of
this neuromorphic system as a flexible and neuroscientifically valuable
modeling tool that can be used by nonhardwareexperts. We consider various
functional aspects to be crucial for this purpose, and we introduce a
consistent workflow with detailed descriptions of all involved modules that
implement the suggested steps: The integration of the hardware interface into
the simulatorindependent model description language PyNN; a fully automated
translation between the PyNN domain and appropriate hardware configurations; an
executable specification of the future neuromorphic system that can be
seamlessly integrated into this biologytohardware mapping process as a test
bench for all software layers and possible hardware design modifications; an
evaluation scheme that deploys models from a dedicated benchmark library,
compares the results generated by virtual or prototype hardware devices with
reference software simulations and analyzes the differences. The integration of
these components into one hardwaresoftware workflow provides an ecosystem for
ongoing preparative studies that support the hardware design process and
represents the basis for the maturity of the modeltohardware mapping
software. The functionality and flexibility of the latter is proven with a
variety of experimental results.