
Synaptic connections between neurons in the brain are dynamic because of
continuously ongoing spine dynamics, axonal sprouting, and other processes. In
fact, it was recently shown that the spontaneous synapseautonomous component
of spine dynamics is at least as large as the component that depends on the
history of pre and postsynaptic neural activity. These data are inconsistent
with common models for network plasticity, and raise the questions how neural
circuits can maintain a stable computational function in spite of these
continuously ongoing processes, and what functional uses these ongoing
processes might have. Here, we present a rigorous theoretical framework for
these seemingly stochastic spine dynamics and rewiring processes in the context
of rewardbased learning tasks. We show that spontaneous synapseautonomous
processes, in combination with reward signals such as dopamine, can explain the
capability of networks of neurons in the brain to configure themselves for
specific computational tasks, and to compensate automatically for later changes
in the network or task. Furthermore we show theoretically and through computer
simulations that stable computational performance is compatible with
continuously ongoing synapseautonomous changes. After reaching good
computational performance it causes primarily a slow drift of network
architecture and dynamics in taskirrelevant dimensions, as observed for neural
activity in motor cortex and other areas. On the more abstract level of
reinforcement learning the resulting model gives rise to an understanding of
rewarddriven network plasticity as continuous sampling of network
configurations.

Previous theoretical studies on the interaction of excitatory and inhibitory
neurons proposed to model this cortical microcircuit motif as a socalled
WinnerTakeAll (WTA) circuit. A recent modeling study however found that the
WTA model is not adequate for databased softer forms of divisive inhibition as
found in a microcircuit motif in cortical layer 2/3. We investigate here
through theoretical analysis the role of such softer divisive inhibition for
the emergence of computational operations and neural codes under spiketiming
dependent plasticity (STDP). We show that in contrast to WTA models  where the
network activity has been interpreted as probabilistic inference in a
generative mixture distribution  this network dynamics approximates inference
in a noisyORlike generative model that explains the network input based on
multiple hidden causes. Furthermore, we show that STDP optimizes the parameters
of this model by approximating online the expectation maximization (EM)
algorithm. This theoretical analysis corroborates a preceding modelling study
which suggested that the learning dynamics of this layer 2/3 microcircuit motif
extracts a specific modular representation of the input and thus performs blind
source separation on the input statistics.

Cortical microcircuits are very complex networks, but they are composed of a
relatively small number of stereotypical motifs. Hence one strategy for
throwing light on the computational function of cortical microcircuits is to
analyze emergent computational properties of these stereotypical microcircuit
motifs. We are addressing here the question how spiketiming dependent
plasticity (STDP) shapes the computational properties of one motif that has
frequently been studied experimentally: interconnected populations of pyramidal
cells and parvalbuminpositive inhibitory cells in layer 2/3. Experimental
studies suggest that these inhibitory neurons exert some form of divisive
inhibition on the pyramidal cells. We show that this databased form of
feedback inhibition, which is softer than that of winnertakeall models that
are commonly considered in theoretical analyses, contributes to the emergence
of an important computational function through STDP: The capability to
disentangle superimposed firing patterns in upstream networks, and to represent
their information content through a sparse assembly code.

General results from statistical learning theory suggest to understand not
only brain computations, but also brain plasticity as probabilistic inference.
But a model for that has been missing. We propose that inherently stochastic
features of synaptic plasticity and spine motility enable cortical networks of
neurons to carry out probabilistic inference by sampling from a posterior
distribution of network configurations. This model provides a viable
alternative to existing models that propose convergence of parameters to
maximum likelihood values. It explains how priors on weight distributions and
connection probabilities can be merged optimally with learned experience, how
cortical networks can generalize learned information so well to novel
experiences, and how they can compensate continuously for unforeseen
disturbances of the network. The resulting new theory of network plasticity
explains from a functional perspective a number of experimental data on
stochastic aspects of synaptic plasticity that previously appeared to be quite
puzzling.

Network of neurons in the brain apply  unlike processors in our current
generation of computer hardware  an eventbased processing strategy, where
short pulses (spikes) are emitted sparsely by neurons to signal the occurrence
of an event at a particular point in time. Such spikebased computations
promise to be substantially more powerefficient than traditional clocked
processing schemes. However it turned out to be surprisingly difficult to
design networks of spiking neurons that are able to carry out demanding
computations. We present here a new theoretical framework for organizing
computations of networks of spiking neurons. In particular, we show that a
suitable design enables them to solve hard constraint satisfaction problems
from the domains of planning  optimization and verification  logical
inference. The underlying design principles employ noise as a computational
resource. Nevertheless the timing of spikes (rather than just spike rates)
plays an essential role in the resulting computations. Furthermore, one can
demonstrate for the Traveling Salesman Problem a surprising computational
advantage of networks of spiking neurons compared with traditional artificial
neural networks and Gibbs sampling. The identification of such advantage has
been a wellknown open problem.