
Cosmological parameter estimation is entering a new era. Large collaborations
need to coordinate highstakes analyses using multiple methods; furthermore
such analyses have grown in complexity due to sophisticated models of cosmology
and systematic uncertainties. In this paper we argue that modularity is the key
to addressing these challenges: calculations should be broken up into
interchangeable modular units with inputs and outputs clearly defined. We
present a new framework for cosmological parameter estimation, CosmoSIS,
designed to connect together, share, and advance development of inference tools
across the community. We describe the modules already available in CosmoSIS,
including CAMB, Planck, cosmic shear calculations, and a suite of samplers. We
illustrate it using demonstration code that you can run outofthebox with the
installer available at http://bitbucket.org/joezuntz/cosmosis

We present the first stable release of Halotools (v0.2), a communitydriven
Python package designed to build and test models of the galaxyhalo connection.
Halotools provides a modular platform for creating mock universes of galaxies
starting from a catalog of dark matter halos obtained from a cosmological
simulation. The package supports many of the common forms used to describe
galaxyhalo models: the halo occupation distribution (HOD), the conditional
luminosity function (CLF), abundance matching, and alternatives to these models
that include effects such as environmental quenching or variable galaxy
assembly bias. Satellite galaxies can be modeled to live in subhalos, or to
follow custom number density profiles within their halos, including spatial
and/or velocity bias with respect to the dark matter profile. The package has
an optimized toolkit to make mock observations on a synthetic galaxy
population, including galaxy clustering, galaxygalaxy lensing, galaxy group
identification, RSD multipoles, void statistics, pairwise velocities and
others, allowing direct comparison to observations. Halotools is
objectoriented, enabling complex models to be built from a set of simple,
interchangeable components, including those of your own creation. Halotools has
an automated testing suite and is exhaustively documented on
http://halotools.readthedocs.io, which includes quickstart guides, source code
notes and a large collection of tutorials. The documentation is effectively an
online textbook on how to build and study empirical models of galaxy formation
with Python.

Given the complexity of modern cosmological parameter inference where we are
faced with nonGaussian data and noise, correlated systematics and multiprobe
correlated data sets, the Approximate Bayesian Computation (ABC) method is a
promising alternative to traditional Markov Chain Monte Carlo approaches in the
case where the Likelihood is intractable or unknown. The ABC method is called
"Likelihood free" as it avoids explicit evaluation of the Likelihood by using a
forward model simulation of the data which can include systematics. We
introduce astroABC, an open source ABC Sequential Monte Carlo (SMC) sampler for
parameter estimation. A key challenge in astrophysics is the efficient use of
large multiprobe datasets to constrain high dimensional, possibly correlated
parameter spaces. With this in mind astroABC allows for massive parallelization
using MPI, a framework that handles spawning of jobs across multiple nodes. A
key new feature of astroABC is the ability to create MPI groups with different
communicators, one for the sampler and several others for the forward model
simulation, which speeds up sampling time considerably. For smaller jobs the
Python multiprocessing option is also available. Other key features include: a
Sequential Monte Carlo sampler, a method for iteratively adapting tolerance
levels, local covariance estimate using scikitlearn's KDTree, modules for
specifying optimal covariance matrix for a componentwise or multivariate
normal perturbation kernel, output and restart files are backed up every
iteration, user defined metric and simulation methods, a module for specifying
heterogeneous parameter priors including nonstandard prior PDFs, a module for
specifying a constant, linear, log or exponential tolerance level,
welldocumented examples and sample scripts. This code is hosted online at
https://github.com/EliseJ/astroABC

DESI (Dark Energy Spectroscopic Instrument) is a Stage IV groundbased dark
energy experiment that will study baryon acoustic oscillations (BAO) and the
growth of structure through redshiftspace distortions with a widearea galaxy
and quasar redshift survey. To trace the underlying dark matter distribution,
spectroscopic targets will be selected in four classes from imaging data. We
will measure luminous red galaxies up to $z=1.0$. To probe the Universe out to
even higher redshift, DESI will target bright [O II] emission line galaxies up
to $z=1.7$. Quasars will be targeted both as direct tracers of the underlying
dark matter distribution and, at higher redshifts ($ 2.1 < z < 3.5$), for the
Ly$\alpha$ forest absorption features in their spectra, which will be used to
trace the distribution of neutral hydrogen. When moonlight prevents efficient
observations of the faint targets of the baseline survey, DESI will conduct a
magnitudelimited Bright Galaxy Survey comprising approximately 10 million
galaxies with a median $z\approx 0.2$. In total, more than 30 million galaxy
and quasar redshifts will be obtained to measure the BAO feature and determine
the matter power spectrum, including redshift space distortions.

DESI (Dark Energy Spectropic Instrument) is a Stage IV groundbased dark
energy experiment that will study baryon acoustic oscillations and the growth
of structure through redshiftspace distortions with a widearea galaxy and
quasar redshift survey. The DESI instrument is a roboticallyactuated,
fiberfed spectrograph capable of taking up to 5,000 simultaneous spectra over
a wavelength range from 360 nm to 980 nm. The fibers feed ten threearm
spectrographs with resolution $R= \lambda/\Delta\lambda$ between 2000 and 5500,
depending on wavelength. The DESI instrument will be used to conduct a
fiveyear survey designed to cover 14,000 deg$^2$. This powerful instrument
will be installed at prime focus on the 4m Mayall telescope in Kitt Peak,
Arizona, along with a new optical corrector, which will provide a threedegree
diameter field of view. The DESI collaboration will also deliver a
spectroscopic pipeline and data management system to reduce and archive all
data for eventual public use.

Cosmological parameter estimation techniques that robustly account for
systematic measurement uncertainties will be crucial for the next generation of
cosmological surveys. We present a new analysis method, superABC, for obtaining
cosmological constraints from Type Ia supernova (SN Ia) light curves using
Approximate Bayesian Computation (ABC) without any likelihood assumptions. The
ABC method works by using a forward model simulation of the data where
systematic uncertainties can be simulated and marginalized over. A key feature
of the method presented here is the use of two distinct metrics, the `Tripp'
and `Light Curve' metrics, which allow us to compare the simulated data to the
observed data set. The Tripp metric takes as input the parameters of models fit
to each light curve with the SALTII method, whereas the Light Curve metric
uses the measured fluxes directly without model fitting. We apply the superABC
sampler to a simulated data set of $\sim$1000 SNe corresponding to the first
season of the Dark Energy Survey Supernova Program. Varying $\Omega_m, w_0,
\alpha$ and $\beta$ and a magnitude offset parameter, with no systematics we
obtain $\Delta(w_0) = w_0^{\rm true}  w_0^{\rm best \, fit} = 0.036\pm0.109$
(a $\sim11$% 1$\sigma$ uncertainty) using the Tripp metric and $\Delta(w_0) =
0.055\pm0.068$ (a $\sim7$% 1$\sigma$ uncertainty) using the Light Curve
metric. Including 1% calibration uncertainties in four passbands, adding 4 more
parameters, we obtain $\Delta(w_0) = 0.062\pm0.132$ (a $\sim14$% 1$\sigma$
uncertainty) using the Tripp metric. Overall we find a $17$% increase in the
uncertainty on $w_0$ with systematics compared to without. We contrast this
with a MCMC approach where systematic effects are approximately included. We
find that the MCMC method slightly underestimates the impact of calibration
uncertainties for this simulated data set.

The Effective Field Theory of LargeScale Structure (EFTofLSS) provides a
novel formalism that is able to accurately predict the clustering of
largescale structure (LSS) in the mildly nonlinear regime. Here we provide
the first computation of the power spectrum of biased tracers in redshift space
at one loop order, and we make the associated code publicly available. We
compare the multipoles $\ell=0,2$ of the redshiftspace halo power spectrum,
together with the realspace matter and halo power spectra, with data from
numerical simulations at $z=0.67$. For the samples we compare to, which have a
number density of $\bar n=3.8 \cdot 10^{2}(h \ {\rm Mpc}^{1})^3$ and $\bar
n=3.9 \cdot 10^{4}(h \ {\rm Mpc}^{1})^3$, we find that the calculation at
oneloop order matches numerical measurements to within a few percent up to
$k\simeq 0.43 \ h \ {\rm Mpc}^{1}$, a significant improvement with respect to
former techniques. By performing the socalled IRresummation, we find that the
Baryon Acoustic Oscillation peak is accurately reproduced. Based on the results
presented here, longwavelength statistics that are routinely observed in LSS
surveys can be finally computed in the EFTofLSS. This formalism thus is ready
to start to be compared directly to observational data.

We present the nonlinear 2D galaxy power spectrum, $P(k,\mu)$, in redshift
space, measured from the Dark Sky simulations, using galaxy catalogs
constructed with both halo occupation distribution and subhalo abundance
matching methods, chosen to represent an intermediate redshift sample of
luminous red galaxies. We find that the information content in individual $\mu$
(cosine of the angle to the line of sight) bins is substantially richer then
multipole moments, and show that this can be used to isolate the impact of
nonlinear growth and redshift space distortion (RSD) effects. Using the
$\mu<0.2$ simulation data, which we show is not impacted by RSD effects, we can
successfully measure the nonlinear bias to an accuracy of $\sim 5$% at $k<0.6
h$Mpc$^{1}$. This use of individual $\mu $ bins to extract the nonlinear bias
successfully removes a large parameter degeneracy when constraining the linear
growth rate of structure. We carry out a joint parameter estimation, using the
low $\mu$ simulation data to constrain the nonlinear bias, and $\mu\ge0.2$ to
constrain the growth rate and show that $f$ can be constrained to $\sim 26\,
(22)$% to a $k_{\rm max}< 0.4\, (0.6) h$Mpc$^{1}$ from clustering alone using
a simple dispersion model, for a range of galaxy models. Our analysis of
individual $\mu $ bins also reveals interesting physical effects which arise
simply from different methods of populating halos with galaxies. We find a
prominent turnaround scale, at which RSD damping effects are greater then the
nonlinear growth, which differs not only for each $\mu$ bin but also for each
galaxy model. These features may provide unique signatures which could be used
to shed light on the galaxydark matter connection.

We determine the concentrationmass relation of 19 Xray selected galaxy
clusters from the CLASH survey in theories of gravity that directly modify the
lensing potential. We model the clusters as NFW haloes and fit their lensing
signal, in the Cubic Galileon and Nonlocal gravity models, to the lensing
convergence profiles of the clusters. We discuss a number of important issues
that need to be taken into account, associated with the use of nonparametric
and parametric lensing methods, as well as assumptions about the background
cosmology. Our results show that the concentration and mass estimates in the
modified gravity models are, within the errorbars, the same as in $\Lambda$CDM.
This result demonstrates that, for the Nonlocal model, the modifications to
gravity are too weak at the cluster redshifts, and for the Galileon model, the
screening mechanism is very efficient inside the cluster radius. However, at
distances $\sim \left[220\right] {\rm Mpc}/h$ from the cluster center, we find
that the surrounding force profiles are enhanced by $\sim2040\%$ in the Cubic
Galileon model. This has an impact on dynamical mass estimates, which means
that tests of gravity based on comparisons between lensing and dynamical masses
can also be applied to the Cubic Galileon model.

The linear growth rate is commonly defined through a simple deterministic
relation between the velocity divergence and the matter overdensity in the
linear regime. We introduce a formalism that extends this to a nonlinear,
stochastic relation between $\theta = \nabla \cdot v({\bf x},t)/aH$ and
$\delta$. This provides a new phenomenological approach that examines the
conditional mean $< \theta\delta>$, together with the fluctuations of $\theta$
around this mean. We measure these stochastic components using Nbody
simulations and find they are nonnegative and increase with decreasing scale
from $\sim$10% at $k<0.2 h $Mpc$^{1}$ to 25% at $k\sim0.45h$Mpc$^{1}$ at $z =
0$. Both the stochastic relation and nonlinearity are more pronounced for
halos, $M \le 5 \times 10^{12}M_\odot h^{1}$, compared to the dark matter at
$z=0$ and $1$. Nonlinear growth effects manifest themselves as a rotation of
the mean $< \theta\delta>$ away from the linear theory prediction $f_{\tiny
\rm LT}\delta$, where $f_{\tiny \rm LT}$ is the linear growth rate. This
rotation increases with wavenumber, $k$, and we show that it can be
welldescribed by second order Lagrangian perturbation theory (2LPT) for $k <
0.1 h$Mpc$^{1}$. The stochasticity in the $\theta$  $\delta$ relation is not
so simply described by 2LPT, and we discuss its impact on measurements of
$f_{\tiny \rm LT}$ from two point statistics in redshift space. Given that the
relationship between $\delta$ and $\theta$ is stochastic and nonlinear, this
will have implications for the interpretation and precision of $f_{\tiny \rm
LT}$ extracted using models which assume a linear, deterministic expression.

The nonlinear, scaledependent bias in the mass distribution of galaxies and
the underlying dark matter is a key systematic affecting the extraction of
cosmological parameters from galaxy clustering. Using 95 million halos from the
MillenniumXXL Nbody simulation, we find that the mass bias is scale
independent only for $k<0.1 h{\rm Mpc}^{1}$ today ($z=0$) and for $k<0.2 h{\rm
Mpc}^{1}$ at $z=0.7$. We test analytic halo bias models against our simulation
measurements and find that the model of Tinker et al. 2005 is accurate to
better then 5% at $z=0$. However, the simulation results are better fit by an
ellipsoidal collapse model at $z=0.7$. We highlight, for the first time,
another potentially serious systematic due to a sampling bias in the halo
velocity divergence power spectra which will affect the comparison between
observations and any redshift space distortion model which assumes dark matter
velocity statistics with no velocity bias. By measuring the velocity divergence
power spectra for different sized halo samples, we find that there is a
significant bias which increases with decreasing number density. This bias is
approximately 20% at $k=0.1h$Mpc$^{1}$ for a halo sample of number density
$\bar{n} = 10^{3} (h/$Mpc$)^3$ at both $z=0$ and $z=0.7$ for the velocity
divergence auto power spectrum. Given the importance of redshift space
distortions as a probe of dark energy and the ongoing major effort to advance
models for the clustering signal in redshift space, our results show this
velocity bias introduces another systematic, alongside scaledependent halo
mass bias, which cannot be neglected.

We present measurements of the number density of voids in the dark matter
distribution from a series of Nbody simulations of a \Lambda CDM cosmology. We
define voids as spherical regions of \rho_v = 0.2\rho_m around density minima
in order to relate our results to the predicted abundances using the excursion
set formalism. Using a linear underdensity of \delta_v = 2.7, from a spherical
evolution model, we find that a volume conserving model, which does not
conserve number density in the mapping from the linear to nonlinear regime,
matches the measured abundance to within 16% for a range of void radii 1<
r(Mpc/h)<15. This model fixes the volume fraction of the universe which is in
voids and assumes that voids of a similar size merge as they expand by a factor
of 1.7 to achieve a nonlinear density of \rho_v = 0.2\rho_m today. We find that
the model of Sheth & van de Weygaert (2004) for the number density of voids
greatly overpredicts the abundances over the same range of scales. We find that
the volume conserving model works well at matching the number density of voids
measured from the simulations at higher redshifts, z=0.5 and 1, as well as
correctly predicting the abundances to within 25% in a simulation of a matter
dominated \Omega_m = 1 universe. We examine the abundance of voids in the halo
distribution and find fewer small, r<10 Mpc/h, voids and many more large, r>10
Mpc/h, voids compared to the dark matter. These results indicate that voids
identified in the halo or galaxy distribution are related to the underlying
void distribution in the dark matter in a complicated way which merits further
study if voids are to be used as a precision probe of cosmology.

We use Nbody simulations to study the statistics of massive halos and
redshift space distortions for theories with a standard \Lambda CDM expansion
history and a galileontype scalar field. The extra scalar field increases the
gravitational force, leading to enhanced structure formation. We compare our
measurements of the real space matter power spectrum and halo properties with
fitting formula for estimating these quantities analytically. We find that a
model for power spectrum, halo massfunction and halo bias, derived from
\Lambda CDM simulations can fit the results from our simulations of modified
gravity when \sigma_8 is appropriately adjusted. We also study the redshift
space distortions in the two point correlation function measured from these
simulations, finding a difference in the ratio of the redshift space to real
space clustering amplitude relative to standard gravity on all scales. We find
enhanced clustering on scales r>10 Mpc/h and increased damping of the
correlation function for scales r<9 Mpc/h. The boost in the clustering on large
scales due to the enhanced gravitational forces cannot be mimicked in a
standard gravity model by simply changing \sigma_8. This result illustrates the
usefulness of redshift space distortion measurements as a probe of
modifications to General Relativity.

We use large volume Nbody simulations to predict the clustering of dark
matter in redshift space in f(R) modified gravity cosmologies. This is the
first time that the nonlinear matter and velocity fields have been resolved to
such a high level of accuracy over a broad range of scales in this class of
models. We find significant deviations from the clustering signal in standard
gravity, with an enhanced boost in power on large scales and stronger damping
on small scales in the f(R) models compared to GR at redshifts z<1. We measure
the velocity divergence (P_\theta \theta) and matter (P_\delta \delta) power
spectra and find a large deviation in the ratios \sqrt{P_\theta \theta/P_\delta
\delta} and P_\delta \theta/P_\delta\delta, between the f(R) models and GR for
0.03<k/(h/Mpc)<0.5. In linear theory these ratios equal the growth rate of
structure on large scales. Our results show that the simulated ratios agree
with the growth rate for each cosmology (which is scale dependent in the case
of modified gravity) only for extremely large scales, k<0.06h/Mpc at z=0. The
velocity power spectrum is substantially different in the f(R) models compared
to GR, suggesting that this observable is a sensitive probe of modified
gravity. We demonstrate how to extract the matter and velocity power spectra
from the 2D redshift space power spectrum, P(k,\mu), and can recover the
nonlinear matter power spectrum to within a few percent for k<0.1h/Mpc.
However, the model fails to describe the shape of the 2D power spectrum
demonstrating that an improved model is necessary in order to reconstruct the
velocity power spectrum accurately. The same model can match the monopole
moment to within 3% for GR and 10% for the f(R) cosmology at k<0.2 h/Mpc at
z=1. Our results suggest that the extraction of the velocity power spectrum
from future galaxy surveys is a promising method to constrain deviations from
GR.

The distribution of angles subtended between pairs of galaxies and the line
of sight,which is uniform in real space, is distorted by their peculiar
motions, and has been proposed as a probe of cosmic expansion. We test this
idea using Nbody simulations of structure formation in a cold dark matter
universe with a cosmological constant and in two variant cosmologies with
different dark energy models. We find that the distortion of the distribution
of angles is sensitive to the nature of dark energy. However, for the first
time, our simulations also reveal dependences of the normalization of the
distribution on both redshift and cosmology that have been neglected in
previous work. This introduces systematics that severely limit the usefulness
of the original method. Guided by our simulations, we devise a new, improved
test of the nature of dark energy. We demonstrate that this test does not
require prior knowledge of the background cosmology and that it can even
distinguish between models that have the same baryonic acoustic oscillations
and dark matter halo mass functions. Our technique could be applied to the
completed BOSS galaxy redshift survey to constrain the expansion history of the
Universe to better than 2%. The method will also produce different signals for
dark energy and modified gravity cosmologies even when they have identical
expansion histories, through the different peculiar velocities induced in these
cases.

Future galaxy surveys hope to distinguish between the dark energy and
modified gravity scenarios for the accelerating expansion of the Universe using
the distortion of clustering in redshift space. The aim is to model the form
and size of the distortion to infer the rate at which large scale structure
grows. We test this hypothesis and assess the performance of current
theoretical models for the redshift space distortion using large volume Nbody
simulations of the gravitational instability process. We simulate competing
cosmological models which have identical expansion histories  one is a
quintessence dark energy model with a scalar field and the other is a modified
gravity model with a time varying gravitational constant  and demonstrate that
they do indeed produce different redshift space distortions. This is the first
time this approach has been verified using a technique that can follow the
growth of structure at the required level of accuracy. Our comparisons show
that theoretical models for the redshift space distortion based on linear
perturbation theory give a surprisingly poor description of the simulation
results. Furthermore, the application of such models can give rise to
catastrophic systematic errors leading to incorrect interpretation of the
observations. We show that an improved model is able to extract the correct
growth rate. Further enhancements to theoretical models of redshift space
distortions, calibrated against simulations, are needed to fully exploit the
forthcoming high precision clustering measurements.

The anisotropy of clustering in redshift space provides a direct measure of
the growth rate of large scale structure in the Universe. Future galaxy
redshift surveys will make high precision measurements of these distortions,
and will potentially allow us to distinguish between different scenarios for
the accelerating expansion of the Universe. Accurate predictions are needed in
order to distinguish between competing cosmological models. We study the
distortions in the redshift space power spectrum in $\Lambda$CDM and
quintessence dark energy models, using large volume Nbody simulations, and
test predictions for the form of the redshift space distortions. We find that
the linear perturbation theory prediction by Kaiser (1987) is a poor fit to the
measured distortions, even on surprisingly large scales $k \ge 0.05
h$Mpc$^{1}$. An improved model for the redshift space power spectrum,
including the nonlinear velocity divergence power spectrum, is presented and
agrees with the power spectra measured from the simulations up to $k \sim 0.2
h$Mpc$^{1}$. We have found a densityvelocity relation which is cosmology
independent and which relates the nonlinear velocity divergence spectrum to
the nonlinear matter power spectrum. We provide a formula which generates the
nonlinear velocity divergence $P(k)$ at any redshift, using only the
nonlinear matter power spectrum and the linear growth factor at the desired
redshift. This formula is accurate to better than 5% on scales $k<0.2 h
$Mpc$^{1}$ for all the cosmological models discussed in this paper. Our
results will extend the statistical power of future galaxy surveys.

We study the nonlinear growth of cosmic structure in different dark energy
models, using large volume Nbody simulations. We consider a range of
quintessence models which feature both rapidly and slowly varying dark energy
equations of state, and compare the growth of structure to that in a universe
with a cosmological constant. The adoption of a quintessence model changes the
expansion history of the universe, the form of the linear theory power spectrum
and can alter key observables, such as the horizon scale and the distance to
last scattering. We incorporate these effects into our simulations in stages to
isolate the impact of each on the growth of structure. The difference in
structure formation can be explained to first order by the difference in growth
factor at a given epoch; this scaling also accounts for the nonlinear growth at
the 15% level. We find that quintessence models that are different from
$\Lambda$CDM both today and at high redshifts $(z \sim 1000)$ and which feature
late $(z<2)$, rapid transitions in the equation of state, can have identical
baryonic acoustic oscillation (BAO) peak positions to those in $\Lambda$CDM. We
find that these models have higher abundances of dark matter haloes at $z>0$
compared to $\Lambda$CDM and so measurements of the mass function should allow
us to distinguish these quintessence models from a cosmological constant.
However, we find that a second class of quintessence models, whose equation of
state makes an early $(z>2)$ rapid transition to $w=1$, cannot be
distinguished from $\Lambda$CDM using measurements of the mass function or the
BAO, even if these models have nonnegligible amounts of dark energy at early
times.