
The DESI Legacy Imaging Surveys are a combination of three public projects
(the Dark Energy Camera Legacy Survey, the BeijingArizona Sky Survey, and the
Mayall zband Legacy Survey) that will jointly image ~14,000 square degrees of
the extragalactic sky visible from the northern hemisphere in three optical
bands (g, r, and z) using telescopes at the Kitt Peak National Observatory and
the Cerro Tololo InterAmerican Observatory. The combined survey footprint is
split into two contiguous areas by the Galactic plane. The optical imaging is
conducted using a unique strategy of dynamic observing that results in a survey
of nearly uniform depth. In addition to calibrated images, the project is
delivering an inferencebased catalog which includes photometry from the grz
optical bands and from four midinfrared bands (at 3.4um, 4.6um, 12um and 22um)
observed by the Widefield Infrared Survey Explorer (WISE) satellite during its
full operational lifetime. The project plans two public data releases each
year. All the software used to generate the catalogs is also released with the
data. This paper provides an overview of the Legacy Surveys project.

We use the Fisher matrix formalism to study the expansion and growth history
of the Universe using galaxy clustering with 2D angular crosscorrelation
tomography in spectroscopic or high resolution photometric redshift surveys.
The radial information is contained in the cross correlations between narrow
redshift bins. We show how multiple tracers with redshift space distortions
cancel sample variance and arbitrarily improve the constraints on the dark
energy equation of state $\omega(z)$ and the growth parameter $\gamma$ in the
noiseless limit. The improvement for multiple tracers quickly increases with
the bias difference between the tracers, up to a factor $\sim4$ in
$\text{FoM}_{\gamma\omega}$. We model a magnitude limited survey with realistic
density and bias using a conditional luminosity function, finding a factor
1.39.0 improvement in $\text{FoM}_{\gamma\omega}$  depending on global
density  with a split in a halo mass proxy. Partly overlapping redshift bins
improve the constraints in multiple tracer surveys a factor $\sim1.3$ in
$\text{FoM}_{\gamma\omega}$. This findings also apply to photometric surveys,
where the effect of using multiple tracers is magnified. We also show large
improvement on the FoM with increasing density, which could be used as a
tradeoff to compensate some possible loss with radial resolution.

We compare reduced threepoint correlations $Q$ of matter, haloes (as proxies
for galaxies) and their cross correlations, measured in a total simulated
volume of $\sim100 \ (h^{1} \text{Gpc})^{3}$, to predictions from leading
order perturbation theory on a large range of scales in configuration space.
Predictions for haloes are based on the nonlocal bias model, employing linear
($b_1$) and nonlinear ($c_2$, $g_2$) bias parameters, which have been
constrained previously from the bispectrum in Fourier space. We also study
predictions from two other bias models, one local ($g_2=0$) and one in which
$c_2$ and $g_2$ are determined by $b_1$ via an approximately universal
relation. Overall, measurements and predictions agree when $Q$ is derived for
triangles with $(r_1r_2r_3)^{1/3} \gtrsim 60 h^{1}\text{Mpc}$, where $r_{13}$
are the sizes of the triangle legs. Predictions for $Q_{matter}$, based on the
linear power spectrum, show significant deviations from the measurements at the
BAO scale (given our small measurement errors), which strongly decrease when
adding a damping term or using the nonlinear power spectrum, as expected.
Predictions for $Q_{halo}$ agree best with measurements at large scales when
considering nonlocal contributions. The universal bias model works well for
haloes and might therefore be also useful for tightening constraints on $b_1$
from $Q$ in galaxy surveys. Such constraints are independent of the amplitude
of matter density fluctuation ($\sigma_8$) and hence break the degeneracy
between $b_1$ and $\sigma_8$, present in galaxy twopoint correlations.

We present a clustering comparison of 12 galaxy formation models (including
SemiAnalytic Models (SAMs) and Halo Occupation Distribution (HOD) models) all
run on halo catalogues and merger trees extracted from a single {\Lambda}CDM
Nbody simulation. We compare the results of the measurements of the mean halo
occupation numbers, the radial distribution of galaxies in haloes and the
2Point Correlation Functions (2PCF). We also study the implications of the
different treatments of orphan (galaxies not assigned to any dark matter
subhalo) and nonorphan galaxies in these measurements. Our main result is that
the galaxy formation models generally agree in their clustering predictions but
they disagree significantly between HOD and SAMs for the orphan satellites.
Although there is a very good agreement between the models on the 2PCF of
central galaxies, the scatter between the models when orphan satellites are
included can be larger than a factor of 2 for scales smaller than 1 Mpc/h. We
also show that galaxy formation models that do not include orphan satellite
galaxies have a significantly lower 2PCF on small scales, consistent with
previous studies. Finally, we show that the 2PCF of orphan satellites is
remarkably different between SAMs and HOD models. Orphan satellites in SAMs
present a higher clustering than in HOD models because they tend to occupy more
massive haloes. We conclude that orphan satellites have an important role on
galaxy clustering and they are the main cause of the differences in the
clustering between HOD models and SAMs.

We report the observation and physical characterization of the possible dwarf
planet \UZ\ ("DeeDee"), a dynamically detached transNeptunian object
discovered at 92 AU. This object is currently the secondmost distant known
transNeptunian object with reported orbital elements, surpassed in distance
only by the dwarf planet Eris. The object was discovered with an $r$band
magnitude of 23.0 in data collected by the Dark Energy Survey between 2014 and
2016. Its 1140year orbit has $(a,e,i) = (109~\mathrm{AU}, 0.65,
26.8^{\circ})$. It will reach its perihelion distance of 38 AU in the year
2142. Integrations of its orbit show it to be dynamically stable on Gyr
timescales, with only weak interactions with Neptune. We have performed
followup observations with ALMA, using 3 hours of onsource integration time to
measure the object's thermal emission in the RayleighJeans tail. The signal is
detected at 7$\sigma$ significance, from which we determine a $V$band albedo
of $13.1^{+3.3}_{2.4}\mathrm{(stat)}^{+2.0}_{1.4}\mathrm{(sys)}$ percent and
a diameter of $635^{+57}_{61}\mathrm{(stat)}^{+32}_{39}\mathrm{(sys)}$~km,
assuming a spherical body with uniform surface properties.

It is usually assumed that in the linear regime the twopoint correlation
function of galaxies contains only a monopole, quadrupole and hexadecapole.
Looking at crosscorrelations between different populations of galaxies, this
turns out not to be the case. In particular, the crosscorrelations between a
bright and a faint population of galaxies contain also a dipole. In this paper
we present the first attempt to measure this dipole. We discuss the four types
of effects that contribute to the dipole: relativistic distortions, evolution
effect, wideangle effect and largeangle effect. We show that the first three
contributions are intrinsic antisymmetric contributions that do not depend on
the choice of angle used to measure the dipole. On the other hand the
largeangle effect appears only if the angle chosen to extract the dipole
breaks the symmetry of the problem. We show that the relativistic distortions,
the evolution effect and the wideangle effect are too small to be detected in
the LOWz and CMASS sample of the BOSS survey. On the other hand with a specific
combination of angles we are able to measure the largeangle effect with high
significance. We emphasise that this largeangle dipole does not contain new
physical information, since it is just a geometrical combination of the
monopole and the quadrupole. However this measurement, which is in excellent
agreement with theoretical predictions, validates our method for extracting the
dipole from the twopoint correlation function and it opens the way to the
detection of relativistic effects in future surveys like e.g. DESI.

It has been shown recently that relativistic distortions generate a dipolar
modulation in the twopoint correlation function of galaxies. To measure this
relativistic dipole it is necessary to crosscorrelate different populations of
galaxies with for example different luminosities or colours. In this paper, we
construct an optimal estimator to measure the dipole with multiple populations.
We show that this estimator increases the signaltonoise of the dipole by up
to 35 percent. Using 6 populations of galaxies, in a survey with halos and
number densities similar to those of the millennium simulation, we forecast a
cumulative signaltonoise of 4.4. For the main galaxy sample of SDSS at low
redshift z<0.2 our optimal estimator predicts a cumulative signaltonoise of
2.4. Finally we forecast a cumulative signaltonoise of 7.4 in the upcoming
DESI survey. These forecasts indicate that with the appropriate choice of
estimator the relativistic dipole should be detectable in current and future
surveys.

DESI (Dark Energy Spectroscopic Instrument) is a Stage IV groundbased dark
energy experiment that will study baryon acoustic oscillations (BAO) and the
growth of structure through redshiftspace distortions with a widearea galaxy
and quasar redshift survey. To trace the underlying dark matter distribution,
spectroscopic targets will be selected in four classes from imaging data. We
will measure luminous red galaxies up to $z=1.0$. To probe the Universe out to
even higher redshift, DESI will target bright [O II] emission line galaxies up
to $z=1.7$. Quasars will be targeted both as direct tracers of the underlying
dark matter distribution and, at higher redshifts ($ 2.1 < z < 3.5$), for the
Ly$\alpha$ forest absorption features in their spectra, which will be used to
trace the distribution of neutral hydrogen. When moonlight prevents efficient
observations of the faint targets of the baseline survey, DESI will conduct a
magnitudelimited Bright Galaxy Survey comprising approximately 10 million
galaxies with a median $z\approx 0.2$. In total, more than 30 million galaxy
and quasar redshifts will be obtained to measure the BAO feature and determine
the matter power spectrum, including redshift space distortions.

DESI (Dark Energy Spectropic Instrument) is a Stage IV groundbased dark
energy experiment that will study baryon acoustic oscillations and the growth
of structure through redshiftspace distortions with a widearea galaxy and
quasar redshift survey. The DESI instrument is a roboticallyactuated,
fiberfed spectrograph capable of taking up to 5,000 simultaneous spectra over
a wavelength range from 360 nm to 980 nm. The fibers feed ten threearm
spectrographs with resolution $R= \lambda/\Delta\lambda$ between 2000 and 5500,
depending on wavelength. The DESI instrument will be used to conduct a
fiveyear survey designed to cover 14,000 deg$^2$. This powerful instrument
will be installed at prime focus on the 4m Mayall telescope in Kitt Peak,
Arizona, along with a new optical corrector, which will provide a threedegree
diameter field of view. The DESI collaboration will also deliver a
spectroscopic pipeline and data management system to reduce and archive all
data for eventual public use.

We study the linear and nonlinear bias parameters which determine the
mapping between the distributions of galaxies and the full matter density
fields, comparing different measurements and predictions. Associating galaxies
with dark matter haloes in the MICE Grand Challenge Nbody simulation we
directly measure the bias parameters by comparing the smoothed density
fluctuations of haloes and matter in the same region at different positions as
a function of smoothing scale. Alternatively we measure the bias parameters by
matching the probability distributions of halo and matter density fluctuations,
which can be applied to observations. These direct bias measurements are
compared to corresponding measurements from twopoint and different thirdorder
correlations, as well as predictions from the peakbackground model, which we
presented in previous articles using the same data. We find an overall
variation of the linear bias measurements and predictions of $\sim 5 \%$ with
respect to results from twopoint correlations for different halo samples with
masses between $\sim 10^{12}  10^{15}$ $h^{1}M_\odot$ at the redshifts
$z=0.0$ and $0.5$. Variations between the second and thirdorder bias
parameters from the different methods show larger variations, but with
consistent trends in mass and redshift. The various bias measurements reveal a
tight relation between the linear and the quadratic bias parameters, which is
consistent with results from the literature based on simulations with different
cosmologies. Such a universal relation might improve constraints on
cosmological models, derived from secondorder clustering statistics at small
scales or higherorder clustering statistics.

Using dark matter simulations we show how halo bias is determined by local
density and not by halo mass. This is not totally surprising, as according to
the peakbackground split model, local density is the property that constraints
bias at large scales. Massive haloes have a high clustering because they reside
in high density regions. Small haloes can be found in a wide range of
environments which determine their clustering amplitudes differently. This
contradicts the assumption of standard Halo Occupation Distribution (HOD)
models that the bias and occupation of haloes is determined solely by their
mass. We show that the bias of central galaxies from semianalytic models of
galaxy formation as a function of luminosity and colour is not correctly
predicted by the standard HOD model. Using local density instead of halo mass
the HOD model correctly predicts galaxy bias. These results indicate the need
to include information about local density and not only mass in order to
correctly apply HOD analysis in these galaxy samples. This new model can be
readily applied to observations and has the advantage that the galaxy density
can be directly observed, in contrast with the dark matter halo mass.

We present a new method to measure the redshiftdependent galaxy bias by
combining information from the galaxy density field and the weak lensing field.
This method is based on Amara et al. (2012), where they use the galaxy density
field to construct a biasweighted convergence field kg. The main difference
between Amara et al. (2012) and our new implementation is that here we present
another way to measure galaxy bias using tomography instead of bias
parameterizations. The correlation between kg and the true lensing field k
allows us to measure galaxy bias using different zerolag correlations, such as
<kgk>/<kk> or <kgkg>/<kgk>. Our method measures the linear bias factor on
linear scales under the assumption of no stochasticity between galaxies and
matter. We use the MICE simulation to measure the linear galaxy bias for a
fluxlimited sample (i < 22.5) in tomographic redshift bins using this method.
This paper is the first that studies the accuracy and systematic uncertainties
associated with the implementation of the method, and the regime where it is
consistent with the linear galaxy bias defined by projected 2point correlation
functions (2PCF). We find that our method is consistent with linear bias at the
percent level for scales larger than 30 arcmin, while nonlinearities appear at
smaller scales. This measurement is a good complement to other measurements of
bias, since it does not depend strongly on sigma8 as the 2PCF measurements. We
apply this method to the Dark Energy Survey Science Verification data in a
followup paper.

Host galaxy identification is a crucial step for modern supernova (SN)
surveys such as the Dark Energy Survey (DES) and the Large Synoptic Survey
Telescope (LSST), which will discover SNe by the thousands. Spectroscopic
resources are limited, so in the absence of realtime SN spectra these surveys
must rely on host galaxy spectra to obtain accurate redshifts for the Hubble
diagram and to improve photometric classification of SNe. In addition, SN
luminosities are known to correlate with hostgalaxy properties. Therefore,
reliable identification of host galaxies is essential for cosmology and SN
science. We simulate SN events and their locations within their host galaxies
to develop and test methods for matching SNe to their hosts. We use both real
and simulated galaxy catalog data from the Advanced Camera for Surveys General
Catalog and MICECATv2.0, respectively. We also incorporate "hostless" SNe
residing in undetected faint hosts into our analysis, with an assumed hostless
rate of 5%. Our fully automated algorithm is run on catalog data and matches
SNe to their hosts with 91% accuracy. We find that including a machine learning
component, run after the initial matching algorithm, improves the accuracy
(purity) of the matching to 97% with a 2% cost in efficiency (true positive
rate). Although the exact results are dependent on the details of the survey
and the galaxy catalogs used, the method of identifying host galaxies we
outline here can be applied to any transient survey.

We present cosmological constraints from the Dark Energy Survey (DES) using a
combined analysis of angular clustering of red galaxies and their
crosscorrelation with weak gravitational lensing of background galaxies. We
use a 139 square degree contiguous patch of DES data from the Science
Verification (SV) period of observations. Using large scale measurements, we
constrain the matter density of the Universe as Omega_m = 0.31 +/ 0.09 and the
clustering amplitude of the matter power spectrum as sigma_8 = 0.74 +/ 0.13
after marginalizing over seven nuisance parameters and three additional
cosmological parameters. This translates into S_8 = sigma_8(Omega_m/0.3)^{0.16}
= 0.74 +/ 0.12 for our fiducial lens redshift bin at 0.35 <z< 0.5, while S_8 =
0.78 +/ 0.09 using two bins over the range 0.2 <z< 0.5. We study the
robustness of the results under changes in the data vectors, modelling and
systematics treatment, including photometric redshift and shear calibration
uncertainties, and find consistency in the derived cosmological parameters. We
show that our results are consistent with previous cosmological analyses from
DES and other data sets and conclude with a joint analysis of DES angular
clustering and galaxygalaxy lensing with Planck CMB data, Baryon Accoustic
Oscillations and Supernova type Ia measurements.

We study halo clustering bias with second and thirdorder statistics of halo
and matter density fields in the MICE Grand Challenge simulation. We verify
that twopoint correlations deliver reliable estimates of the linear bias
parameters at large scales, while estimations from the variance can be
significantly affected by nonlinear and possibly nonlocal contributions to
the bias function. Combining threepoint auto and crosscorrelations we find,
for the first time in configuration space, evidence for the presence of such
nonlocal contributions. These contributions are consistent with predicted
secondorder nonlocal effects on the bias functions originating from the dark
matter tidal field. Samples of massive haloes show indications of bias (local
or nonlocal) beyond second order. Ignoring nonlocal bias causes $2030$\% and
$510$\% overestimation of the linear bias from threepoint auto and
crosscorrelations respectively. We study two thirdorder bias estimators which
are not affected by secondorder nonlocal contributions. One is a combination
of threepoint auto and cross correlation. The other is a combination of
thirdorder one and twopoint cumulants. Both methods deliver accurate bias
estimations of the linear bias. Furthermore their estimations of secondorder
bias agree mutually. Ignoring nonlocal bias causes higher values of the
secondorder bias from threepoint correlations. Our results demonstrate that
thirdorder statistics can be employed for breaking the growthbias degeneracy.

In the first paper of this series, we studied the effect of baryon acoustic
oscillations (BAO), redshift space distortions (RSD) and weak lensing (WL) on
measurements of angular crosscorrelations in narrow redshift bins. PaperII
presented a multitracer forecast as Figures of Merit (FoM), combining a
photometric and spectroscopic stageIV survey. The uncertainties from galaxy
bias, the way light traces mass, is an important ingredient in the forecast.
Fixing the bias would increase our FoM equivalent to 3.3 times larger area for
the combined constraints. This paper focus on how the modelling of bias affect
these results. In the combined forecast, lensing both help and benefit from the
improved bias measurements in overlapping surveys after marginalizing over the
cosmological parameters. Adding a second lens population in countsshear does
not have a large impact on bias error, but removing all countsshear
information increases the bias error in a significant way. We also discuss the
relative impact of WL, magnification, RSD and BAO, and how results change as a
function of bias amplitude, photoz error and sample density. By default we use
one bias parameter per bin (with 72 narrow bins), but we show that the results
do not change much when we use other parameterizations, with at least 3
parameters in total. Bias stochasticity, even when added as one new free
parameter per bin, only produce moderate decrease in the FoM. In general, we
find that the degradation in the figure of merit caused by the uncertainties in
the knowledge of bias is significantly smaller for overlapping surveys.

We model the abundance of haloes in the $\sim(3 \ \text{Gpc}/h)^3$ volume of
the MICE Grand Challenge simulation by fitting the universal mass function with
an improved JackKnife error covariance estimator that matches theory
predictions. We present unifying relations between different fitting models and
new predictions for linear ($b_1$) and nonlinear ($c_2$ and $c_3$) halo
clustering bias. Different mass function fits show strong variations in their
performance when including the low mass range ($M_h \lesssim 3 \ 10^{12} \
M_{\odot}/h$) in the analysis. Together with fits from the literature we find
an overall variation in the amplitudes of around $10$% in the low mass and up
to $50$% in the high mass (galaxy cluster) range ($M_h > 10^{14} \
M_{\odot}/h$). These variations propagate into a $10$% change in $b_1$
predictions and a $50$% change in $c_2$ or $c_3$. Despite these strong
variations we find universal relations between $b_1$ and $c_2$ or $c_3$ for
which we provide simple fits. Excluding low mass haloes, different models
fitted with reasonable goodness in this analysis, show percent level agreement
in their $b_1$ predictions, but are systematically $510$% lower than the bias
directly measured with twopoint halomass clustering. This result confirms
previous findings derived from smaller volumes (and smaller masses).
Inaccuracies in the bias predictions lead to $510$% errors in growth
measurements. They also affect any HOD fitting or (cluster) mass calibration
from clustering measurements.

Future spectroscopic and photometric surveys will measure accurate positions
and shapes of an increasing number of galaxies. In the previous paper of this
series we studied the effects of Redshift Space Distortions (RSD), baryon
acoustic oscillations (BAO) and Weak gravitational Lensing (WL) using angular
crosscorrelation. Here, we provide a new forecast that explores the
contribution of including different observables, physical effects (galaxy bias,
WL, RSD, BAO) and approximations (nonlinearities, Limber approximation,
covariance between probes). The radial information is included by using the
crosscorrelation of separate narrow redshift bins. For the auto correlation
the separation of galaxy pairs is mostly transverse, while the
crosscorrelations also includes a radial component. We study how this
information adds to our figure of merit (FoM), which includes the dark energy
equation of state $w(z)$ and the growth history, parameterized by $\gamma$. We
show that the Limber approximation and galaxy bias are the most critical
ingredients to the modelling of correlations. Adding WL increases our FoM by
4.8, RSD by 2.1 and BAO by 1.3. We also explore how overlapping surveys perform
under the different assumption and for different figures of merit. Our
qualitative conclusions depend on the survey choices and scales included, but
we find some clear tendencies that highlight the importance of combining
different probes and can be used to guide and optimise survey strategies.

This article looks at the combined constraints from a photometric and
spectroscopic survey. These surveys will measure cosmology using weak lensing
(WL), galaxy clustering, baryon acoustic oscillations (BAO) and redshift space
distortions (RSD). We find, contrary to some findings in the recent literature,
that overlapping surveys can give important benefits when measuring dark
energy. We therefore try to clarify the status of this issue with a full
forecast of two stageIV surveys using a new approach to properly account for
covariance between the different probes in the overlapping samples. The benefit
of the overlapping survey can be traced back to two factors: additional
observables and sample variance cancellation. Both needs to be taken into
account and contribute equally when combining 3D power spectrum and 2D
correlations for lensing. With an analytic example we also illustrate that for
optimal constraints, one should minimize the (Pearson) correlation coefficient
between cosmological and nuisance parameters and maximize the one among
nuisance parameters (e.g. galaxy bias) in the two samples. This can be achieved
by increasing the overlap between the spectroscopic and photometric surveys. We
show how BAO, WL and RSD contribute to this benefit and also look at some other
survey designs, such as photometric redshift errors and spectroscopic density.

Measurements of the linear growth factor $D$ at different redshifts $z$ are
key to distinguish among cosmological models. One can estimate the derivative
$dD(z)/d\ln(1+z)$ from redshift space measurements of the 3D anisotropic galaxy
twopoint correlation $\xi(z)$, but the degeneracy of its transverse (or
projected) component with galaxy bias $b$, i.e. $\xi_{\perp}(z) \propto\ D^2(z)
b^2(z)$, introduces large errors in the growth measurement. Here we present a
comparison between two methods which break this degeneracy by combining second
and thirdorder statistics. One uses the shape of the reduced threepoint
correlation and the other a combination of thirdorder one and twopoint
cumulants. These methods use the fact that, for Gaussian initial conditions and
scales larger than $20$ $h^{1}$Mpc, the reduced thirdorder matter
correlations are independent of redshift (and therefore of the growth factor)
while the thirdorder galaxy correlations depend on $b$. We use matter and halo
catalogs from the MICEGC simulation to test how well we can recover $b(z)$ and
therefore $D(z)$ with these methods in 3D real space. We also present a new
approach, which enables us to measure $D$ directly from the redshift evolution
of second and thirdorder galaxy correlations without the need of modelling
matter correlations. For haloes with masses lower than $10^{14}$
$h^{1}$M$_\odot$, we find $10%$ deviations between the different estimates of
$D$, which are comparable to current observational errors. At higher masses we
find larger differences that can probably be attributed to the breakdown of the
bias model and nonPoissonian shot noise.

Weak lensing (WL) clustering is studied using 2D (angular) coordinates, while
redshift space distortions (RSD) and baryon acoustic oscillations (BAO) use 3D
coordinates, which requires a model dependent conversion of angles and
redshifts into comoving distances. This is the first paper of a series, which
explore modelling multitracer galaxy clustering (of WL, BAO and RSD), using
only angular (2D) crosscorrelations in thin redshift bins. This involves
evaluating many thousands crosscorrelations, each a multidimensional integral,
which is computationally demanding. We present a new algorithm that performs
these calculations as matrix operations.
Nearby narrow redshift bins are intrinsically correlated, which can be used
to recover the full (radial) 3D information. We show that the Limber
approximation does not work well for this task. In the exact calculation, both
the clustering amplitude and the RSD effect increase when decreasing the
redshift bin width. For narrow bins, the crosscorrelations has a larger BAO
peak than the autocorrelation because smaller scales are filtered out by the
radial redshift separation. Moreover, the BAO peak shows a second (ghost) peak,
shifted to smaller angles. We explore how WL, RSD and BAO contribute to the
crosscorrelations as a function of the redshift bin width and present a first
exploration of nonlinear effects and signaltonoise ratio on these
quantities. This illustrates that the new approach to clustering analysis
provides new insights and is potentially viable in practice.

Several papers have recently highlighted the possibility of measuring
redshift space distortions from angular autocorrelations of galaxies in
photometric redshift bins. In this work we extend this idea to include as
observables the crosscorrelations between redshift bins, as an additional way
of measuring radial information. We show that this extra information allows to
reduce the recovered error in the growth rate index \gamma by a factor of ~2.
Although the final error in \gamma depends on the bias and the mean photometric
accuracy of the galaxy sample, the improvement from adding crosscorrelations
is robust in different settings. Another factor of 23 improvement in the
determination of \gamma can be achieved by considering two galaxy populations
over the same photometric sky area but with different biases. This additional
gain is shown to be much larger than the one from the same populations when
observed over different areas of the sky (with twice the combined area). The
total improvement of ~5 implies that a photometric survey such as the Dark
Energy Survey should be able to recover \gamma at the 510% from the angular
clustering in linear scales of two different tracers. It can also constrain the
evolution of f(z)x\sigma_8(z) in few bins beyond z~0.80.9 at the 1015% level
perbin, compatible with recent constrains from lowerz spectroscopic surveys.
We also show how further improvement can be achieved by reducing the
photometric redshift error.

We study the shapes of subhalo distributions from four darkmatteronly
simulations of Milky Way type haloes. Comparing the shapes derived from the
subhalo distributions at high resolution to those of the underlying dark matter
fields we find the former to be more triaxial if theanalysis is restricted to
massive subhaloes. For three of the four analysed haloes the increased
triaxiality of the distributions of massive subhaloes can be explained by a
systematic effect caused by the low number of objects. Subhaloes of the fourth
halo show indications for anisotropic accretion via their strong triaxial
distribution and orbit alignment with respect to the dark matter field. These
results are independent of the employed subhalo finder. Comparing the shape of
the observed Milky Way satellite distribution to those of highresolution
subhalo samples from simulations, we find an agreement for samples of bright
satellites, but significant deviations if faint satellites are included in the
analysis. These deviations might result from observational incompleteness.

We study how well we can reconstruct the 2point clustering of galaxies on
linear scales, as a function of mass and luminosity, using the halo occupation
distribution (HOD) in several semianalytical models (SAMs) of galaxy formation
from the Millennium Simulation. We find that HOD with Friends of Friends groups
can reproduce galaxy clustering better than gravitationally bound haloes. This
indicates that Friends of Friends groups are more directly related to the
clustering of these regions than the bound particles of the overdensities. In
general we find that the reconstruction works at best to 5% accuracy: it
underestimates the bias for bright galaxies. This translates to an
overestimation of 50% in the halo mass when we use clustering to calibrate
mass. We also found a degeneracy on the mass prediction from the clustering
amplitude that affects all the masses. This effect is due to the clustering
dependence on the host halo substructure, an indication of assembly bias. We
show that the clustering of haloes of a given mass increases with the number of
subhaloes, a result that only depends on the underlying matter distribution. As
the number of galaxies increases with the number of subhaloes in SAMs, this
results in a low bias for the HOD reconstruction. We expect this effect to
apply to other models of galaxy formation, including the real universe, as long
as the number of galaxies incresases with the number of subhaloes. We have also
found that the reconstructions of galaxy bias from the HOD model fails for low
mass haloes with M = 35x10^11 Msun/h. We find that this is because galaxy
clustering is more strongly affected by assembly bias for these low masses.

We study the twopoint crosscorrelation function between two populations of
galaxies: for instance a bright population and a faint population. We show that
this crosscorrelation is asymmetric under the exchange of the lineofsight
coordinate of the galaxies, i.e. that the correlation is different if the
bright galaxy is in front of, or behind, the faint galaxy. We give an
intuitive, quasiNewtonian derivation of all the effects that contribute to
such an asymmetry in largescale structure: gravitational redshift, Doppler
shift, lensing, lightcone, evolution and AlcockPaczynski effects 
interestingly, the gravitational redshift term is exactly canceled by some of
the others, assuming geodesic motion. Most of these effects are captured by
previous calculations of general relativistic corrections to the observed
galaxy density fluctuation; the asymmetry arises from terms that are suppressed
by the ratio H/k  H is the Hubble constant and k is the wavenumber  which are
more readily observable than the terms suppressed by (H/k)^2. Some of the
contributions to the asymmetry, however, arise from terms that are generally
considered 'Newtonian'  the lensing and evolution  and thus represent a
contaminant in the search for general relativistic corrections. We propose
methods to disentangle these different contributions. A simple method reduces
the contamination to a level of < 10% for redshifts z<1. We also clarify the
relation to recent work on measuring gravitational redshifts by stacking
clusters.