Professional Documents
Culture Documents
Abstract
One-point probability distribution functions (PDFs) of the cosmic matter density are powerful
cosmological probes that extract non-Gaussian properties of the matter distribution and complement
two-point statistics. Computing the covariance of one-point PDFs is key for building a robust galaxy
survey analysis for upcoming surveys like Euclid and the Rubin Observatory LSST and requires good
models for the two-point PDFs characterising spatial correlations. In this work, we obtain accurate
PDF covariances using effective shifted lognormal two-point PDF models for the mildly non-Gaussian
weak lensing convergence and validate our predictions against large sets of Gaussian and non-Gaussian
maps. We show how the dominant effects in the covariance matrix capturing super-sample covariance
arise from a large-separation expansion of the two-point PDF and discuss differences between the
covariances obtained from small patches and full sky maps. Finally, we describe how our formalism
can be extended to characterise the PDF covariance for 3D-dimensional spectroscopic fields using the
3D matter PDF as an example. We describe how covariances from simulated boxes with fixed overall
density can be supplemented with the missing super-sample covariance effect by relying on theoretical
predictions validated against separate-universe style simulations.
1. INTRODUCTION skewness at one given scale (c.f. Section 5.2 and Figure
The recent rise of non-Gaussian statistics for galaxy 4 of Friedrich et al. 2020). This however also highlights
clustering and weak lensing has highlighted the need for a limitation of those simulations: they potentially fail
accurate models of the likelihood function and, in par- to realistically capture the hierarchy of moments beyond
ticular, the covariance matrices of such statistics. Aside the skewness (for an extension see Baratta et al. 2020) as
from the challenge to model non-Gaussian statistics and well as the full shape of higher order N-point functions.
capture their response to changing cosmological param- With the large data vectors that future multi-probe
eters in the nonlinear regime, getting accurate covari- analyses of upcoming surveys like Euclid (Laureijs et al.
ance matrices can prove difficult and require thousands 2011; Amendola et al. 2018) and the Rubin Observatory
of numerical simulations to ensure convergence (Taylor & LSST (Ivezić et al. 2019) are aiming for, estimated co-
Joachimi 2014; Colavincenzo et al. 2019). For statistics variances - whether from log-normal simulations or other
relying on higher-order correlation functions involving N mock data - can lead to a huge degradation of cosmolog-
points, modelling the covariance generally requires as- ical constraining power (Dodelson & Schneider 2013a;
sumptions for the shape of 2N -point correlations. Even Friedrich & Eifler 2017; Percival et al. 2022). This makes
for Gaussian fields, this task has only been achieved re- it desirable to have an analytic understanding of mea-
cently at all orders N (Hou et al. 2021). To go beyond the surement uncertainties. One-point statistics such as the
assumption of Gaussianity, log-normal models of the cos- PDF of the weak lensing convergence have the advan-
mic density field have become a tool of choice in higher- tage that both their dependence on cosmological param-
order analyses such as studies of the bispectrum (Mar- eters and their covariance, being given in terms of the
tin et al. 2012; Halder et al. 2021), moments of cosmic two-point PDF, can be modelled theoretically (Codis
random fields (Gatti et al. 2019), density split statis- et al. 2016b; Uhlemann et al. 2017). In this work, we
tics (Friedrich et al. 2018; Gruen et al. 2018), the full will focus on the weak lensing convergence smoothed on
probability distribution function (PDF) of cosmic ran- mildly nonlinear scales (of order 100 at source redshifts of
dom fields (Uhlemann et al. 2020; Friedrich et al. 2020; z ' 1 − 2), where the bulk of the PDF and its cosmology
Boyle et al. 2021), and even map-based inference ap- dependence can be predicted at the percent level using
proaches (Sarma Boruah et al. 2022). The reason why large deviation statistics (Bernardeau & Valageas 2000;
(shifted) log-normal simulations (Xavier et al. 2016) have Bernardeau & Reimberg 2016; Barthelemy et al. 2020a;
become so popular is that they obey a similar hierarchy Boyle et al. 2021) and its covariance is well-approximated
between variance and skewness as the physical cosmic by a shifted lognormal distribution (Boyle et al. 2021).
density field. In particular, they can be tuned to match In the opposite regime for small smoothing scales, the
both a desired power spectrum at all scales and a desired high-convergence tail of the weak lensing convergence
PDF and its covariance have been modelled using a halo-
? cora.uhlemann@newcastle.ac.uk
model formalism in Thiele et al. (2020), finding qualita-
2
tive agreement with simulations but significant quanti- ∑ PDF covariance, full sky, O
tative discrepancies attributed to strong sensitivity to Takahashi simulation
small-scale effects. -0.02 0.004
For galaxy clustering, the error on and correlation
among the factorial moments of galaxy counts in cells
Fk = hN (N − 1) . . . (N − k)i have been computed for -0.01 0.002
hierarchical models in Szapudi & Colombi (1996) and
Szapudi et al. (1999). Those studies included cosmic
errors due to a finite volume and discreteness errors 0.0
∑2
0.000
due to a finite number of tracers relevant for cluster-
ing but not the weak lensing convergence, which is in-
stead affected by shape noise due to shear measure- 0.01
°0.002
ments. In Repp & Szapudi (2021a), the variance and
covariance among galaxy counts in cells was obtained for
0.02
the case of a constant separation-independent cell cor- °0.004
relation, which has previously been considered in Codis lognormal theory
et al. (2016b). As we explain here, and derive in Ap- -0.02 -0.01 0.0 0.01 0.02
pendix B, the large-separation expansion in terms of ∑1
bias functions adopted in Codis et al. (2016b); Uhlemann
et al. (2017); Repp & Szapudi (2021a); Repp & Szapudi Fig. 1.— PDF covariance matrix for circular apertures of radius
θs = 7.50 at source redshift zs = 2 comparing the measurement
(2021b) can be generalised to the physically more re- from 108 full sky Takahashi lensing maps (upper triangle) to the
alistic case of a separation-dependent correlation. The shifted lognormal prediction (lower triangle).
large-separation two-point PDF and the resulting super-
sample covariance effect can also be related to the idea of
the position-dependent matter PDF studied in Jamieson
zs = 2, A = 15000 deg2
& Loverde (2020). Recent work of Bernardeau (2022) Lognormal model
determined covariances of density PDFs in hierarchical Takahashi
models, which can be used to understand the dominant FLASK Takahashi
terms in the covariance matrix of cosmological density
fields in the strongly non-Gaussian regime. In this work,
we build on this foundation but focus on the weakly non-
Gaussian weak lensing PDF for which hierarchical mod-
els are less suitable and better results can be achieved
using shifted lognormal models.
86
0.
STRUCTURE
84
20
22
24
26
78
80
82
84
86
models using several quantitative tests. In Section 5, we
0.
0.
0.
0.
0.
0.
0.
0.
present the basics for generalising our findings to the case Ωcdm σ8 0.
of the three-dimensional matter (or spectroscopic galaxy
clustering) PDF. We conclude in Section 6 and provide Fig. 2.— Fisher forecasts validating the predicted shifted lognor-
an outlook for how our results can be applied to related mal PDF covariance (purple) against measured covariances from
108 realistic full sky simulated maps (red) and 500 associated mock
one-point observables, include tomography and informa- FLASK maps (green). The parameter errors obtained from the
tion from two-point statistics. raw measured covariances have additionally been corrected by the
Hartlap factor (72). To highlight the variation that can result from
EXECUTIVE SUMMARY small samples, we also show contours for two subsamples of 100 of
the 500 mock FLASK maps (green dotted).
We develop a theoretical model for the covariance of
measurements of the one-point weak lensing convergence
PDF in finite bins. Our model is based on an integra-
tion over the two-point PDF (6) at different separations
and we rely on a joint shifted lognormal model for that inverse of the survey area. We validate our predictions
two-point PDF (28) tuned to match the PDF skewness against large sets of simulated convergence maps both
and shown to capture the density-dependence of the two- from N-body simulations (Takahashi et al. 2017) and us-
point correlation (12) in simulations, as illustrated in ing the FLASK tool for generating log-normal random
Figure 8. We establish expressions for the finite sam- fields (Xavier et al. 2016). To verify the quality of our
pling (11) and super-sample covariance (13) contribu- covariance model we employ a range of tests including
tions to the PDF covariance and their scaling with the visual comparisons, an eigendecomposition of the covari-
3
ance, a band-decomposition of the precision matrix, χ2 et al. 2016) to match the convergence PDF and ob-
tests and Fisher forecasts, with Figures 1 and 2 sum- tain precise covariance estimates in good agreement with
marising two main results. In Figure 1 we show the maps from N-body simulations, of which often only a
measured covariance for the central region of the binned very limited number is available. Following this work, we
κ-PDF for a smoothing radius of θs = 7.50 at source red- will study the covariance of the central region of the one-
shift zs = 2 from Nsim = 108 simulated full sky κ maps point PDF of the weak lensing convergence smoothed
from Takahashi et al. (2017) (upper triangle) along with with a top-hat filter of radius θs = 7.30 at a fixed source
the shifted lognormal prediction (lower triangle, using redshift zs = 2 using 26 linearly spaced bins in the range
equations (6) and (28)). For the predictions, we used the −0.024 < κ < 0.026. In this Section we will use both
cosmological parameters of the simulation along with the Gaussian and shifted lognormal fields generated with
shift parameter s ' 0.085 obtained from the second and FLASK to test our theoretical framework for predict-
third moments measured from the PDF. In Figure 8 in ing the weak lensing PDF covariance based on different
the main text we show that the environment-dependent models for the two-point PDF of the weak lensing con-
clustering that enters the eigendecomposition of the co- vergence in cells of a given separation. While we focus
variance is well captured by the shifted lognormal model. on the two-dimensional case of the weak lensing PDF
Aside from the good visual agreement, we quantify here, the formalism can be adapted to describe the three-
their consistency using two tests. First, we test the com- dimensional clustering PDF as we lay out in Section 5.
patibility of the diagonal elements of the two covariances
shown in Figure 1 with a χ2 test, taking into account 2.1. Covariance prediction from joint 2-point PDF
their uncertainty and correlation among each other1 (co-
variance of the covariance). We use the covariance be- Before calculating the covariance matrix of a PDF mea-
tween the numerically estimated diagonal covariance el- surement, let us first specify how this measurement is
conducted. Let us assume that we have observed the
ements Ĉii and Ĉjj as given in equation (36) following
gravitational lensing convergence field on a part of the
Taylor et al. (2013), where Nsim = 108 is the number of
sky with the area Asurvey . Now let us smooth this con-
simulations used to estimate and the covariance elements
vergence field with a circular aperture of some angular
Cij are evaluated with our theoretical model. Using the
radius θs (potentially cutting away parts of the edges of
above expression to compute a χ2 between the diago- the survey, so that each aperture falls fully inside the
nal of the estimated and predicted covariance we find survey area), and let us pixelise the resulting map. Each
χ2 = 36.8 for a number of 26 bins. This corresponds to pixel of that map contains the value of the average con-
a marginal ∼ 1.50σ deviation between the two matrix vergence within a radius θs around the pixel center.
diagonals. We would now like to estimate the probability density
Finally, we assess the quality of the off-diagonal ele- of a certain value κ for the smoothed convergence field.
ments of our covariance model by performing a Fisher In practice one will need to compute this PDF within a
forecast. Figure 2 compares Fisher contours on the pa- set of finite bins, say around a set of central values κi
rameters σ8 and Ωcdm derived from a number of different and with constant bin width ∆κ, which results in the
covariance matrices: our covariance model, the covari- bins bini = [κi − ∆κ/2, κi + ∆κ/2] . The PDF at each
ance estimated from the N-body simulations of Taka- κi can then be approximated as
hashi et al. (2017) and the covariance estimated from
log-normal simulations. The differences in the width and P (κ ∈ [κi − ∆κ/2, κi + ∆κ/2])
alignment of the constraints obtained from these matri- P(κi ) ≈ , (1)
∆κ
ces is negligible compared to the overall parameter uncer-
tainties. Hence, our covariance model is sufficiently ac- where P(κ) is the PDF (obtained in the limit ∆κ → 0)
curate for application to observational data. We demon- and P (A) is the probability of statement A. With our
strate the flexibility of our covariance model by adapting pixelised map we can estimate the right-hand side of this
it to predict super sample covariance effects for the 3- to obtain the measurement
dimensional matter PDF paving the way to spectroscopic
galaxy clustering. #{pix with |κ − κi | ≤ ∆κ/2}
P̂(κi ) ≡ . (2)
#{pix}∆κ
2. WEAK LENSING PDF COVARIANCE
Following Bernardeau (2022), we introduce the weight
The weak lensing convergence PDF is a well-suited function
complementary summary statistic that can extract non-
Gaussian information from cosmic shear observations, 1 if κ ∈ [κi − ∆κ/2, κi + ∆κ/2]
thus complementing two-point statistics. In this work, χi (κ) = , (3)
0 else
we focus on mildly nonlinear scales, where weak lens-
ing convergence PDF and its response to fundamental such that the estimator P̂(κi ) is given by
cosmological parameters can be predicted at percent-
level accuracy from large-deviation statistics as shown
P
P χi (κP ) 1 χi (κ(θ))
Z
in Barthelemy et al. (2020a); Boyle et al. (2021). In the P̂(κi ) = ≈ d2 θ , (4)
same work it was shown how one can use shifted log- NP ∆κ Asurvey ∆κ
survey
normal convergence fields generated by FLASK (Xavier
where P labels the NP = Asurvey /Apix pixels of area Apix ,
1 Note that typically only the diagonal elements of estimated θ are angular coordinates on the sky, and κ(θ) is the
covariances are well described with multi-variate Gaussian noise. smoothed, continuous convergence field at that location.
4
7 θ/Θside , is
continuous square patch
discrete, non-overlapping grid
6
continuous full sky
Pd,square (θ̂) = (8)
(
5 2θ̂[π + (θ̂ − 4)
pθ̂] θ̂ ∈ [0, 1[ ,
2 −1
√
4
2θ̂[π − 2 + 4 θ̂2 − 1 − θ̂ − 4sec (θ̂)] θ̂ ∈ [1, 2] .
Pd (r)
3
This distribution is illustrated as black line in Figure 3,
with a maximum at roughly half the patch size where
2 θ̂max ' 0.479 and a linear growth for small distances,
Pd,square (θ̂) ' 2π θ̂.
1
Before looking at PDF covariance matrix examples in
0
Section 2.2 and introducing models for the two-point
0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 PDF in Section 2.3, let us discuss the main universal
r = θ/Θside
effects affecting the PDF covariance: i) the shot-noise
effect related to the finite sampling of cells and ii) the
Fig. 3.— Comparison of the continuous distribution of rela-
tive distances in a square Cartesian survey patch of side length super-sample covariance effect related to a finite survey
Θside = 5deg= 3000 (black) and the discrete version for a coarse area.
grid corresponding to non-overlapping circular cells of radius 7.30
(blue). We also show the distribution of relative distances on the 2.1.1. Finite sampling (shot noise) effects
full sky with maximum Θside = π = 180deg (red).
Previously we discussed the covariance of the PDF in
the continuum limit. In the more realistic discrete case,
we estimate the PDF in a bin centred at κi with width
The second moment of our above estimator is given by ∆i (in our case linearly spaced bins ∆i = ∆), so κ ∈
[κi − ∆i /2, κi + ∆i /2] using a finite sample of NT cells
hP̂(κi )P̂(κj )i that are placed on a regular grid in the survey area. The
histogram-value of the PDF in a bin is obtained from
1 hχi (κ(θ1 ))χj (κ(θ2 ))i
Z Z
≈ 2 d2 θ1 d2 θ2 averaging the continuous PDF P(κ) over that bin
Asurv. (∆κ)2 Z ∆i /2
surv. surv. P(κi + δκi )
1
Z Z Pi = dδκi ' P(κi ) , (9)
≈ 2 2
d θ1 d2 θ2 P(κi , κj ; |θ1 − θ2 |) , (5) −∆i /2 ∆i
Asurv.
surv. surv. and for small bin sizes close to the value of the contin-
where P(κi , κj ; θ) is the joint PDF of two points of the uous PDF at the bin center κ̄i . To compute the co-
smoothed convergence field at distance θ. This equation variance for the discrete case, we can discretise equa-
can be simplified to tion (6) by averaging the continuous joint PDF over
Z the κ-bins and replacing the integration over the dis-
tance distribution by a sum-based average over the set of
hP(κi )P(κj )i = dθ Pd (θ)P(κi , κj ; θ) , (6a) Nd = NT2 = θ∈Θ Nd (θ) distances Θ of all grid cells with
P
multiplicities Nd (θ) that are part of the survey patch
with Pd (θ) indicating the distribution of angular dis-
tances in a given survey area. The covariance of the X Nd (θ)Z ∆i /2 Z ∆j /2 P(κi + δκi , κj + δκj ; θ)
PDF estimator can then be computed from this moment hPi Pj i = dδκi dδκj
Nd −∆i /2 −∆j /2 ∆i ∆ j
via θ∈Θ
X Nd (θ)
cov(P(κi ), P(κj )) = hP̂(κi )P̂(κj )i − P̄(κi )P̄(κj ) , (6b) = Pij (θ) , (10)
Nd
θ∈Θ
where P̄ ≡ hP̂i ≈ P . We see that covariance predictions
rely on two key ingredients: the distance distribution of where for small enough bins Pij (θ) ' P(κi , κj ; θ). In
angular separations and the two-point PDF of the lensing Figure 3 we show a comparison between the continuous
convergence at those separations, which will be discussed distribution of distances in a square patch and the dis-
in more detail in Section 2.3. crete case for a coarser grid of non-overlapping cells. We
Distance distributions. The relevant distance distribu- now split this sum over cell separations showing that the
tions for weak lensing describe a two-dimensional field. contribution from zero separation leads to a finite sam-
For the full sky, the uniform distribution of angular sep- pling or shot-noise term, and the remainder resembles the
arations (measured in radians) reads super-sample covariance contribution that dominates for
1 the case of well-separated cells. Since the one-point PDF
Pd,full sky (θ) =
sin θ , (7) can be obtainedRfrom the two-point PDF by marginali-
2 sation, P(κi ) = dκj P(κi , κj ; θ), for zero separation we
with a maximum at θmax = π/2. This distribution when require that P(κi , κj ; θ = 0) = P(κi )δD (κi − κj ). In the
written in terms of distances relative to the maximal dis- discrete case the joint two-point PDF at zero separation
tance Θside = π is illustrated as red line in Figure 3. is given byP Pij (θ = 0) = P̄i δij /∆i , required by the rela-
For a square survey patch, the distribution of angular tion P̄i = j ∆j Pij (θ = 0). The zero separation mul-
distances relative to the size of the survey patch, θ̂ = tiplicity equals the cell number Nd (θ = 0) = NT , such
5
that the zero-separation term in equation (10) produces cells are at sufficiently wide separations θ & 2θs , the co-
a finite sampling (or shot-noise) covariance contribution2 variance is obtained from integrating the large-separation
limit of the two-point PDF (12) to obtain a super-sample
NT P̄i covariance (SSC) covariance term
covFS (Pi , Pj ) = Pij (θ = 0) = δij . (11)
Nd ∆ i NT
covSSC (P(κi ), P(κj )) = ξ¯ (b1 P)(κi )(b1 P)(κj ) , (13a)
Unsurprisingly, this finite sampling term is inversely pro-
portional to the number of cells. If the cells are placed where we defined the mean correlation
at a fixed separation, an increase of the area corresponds Z θmax
to a suppression of the shot-noise covariance term as ξ¯ = dθ Pd (θ)ξ(θ) , (13b)
2
σFS (P) ∝ NT−1 ∝ Θ−2 −1
side ∝ Asurvey , which is inversely
proportional to the area.
While in principle the finite sampling term can be ren- with Pd (θ) being the probability distribution of angular
dered arbitrarily small by using an increasing number distances and θmax being the maximum distance set by
of cells, this comes at the price of heavy cell overlaps the size of the survey patch, respectively. This mean
at small separations. In this regime, a small-scale ex- correlation agrees with the variance of the mean of κ
1/2 p
2
measured across different patches, so ξ¯ = σκ̄2 .
pansion with small ∆ξ (θ) = σ − ξ(θ) ξ(θ) 1 For a square patch of side length√Θside , the maxi-
can be used to capture effects of heavy cell overlaps as mal distance in the patch is θmax ' 2Θside where the
discussed in Bernardeau (2022). In the closed-form two-
normalised distance distribution Pd (θ̂) is given in Equa-
point PDF models discussed in Section 2.3 those overlap
tion (8), by conservation of probability we have
effects will be automatically included.
2.1.2. Super-sample covariance effects dθ̂ θ 1 θ̂1 −2
Pd (θ) = Pd (θ̂) = Pd θ̂ = ∝ Θside .
Beyond assuming specific (log-)normal models for the dθ Θside Θside
bivariate PDF, one can obtain accurate estimates of the (14)
two-point PDF in the large separation regime illustrated The main contribution in the mean correlation integral
in Figure 4. A large-scale expansion with small corre- comes from small angles θ, where the correlation func-
lation ξ(θ) ∆ξ (θ) 1 is suitable for well-separated tion is largest and Pd (θ̂) ∝ θ̂. Hence, the SSC covari-
cells where the joint PDF at leading order can be ap- ance term from equation (13) scales as covSSC (Pi , Pj ) ∝
proximated as Codis et al. (2016b) Θ−2 −1
side ∝ Asurvey , which is inversely proportional to the
P(κi , κj ; θ & 2θs ) ' P(κi )P(κj )[1 + ξ(θ)b1 (κi )b1 (κj )] , survey area.
(12) In the discrete case, the covariance can be expressed
as a sum of the previously discussed finite sampling
with some bias function b1 , which describes how the two- term (11) and a super-sample covariance like term
point correlation at a given separation is modulated by
the local weak-lensing convergences. This functional has cov(Pi , Pj ) = hPj Pj i − P̄i P̄j (15a)
been found to be very robust and the previous result = covFS (Pi , Pj ) (15b)
extends what is expected in cosmological density fields
(Bernardeau 1996). When cells are at sufficiently large 1 X h Nd i
+ Pij (θ) − P̄i P̄j .
separations and the initial conditions are Gaussian, this Nd NT (NT − 1)
θ∈Θ\{0}
modulation is independent of separation. We will intro-
| {z }
'1
duce a discretised estimator in equation (16c) and derive
the functional form of those bias functions for specific If cells at separation θ were completely independent, such
forms of the two-point PDF discussed in Section 2.3 by that Pij (θ) = P̄i P̄j , then only the shot-noise term from
an expansion in the correlation ξ(θ). For the related the finite sampling of cells would remain. The second
case of densities in spheres and cylinders, this bias func- term in equation (15) includes significant contributions
tion can be accurately predicted by spherical or cylindri- from cells at wide separation θ & 2θs , where we can use
cal collapse, respectively. The large-deviation statistics the large-separation PDF expansion (12). This leads to a
techniques used to predict the one-point weak lensing super-sample covariance term analogous to equation (13)
PDF (Barthelemy et al. 2020a; Boyle et al. 2021) based
on cylindrical collapse for thin redshift slices could be ¯ 1 P)i (b1 P)j ,
covSSC (Pi , Pj ) = ξ(b (16a)
extended to derive the functional form of this bias, al-
though this is beyond the scope of this work. This large- where (b1 P)i are obtained from a bin-average of the in-
separation expansion predicts a leading order super- volved functions resembling equation (9) and the mean
sample covariance term and further sub-leading contri- correlation can be obtained as
butions that we will describe in Section 3. When all X Nd (θ)
ξ¯ = ξ(θ) . (16b)
2 This shot-noise term was already described in (Codis et al. Nd
2016b). Even if there was no spatial correlation in the sample, in θ∈Θ\{0}
the Poisson limit the covariance of finding Ni cells in convergence
¯ the value
Using the relationship κ0j |κi κ0 = b1 (κi )/ξ,
bin covSN (Ni , Nj ) = N̄i δij , where N̄i is the mean number of cells
expected to be in the convergence bin centred on κ̄i . The number j
of cells, Ni in a given convergence bin of width ∆i centered on κi is of the bias function in the bin κi can be estimated by
related to the histogram-value of the binned PDF from equation (9) comparing the mean of the convergence at separation
such that Ni = NT ∆i Pi and similarly N̄i = NT ∆i P̄i . θ = 2θs , κ0j , from this given convergence κi to the average
6
σ 2 (P (κ))
O κ − κ̄ FLASK
()
κ PDF correlation, (5deg)2 patches, NO κ PDF correlation, (5deg)2 patches, O κ − κ̄ PDF correlation, (5deg)2 patches, O
1.00
FLASK FLASK FLASK
-0.02
0.75
0.50
-0.01
0.25
0.0
κ2
0.00
−0.25
0.01
−0.50
0.02 −0.75
Gauss Gauss Gauss
−1.00
-0.02 -0.01 0.0 0.01 0.02 -0.02 -0.01 0.0 0.01 0.02 -0.02 -0.01 0.0 0.01 0.02
κ1 κ1 κ1
Fig. 6.— Correlation matrix between the bins of the weak lensing convergence κ PDF measured at a smoothing scale ' 7.30 from (5
deg)2 patches of a full sky FLASK map of κ including the nonzero mean in the patches (left and middle panel) with non-overlapping (NO)
cells (left) and overlapping cells (middle) and κ − κ̄ after subtracting the mean with overlapping cells (right panel). The upper triangles
labelled by FLASK show results from shifted-lognormal realisations, while the lower triangles those of Gaussian realisations. The diagonal
of the covariance matrix is shown in Figure 5.
• mean-subtracted κ̃ = κ − κ̄ from overlapping cells and a correlation structure, for example specified by an
(purple lines in Figure 5, right panel in Figure 6) . expansion for large separations or heavy overlaps.
We see that while the lognormal maps have a visible 2.3.1. Bivariate Gaussian
asymmetry in the covariance diagonal compared to the A Gaussian joint PDF is parametrised by the one-point
Gaussian maps, both lead to very similar correlation ma- PDF variance σ 2 and the two-point correlation between
trices. Comparing the red and blue lines shows that given cells ξ = ξ12 (θ) such that
a fixed total area, increasing the number of cells for the
PDF measurement decreases the finite sampling (or shot PG (κ1 , κ2 ; θ) = PG (κ1 , κ2 ; σ 2 , ξ12 (θ)) (19)
noise term) that determines the covariance matrix diag- h 2
ξ12 (θ)
i
onal, as we explained in Section 2.1.1. The suppression exp − σ2d [κ1 2 + κ2 2 ] + d κ1 κ2
of the diagonal for the case of an increased number of = √ ,
cells comes at the price of making cells overlap which 2π d
creates a strong positive correlation band around the di- (20)
agonal, as can be seen from comparing the left and mid- with the determinant d = σ − ξ12 = σ [1 − 4 2 4
dle panel in Figure 6. Comparing the blue and purple (ξ12 /σ 2 )2 ]. One can perform a large-separation expan-
lines in Figure 5 illustrates that subtracting the mean
in small patches significantly reduces the impact of the sion for ξ12 /σ 2 1 valid in the regime of no overlaps as
super-sample covariance term discussed in Section 2.1.2 shown in Figure 4. At leading order, one can approxi-
and causes the 4-tiled pattern in the middle panel of Fig- mate d ' σ 4 and expand the remaining ξ12 (θ) term in
ure 6 to transform to a 9-tiled pattern in the right panel. the exponential to obtain
Additionally, the removal of this dominant term causes PG (κ1 , κ2 ; θ) κ1 κ2
the overlap contribution in the band along the diago- = 1 + ξ12 (θ) 2 2 + O(ξ 2 ) , (21)
PG (κ1 )PG (κ2 ) σ σ
nal to become more apparent. As we will see later, the
mean-subtracted convergence in small patches provides a which is a meaningful expansion in the regime κ . O(σ)
proxy for the convergence on the full sky, as evident from and reproduces the linear ‘Kaiser’ bias function (Kaiser
Figure 9. While most of our discussion is focused on the 1984; Codis et al. 2016a; Uhlemann et al. 2017)
underlying weak lensing convergence field, we discuss the
inclusion of shape noise arising from cosmic shear mea- b1,G (κ) = κ/σ 2 . (22)
surements in Section 2.3.4. We will discuss an expansion up to higher orders in
ξ(θ)/σ 2 in the context of the large-separation expansion
2.3. Specific models for the joint two-point PDF in Section 3.1.2.
As evident from equation (6), the PDF covariance can The impact of mean-subtraction. For small patches,
be computed directly if the two-point PDF P(κi , κj ; θ) the mean convergence κs will significantly fluctuate from
is known a priori for all separations. In the following patch to patch with a sizeable average correlation ξ¯ = σκ2 s
we will discuss three possible cases relying on closed- causing the super-sample covariance. To better emulate
form expressions for that joint PDF, a bivariate Gaussian measurements of modern surveys on a significant frac-
(normal) PDF, a bivariate shifted lognormal PDF and tion of the full sky, subtracting this mean convergence
a bivariate PDF built from a marginal one-point PDF in each patch in the spirit of the peak-background split
8
(Bardeen et al. 1986; Mo & White 1996; Sheth & Tor- zero-crossings each, see equation (56)].
men 1999) can be desirable. The two-point PDF of the
mean-subtracted lensing convergence κ̃i = κi − κ̄ is ob- 2.3.2. Bivariate shifted lognormal
tained from a trivariate Gaussian PDF for κ1 , κ2 and A shifted lognormal joint PDF for the lensing conver-
the background value in the patch κs = κ̄, which is then gence κ is parametrised by the variance σκ2 and two-point
integrated out correlation ξκ = ξ12,κ (θ) along with a shift parameter s
Z related to the skewness, which we will keep fixed through-
PG (κ̃1 , κ̃2 ) = dκs PG (κ1 = κ̃1 + κs , κ2 = κ̃2 + κs , κs ) . out here. We can define a Gaussian-distributed variable
g(κ) with zero-mean
(23a)
The covariance matrix is of the form κ κ σ2
g(κ) = ln + 1 − µG = ln +1 + G , (25)
s s 2
σ 2 ξ12 ξ1s
Σκ1 ,κ2 ,κs = ξ12 σ 2 ξ2s , (23b) where µG = −σG 2
/2 is fixed to ensure hκi
ξ1s ξ2s σs2 2
= 0 and the
Gaussian variance is σG = ln 1 + σκ2 /s2 . This can be
used to construct the shifted lognormal univariate PDF
where we assume ξ¯ = σs2 = ξ1s = ξ2s , which holds for as
large enough patches compared to the cell size. The in-
−g(κ)2
tegration yields a bivariate Gaussian PDF for the mean- 2 Θ(κ + s)
PLN (κ; σG , s) = √ exp , (26)
subtracted convergence P(κ̃1 , κ̃2 ) with reduced variance 2πσG (κ + s) 2σG2
σκ̃2 = σ 2 − ξ¯ and reduced correlation ξκ̃ (θ) = ξ(θ) − ξ, ¯
where the reduction is set by the variance at the patch where Θ indicates the Heaviside step function, such that
scale, ξ¯ = σs2 . The mean-subtracted correlation ξ − ξ¯ the PDF is zero if κ ≤ −s. The shift parameter, s, can
now leads to a vanishing leading order super-sample co- be set to replicate the desired skewness (see the formula
variance term from equation (24) and creates a new ef- for λ in Xavier et al. 2016)
fective leading order term that we will discuss in Sec-
tion 3.2.1. Obviously this above described formalism has σκ σ4
1 + y(µ̃3 )−1 + y(µ̃3 ) − hκi ≈ 3 κ3 , (27)
s=
a long history which goes back to early papers on con- µ̃3 hκ i
strained Gaussian random fields e.g (Bertschinger 1987) s p
2
3 2 + µ̃ + µ̃3 4 + µ̃23
and (Hoffman & Ribak 1991) 3
y(µ̃3 ) = ≈ 1,
Covariance. When obtaining the covariance according 2
to equation (6) from the large-separation expansion of
the Gaussian (21), the leading 1 disappears and the cor- where hκi is the desired mean, σ the target variance and
relation ξ12 (θ) is integrated over the distance distribution µ̃3 = hκ3 i/σκ3 the skewness. The shift parameter is typ-
to yield ξ¯ which sets the amplitude of a super-sample co- ically positive and for our fiducial weak lensing conver-
variance like term we will discuss further in Section 2.1.2 gence case we use s = 0.1145 throughout the main text
as done in Boyle et al. (2021). In the executive summary,
¯ 1 P)G (κi )(b1 P)G (κj ) + O(ξ 2 ) .
covG (P(κi ), P(κj )) = ξ(b we use a different shift parameter because the Takahashi
(24) et al. (2017) simulations have a different cosmology. The
In Figure 7 we compare the correlation matrices as Gaussian limit can be reproduced by sending s/σκ → ∞.
measured (upper triangle) and predicted (lower trian- The bivariate shifted lognormal distribution also de-
gle) for Gaussian fields of the three cases discussed pends on the correlation ξG = ln 1 + ξκ /s2 and the
before (generated with FLASK, with the same initial determinant d = σG 4
− ξG2
power spectrum and smoothing as before), from non-
overlapping cells with raw κ (left) along with overlapping 2 Θ(κ1 + s)Θ(κ2 + s)
cells of raw κ (middle) and mean-subtracted κ̃ = κ − κ̄ PLN (κ1 , κ2 ; σG √
, ξG , s) = (28)
2π d(κ1 + s)(κ2 + s)
(right panel). In the non-overlapping case on the left-
σ2
hand side, we can see how the finite sampling term (11) ξG
× exp − G [g(κ1 )2 + g(κ2 )2 ] + g(κ1 )g(κ2 ) .
causes the diagonal to be most pronounced with a sub- 2d d
dominant 4-tile pattern showing the leading order super-
sample covariance like term from equation (24). When The joint shifted lognormal PDF can be expanded at
2
making cells overlap as shown in the middle panel, the large separations where ξG (ξκ )/σG 1 to obtain up to
finite sampling term becomes irrelevant and the 4-tile leading order in ξκ
pattern [due to the factorisation of that covariance into a PLN (κ1 , κ2 ; θ) ξκ (θ) g(κ1 ) g(κ2 )
product of two independent bias functions with one zero- =1+ 2 2 2 + O(ξκ2 ) .
crossing each] as seen in equation (24)] becomes more PLN (κ1 )PLN (κ2 ) s σG σG
pronounced, while the resulting cell overlap causes a sig- (29)
nificant increase of the correlation in nearby bins as can
be seen in a band along the diagonal. Considering the In particular, it is useful to define a leading order bias
mean-subtracted convergence shown in the right panel function that resembles the Gaussian expression b1,G =
will remove the leading order, super-sample covariance κ/σ 2
term from equation (24) and instead lead to a 9-tile pat- g(κ) sg(κ)
tern [due to the factorisation of that covariance contri- b1,LN (κ) = 2 ' , (30)
bution into a products of two bias functions with two sσG σκ2
9
0.50
-0.01
0.25
0.0
κ2
0.00
−0.25
0.01
−0.50
0.02 −0.75
Gauss theory Gauss theory Gauss theory
−1.00
-0.02 -0.01 0.0 0.01 0.02 -0.02 -0.01 0.0 0.01 0.02 -0.02 -0.01 0.0 0.01 0.02
κ1 κ1 κ1
Fig. 7.— Correlation matrix between the bins of the Gaussian weak lensing convergence κ PDF at a smoothing scale ' 7.30 measured
from 1200 (5 deg)2 patches for each of 10 full sky Gaussian maps (upper triangle) and the theory prediction (lower triangle) for κ including
the nonzero mean in the patches (left and middle panel) with non-overlapping (NO) cells (left) and overlapping cells (middle) and κ − κ̄
after subtracting the mean with overlapping cells (right panel). The predictions for the diagonal of the covariance matrices are shown in
Figure 5.
mal variable X. The relevant relations are hY (eX−1)i = κ PDF correlation, (5deg)2 patches, O κ PDF correlation, full sky, O
1.00
FLASK FLASK
−2
ξXY and hY 2 (eX − 1)i = ξXY 2
σX2
+ 1 − σX which -0.02
0.75
X 2
yields hY (e − 1) i = 0. This can be used to obtain -0.01
0.50
this into equations (33) and (34) leads to a relation be- 0.0
κ2
0.00
σ 2 (P(κ))
theory lognormal full sky
seen in Figure 6 while the diagonal is modified due to
the non-Gaussian shape of the final convergence PDF as 0.010
seen in Figure 5.
In Figure 9 we compare the measurements from
FLASK maps to the theoretical predictions for shifted 0.005
ables obtained
Rκ from the cumulative distribution function diagonal covariance matrix PDF shape noise only
C(κ) = dκ̃ P(κ̃), such that theory
8 measurement
P(κ1 , κ2 ; θ) = P(κ1 )P(κ2 )c12 (C(κ1 ), C(κ2 ); θ) , (37)
where θ indicates angular separation as before. If the
6
κi and κ2 are independent, then the copula density is
σ 2 (P(κSNonly ))
unity c12,indep = 1. A Gaussian copula only depends
on the distance through the pairwise correlation ξ12 (θ), 4
c12,G (C(κ1 ), C(κ2 ); ξ12 (θ)) and is a known function which
can be computed numerically although it has no ana-
lytical expression. Beyond those two cases, systemati- 2
cally constructing the dependence structure is possible
in the large-separation regime, where the copula can be
expanded in powers of the pairwise correlation ξ12 (θ) as 0
we will further discuss in Section 3. −4 −2 0 2 4
κSNonly /σSN
2.3.4. Inclusion of shape noise
Observationally, the weak lensing convergence map is correlation matrix PDF shape noise only
1.00
obtained from cosmic shear measurements, which are measurement
−4
subject to shape noise often constituting the dominant 0.75
source of noise. Since galaxies are intrinsically elliptical, −3
the observed shear contains a contribution from this in- 0.50
trinsic signal, quantified by the variance of the intrinsic −2
ellipticity σ . In simulated maps, shape noise can be in- −1 0.25
cluded by adding a white noise – modeled by a Gaussian κSNonly /σSN
random map κSN – to the ‘raw’ simulated convergence 0 0.00
map κ before smoothing. The standard deviation of this
Gaussian is typically assumed to be 1 −0.25
σ 2
σSN = p ∝ θs−1 , (38) −0.50
n g · Ωθ s 3
−0.75
with the shape-noise parameter σ (here taken to be 4
theory
σ = 0.30), the galaxy density ng (here taken to be 30 −1.00
arcmin−2 ) and the solid angle Ωθs (here taken to corre- −4 −3 −2 −1 0 1 2 3 4
spond to circular apertures of radius θs = 7.30 ). κSNonly /σSN
Smoothing induced shape noise correlations and covari-
ance. At the raw κ map level, the shape noise values Fig. 10.— PDF covariance diagonal (upper panel) and correlation
matrix (lower panel) for the pure shape-noise lensing convergence
in different pixels are uncorrelated and ξ ≡ 0. Top- in circular apertures of radius θs comparing the prediction from
hat smoothing the field at angular size θs will correlate equation (40) to the measurement from shape noise realisations.
shape noise values in overlapping cells (being at separa- The slight apparent asymmetry between positive and negative κ is
tion θ < 2θs ). Their correlation is related to the area of caused by the noise due to the finite number of realisations.
the cell overlap (which is a symmetric lens)
2 θ θp 2
A(θ < 2θs ) = 2θs arccos − 4θs − θ2 . (39a)
2θs 2
The resulting correlation of the top-hat smoothed shot for small distances Pd (θ) ≈ 2πθ. This gives the PDF
noise contribution to the weak lensing convergence is covariance for the case of pure shape noise as
2
ξSN (θ < 2θs ) = σSN A(θ)/(πθs2 ) , ξSN (θ ≥ 2θs ) = 0 . Z 2θs
(39b) covSN (P(κi ), P(κj )) = 2
dθ Pd (θ)PG (κi , κj ; σSN , ξSN (θ))
To obtain the covariance of the shot noise contribution 0
alone, we integrate the bivariate Gaussian with variance Z 2θs
2 2 2
σSN and covariance (39) over distances following equa- − PG (κi ; σSN )PG (κj ; σSN ) dθ Pd (θ) .
tion (6) to get 0
(40)
Z 2θs
2
hPSN (κi )PSN (κj )i = dθ Pd (θ)PG (κi , κj ; σSN , ξSN (θ)) In Figure 10 we compare this theoretical result to a mea-
0
Z ∞ surement across 1000 shape noise realisations for square
2 2 maps of length 5deg with a top-hat smoothing of 7.30
+ PG (κi ; σSN )PG (κj ; σSN ) dθ Pd (θ) , finding excellent agreement.
2θs
Shape noise induced convergence PDF convolution.
where Pd (θ) indicates the distribution of angular dis- Shape noise impacts the one-point weak lensing conver-
tances in the survey area, which is approximately linear gence PDF as if it was convolved with a zero-centred
12
0.005
case, illustrated by results for normal and shifted lognor-
0.004
mal fields. This will naturally lead to an expansion of the
0.003 covariance matrix that resembles an eigendecomposition,
which we will discuss in Section 4.1.
0.002
where ϕ0,G = λ2 σ 2 /2, ϕ1,G (λ) = λ and ϕn≥2,G (λ) = 0. averages 2-point cell correlation vs. variance powers
By expanding the last term exp[ξλ1 λ2 ] in a power series ξ n /σ 2n 5deg rescaled
and converting λi to derivatives w.r.t. κi , we obtain ¯ n /(σ 2 − ξ)
¯ n 5deg rescaled
(ξ − ξ)
∞ 10−5
ξ n /σ 2n full sky
PG (κ1 , κ2 , θ) X ξ n (θ)
=1+ bn,G (κ1 )bn,G (κ2 ) (47)
PG (κ1 )PG (κ2 ) n=1
n!
(−1)n ∂ n PG (κ) 1 κ
bn,G (κ) = = He n ,
PG (κ) ∂κn σn σ
(48)
10−6
which can be written in terms of the probabilist’s Her-
mite polynomials Hen
2 2
x d x
Hen (x) = (−1)n exp exp − , (49)
2 dxn 2 2 4 6 8 10
n
when using x = κ/σ. The first two terms read (Des-
jacques et al. 2018, see also) Fig. 12.— Scaling of the average of powers of the two-point cell
correlation function compared to the variance for the raw κ maps
κ κ2 − σ 2 1 in 5deg patches (red plusses) and the full sky result rescaled by a
b1,G (κ) = 2
, b2,G (κ) = = b21,G − 2 , (50) factor of Apatch /Afullsky (blue plusses). Aside from the removal of
σ σ4 σ the first order term, those hierarchies closely resemble each other
as well as the mean-subtracted κ̃ maps in patches (red crosses).
which can be also obtained from a direct expansion of the
joint PDF (19). This leads to the following expansion for
the covariance following equation (6)
∞
X ξ n ∂ n PG (κ1 ) ∂ n PG (κ2 )
cov(PG (κ1 ), PG (κ2 )) =
n=1
n! ∂κ1 n ∂κ2 n
(51)
∞ hierarchical ordering of ξ n /σ 2n will reflect in the eigen-
X ξn κ
1 2
κ
values and the bias functions multiplied by the Gaussian
= 2n
Hen PG (κ1 )Hen PG (κ2 ) ,
n=1
n!σ σ σ PDF evaluated at a given κ-value will yield the eigen-
Z X Nd (θ) vectors (after orthogonalisation and appropriate normal-
ξ := dθPd (θ)ξ n (θ) '
n ξ n (θ) , (52) isation). In this context, the predicted zero-crossings of
Nd the first bias function b1,G (κ) at κ = 0 and of the sec-
θ∈Θ\{0}
ond b2,G (κ) at κ = σ 2 will translate into the approximate
where in the second line we have defined the average of zero crossings of the first two eigenvectors as we will show
powers of the two-point correlation function over the sur- later in Figure 15.
vey patch in analogy to the mean from equation (13b).
The averages of powers of the correlation function ξ n 3.1.3. Beyond leading order: general non-Gaussian case
compared to powers of the variance σ 2n control the large-
separation expansion of the covariance and their hierar- Higher-order terms are obtained from expanding the
chy is illustrated in Figure 12. This plot also shows that joint two-point cumulant generating function in powers
the covariance contributions scale inverse proportional to of the two-point correlation ξ = ξ12 (θ) as follows
the survey area ξ n ∝ A−1 .
Due to the hierarchical ordering of averages of the cor- ϕ(λ1 , λ2 ) = ϕ0 (λ1 ) + ϕ0 (λ2 ) + ξϕ1 (λ1 )ϕ1 (λ2 ) (54)
relation function we expect the two leading order terms ξ2
in the regime κ . O(σ) to be + [ϕ2 (λ1 )ϕ2 (λ2 ) + ϕ2 (λ1 )ϕ21 (λ2 )] + O(ξ 3 )
2 1
cov(PG (κ1 ), PG (κ2 )) ξ¯ κ1 κ2 We are then in position to compute the (connected part)
= 2 (53a)
PG (κ1 )PG (κ2 ) σ σ σ of the joint PDF,
ξ 2 κ21 − σ 2 κ22 − σ 2 Z
dλ1
Z
dλ2
+ (53b) P(κ1 , κ2 ) − P(κ1 )P(κ2 ) =
2σ 4 σ !
2 σ2 2πi 2πi
ξ3 exp [−λ1 κ1 − λ2 κ2 + ϕ(λ1 , −λ1 ) + ϕ(λ2 , −λ2 )] ×
+O ,
σ6 ξ2
ξϕ1 (λ1 )ϕ1 (λ2 ) + ϕ21 (λ1 )ϕ21 (λ2 ) (55a)
which predicts a super-sample covariance leading order 2
term similar to the separate universe picture (81), being ξ2
proportional to the variance of the mean κ in different + [ϕ21 (λ1 )ϕ2 (λ2 ) + ϕ2 (λ1 )ϕ21 (λ2 )] + O(ξ 3 ) (55b)
2
patches, ξ¯ = σ 2 (κs ), and the derivative of the PDF. This
large-separation expansion of the covariance will be con- We obtain the covariance according to equation (6) af-
nected to an eigendecomposition in Section 4.1. Here, the ter integrating over λi and the distance distribution and
14
symmetrising
cov(P(κ1 ), P(κ2 )) 0.004 σ 2 =10x10-4
¯ 1 (κ1 )b1 (κ2 )
= ξb (56a)
P(κ1 )P(κ2 ) σ 2 =5x10-4
0.003
b2,LN (κ)σ 4 +σ 2
ξ2 σ 2 =2x10-4
+ [(b2 + q1 )(κ1 )(b2 + q1 )(κ2 ) − q1 (κ1 )q1 (κ2 )] (56b)
2 s2 ln(1+κ/s)2
0.002
+ O(ξ 3 ) , κ2
where we defined 0.001
dλ
Z
(b1 P)(κ) = ϕ1 (λ) exp[−λκ + ϕ(λ)] (56c)
2πi 0.000
dλ
Z
-0.04 -0.02 0.00 0.02 0.04
(b2 P)(κ) = ϕ1 (λ)2 exp[−λκ + ϕ(λ)] (56d) κ
2πi
dλ Fig. 13.— A comparison of a rescaled version of the second order
Z
(q1 P)(κ) = ϕ2 (λ) exp[−λκ + ϕ(λ)] . (56e) bias functions from the shifted lognormal model used here b2,LN
2πi (solid) along with (b2 + q1 )LN (dot-dashed) from equations (58)
(colours for different variances) and the limiting behaviour (59)
For a saddle-point approximation of the integral, we (black solid) in comparison to the Gaussian result (black dashed).
would typically expect b2 ' b21 , although that does not We also show lines of constant σ 2 (dotted) to indicate the zero-
reproduce the Gaussian or shifted lognormal result. The crossings of b2,LN (κ) which appear in the second eigenvectors for
expression (56) consists of dyadic products that resem- the κ PDF covariance in the lower panel of Figure 16.
ble the structure of an eigendecomposition, which we will
look at in Section 4.1, although eigenvectors are addition-
ally pairwise orthogonal.
where we used the result for b1,LN from equation (30).
3.1.4. Beyond leading order: shifted lognormal case We see that in the limit of s → ∞ we recover sg(κ) → κ
The recipe described in the previous paragraph for gen- and the Gaussian result with q1,G = 0, while for mildly
eral non-Gaussian distributions is not valid for a lognor- non-Gaussian fields, we typically have q1 b2 . In a
mal PDF, which is not well-described in terms of a CGF. small variance expansion where σ/s 1, we obtain the
Here we detail an approach that can be used to obtain limiting behaviour
lognormal bias functions by applying a large-separation 2
expansion to the shifted lognormal two-point PDF (28).
σ
s 1
s2 ln 1 + κs − σ 2 1
b2,LN (κ) −→ 4
− 2 (59)
This bias expansion is similar in spirit to the Gaussian σ 4s
case, but requires an additional expansion of the Gaus- Using the all-order results for bn,G (48) together
sian field correlation ξG in terms of the underlying κ n with
P (−1)n−1 ξκ
correlation ξκ . The joint shifted lognormal PDF can be the expansion of ξG (ξκ ) = n=1 n s2 allows
2
expanded at large separations where ξG (ξκ )/σG 1 to to compute all order expressions for the lognormal bias
obtain up to next-to-leading order in ξκ functions. Note that due to the difference between ξG
PLN (κ1 , κ2 ; ξκ ) g(κ1 ) g(κ2 ) and ξκ , the higher-order terms receive contributions from
= 1 + ξG (ξκ ) 2 2 (57a) lower-order bias functions, as predicted by the CGF-
PLN (κ1 )PLN (κ2 ) σG σG based expansion from before. In Figure 13 we show a
(ξκ ) g(κ1 )2 − σG2
g(κ2 )2 − σG
2
2 comparison of the lognormal terms b2 (solid) and b2 + q1
ξG
+ 4 4 (dashed) to the Gaussian result for the shift parameter
2 σG σG
adopted in this work and different variances. We can
3
+ O(ξG ) see that for our case (closely corresponding to the green
ξκ ξκ2 g(κ1 ) g(κ2 )
line), the q1 contribution is negligible. The crossing be-
=1+ − 2 2 tween the solid b2 line and the dotted line of constant σ 2
s2 2s4 σG σG
indicates the zero-crossing of b2 (κ), which is reflected in
(57b) the second eigenvector for the κ covariance shown in the
2 2 2 2
ξκ2 g(κ1 ) − σG g(κ2 ) − σG lower panel of Figure 16. For the case of a survey area
+ 4 4 being a significant portion of the sky, the mean correla-
2s4 σG σG
tion ξ¯ will be extremely small and this second order bias
+ O(ξκ3 ) , function b2 (κ) will determine the covariance structure.
In particular, it is useful to define two additional bias The two zero-crossings of b2 cause the 9-tiled correlation
functions q1,LN and b2,LN following the previous expan- matrix structure displayed earlier, notably in the upper
sion (56), with their combination resembling the Gaus- right panel of Figure 9.
sian expression from equation (50)
3.2. Covariance expansion for mean-subtracted
g(κ)2 − σG
2
[sg(κ)]2 − σκ2 κ̃ = κ − κs
(b2 + q1 )LN (κ) = 4 ' (58a)
s2 σG σκ4 To determine the PDF covariance for the case of a
1 g(κ) g(κ) mean-subtracted κ field, one needs to consider the 3 vari-
q1,LN (κ) = b1,LN (κ) = 2 2 ' 2 , (58b) ables κ1 , κ2 and κs , where κ1/2 are as before and κs is
s s σG σκ the mean convergence in the survey patch. Like the con-
15
vergence field, those variables all have zero expectation to powers of the variances σ 2n and (σ 2 − ξ) ¯ n control
value and we are then interested in the joint PDF of the the large-separation expansion of the covariance (in the
mean-subtracted κ̃i Gaussian case given in equation (51) for raw κ and in
equation (65) for the mean-subtracted κ̃ = κ − κ̄) follow
κ̃1 = κ1 − κs , κ̃2 = κ2 − κs . (60) similar hierarchies as illustrated in Figure 12.
Let us note that this tilde variable is formally equivalent In particular, the leading order term will be
to the slope variable that was introduced in Bernardeau ¯ 2 κ̃2 2
cov(PG (κ̃1 ), PG (κ̃2 )) (ξ − ξ)
et al. (2014, 2015). The CGF for the three variables de- 1 κ̃2
= −1 −1
fined above is written here up to linear order in the two- PG (κ̃1 )PG (κ̃2 ) 2σκ̃4 σ2 σ2
point function at the survey scale, ξs , and up to quadratic | κ̃ {z κ̃ }
4b
σκ̃ 2,G (κ̃1 )b2,G (κ̃2 )
order in the two-point correlation at the given cell sepa- !
ration ξ12 (ξ − ξ)¯3
+O , (66)
1 2 2 σκ̃6
ϕ(λ1 , λ2 , λs ) = σs λs + ϕ0 (λ1 ) + ϕ0 (λ2 ) + ξ12 ϕ1 (λ1 )ϕ1 (λ2 )
2
+ λs [ξ1s ϕ1 (λ1 ) + ξ2s ϕ1 (λ2 )] (61) which predicts a first eigenvalue/-vector similar to the
2
second eigenvalue/-vector of the Gaussian case without
ξ12 2 2 3 ¯ 2 /σ 4 ' ξ 2 /σ 4 as
+ [ϕ (λ1 )ϕ2 (λ2 ) + ϕ2 (λ1 )ϕ1 (λ2 )] + O(ξ12 ) mean-subtraction (53b) because (ξ − ξ) κ̃
2 1 shown in Figure 12. The zero-crossings of the first eigen-
1 2 ¯ 2
vector will be at κ̃ ≈ σ − ξ ≈ σ , closely following the
ϕ(λ1 , λs ) = ξs λ2s + ϕ0 (λ1 ) + λs ξ1s ϕ1 (λ1 ) , (62)
2 result for full sky maps where ξ¯ is negligibly small and
where we assumed that the patches are big enough such eigenvectors are shown as green/olive lines in the middle
that the κs PDF is Gaussian. The one- and two-point panel of Figure 16, as discussed in Section 4.1. Note that
PDFs are obtained from this CGF by performing the as before the joint two-point PDF of the mean-subtracted
change of variable (60) and then marginalising over the κ̃ values could be written in terms of the marginal Gaus-
background κs , leading to sian PDFs and a series of Hermite polynomials as in
equation (48) with prefactors set by averages of powers
dλ1 −λ1 κ̃1 +ϕ(λ1 ,−λ1 ) ¯ 2.
Z
of (ξ − ξ)/σ κ̃
P(κ̃1 ) = e (63)
2πi
ZZ
dλ1 dλ2 −λ1 κ̃1 −λ2 κ̃2 +ϕ(λ1 ,λ2 ,−λ1 −λ2 ) 3.2.2. Leading order: general non-Gaussian case
P(κ̃1 , κ̃2 ) = e . Similarly as for the Gaussian case, we will rewrite the
2πi 2πi
(64) joint CGF of the three variables λ1 , λ2 , λs in terms of
the joint CGF for just two variables λi and λs , where
When all computations are made at linear order in ξ, λs = −(λ1 + λ2 ). We will first discuss the leading order
we can set σs2 = ξ1s = ξ2s = ξ12 = ξ(θ). To obtain result which reads
the second order, one needs to start from ξ¯ = σs2 =
ξ1s = ξ2s 6= ξ12 = ξ(θ), which we will use to illustrate ϕ(λ1 , λ2 , −λ1 − λ2 ) − ϕ(λ1 , −λ1 ) − ϕ(λ2 , −λ2 ) (67)
the similarities and differences between the Gaussian and ¯ 1 λ2 − λ1 ϕ1 (λ2 ) − λ2 ϕ1 (λ1 )] + O(ξ 2 )
=ξϕ1 (λ1 )ϕ1 (λ2 ) + ξ[λ
non-Gaussian cases. ¯ 1 (λ1 )ϕ1 (λ2 ) + ξ[ϕ
¯ 1 (λ1 ) − λ1 ][ϕ1 (λ2 ) − λ2 ] + O(ξ 2 ) ,
=(ξ − ξ)ϕ
3.2.1. Gaussian case
where we used ξ = ξ(θ) for brevity and reorganised the
For a Gaussian field, the two terms λ2s and λs [ϕ1 (λ1 ) + formula such that the for a Gaussian only the first term
ϕ1 (λ2 )] can be summarised and act to reduce the vari- remains, because ϕ1 (λ) = λ. We are then in position to
ance and correlation function by ξ, ¯ such that joint PDF compute the connected part of the joint PDF, given by
P(κ̃1 , κ̃2 ) is a Gaussian with variance σκ̃2 = σ 2 − ξ¯ and
¯ Hence, the covariance can P(κ̃1 , κ̃2 ) − P(κ̃1 )P(κ̃2 )
correlation ξκ̃ (θ) = ξ(θ) − ξ.
dλ1 dλ2 −λ1 κ̃1 −λ2 κ̃2 +ϕ(λ1 ,−λ1 )+ϕ(λ2 ,−λ2 )
ZZ
be obtained in full analogy to the previous computation = e
that led to equation (51) which is modified to read 2πi 2πi
h
X∞ ¯ n ∂ n PG (κ̃1 ) ∂ n PG (κ̃2 )
(ξ − ξ) × (ξ − ξ)ϕ ¯ 1 (λ1 )ϕ1 (λ2 ) + ξ[ϕ
¯ 1 (λ1 ) − λ1 ][ϕ1 (λ2 ) − λ2 ]
cov(PG (κ̃1 ), PG (κ̃2 )) = ,
n! ∂κ̃n1 ∂κ̃n2 i
n=2 + O({ξ 2 , ξ¯2 , (ξ − ξ)¯ 2 }) .
(65)
where due to the definition of the average of powers of When integrated over the distance distribution, the term
the correlation function (52) the n = 1 term disappears. linear in ξ − ξ¯ will vanish and the term involving products
Note that PG (κ̃1 ) does not denote the same PDF as of ϕ1 (λi ) − λi will create a new bias function b̃1,NG (κ̃) for
PG (κ1 ), for the Gaussian case they differ by the vari- the mean-subtracted convergence κ̃ only present for non-
ance being σκ̃2 = σκ2 − ξ¯ rather than σκ2 . Aside from this Gaussian fields. In terms of this, the PDF covariance can
(small) difference and the removal of the n = 1 term, we be written as
now have a different prefactor consisting of averages of ¯ b̃1,NG P)(κ̃1 )(b̃1,NG P)(κ̃2 ) (68a)
cov(P(κ̃1 ), P(κ̃2 )) =ξ(
powers of ξ − ξ¯ rather than ξ. Those averages of pow-
¯ n compared
ers of the correlation function ξ n and (ξ − ξ) + O({ξ 2 , ξ¯2 , (ξ − ξ)
¯ 2 }) ,
16
150
∂ log PLN
∂κ
b̃1,LN
b1,LN P(κ̃1 , κ̃2 ) − P(κ̃1 )P(κ̃2 )
dλ1 dλ2 −λ1 κ̃1 −λ2 κ̃2 +ϕ(λ1 ,−λ1 )+ϕ(λ2 ,−λ2 )
ZZ
100 = e
2πi 2πi
50 h
× (ξ − ξ)ϕ ¯ 1 (λ1 )ϕ1 (λ2 ) + ξ[ϕ
¯ 1 (λ1 ) − λ1 ][ϕ1 (λ2 ) − λ2 ]
0
ξ¯2
−50 + [ϕ1 (λ1 ) − λ1 ]2 [ϕ1 (λ2 ) − λ2 ]2 (70a)
2
−100 ¯ − ξ)ϕ
+ ξ(ξ ¯ 1 (λ1 )ϕ1 (λ2 )[ϕ1 (λ1 ) − λ1 ][ϕ1 (λ2 ) − λ2 ]
−150
ξ2 2
−0.03 −0.02 −0.01 0.00 0.01 0.02 0.03 + [ϕ (λ1 )ϕ2 (λ2 ) + ϕ2 (λ1 )ϕ21 (λ2 )] (70b)
κ̃ 2 1
¯2
(ξ − ξ) i
Fig. 14.— Additional first-order bias b̃1,NG (κ̃) (green) of the + ϕ21 (λ1 )ϕ21 (λ2 ) + O(ξ 3 ) .
mean-subtracted κ̃ = κ − κ̄ obtained from the sum of the ‘raw’ 2
bias b1 (green) and the response of the PDF ∂ log P̄/∂κ (red) as When integrated over the distance distribution, all terms
given in equation (68b). We also show predictions from the shifted
lognormal case (dashed) relying on the PDF from equation (26) linear in ξ − ξ¯ will vanish. The remaining contributions
and the bias b1,LN from equation (30) are
a) terms in the first two lines, involving powers of
ϕ1 (λ) − λ creating new bias functions, b̃n,NG , only
present for non-Gaussian fields.3
b) terms in the last two lines containing products of
powers of ϕn that yield the bias functions encoun-
tered for the covariance of the raw κ PDF, at sec-
where we introduced the leading order composite bias ond order b2 from equation (56d) and q1 from equa-
tion (56e).
∂
b̃1,NG (κ̃) = b1 (κ̃) + log P(κ̃) . (68b)
∂κ̃ This yields
In Figure 14 we illustrate the first composite non- cov(P(κ̃1 ), P(κ̃2 )) = ξ( ¯ b̃1,NG P)(κ̃1 )(b̃1,NG P)(κ̃2 )
Gaussian bias function (green) along with its ingredi- ξ¯2
ents, the first order bias b1 (blue) and the logarithmic + (b̃2,NG P)(κ̃1 )(b̃2,NG P)(κ̃2 ) (71a)
derivative of the PDF (red) as measured in one FLASK 2
realisation (data points) and predicted for a shifted log- ξ2
normal (dashed lines). We can see that for a mildly non- + [(b2 P)(κ̃1 )(q1 P)(κ̃2 ) + (q1 P)(κ̃1 )(b2 P)(κ̃2 )]
2
Gaussian field, this composite non-Gaussian bias func- ¯2
tion is more than an order of magnitude smaller than (ξ − ξ)
+ (b2 P)(κ̃1 )(b2 P)(κ̃2 ) (71b)
the raw bias b1 , such that there is no guarantee that 2
our result at leading order in ξ¯ constitutes the most + O({ξ 3 , ξ¯ ξ 2 , ξ¯3 , ξ¯ (ξ − ξ)
¯ 2 , (ξ − ξ)
¯ 3 }) ,
relevant correction. This can be established analyti-
cally for a shifted lognormal distribution using equa- where we defined the following NLO composite bias func-
tions (26) and (30), which gives tion only present for non-Gaussian fields
1 x(κ̃)
1
1 ∂(b1 P) 1 ∂2P
b̃1,LN (κ̃) = − + 2 2 1− . (68c) b̃2,NG (κ̃) = b2 (κ̃) + 2 + . (71c)
κ̃ + s sσG (σκ̃ ) 1 + κ̃/s P(κ̃) ∂κ̃ P(κ̃) ∂κ̃2
For a Gaussian field, the first three lines in equation (71)
The minimum value is b̃1,LN (κ̃ ' 0) ' −1/s and in the do not contribute because ϕ1 (λ) = λ and ϕn≥2 = 0, so
limit of s → ∞ we recover the Gaussian result with van- we recover the leading order of equation (65). Note that
ishing b̃1 . while we ordered terms according to powers of ξ here,
there is no guarantee that the leading order term pro-
3.2.3. Next-to-leading order: general non-Gaussian case portional to ξ¯ will be larger than the next-to-leading or-
der terms proportional to ξ 2 and (ξ − ξ)¯ 2 , as highlighted
We now generalise the previous result up to next-to- by the Gaussian case where the leading order term van-
leading order, where we find ishes and the shifted lognormal case where we found the
ϕ(λ1 , λ2 , −λ1 − λ2 ) − ϕ(λ1 , −λ1 ) − ϕ(λ2 , −λ2 ) (69) term to be subdominant. Typically, power of averages
of the mean correlation are much smaller than averages
¯ 1 (λ1 )ϕ1 (λ2 ) + ξ[ϕ
=(ξ − ξ)ϕ ¯ 1 (λ1 ) − λ1 ][ϕ1 (λ2 ) − λ2 ]
3 At higher orders, this will generalise to a class of terms with
ξ2 products of positive powers of ϕ1 (λ − λ) and powers of ϕ1 (λ)2
+ [ϕ21 (λ1 )ϕ2 (λ2 ) + ϕ2 (λ1 )ϕ21 (λ2 )] + O(ξ 3 ) . and/or ϕn≥2 (λ).
2
17
of powers, so ξ¯2 ξ 2 ≈ (ξ − ξ) ¯ 2 as evident from Fig- which can partially counteract the previous effect, but
ure 12 and for a weakly non-Gaussian field the b̃2,NG term was found to be negligible in our setting. Beyond direct
will only give a small contribution. For the mildly non- effects for the width of the likelihood contours, noise also
Gaussian κ PDF we are considering here, we observe that leads to an uncertainty for their location in parameter
the first order term is effectively removed by the mean- space (Dodelson & Schneider 2013b), which is only neg-
subtraction, as evident from the visual inspection of the ligible if Nsim − Nd Nd − Np as is the case for our
covariance matrix shown in Figure 6 and the dominant simplified setting with a short data vector.
eigenvectors, which resemble the full sky case shown in
4.1. Eigendecomposition of the covariance matrix
green/olive in the lower panel of Figure 16. Given the
similarity between the Gaussian and FLASK correlation The eigendecomposition of the covariance matrix of di-
matrices, we can deduce that the b2 term given by equa- mension n × n (set by the length n of the data vector)
tion (58) for the shifted lognormal case is in fact the can be written in terms of a complete set of eigenval-
dominant contribution. ues λi (for convenience ordered in descending order, so
Our results from this section can largely be carried over λ1 > . . . > λn ) and associated normalised eigenvectors
to the covariance modelling for the 3-dimensional mat- ei arranged in an orthogonal matrix O
ter PDF on mildly nonlinear scales (roughly 10Mpc/h
at low redshifts), as we will discuss in Section 5 be- cov(P, P) = O · diag(λ1 , . . . , λn ) · OT , O = (e1 , . . . , en )
n
low. Typically, the matter PDF features stronger non- X
Gaussianities, such that the lognormal shift parameter = λk ek ek T . (73)
will be close to unity s3D . 1. In Appendix A we ad- k=1
ditionally discuss tree models as a useful class of non- In terms of components, we can write this as
Gaussian models for which all order covariance expansion
n n
results can be obtained. X X
λk ek ek T
cov(Pi , Pj ) = ij
= λk (ek )i (ek )j ,
4. QUANTITATIVE COVARIANCE MODEL TESTS k=1 k=1
0.1
10−2
0.0
Fig. 15.— First (solid), second (dashed) and third (dotted) eigenvectors covariance κ, overlapping cells
eigenvectors of the Gaussian κ PDF covariance predicted for small
Gauss
patches (cyan) and the normalised bias functions (bn P)G (κ) (light 0.3
grey). We also show the eigenvectors obtained from a Hermite ex-
pansion (51) up to order nmax (shaded from dark grey to cyan). 0.2
The vertical black dotted lines indicate where κ = ±σ.
0.1
0.0
−0.1
the smallness (or vanishing) of the mean correlation ξ¯ −0.2
κ patches
κ patches, pred
will practically remove the contribution of b1 P such that κ full sky
the first and second eigenvectors will be resembling b2 P −0.3
κ full sky, pred
and b3 P, respectively. This qualitative statement re- −0.4
mains true for mildly non-Gaussian fields such as our FLASK
shifted lognormal (FLASK) case. We show the eigen- 0.3
values and dominant eigenvectors for the Gaussian and 0.2
FLASK maps in Figure 16. The eigenvalues in the up-
0.1
per panel of Figure 16 show a good agreement between
measurements and theory and suggest a clear ordering 0.0
between contributions from different bias functions aris- −0.1
ing from a large-separation expansion, qualitatively re- κ patches
sembling the behaviour seen in Figure 12 for the aver- −0.2 κ patches, pred
ages of powers of the cell correlation in comparison with −0.3 κ full sky
the powers of the variance. The middle and lower pan- κ full sky, pred
−0.4
els of Figure 16 show the leading three eigenvectors for −0.02 −0.01 0.00 0.01 0.02
Gaussian (middle) and lognormal (bottom) κ fields in κ
small patches (blue/cyan) and the full sky (green/olive).
The first eigenvector (solid) is determined by b1 P and Fig. 16.— (Upper panel) Eigenvalues of the Gaussian κ PDF
covariance for κ from small patches (measurement in blue, predic-
hence shows one zero-crossing close to κ ' 0 as pre- tion in cyan) and the full sky (measurement in green, prediction
dicted by b1,G (22) and b1,LN (30). The second eigen- in olive). (Middle and lower panel) The dominant eigenvectors of
vector (dashed) is driven by b2 P with two zero-crossings the Gaussian (middle) and FLASK (lower) κ PDF covariance, in-
around κ ' σκ as predicted by b2,G (50) and b2,G (58). cluding the 1st eigenvector (solid lines), 2nd eigenvector (dashed
lines) and 3rd eigenvector for raw κ from small patches (dotted
The third eigenvector (dotted) involves b3 P, which is pre- lines). The move from small patches to the full sky (solid and
dicted to have three zero-crossings for the Gaussian case dashed lines) essentially removes the first term in the eigendecom-
from equation (48), but is also sensitive to higher-order position and replaces it with the second term as evident from the
terms as we illustrated in Figure 15. For the covariance agreement of the first green cross with the second blue cross, and
the agreement of the solid green and the dashed blue lines. The
of the κ PDF on the full sky we show the first two eigen- FLASK results are qualitatively similar to the Gaussian case ex-
vectors (measurements in green, predictions in olive). cept for subtle shifts in the location of the zero-crossings and the
We observe that the first eigenvector closely resembles asymmetry between positive and negative κ, both related to the
non-Gaussianity of the underlying field.
the second eigenvector of the κ PDF from small patches
(measurements in blue, predictions in cyan), as predicted
for Gaussian fields in equation (65) and for mildly non-
Gaussian fields in equation (71). Overall, the theoreti-
cal predictions are in good agreement with the measure- ing the skewness of the PDF. Due to the nature of the
ments from Gaussian and FLASK maps. The Gaussian eigendecomposition, the contribution of the finite sam-
and the lognormal cases are qualitatively similar with a pling term on the diagonal of the covariance matrix (11)
few important differences appearing mostly as shifts to- to individual eigenvectors is typically small, such that
wards negative κ (reflecting the PDF peak location) and the leading order results for the non-overlapping case
an asymmetry between positive and negative κ reflect- (not shown) resemble the overlapping case aside from
19
additional noise caused by the small number of sampling precision matrix, lognormal full sky
cells. FLASK 8000
In principle, one can reconstruct the covariance matrix -0.02
from the terms in the eigendecomposition, although in 6000
the presence of an incomplete set of eigenvectors (a num-
ber less than the length of the data vector), this matrix -0.01 4000
will not be invertible and hence unsuitable for statistical
2000
analysis. For the case of non-overlapping cells, there is
a large contribution to the diagonal from shot noise, so 0.0
κ2
0
one can attempt a covariance reconstruction using only
the first few eigenvectors along with a matched diago- −2000
nal mostly arising from shot noise. In practice, we opt 0.01
for the approach to use an invertible covariance baseline −4000
model, such as the ones obtained from the Gaussian or −6000
shifted lognormal model. If this was found to be insuffi- 0.02
ciently accurate, it could be successively improved using theory −8000
the precision matrix expansion (Friedrich & Eifler 2017)
-0.02 -0.01 0.0 0.01 0.02
or iteratively corrected by adjusting the dominant eigen- κ1
vectors to their desired values as predicted by a large-
scale expansion discussed in Section 3.
precision matrix diagonal bands, lognormal fullsky
using the mean data vector, µ (also of length n) and data Fig. 18.— χ2 -tests for the validation of our predicted covariances
covariance C (of dimension n × n). The distribution of and the Gaussianity of our data vector showing the χ2 distribution
(black line) and the result for a data vector drawn from a Gaussian
χ2 follows a χ2 distribution with the appropriate number with the measured covariance (black histogram). (Upper panel)
of degrees of freedom n (here the number of PDF bins). Covariance matrix test for the PDF of the convergence κ (blue) or
To test the Gaussianity of the data vector distribution, mean-subtracted convergence κ − κ̄ (purple) measured from over-
one can create another sample of data vectors, DG , by lapping cells computed in small patches comparing the FLASK
measurements (solid) to the predictions from a shifted lognormal
drawing N times from a multivariate Gaussian using the (dotted). (Lower panel) Covariance matrix test for overlapping
mean data vector, µ and covariance C. Then one recom- cells computed for the full sky comparing the measurements from
putes the χ2 for every sample using the covariance to 500 FLASK realisations (solid) to the lognormal prediction (dot-
check whether it follows the same distribution ted).
90
0.
denotes the precision matrix introduced in Section 4.2,
which is the matrix-inverse of the data covariance C
σ8
85
0.
given in equation (17) and is was assumed that the covari-
ance matrix is independent of cosmology. The Fisher for-
80
0.
malism rests on the assumption that the summary data
vector is Gaussian distributed which we have explicitly
75
0.
checked in the last subsection for the central region of
20
24
28
32
36
75
80
85
90
the weak lensing PDF considered here. The parameter
0.
0.
0.
0.
0.
0.
0.
0.
0.
covariance matrix C(~ p) is then obtained as inverse of the Ωcdm σ8
Fisher matrix, CijP
= (F −1 )ij . In the Fisher formalism,
Fig. 19.— Fisher forecasts comparing different FLASK-generated
marginalisation over a subset of parameters is achieved covariances using non-overlapping cells (cyan), overlapping cells
by simply selecting the appropriate sub-elements of the (blue), overlapping cells with mean-subtraction (purple) and over-
parameter covariance. lapping cells on full sky maps (green). All covariances were rescaled
To further validate our theoretical covariances, we per- to mimic the same survey area.
form simple Fisher forecasts for two cosmological param-
eters {Ωcdm , σ8 } with the measured and predicted co-
variance matrices, while keeping the data vector and its zs = 2, A = 15000 deg2
dependence on cosmological parameters in (80) as pre- FLASK patches κ − κ̄
dicted from large-deviation statistics (see Boyle et al. Lognormal patches κ − κ̄
2021).4 The Fisher forecasts in Figure 19 demonstrate FLASK full sky
Lognormal full sky
that measuring non-overlapping cells only (cyan) leads FLASK full sky noisy
to much wider parameter errors than including cell over- Lognormal full sky noisy
laps (blue), such that capturing cell overlaps is imper-
ative to avoid artificially inflating the error budgets (as
already pointed out in Szapudi & Colombi 1996). Addi-
tionally we show that when measuring covariances from
small patches obtained from FLASK and subtracting the
88
0.
24
26
28
30
80
82
84
86
88
0.
0.
0.
0.
0.
0.
0.
0.
0.
Ωcdm σ8
5. EXTENSION TO 3D CLUSTERING PDF
COVARIANCE Fig. 20.— Fisher forecasts validating the predicted covariances
(dotted) against FLASK measurements (solid) for shifted lognor-
In this Section, we describe the essential steps needed mal fields on small patches (purple), the full sky (green) and in-
to adapt our covariance model to 3-dimensional fields cluding shape noise (orange).
using the 3D matter PDF as an example. While the
line-of-sight projection involved in computing the weak
4 The predicted derivatives rely on large-deviations theory for
the convergence PDF (Barthelemy et al. 2020a). Using CLASS lensing convergence and photometric galaxy density on
(Blas et al. 2011), linear and non-linear (Halofit, Peacock & Smith
2014) matter density variances are calculated to rescale the matter
mildly nonlinear scales tends to Gaussianise the under-
density CGF in each redshift slice. The convergence CGF is ob- lying field, 3D matter and spectroscopic galaxy densities
tained by integration over the redshift slices, and translated to the typically show a greater degree of non-Gaussianity on
final PDF by an inverse Laplace transform. mildly nonlinear scales. This might complicate the two-
22
1.2
2020), and can be improved by including a shift factor
large-deviation theory, σμ =0.61 below unity (dashed black line).6 The good match for
1.0
the fiducial PDF in Figure 21 and the leading order bias
shifted lognormal, σμ =0.61, s=0.9
in Figure 23 suggests that the shifted lognormal model
(ρ), z=0, R=10 Mpc/h
matter PDF covariance P(Ω), z=0, R=10 Mpc/h super-sample covariance ingredients, z=0, R=10 Mpc/h
3
0.3 SSC ∂ log P
pred b̂1 pred b1 pred
∂ log ρ̂
∂ log P b̂1 sim b1 lognormal
2 sim
0.0004 ∂ log ρ̂
∂ log P b1 sim
0.39 ∂δb sim
1
0.52 0.0002
0
Ω2
0.76 0.0000
−1
1.0
°0.0002
−2
1.32
°0.0004
−3
0.3 0.39 0.52 0.76 1.0 1.32 1.92
1.92 sim x10 ρ̂
0.3 0.39 0.52 0.76 1.0 1.32 1.92
Ω1 Fig. 23.— Comparison of the theoretically predicted sphere bias
(blue solid), the lognormal approximation (blue dashed) and the
measurement in one realisation of the fiducial cosmology of Qui-
Fig. 22.— Comparison of the PDF covariance matrix contribu- jote (blue data points) at radius R = 10 Mpc/h, distance d = 20
tions for z = 0, R = 10 Mpc/h as measured from the 1(Gpc/h)3 Mpc/h and redshift z = 0. Additionally shown is the logarithmic
Quijote simulations and the super-sample covariance contribution derivative of the PDF (red) as predicted (solid) and measured (data
constructed from the DC mode following equation (81). The upper points) and the resulting prediction from the effective bias (83) us-
triangle shows the super-sample covariance contribution while the ing the theoretical ingredients (green solid) or the simulated ones
lower triangle shows the measured covariance from the simulations (green dotted). The predictions agrees very well with the mea-
enhanced by a factor of 10, while the diagonal from the upper panel surements from separate universe simulations (green data points).
is masked. The vertical dotted line indicates the peak location of the PDF.
The ρ-range shown equals the one used in the covariance plots in
Figure 22.
6. CONCLUSION & OUTLOOK Based on our PDF estimator, one can further attempt to
Conclusion. In this work, we have shown how to com- model cross-covariances between the one-point PDF and
pute covariances for the one-point PDF of the weak lens- 2-point statistics, which in the mildly non-linear regime
ing convergence on mildly nonlinear scales. Our covari- are dominated by the correlation between the PDF and
ance predictions (6) rely on the two-point PDF, which the variance (see Figure 12 in Uhlemann et al. 2020).
is integrated over all separations according to the distri- Note that our covariance model and its possible exten-
bution of distances in the survey area. Our formalism sions can serve as a starting point for improving covari-
includes predictions for the key effects including finite ance estimates from simulations, even in regimes where
sampling (11) and super-sample covariance (13) which the model itself becomes inaccurate. The Gaussian and
were shown to scale with the inverse survey area. We shifted lognormal covariance models can be used as a
presented effective models for this joint PDF for a Gaus- starting point for a precision matrix expansion (Friedrich
sian field (19), a shifted lognormal field (28) matching & Eifler 2017) in which the leading order eigenvectors
the PDF skewness and density-dependence of the two- could iteratively refine this baseline covariance model.
point clustering (Figure 8), and a field affected by shape Additionally, analytical models can be used to construct
noise (42) from galaxy shape measurements. Going be- priors for a Bayesian covariance estimation with the help
yond (log-)normal models, we laid out a large-separation of cheap surrogates (Chartier & Wandelt 2022).
expansion for the two-point PDF that captures the key
effects of the PDF covariance for general non-Gaussian ACKNOWLEDGEMENTS
fields. The validity of our predictions has been estab- This work has made use of the Infinity Cluster
lished through a set of tests including visual inspection hosted by Institut d’Astrophysique de Paris. We thank
of the covariance, leading order eigenvalues and eigen- Stéphane Rouberol for smoothly running this cluster for
vectors (Figure 16), the precision matrix (Figure 17), χ2 us. We thank the authors of Takahashi et al. (2017)
tests (Figure 18) and a two-parameter Fisher forecast for their publicly available lensing maps, Villaescusa-
(Figure 20). With this suite of tests we have demon- Navarro et al. (2019) for their publicly available simu-
strated that our PDF covariance model not only accu- lation suite products, and Xavier et al. (2016) for their
rately describes large sets of simulated FLASK (Xavier publicly available FLASK code. The figures in this work
et al. 2016) maps, but also measurements of realistic full were created with matplotlib (Hunter et al. 2007) mak-
sky convergence maps obtained from full N -body simu- ing use of the numpy (Harris et al. 2020), scipy (Vir-
lations (Takahashi et al. 2017). tanen et al. 2020), sci-kit image (van der Walt et al.
Finally, we have sketched how the formalism described 2014) and ChainConsumer Python packages.
here for mildly non-Gaussian weak lensing observables is CU is supported by the STFC Astronomy Theory Con-
also applicable to more strongly non-Gaussian 3D clus- solidated Grant ST/W001020/1 from UK Research & In-
tering quantities. Most importantly, we showed how this novation. CU’s work was performed in part at Aspen
can yield predictions for the super-sample covariance ef- Center for Physics, which is supported by National Sci-
fect (Figures 22 and 23) to complement covariance mea- ence Foundation grant PHY-1607611 and partially sup-
surements from finite-box simulations. ported by a grant from the Simons Foundation. AG
Outlook. We expect our formalism can be straightfor- is supported by an EPSRC studentship under Project
wardly adapted to describe the one-point PDF of related 2441314 from UK Research & Innovation. SC thanks
weak-lensing observables such as the CMB weak lensing Fondation Merac and the French Agence Nationale de la
convergence (Barthelemy et al. 2020b) and aperture mass Recherche (grant ANR-18-CE31-0009) for funding. The
(Barthelemy et al. 2021), as well as photometric galaxy authors thank the Euclid work packages for additional
clustering. Our model can be extended to describe the galaxy clustering probes and higher-order weak lensing
cross-correlations between weak lensing PDFs at differ- statistics, as well as the Rubin LSST:DESC higher-order
ent smoothing scales and/or tomographic redshift bins. statistics group for useful discussions.
REFERENCES
Amendola L., et al., 2018, Living Reviews in Relativity, 21 Chartier N., Wandelt B. D., 2022, MNRAS, 515, 1296
Baratta P., Bel J., Plaszczynski S., Ealet A., 2020, A&A, 633, A26 Clerkin L., et al., 2016, Monthly Notices of the Royal
Bardeen J. M., Bond J. R., Kaiser N., Szalay A. S., 1986, ApJ, Astronomical Society, 466, 1444–1461
304, 15 Codis S., Pichon C., Bernardeau F., Uhlemann C., Prunet S.,
Barthelemy A., Codis S., Uhlemann C., Bernardeau F., Gavazzi 2016a, MNRAS, 460, 1549
R., 2020a, MNRAS, 492, 3420 Codis S., Bernardeau F., Pichon C., 2016b, MNRAS, 460, 1598
Barthelemy A., Codis S., Bernardeau F., 2020b, MNRAS, 494, Colavincenzo M., et al., 2019, MNRAS, 482, 4883
3368 Coles P., Jones B., 1991, MNRAS, 248, 1
Barthelemy A., Codis S., Bernardeau F., 2021, MNRAS, 503, Desjacques V., Jeong D., Schmidt F., 2018, Phys. Rep., 733, 1
5204 Dodelson S., Schneider M. D., 2013a, Phys. Rev. D, 88, 063537
Bernardeau F., 1996, A&A, 312, 11 Dodelson S., Schneider M. D., 2013b, Phys. Rev. D, 88, 063537
Bernardeau F., 2022, arXiv e-prints, p. arXiv:2207.02500 Friedrich O., Eifler T., 2017, Monthly Notices of the Royal
Bernardeau F., Reimberg P., 2016, Phys. Rev. D, 94, 063520 Astronomical Society, 473, 4150
Bernardeau F., Valageas P., 2000, A&A, 364, 1 Friedrich O., et al., 2018, Phys. Rev. D, 98, 023508
Bernardeau F., Pichon C., Codis S., 2014, Phys. Rev. D, 90, Friedrich O., Uhlemann C., Villaescusa-Navarro F., Baldauf T.,
103519 Manera M., Nishimichi T., 2020, MNRAS, 498, 464
Bernardeau F., Codis S., Pichon C., 2015, MNRAS, 449, L105 Friedrich O., et al., 2021, MNRAS, 508, 3125
Bertschinger E., 1987, ApJ, 323, L103 Gatti M., et al., 2019, arXiv e-prints, p. arXiv:1911.05568
Blas D., Lesgourgues J., Tram T., 2011, Journal of Cosmology Gruen D., et al., 2018, Phys. Rev. D, 98, 023507
and Astroparticle Physics, 2011, 034–034 Halder A., Friedrich O., Seitz S., Varga T. N., 2021, MNRAS,
Boyle A., Uhlemann C., Friedrich O., Barthelemy A., Codis S., 506, 2780
Bernardeau F., Giocoli C., Baldi M., 2021, Monthly Notices of Harris et al. C. R., 2020, Nature, 585, 357
the Royal Astronomical Society, 505, 2886
25
Hartlap J., Simon P., Schneider P., 2006, Astronomy & Sellentin E., Heavens A. F., 2017, MNRAS, 464, 4658
Astrophysics, 464, 399–404 Sherman J., Morrison W. J., 1950, The Annals of Mathematical
Hilbert S., Hartlap J., Schneider P., 2011, Astronomy & Statistics, 21, 124
Astrophysics, 536, A85 Sheth R. K., Tormen G., 1999, MNRAS, 308, 119
Hoffman Y., Ribak E., 1991, ApJ, 380, L5 Sirko E., 2005, ApJ, 634, 728
Hou J., Cahn R. N., Philcox O. H. E., Slepian Z., 2021, arXiv Szapudi I., Colombi S., 1996, ApJ, 470, 131
e-prints, p. arXiv:2108.01714 Szapudi I., Colombi S., Bernardeau F., 1999, Monthly Notices of
Hunter et al. J. D., 2007, Computing in Science & Engineering, 9, the Royal Astronomical Society, 310, 428
90 Takahashi R., Hamana T., Shirasaki M., Namikawa T.,
Ivezić Ž., et al., 2019, ApJ, 873, 111 Nishimichi T., Osato K., Shiroyama K., 2017, The
Jamieson D., Loverde M., 2020, Physical Review D, 102 Astrophysical Journal, 850, 24
Kaiser N., 1984, ApJ, 284, L9 Taylor A., Joachimi B., 2014, MNRAS, 442, 2728
Kaufman G. M., 1967, Report No. 6710, Center for Operations Taylor A., Joachimi B., Kitching T., 2013, MNRAS, 432, 1928
Research and Econometrics, Catholic University of Louvain, Thiele L., Hill J. C., Smith K. M., 2020, Phys. Rev. D, 102,
Heverlee, Belgium 123545
Lacasa F., Grain J., 2019, Astronomy & Astrophysics, 624, A61 Uhlemann C., Codis S., Pichon C., Bernardeau F., Reimberg P.,
Laureijs R., et al., 2011, preprint, (arXiv:1110.3193) 2016, MNRAS, 460, 1529
Martin S., Schneider P., Simon P., 2012, Astronomy & Uhlemann C., Codis S., Kim J., Pichon C., Bernardeau F.,
Astrophysics, 540, A9 Pogosyan D., Park C., L’Huillier B., 2017, MNRAS, 466, 2067
Mathai A. M., Moschopoulos P., Pederzoli G., 1999, Statistica, Uhlemann C., Pichon C., Codis S., L’Huillier B., Kim J.,
59, 61 Bernardeau F., Park C., Prunet S., 2018, MNRAS, 477, 2772
Mo H. J., White S. D. M., 1996, MNRAS, 282, 347 Uhlemann C., Friedrich O., Villaescusa-Navarro F., Banerjee A.,
Peacock J. A., Smith R. E., 2014, HALOFIT: Nonlinear Codis S., 2020, MNRAS, 495, 4006
distribution of cosmological mass and galaxies (ascl:1402.032) Villaescusa-Navarro F., et al., 2019, arXiv e-prints, p.
Percival W. J., et al., 2014, MNRAS, 439, 2531 arXiv:1909.05273
Percival W. J., Friedrich O., Sellentin E., Heavens A., 2022, Virtanen et al. P., 2020, Nature Methods, 17, 261
MNRAS, 510, 3207 Weisstein E. W., 2022, Cube Line Picking. From MathWorld—A
Repp A., Szapudi I., 2021a, MNRAS, 500, 3631 Wolfram Web Resource,
Repp A., Szapudi I., 2021b, Monthly Notices of the Royal http://mathworld.wolfram.com/CubeLinePicking.html
Astronomical Society, 509, 586 Xavier H. S., Abdalla F. B., Joachimi B., 2016, MNRAS, 459,
Sarma Boruah S., Rozo E., Fiedorowicz P., 2022, arXiv e-prints, 3693
p. arXiv:2204.13216 van der Walt et al. S., 2014, PeerJ, 2, e453
1.0
and for the minimal tree model the Ij (ρ) can be obtained
σ 2 =0.45 as
σ 2 =0.1 4j 2 4jρ
0.5 Ij (ρ) = 4 exp − 2 (j + ρ) 0 F1 2, 4 . (A9)
σ σ σ
2( ρ -1)
b1,t (ρ)σ 2
0.0
LDT
We can therefore obtain a closed form equation for bn (ρ)
by differentiation of equation A9. The nth derivative of
ρ-1 Ij is given by
-0.5
n X k
dn Ij X n k (n−k)
-1.0 = f (j)g (k−`) (j)h(`) (j) ,
dj n k `
k=0 `=0
0.5 1.0 1.5 2.0 (A10)
ρ where we’ve split Ij into three functions, f, g, h, which
1.5 we can differentiate separately as follows
σ 2 =0.45 4j 2
f (j) = 4 , g(j) = exp − 2 (j + ρ) , (A11a)
σ 2 =0.1
σ σ
1.0
dn f 4
b2,t (ρ)σ 4 +σ 2
B. REFINED DERIVATION OF COVARIANCE distribution of the N cells. Let us assume that the value
FOR CELLS IN LARGE-SEPARATION ρ̂I is fixed to a grid point I, such that the distance be-
LIMIT tween the densities ρ̂I and ρ̂J are given by the distance
In Appendix C of Codis et al. (2016b) the result of of grid point J to grid point I. Hence, we will drop
{rIJ } from the argument of the joint PDF in the follow-
the covariance of finding Ni cells with density ρi and Nj
spheres of density ρj is given as (with no summation over ing. Then considering different spatial distributions of
the cells corresponds to selecting a subset S of length N
repeated indices implied)
from the set of indices {1, . . . , NT }
¯ i )b(ρj ) + δij N̄i ,
cov(Ni , Nj ) = N̄i N̄j ξb(ρ (B1)
where b(ρ) is the cell bias and ξ¯ is the mean correlation X Y Z Y Z
P b (N ) = dρ̂I dρ̂J P(ρ̂1 , . . . , ρ̂NT ) .
of spheres defined in equation (B10). We can rewrite ˆ
∆ ˆ
¬∆
S I∈S J ∈S
/
the ratio of the number of cells in the bin to the total
number in terms of the measured PDF value in the bin (B7)
and associated density bin size ∆i as Ni ' NT P(ρi )∆i
and similarly N̄i ' NT P̄(ρi )∆i . Hence, we have for the Now we can insert equation (B6) and simplify using
covariance of the histogram probability Pi :
The probability to find exactly N cells in a density bin where in the second step we have converted p = N̄ /NT .
ˆ = [ρ̂ − ∆ρ/2, ρ̂ + ∆ρ/2] is given by a) integrating over
∆ Note that one can rewrite the sums over all N -
the probabilities of finding N of the values of ρ1 , . . . , component subsets S (of which there are NNT ) in terms
ρNT within ∆ˆ and NT − N outside ¬∆ ˆ = R+ \ ∆, ˆ and of the mean spatial correlation (which agrees with the
b) summing over all different possibilities for the spatial definition used in Codis et al. (2016b))
var(N ) = hN 2 i − hN i2 = N̄ + N̄ 2 ξ¯ b̄2 (B15) The sums over the different subsets SN1 and SN2 can be
simplified using combinatorical properties.
P We use the
9 Note that without the Poisson limit, the variance of the bino-
average correlation ξ¯ and its numerator ξ as defined in
mial distribution becomes N̄ (1 − p). equation (B10), ¬i to indicate the opposite index ¬i = 2
29
NT − 2 NT − N1 − 2 X
= ξ With this we reproduce the result from Codis et al.
N1 N2 (2016a) for the Poisson limit and the associated covari-
NT NT − N1 ¯∆NT,12 (∆NT,12 − 1)
ance
= ξ . (B21) hN1 N2 i = N̄1 N̄2 (1 + ξ¯ b̄1 b̄2 ) . (B23)
N1 N2 2