Predicting the 21 cm field with a Hybrid Effective Field Theory approach

Danial Baradaran Berkeley Center for Cosmological Physics, Department of Physics, University of California, Berkeley, CA 94720, USA    Boryana Hadzhiyska boryanah@berkeley.edu Physics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA Berkeley Center for Cosmological Physics, Department of Physics, University of California, Berkeley, CA 94720, USA    Martin White Physics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA Berkeley Center for Cosmological Physics, Department of Physics, University of California, Berkeley, CA 94720, USA    Noah Sailer Berkeley Center for Cosmological Physics, Department of Physics, University of California, Berkeley, CA 94720, USA Physics Division, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
Abstract

A detection of the 21 cm signal can provide a unique window of opportunity for uncovering complex astrophysical phenomena at the epoch of reionization and placing constraints on cosmology at high redshifts, which are usually elusive to large-scale structure surveys. In this work, we provide a theoretical model based on a quadratic bias expansion capable of recovering the 21 cm power spectrum with high accuracy sufficient for upcoming ground-based radio interferometer experiments. In particular, we develop a hybrid effective field theory (HEFT) model in redshift space that leverages the accuracy of N𝑁Nitalic_N-body simulations with the predictive power of analytical bias expansion models, and test it against the Thesan suite of radiative transfer hydrodynamical simulations. We make predictions of the 21 cm brightness temperature field at several distinct redshifts, ranging between z=6.5𝑧6.5z=6.5italic_z = 6.5 and 11, thus probing a large fraction of the reionization history of the Universe (xHI=0.30.9subscript𝑥HI0.3similar-to0.9x_{\rm HI}=0.3\sim 0.9italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT = 0.3 ∼ 0.9), and compare our model to the ‘true’ 21 cm brightness in terms of the correlation coefficient, power spectrum and modeling error. We find percent-level agreement at large and intermediate scales, k0.5h/Mpcless-than-or-similar-to𝑘0.5Mpck\lesssim 0.5h/{\rm Mpc}italic_k ≲ 0.5 italic_h / roman_Mpc, and favorable behavior down to small scales, k1h/Mpcsimilar-to𝑘1Mpck\sim 1h/{\rm Mpc}italic_k ∼ 1 italic_h / roman_Mpc, outperforming pure perturbation-theory-based models. To put our findings into context, we show that even in the absence of any foreground contamination the thermal noise of a futuristic HERA-like experiment is comparable with the theoretical uncertainty in our model in the allowed ‘wedge’ of observations, providing further evidence in support of using HEFT-based models to approximate a range of cosmological observables.

I Introduction

Detecting the hyperfine transition of neutral hydrogen in the form of emitted radiation at a rest wavelength of 21 cm offers a viable path for mapping out structure in the Universe at high redshifts, which are otherwise hard to probe due to the small number of detectable luminous sources. Currently, most of the cosmological information used to test our theories about the Universe comes from either large-scale structure (LSS) surveys at low redshifts z3less-than-or-similar-to𝑧3z\lesssim 3italic_z ≲ 3 or the cosmic microwave background (CMB), emitted at z1100𝑧1100z\approx 1100italic_z ≈ 1100 during the epoch of recombination. As such, much of the formation history of the Universe at intermediate redshifts remains unexplored. The ideal tracer of the period spanning between the epoch of recombination and the epoch of reionization (EoR) is the diffuse neutral hydrogen gas, detectable in the form of 21 cm emission.

The EoR refers to the period during which the first sources of (ionizing) UV radiation turned on and ionized the neutral hydrogen in the Universe (2016ARA&A..54..313M, ). In addition to providing avenues for testing central assumptions in cosmology, studying the EoR is invaluable to understanding the formation of galaxies and other complex astrophysical phenomena. As indicated by CMB and Lyα𝛼\alphaitalic_α forest measurements, the mean redshift of reionization is z78𝑧78z\approx 7-8italic_z ≈ 7 - 8 (2016A&A…596A.108P, ), and it is mostly complete by z6similar-to𝑧6z\sim 6italic_z ∼ 6 (2015MNRAS.447..499M, ). Improvements in the measurements of the reionization process could reveal information about the properties of the ionizing sources (2006MNRAS.365..115F, ; 2007MNRAS.377.1043M, ), the abundance of small-scale structures that absorb many of the ionizing photons (2006MNRAS.366..689C, ; 2009MNRAS.394..960C, ; 2008ApJ…682…14F, ), and overall help us to better grasp the process of galaxy formation.

Several experiments are already well on their way to directly image the neutral hydrogen and map the 21 cm signal from the EoR. A number of them attempt to do so at the level of the mean global signal (monopole) such as EDGES (2017ApJ…835…49M, ), LEDA (2018MNRAS.478.4193P, ), PRIZM (2019JAI…..850004P, ), and SARAS (2018ExA….45..269S, ), while others are after detecting the fluctuations in the signal such as PAPER (2010AJ….139.1468P, ), the MWA (2013PASA…30….7T, ; 2013PASA…30…31B, ), LOFAR (2013A&A…556A…2V, ), HERA (2017PASP..129d5001D, ), and eventually the Square Kilometre Array (SKA) (2020PASA…37….2W, ). Additionally, efforts such as CHIME (2014SPIE.9145E..22B, ), HIRAX (2016SPIE.9906E..5XN, ), and CHORD (2019clrp.2020…28V, ) attempt to trace the post-reionization history of the Universe by adopting intensity mapping techniques. Once a detection of the redshifted 21 cm signal is confirmed, the challenge is to robustly extract information about the EoR and provide constraints on our cosmological model.

To allow for a reliable interpretation of the data, a number of numerical methods have been developed, including semi-analytic reionization models such as 21CMFAST (2011MNRAS.411..955M, ) and SimFast21 (2010MNRAS.406.2421S, ). The advantage of these methods is that they are computationally inexpensive and can therefore be scaled up to cosmological volumes. In these models, ionized regions are typically ‘painted’ on top of a realization of the matter density distribution at the locations where sources are likely to reside (2005MNRAS.363.1031F, ; 2012ApJ…747..126A, ; 2014MNRAS.440.1662S, ). While some of these prescriptions can also incorporate models for the photon sinks that retard reionization, they do not directly resolve them and as such are not as faithful on small scales.

The most reliable method for simulating the reionization process is through the use of radiative transfer simulations (1997ApJ…486..581G, ; 2003MNRAS.343.1101C, ; 2006MNRAS.369.1625I, ; 2007ApJ…671….1T, ; 2014ApJ…793…30G, ; 2017MNRAS.466..960P, ), which are much more computationally expensive and are therefore often limited to smaller box sizes (similar-to\sim100 Mpc) (2005ApJ…624L..65B, ). The challenge is that these simulations need to have a sufficiently large number of resolution elements in order to capture the smallest intergalactic structures that act as photon sinks (2000ApJ…530….1M, ; 2000ApJ…542..535G, ; 2004MNRAS.348..753S, ), and at the same time have a sufficiently large volume to provide a representative sample of the largest structures found in the Universe. Nonethlesess, these simulations are prohibitively expensive to survey the vast parameter space of potential source models. However, perturbative models, inspired by Effective Field Theory (EFT) (e.g., 2005MNRAS.363.1031F, ; 2007ApJ…654…12Z, ; 2011MNRAS.414..727Z, ), can potentially provide a fast way of analyzing 21 cm observables and thus, elucidate many astrophysical and cosmological conundrums.

Until recently, it was believed that the 21 cm signal cannot be treated perturbatively in the regime probed by 21 cm experiments due to the presence of large ionized bubbles. However, developments in the last few years (2007MNRAS.375..324Z, ; 2015PhRvD..91h3015M, ; 2018JCAP…10..016M, ; 2022PhRvD.106l3506Q, ; 2022JCAP…10..007S, ) suggest otherwise: the 21cm signal is likely perturbative on many of the scales that future interferometric instruments will be sensitive to. On these scales, the signal is modeled in terms of ‘bias’ parameters, which connect it to the underlying matter density field and which can be tied to physical quantities relevant to the physics of reionization.

Apart from being able to model the 21 cm field in real space, it is of utmost importance that we develop techniques that work in redshift space as well, where the 21 cm field is affected by the presence of redshift space distortions (RSDs). These contributions to the observed redshift result from the peculiar velocities of the neutral hydrogen along the line-of-sight (LOS) and as such, make it difficult to disentangle the ‘real-space’ distribution of the field from its intrinsic motion along the LOS. Interestingly, future experiments such as HERA will predominantly observe modes that are nearly parallel to the LOS for three main reasons: i) the instrument needs to be shielded from ground radio sources; ii) obtaining high spectral resolution is easier than obtaining high angular resolution; iii) modes parallel to the LOS are less affected by foreground contamination due to the chromatic instrument response, which even in the case of spectrally smooth foregrounds induces an unsmooth ringing that is mostly confined to the low ksubscript𝑘parallel-tok_{\parallel}italic_k start_POSTSUBSCRIPT ∥ end_POSTSUBSCRIPT and high ksubscript𝑘perpendicular-tok_{\perp}italic_k start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT regime (see e.g., 1997ApJ…486..581G, ; 2003MNRAS.343.1101C, ; 2015aska.confE…1K, ). This underscores the importance of developing redshift-space methods such as the one proposed in this work.

In this paper, we develop a method originally from cosmological perturbation theory for modeling the 21 cm signal in redshift space. Our method provides a viable path towards extracting the cosmological information about the composition of the Universe, encapsulated in the large-scale modes, and the astrophysical information about the morphology of reionization, encoded in the small-scale modes. Since the 21 cm field obeys basic symmetries such as rotational invariance and the equivalence principle, we can write it as an expansion in powers of the density field, tidal field and low-order derivatives on scales larger than the typical size of the ionized bubbles (2009JCAP…08..020M, ; 2018JCAP…10..016M, ; 2018PhR…733….1D, ; 2022PhRvD.106l3506Q, ). To enable us to work to smaller scales, we combine the power of such a symmetries-based bias expansion and the accuracy of N𝑁Nitalic_N-body simulations in the non-linear regime. The free (bias) parameters of our theory can be linked to valuable astrophysical quantities and can reveal the answers to questions such as when reionization occurred, what fraction of the Universe is neutral, what the bias of the sources is, and what the characteristic bubble sizes are.

This paper is organized as follows. In Section II, we provide details about our hybrid perturbative model and describe the radiative transfer simulation Thesan-1, against which we test our model. In Section III, we present our main findings in terms of the two-point correlation function and the relative modeling error, putting it in the context of the sensitivity of future experiments. Finally, we summarize our main conclusions in Section IV.

II Methods

In this section, we describe the radiative transfer simulations adopted in this study and our theoretical model for approximating the 21 cm field.

II.1 Thesan simulations

We test the theoretical model developed in this work via the high-resolution radiation-magneto-hydrodynamical simulation suite, Thesan. This suite is designed to faithfully reproduce both the complex physics of reionization as well as galaxy formation at high redshifts. The comoving box size of these simulations is 65h1absent65superscript1\approx 65\,h^{-1}≈ 65 italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPTMpc, and they are sufficiently well-resolved to capture the formation of atomic cooling halos, which are the smallest structures significantly contributing to the process of reionization. The galaxy formation model is built on top of the well-tried and tested IllustrisTNG model (2017MNRAS.465.3291W, ; 2018MNRAS.473.4077P, ), whereas the initial conditions are produced according to Ref. (2016MNRAS.462L…1A, ). The simulations are run using the radiative transfer version of the Arepo code, Arepo-RT (2010MNRAS.401..791S, ; 2019MNRAS.485..117K, ). Radiation-magnetohydrodynamics equations are solved on a mesh, with mesh-generating points that approximately follow the gas flow via their Voronoi tessellation.

The Thesan simulations have been shown to reproduce a number of observables of the reionization history of the Universe, the evolution of the temperature of the intergalactic medium (IGM), the optical depth to the CMB, the UV luminosity function at z6𝑧6z\geq 6italic_z ≥ 6 (2022MNRAS.514.3857K, ), the photo-ionization rate, the mean-free-path of ionizing photons, the IGM opacity, and the temperature-density relation (2022MNRAS.512.4909G, ). In this work, we employ Thesan-1, the highest-resolution simulation, which contains 21003 dark matter particles with mass 3.12×106M3.12superscript106subscript𝑀direct-product3.12\times 10^{6}\ M_{\odot}3.12 × 10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT and 21003 gas particles of mass 5.82×105M5.82superscript105subscript𝑀direct-product5.82\times 10^{5}\ M_{\odot}5.82 × 10 start_POSTSUPERSCRIPT 5 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT, and its dark-matter-only (N𝑁Nitalic_N-body) counterpart Thesan-Dark-1, which contains 21003 dark matter particles with mass 3.702×106M3.702superscript106subscript𝑀direct-product3.702\times 10^{6}\ M_{\odot}3.702 × 10 start_POSTSUPERSCRIPT 6 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT. The cosmology of the suite is identical to that of IllustrisTNG: Ωm=0.3089subscriptΩ𝑚0.3089\Omega_{m}=0.3089roman_Ω start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT = 0.3089, Ωb=0.0486subscriptΩ𝑏0.0486\Omega_{b}=0.0486roman_Ω start_POSTSUBSCRIPT italic_b end_POSTSUBSCRIPT = 0.0486, ΩΛ=0.6911subscriptΩΛ0.6911\Omega_{\Lambda}=0.6911roman_Ω start_POSTSUBSCRIPT roman_Λ end_POSTSUBSCRIPT = 0.6911, h=0.67740.6774h=0.6774italic_h = 0.6774, σ8=0.8159subscript𝜎80.8159\sigma_{8}=0.8159italic_σ start_POSTSUBSCRIPT 8 end_POSTSUBSCRIPT = 0.8159, ns=0.9667subscript𝑛𝑠0.9667n_{s}=0.9667italic_n start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = 0.9667.

In this work, we utilize outputs at several different redshifts: z=10.8𝑧10.8z=10.8italic_z = 10.8, 9, 8.3, 7.5, 7, and 6.5, with neutral hydrogen fraction of xHI=0.92subscript𝑥HI0.92x_{\rm HI}=0.92italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT = 0.92, 0.8, 0.71, 0.57, 0.45 and 0.3, respectively, thus sampling a large fraction of the reionization history of the Universe. We show the reionization history predicted by the Thesan-1 simulation in Fig. 1. Consistent with CMB experiments, the mid-point in reionization according to Thesan-1 occurs around z78similar-to-or-equals𝑧78z\simeq 7-8italic_z ≃ 7 - 8. Most of the redshifts at which we probe our theoretical model are in the first half of reionization, where the bubble sizes are smaller, and we expect the 21 cm field to be more perturbative. We also display the characteristic bubble size in the same figure, identified through the mean-free path analysis of Ref. (2024MNRAS.tmp.1328N, ). Towards the end of reionization xHI<0.5subscript𝑥HI0.5x_{\rm HI}<0.5italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT < 0.5, the bubbles expand exponentially, increasing the scale range affected by non-local physics. Identifying the scale of non-locality is crucial for perturbative models, as it indicates the regime in which we expect the model to break.

Refer to caption
Figure 1: Reionization history of the Thesan-1 simulation, shown as the mean neutral hydrogen fraction as a function of redshift. The blue diamond markers indicate the redshifts at which we test the perturbative method developed in this work. The right axis shows the characteristic bubble size (median and 16th to 84th percentile band), identified through the mean-free path analysis of Ref. (2024MNRAS.tmp.1328N, ). Towards the end of reionization xHI<0.5subscript𝑥HI0.5x_{\rm HI}<0.5italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT < 0.5, the bubbles expand exponentially, increasing the scale range affected by non-local physics.

II.2 Theoretical model

In this section, we summarize our approach for modeling the 21 cm signal, which is inspired by the treatment of biased tracers in Lagrangian Perturbation Theory (LPT). For a review of perturbation theory, see Ref. 2002PhR…367….1B ; 2018PhR…733….1D ; 2020moco.book…..D .

In the Lagrangian picture, we work with infinitesimal fluid elements labeled by their initial (Lagrangian) positions 𝒒𝒒\bm{q}bold_italic_q. Their dynamics are encoded in a displacement vector 𝚿(𝒒,η)𝚿𝒒𝜂\bm{\Psi}(\bm{q},\eta)bold_Ψ ( bold_italic_q , italic_η ), sourced by the gravitational potential and defined such that the Eulerian (comoving) positions 𝒙𝒙\bm{x}bold_italic_x of the fluid element at some conformal time η𝜂\etaitalic_η is 𝒙(𝒒,η)=𝒒+𝚿(𝒒,η)𝒙𝒒𝜂𝒒𝚿𝒒𝜂\bm{x}(\bm{q},\eta)=\bm{q}+\bm{\Psi}(\bm{q},\eta)bold_italic_x ( bold_italic_q , italic_η ) = bold_italic_q + bold_Ψ ( bold_italic_q , italic_η ) 2015JCAP…09..014V ; 2014JCAP…05..022P . The 3D distribution of cosmological observables such as the galaxy field or the 21cm signal is, in general, determined by complex small-scale astrophysical processes and their response to the large-scale matter distribution (2018PhR…733….1D, ).

We can approximate the behavior of any field obeying simple symmetries such as rotation invariance and the equivalence principle in terms of the matter density, velocity gradients and tidal fields in a neighborhood around its trajectory, i.e., within the scale of ‘nonlocality’. In the case of galaxies, this roughly corresponds to the size of the Lagrangian halo radius, since the dominant physical process is the gravitational collapse of the Lagrangian patch into a halo. On the other hand, the 21cm signal is sensitive to potentially larger nonlocalities resulting from the mean free path of ionizing photons, which can travel significant distances in the absence of absorption. In that case, the size of the nonlocality is roughly set by the size of the ionized bubbles. Thus, at higher redshifts, when the bubbles are smaller, we can use perturbation theory to model the signal over a wider range of scales 2018JCAP…10..016M .

We can thus write the dependence of the 21cm signal along its trajectory as an expansion to second order in the initial conditions adopting the Lagrangian EFT framework 2009JCAP…08..020M ; 2015JCAP…11..007S ; 2014JCAP…08..056A ; 2016JCAP…12..007V :

F(𝒒)=𝐹𝒒absent\displaystyle F(\bm{q})=italic_F ( bold_italic_q ) = 1+b1δL(𝒒)+b2(δL2(𝒒)δL2)1subscript𝑏1subscript𝛿𝐿𝒒subscript𝑏2superscriptsubscript𝛿𝐿2𝒒delimited-⟨⟩superscriptsubscript𝛿𝐿2\displaystyle 1+b_{1}\delta_{L}(\bm{q})+b_{2}\big{(}\delta_{L}^{2}(\bm{q})-% \langle\delta_{L}^{2}\rangle\big{)}1 + italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( bold_italic_q ) + italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( bold_italic_q ) - ⟨ italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ ) (1)
+bs(sL2(𝒒)sL2)+b2δL(𝒒),subscript𝑏𝑠superscriptsubscript𝑠𝐿2𝒒delimited-⟨⟩superscriptsubscript𝑠𝐿2subscript𝑏superscript2subscript𝛿𝐿𝒒\displaystyle+b_{s}\big{(}s_{L}^{2}(\bm{q})-\langle s_{L}^{2}\rangle\big{)}+b_% {\nabla}\nabla^{2}\delta_{L}(\bm{q}),+ italic_b start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( bold_italic_q ) - ⟨ italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ⟩ ) + italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ( bold_italic_q ) ,

where b1subscript𝑏1b_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, b2subscript𝑏2b_{2}italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, bssubscript𝑏𝑠b_{s}italic_b start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and bsubscript𝑏b_{\nabla}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT are free bias parameters and sL2=sijsijsuperscriptsubscript𝑠𝐿2subscript𝑠𝑖𝑗superscript𝑠𝑖𝑗s_{L}^{2}=s_{ij}s^{ij}italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_s start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT italic_i italic_j end_POSTSUPERSCRIPT is the shear/tidal field, with sij(ii/2δij/3)δLsubscript𝑠𝑖𝑗subscript𝑖subscript𝑖superscript2subscript𝛿𝑖𝑗3subscript𝛿𝐿s_{ij}\equiv(\partial_{i}\partial_{i}/\partial^{2}-\delta_{ij}/3)\ \delta_{L}italic_s start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ≡ ( ∂ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∂ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT / ∂ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_δ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT / 3 ) italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT. Note that the functional F(𝒒)𝐹𝒒F(\bm{q})italic_F ( bold_italic_q ) can only depend on scalar combinations of the density field and the tidal field. Nonlocality effects are handled by performing a Taylor expansion of the δLsubscript𝛿𝐿\delta_{L}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT field around 𝒒𝒒\bm{q}bold_italic_q. To lowest order, this calls for the inclusion of 2δLsuperscript2subscript𝛿𝐿\nabla^{2}\delta_{L}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT. We note that on nonlocal scales, i.e., smaller than the bubble size, the Lagrangian approximation breaks down.

The functional can then be advected to the real-space (Eulerian) position 𝒙𝒙\bm{x}bold_italic_x 2008PhRvD..77f3530M :

1+δ21(𝒙)=d3𝒒F(𝒒)δD(𝒙𝒒𝚿(𝒒))1subscript𝛿21𝒙superscript𝑑3𝒒𝐹𝒒superscript𝛿𝐷𝒙𝒒𝚿𝒒1+\delta_{\rm 21}(\bm{x})=\int d^{3}\bm{q}\,F(\bm{q})\,\delta^{D}(\bm{x}-\bm{q% }-\bm{\Psi}(\bm{q}))1 + italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ( bold_italic_x ) = ∫ italic_d start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT bold_italic_q italic_F ( bold_italic_q ) italic_δ start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT ( bold_italic_x - bold_italic_q - bold_Ψ ( bold_italic_q ) ) (2)

Thus, the 21cm clustering signal contains a dynamics piece, 𝚿𝚿\bm{\Psi}bold_Ψ, as well as a piece depending on the initial conditions, F(𝒒)𝐹𝒒F(\bm{q})italic_F ( bold_italic_q ).

Refer to caption
Figure 2: 21 cm field (top left) and Lagrangian operators (remaining five panels) at z=9𝑧9z=9italic_z = 9 at the fiducial resolution of this analysis, Nmesh=1283subscript𝑁meshsuperscript1283N_{\rm mesh}=128^{3}italic_N start_POSTSUBSCRIPT roman_mesh end_POSTSUBSCRIPT = 128 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT. The Lagrangian operator fields are obtain by advecting the initial condition density fields, δLsubscript𝛿𝐿\delta_{L}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, δL2subscriptsuperscript𝛿2𝐿\delta^{2}_{L}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, sL2subscriptsuperscript𝑠2𝐿s^{2}_{L}italic_s start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT and 2δLsuperscript2subscript𝛿𝐿\nabla^{2}\delta_{L}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, via a time-evolved N𝑁Nitalic_N-body simulation. Visible in the 21 cm field are the bubble nucleation sites (dark spots) and the regions of high HI density (lighter spots). Our theoretical model of the 21 cm field, described in Section II.2, is constructed by linear combinations of the five advected fields. We note this is an xy𝑥𝑦xyitalic_x italic_y cross-section of the redshfit-space 21 cm field with LOS along the z𝑧zitalic_z axis.
Refer to caption
Figure 3: Cross- power spectra of all Lagrangian operators in real space, 15 unique combinations in total. Shown here is the comparison between the 2nd order LPT theoretical prediction (computed via velocileptors, dashed lines) and the numerical result (solid lines) obtained using the dark-matter-only counterpart of Thesan-1. We see good agreement between the two for all combinations except those involving the 2δsuperscript2𝛿\nabla^{2}\delta∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ due to numerical instabilities on small scales. These differences are well understood and irrelevant to our later analysis, as they can be reconciled via a renormalization. As the size of the simulation box is small, we see a significant amount of noise especially in the squared operators.

To account for the fact that we observe the signal in redshift space, 𝒔𝒔\bm{s}bold_italic_s, and thus pick up contributions in the form of redshift-space distortions (RSD) due to the peculiar velocity along the LOS, 𝒖=𝒏^(𝒏^𝒗pec)/𝒖^𝒏^𝒏subscript𝒗pec\bm{u}=\hat{\bm{n}}\ (\hat{\bm{n}}\cdot\bm{v}_{\rm pec})/\mathcal{H}bold_italic_u = over^ start_ARG bold_italic_n end_ARG ( over^ start_ARG bold_italic_n end_ARG ⋅ bold_italic_v start_POSTSUBSCRIPT roman_pec end_POSTSUBSCRIPT ) / caligraphic_H, we write the redshift-space signal as 2008PhRvD..78j9901M ; 2019JCAP…03..007V ; 2021JCAP…03..100C :

1+δ21(𝒔)=d3𝒒F(𝒒)δD(𝒔𝒒𝚿s(𝒒)),1subscript𝛿21𝒔superscript𝑑3𝒒𝐹𝒒superscript𝛿𝐷𝒔𝒒subscript𝚿𝑠𝒒1+\delta_{\rm 21}(\bm{s})=\int d^{3}\bm{q}\,F(\bm{q})\,\delta^{D}(\bm{s}-\bm{q% }-\bm{\Psi}_{s}(\bm{q})),1 + italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ( bold_italic_s ) = ∫ italic_d start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT bold_italic_q italic_F ( bold_italic_q ) italic_δ start_POSTSUPERSCRIPT italic_D end_POSTSUPERSCRIPT ( bold_italic_s - bold_italic_q - bold_Ψ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( bold_italic_q ) ) , (3)

where 𝚿ssubscript𝚿𝑠\bm{\Psi}_{s}bold_Ψ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is the redshift-space displacement field, defined as:

𝚿s(𝒒)=𝚿+𝒖.subscript𝚿𝑠𝒒𝚿𝒖\bm{\Psi}_{s}(\bm{q})=\bm{\Psi}+\bm{u}.bold_Ψ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( bold_italic_q ) = bold_Ψ + bold_italic_u . (4)

In this work, instead of adopting the usual LPT steps of perturbatively expanding the displacement fields 𝚿𝚿\bm{\Psi}bold_Ψ and 𝚿ssubscript𝚿𝑠\bm{\Psi}_{s}bold_Ψ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, we use an N𝑁Nitalic_N-body simulation as an accurate solver of the dynamics of the matter field. This method has recently been referred to in the literature as Hybrid Effective Field Theory (HEFT) (2020MNRAS.492.5754M, ; 2021JCAP…09..020H, ; 2021MNRAS.505.1422K, ). While in real space, this is a trivial operation as described below, in redshift space, we need to be a bit more careful, as taking directly the peculiar velocity field 𝒖𝒖\bm{u}bold_italic_u and adding it to the displacement field 𝚿𝚿\bm{\Psi}bold_Ψ can incur unwanted contributions from the motions of particles on small scales, also known as Finger-of-God (FoG) effects, which can affect observables such as the power spectrum at the scales of cosmological interest. This is exacerbated by the fact that we use the highest-resolution simulation with 21003 particles, for which such FoG effects are inevitable. To mitigate this effect, we low-pass filter the LOS velocity field 𝒖𝒖\bm{u}bold_italic_u in Fourier space with the following kernel:

β(k)=1tanh[kk0Δk]𝛽𝑘1𝑘subscript𝑘0Δ𝑘\beta(k)=1-\tanh\left[\frac{k-k_{0}}{\Delta k}\right]italic_β ( italic_k ) = 1 - roman_tanh [ divide start_ARG italic_k - italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG roman_Δ italic_k end_ARG ] (5)

We adopt the following values for the free parameters: k0=0.7hMpc1subscript𝑘00.7superscriptMpc1k_{0}=0.7\,h\,{\rm Mpc}^{-1}italic_k start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0.7 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and Δk=0.1hMpc1Δ𝑘0.1superscriptMpc1\Delta k=0.1\,h\,{\rm Mpc}^{-1}roman_Δ italic_k = 0.1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and check that our results are unchanged if we vary these parameters by 50%. We then add the low-pass filtered velocities sampled at the location of the DM particles to the displacement field. Our approach for handling redshift-space displacements is not unique, and other methods have been proposed to model the redshift-space clustering of large-scale structure tracers using the HEFT model (see e.g., 2023MNRAS.520.3725P, , for an application on galaxies). A nice property of the field-level HEFT model is that in principle it can be used to model not only the two-point correlation function (power spectrum in Fourier space), but also higher-order statistics such as the 21 cm bispectrum, which are expected to contain much of the information, since the 21 cm field is highly non-Gaussian (2022MNRAS.510.3838W, ).

We summarize the empirical steps we employ in order to obtain the advected operators as follows:

  1. 1.

    We use the initial conditions at zIC=49subscript𝑧IC49z_{\rm IC}=49italic_z start_POSTSUBSCRIPT roman_IC end_POSTSUBSCRIPT = 49 to calculate the 4 fields (δL,δL2,sL2,2δL)subscript𝛿𝐿superscriptsubscript𝛿𝐿2superscriptsubscript𝑠𝐿2superscript2subscript𝛿𝐿(\delta_{L},\delta_{L}^{2},s_{L}^{2},\nabla^{2}\delta_{L})( italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT , italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT ) on a cubic grid of size 1283superscript1283128^{3}128 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT in Fourier space. We note that we have tested our entire pipeline at a resolution of 10243superscript102431024^{3}1024 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT and find that our results are unchanged at the scales of interest.

  2. 2.

    We then evolve the initial condition fields linearly to some redshift of choice, z𝑧zitalic_z, by tagging each particle at z𝑧zitalic_z by the values of the four initial conditions fields and then performing TSC interpolation, using these tags as weights. Including a case in which we set the weights to one, this yields the following five advected bias operators: (1adv.,δLadv.,δL2,adv.,sL2,adv.,2δLadv.)superscript1advsuperscriptsubscript𝛿𝐿advsuperscriptsubscript𝛿𝐿2advsuperscriptsubscript𝑠𝐿2advsuperscript2superscriptsubscript𝛿𝐿adv(1^{\rm adv.},\delta_{L}^{\rm adv.},\delta_{L}^{2,\ \rm adv.},s_{L}^{2,\ \rm adv% .},\nabla^{2}\delta_{L}^{\rm adv.})( 1 start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT , italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT , italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT , italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT , ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT ) by assigning each particle a weight given by the value of its position in the initial conditions.

  3. 3.

    Finally, we store all the advected fields at that redshift z𝑧zitalic_z and construct a model of our observable as a linear combination of these operators:

    1+δ211subscript𝛿21\displaystyle 1+\delta_{21}1 + italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT =1adv.+b1δLadv.+b2δL2,adv.absentsuperscript1advsubscript𝑏1superscriptsubscript𝛿𝐿advsubscript𝑏2superscriptsubscript𝛿𝐿2adv\displaystyle=1^{\rm adv.}+b_{1}\delta_{L}^{\rm adv.}+b_{2}\delta_{L}^{2,\ \rm adv.}= 1 start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT + italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT + italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT
    +bssL2,adv.+b2δLadv..subscript𝑏𝑠superscriptsubscript𝑠𝐿2advsubscript𝑏superscript2superscriptsubscript𝛿𝐿adv\displaystyle+b_{s}s_{L}^{2,\ \rm adv.}+b_{\nabla}\nabla^{2}\delta_{L}^{\rm adv% .}.+ italic_b start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT + italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT ∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT .

We note that our expansion is performed in the product of the neutral hydrogen fraction and the gas density rather than as a combined expansion in both fields separately, which is more natural from the perspective of perturbation theory applied to large-scale structure 2022PhRvD.106l3506Q ; 2022JCAP…10..007S .

II.3 Hybrid Lagrangian operators

Refer to caption
Figure 4: Cross-correlation coefficient, r𝑟ritalic_r, between the redshift-space 21 cm field and the matter field and advected fields at z=8.3𝑧8.3z=8.3italic_z = 8.3. The correlation with the DM field (red line) peaks at 60% on the largest available scales for our box, k0.1hMpc1similar-to𝑘0.1superscriptMpc1k\sim 0.1\,h\,{\rm Mpc}^{-1}italic_k ∼ 0.1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT then crosses zero around k1hMpc1𝑘1superscriptMpc1k\approx 1\,h\,{\rm Mpc}^{-1}italic_k ≈ 1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, due to bubble formation, and finally peaks on very small scales k6hMpc1𝑘6superscriptMpc1k\approx 6\,h\,{\rm Mpc}^{-1}italic_k ≈ 6 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, where the Universe is mostly ionized and only small pockets of neutral hydrogen survive. Interestingly, the δL2,adv.superscriptsubscript𝛿𝐿2adv\delta_{L}^{\rm 2,\ adv.}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT operator exhibits the strongest correlation (r0.9greater-than-or-equivalent-to𝑟0.9r\gtrsim 0.9italic_r ≳ 0.9) on large scales and then drops steadily on small scales. At this redshift, sL2,adv.subscriptsuperscript𝑠2adv𝐿s^{\rm 2,\ adv.}_{L}italic_s start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT is the least correlated operator on large scales. We note that to create this plot, we use a finer grid size than in the rest of the paper similar-to\sim60 h1Mpcsuperscript1Mpc\,h^{-1}{\rm Mpc}italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc for all operators except for 2δLadv.superscript2superscriptsubscript𝛿𝐿adv\nabla^{2}\delta_{L}^{\rm adv.}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT, for which we opt for a grid size of 0.5 h1Mpcsuperscript1Mpc\,h^{-1}{\rm Mpc}italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc due to its numerical instability.

The end product of our theoretical model for the 21 cm field is the Lagrangian fields, δLsubscript𝛿𝐿\delta_{L}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, δL2subscriptsuperscript𝛿2𝐿\delta^{2}_{L}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, sL2subscriptsuperscript𝑠2𝐿s^{2}_{L}italic_s start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT and 2δLsuperscript2subscript𝛿𝐿\nabla^{2}\delta_{L}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT, advected to some redshift of interest using an N𝑁Nitalic_N-body simulation. In this section, we compare their power spectra to a theoretical prediction coming from the second-order LPT code, velocileptors 2020JCAP…07..062C and study how strongly correlated these fields are with our observable, the 21 cm field.

We first visualize the advected fields (Lagrangian operators) as well as the 21 cm field in Fig. 2. These are shown at z=9𝑧9z=9italic_z = 9 when the Universe is 80% neutral and the bubbles have not yet started to coalesce into large ionized regions. Visible in this plot are the bubble nucleation sites, the sharp ionization fronts where ionizing photons get absorbed, and the regions of high density in neutral hydrogen.

Looking at the higher-order Lagrangian operators, we notice that the δL2,adv.superscriptsubscript𝛿𝐿2adv\delta_{L}^{2,\ {\rm adv.}}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT field highlights the regions with high bias, the sL2,adv.superscriptsubscript𝑠𝐿2advs_{L}^{2,\ {\rm adv.}}italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT emphasizes regions with large anisotropies, and 2δLadv.superscript2superscriptsubscript𝛿𝐿adv\nabla^{2}\delta_{L}^{{\rm adv.}}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT accentuates the smallest scales aiming to capture non-local effects. We display our observable as well as the Lagrangian operators at the fiducial resolution of this analysis, Nmesh=1283subscript𝑁meshsuperscript1283N_{\rm mesh}=128^{3}italic_N start_POSTSUBSCRIPT roman_mesh end_POSTSUBSCRIPT = 128 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT, which corresponds to a cell size of 0.5h1Mpc0.5superscript1Mpc0.5\,h^{-1}{\rm Mpc}0.5 italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc. This is sufficiently fine to resolve the scales of cosmological interest for future radio interferometer experiments, i.e. Fourier modes smaller than k1hMpc1less-than-or-similar-to𝑘1superscriptMpc1k\lesssim 1\,h\,{\rm Mpc}^{-1}italic_k ≲ 1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, or configuration-space scales larger than r5h1Mpcsimilar-to𝑟5superscript1Mpcr\sim 5\,h^{-1}{\rm Mpc}italic_r ∼ 5 italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc.

We next compare the 15 combinations of cross-power spectra constructed from our five numerically advected fields against the predictions from second-order LPT obtained using the python package velocileptors111https://github.com/sfschen/velocileptors. We compute the power spectrum using 20 bins between kmin=0subscript𝑘min0k_{\rm min}=0italic_k start_POSTSUBSCRIPT roman_min end_POSTSUBSCRIPT = 0 and kmax=1hMpc1subscript𝑘max1superscriptMpc1k_{\rm max}=1\,h\,{\rm Mpc}^{-1}italic_k start_POSTSUBSCRIPT roman_max end_POSTSUBSCRIPT = 1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, and note that the fundamental mode of the box is 2π/Lbox0.1hMpc12𝜋subscript𝐿box0.1superscriptMpc12\pi/L_{\rm box}\approx 0.1\,h\,{\rm Mpc}^{-1}2 italic_π / italic_L start_POSTSUBSCRIPT roman_box end_POSTSUBSCRIPT ≈ 0.1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. We show this comparison at z=9𝑧9z=9italic_z = 9 in Fig. 3, but we find a consistent picture for all other redshifts used in this study.

We see good agreement overall between theory and numerics: terms involving the δLadv.superscriptsubscript𝛿𝐿adv\delta_{L}^{\rm adv.}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT and sL2,adv.superscriptsubscript𝑠𝐿2advs_{L}^{\rm 2,\ adv.}italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT tend to agree best with theory, whereas terms involving the 2δLadv.superscript2superscriptsubscript𝛿𝐿adv\nabla^{2}\delta_{L}^{\rm adv.}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT and δL2,adv.superscriptsubscript𝛿𝐿2adv\delta_{L}^{\rm 2,\ adv.}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT exhibit the largest deviations from theory likely because both are particularly sensitive to small-scale noise due to aliasing effects. The numerical noise is particularly pronounced on all scales for the δ𝛿\deltaitalic_δ-squared operators, δL2,adv.superscriptsubscript𝛿𝐿2adv\delta_{L}^{\rm 2,\ adv.}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT and sL2,adv.superscriptsubscript𝑠𝐿2advs_{L}^{\rm 2,\ adv.}italic_s start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT. We note that smoothing of the initial conditions fields before performing the advection helps in reconciling these differences. Thus, the cause of the differences in all of these operators is well-understood from a theoretical point of view and irrelevant to the analysis performed in this work, as it can be diminished via a renormalization.

We are also interested in understanding how correlated the advected fields are with our observable of interest, the 21 cm field δ21subscript𝛿21\delta_{21}italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT. To this end, we compute the cross-correlation coefficient between δ21subscript𝛿21\delta_{21}italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT and the advected fields, which allows us to quantify the amount of correlation, as follows:

rAB(k)=PAB(k)PAA(k)PBB(k).superscript𝑟𝐴𝐵𝑘superscript𝑃𝐴𝐵𝑘superscript𝑃𝐴𝐴𝑘superscript𝑃𝐵𝐵𝑘r^{AB}(k)=\frac{P^{AB}(k)}{\sqrt{P^{AA}(k)P^{BB}(k)}}.italic_r start_POSTSUPERSCRIPT italic_A italic_B end_POSTSUPERSCRIPT ( italic_k ) = divide start_ARG italic_P start_POSTSUPERSCRIPT italic_A italic_B end_POSTSUPERSCRIPT ( italic_k ) end_ARG start_ARG square-root start_ARG italic_P start_POSTSUPERSCRIPT italic_A italic_A end_POSTSUPERSCRIPT ( italic_k ) italic_P start_POSTSUPERSCRIPT italic_B italic_B end_POSTSUPERSCRIPT ( italic_k ) end_ARG end_ARG . (7)

Qualitatively, we expect the r(k)𝑟𝑘r(k)italic_r ( italic_k ) coefficient to drop sharply on small scales once we approach the typical scale of bubbles at the given redshift, which corresponds to the scale of non-locality. In Fig. 4, we show this quantity at z=8.3𝑧8.3z=8.3italic_z = 8.3 for all five advected fields. We note that qualitatively this plot looks very similar at the higher and lower redshifts we consider, with the higher redshifts yielding generally higher correlation and the lower redshifts generating a lower correlation. This can be easily understood as at these higher redshifts there are fewer bubbles to decorrelate the signal from the underlying density fields as well as the Universe is more linear as structure has barely begun to collapse gravitationally.

While on large scales, k0.1hMpc1similar-to𝑘0.1superscriptMpc1k\sim 0.1\,h\,{\rm Mpc}^{-1}italic_k ∼ 0.1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, we are limited by cosmic variance due to the small volume of the simulation, recently a powerful technique known as Zel’dovich Control Variates has been developed and applied to cosmological observables such as the power spectrum and the correlation function to significantly reduce the sample variance of simulation-measured quantities (2022JCAP…09..059K, ; 2023JCAP…02..008D, ; 2023OJAp….6E..38H, ). This method performs best in the regime where the cross-correlation coefficient r(k)𝑟𝑘r(k)italic_r ( italic_k ) between the observable of interest and the Lagrangian/Zel’dovich operators is close to one, as is the case for the 21 cm field at z7greater-than-or-equivalent-to𝑧7z\gtrsim 7italic_z ≳ 7 (i.e., when the bubble size is smaller than the box, and r(k)𝑟𝑘r(k)italic_r ( italic_k ) is therefore close to one). Since radiative transfer hydrodynamical simulations are prohibitively expensive to run in Gpc-sized boxes, the Control Variates technique could be extremely useful for partially overcoming the volume limitation.

Looking at Fig. 4, we see that the correlation with the DM field peaks at 60% on the largest available scales for our box, k0.1hMpc1similar-to𝑘0.1superscriptMpc1k\sim 0.1\,h\,{\rm Mpc}^{-1}italic_k ∼ 0.1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT then crosses zero around k1hMpc1𝑘1superscriptMpc1k\approx 1\,h\,{\rm Mpc}^{-1}italic_k ≈ 1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, the characteristic bubble size at this redshift, and finally peaks at very small scales k6hMpc1𝑘6superscriptMpc1k\approx 6\,h\,{\rm Mpc}^{-1}italic_k ≈ 6 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. We note that on these scales the Universe is mostly ionized and only small pockets of neutral hydrogen exist due to recombination and self-shielding. Interestingly, these surviving pockets of neutral hydrogen are traced well by the regions of high density. Furthermore, we see that the δL2,adv.superscriptsubscript𝛿𝐿2adv\delta_{L}^{\rm 2,\ adv.}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT operator exhibits the strongest correlation (r0.9greater-than-or-equivalent-to𝑟0.9r\gtrsim 0.9italic_r ≳ 0.9) on large scales and then drops steadily on small scales, crossing zero at k4hMpc1𝑘4superscriptMpc1k\approx 4\,h\,{\rm Mpc}^{-1}italic_k ≈ 4 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, which occurs on smaller scales than the bubble size due to the mode-coupling of Fourier modes. The sL2,adv.subscriptsuperscript𝑠2adv𝐿s^{\rm 2,\ adv.}_{L}italic_s start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT is the least correlated operator on large scales. As will be discussed later, at these redshifts z89similar-to-or-equals𝑧89z\simeq 8-9italic_z ≃ 8 - 9, the linear bias appears to vanish, i.e. b1=1subscript𝑏11b_{1}=-1italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = - 1 in the Lagrangian picture (see Fig. 6), which implies that the higher-order operators dominate, which provides a plausible explanation for the high cross-correlation coefficient between the δL2,adv.subscriptsuperscript𝛿2adv𝐿\delta^{2,{\rm adv.}}_{L}italic_δ start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT field and δ21subscript𝛿21\delta_{\rm 21}italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT.

We interpret this due to the fact that the matter displacements are much smaller than the distances photons travel during reionization (i.e., the bubble sizes). In addition, because the sources driving reionization are highly biased, we expect their tidal bias, which is proportional to the 21 cm tidal bias, to be small (2018JCAP…10..016M, ). Finally, the 2δLadv.superscript2superscriptsubscript𝛿𝐿adv\nabla^{2}\delta_{L}^{\rm adv.}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT behaves similarly to the δLadv.superscriptsubscript𝛿𝐿adv\delta_{L}^{\rm adv.}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT term. This is because 2δLadv.superscript2superscriptsubscript𝛿𝐿adv\nabla^{2}\delta_{L}^{\rm adv.}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT can be approximated to first order as k2δLadv.superscript𝑘2superscriptsubscript𝛿𝐿advk^{2}\delta_{L}^{\rm adv.}italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT. Thus, the bias parameter associated with that field can be linked to the characteristic bubble size Rsubscript𝑅R_{\ast}italic_R start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT, as bR1/2k1/2similar-tosubscript𝑏superscriptsubscript𝑅12similar-tosuperscriptsubscript𝑘12b_{\nabla}\sim R_{\ast}^{1/2}\sim k_{\ast}^{-1/2}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT ∼ italic_R start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 / 2 end_POSTSUPERSCRIPT ∼ italic_k start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 / 2 end_POSTSUPERSCRIPT, where ksubscript𝑘k_{\ast}italic_k start_POSTSUBSCRIPT ∗ end_POSTSUBSCRIPT is the scale at which this term surges up (and r(k)𝑟𝑘r(k)italic_r ( italic_k ) reaches zero). To create this plot, we use a finer grid size than used in the rest of the paper: similar-to\sim60 h1Mpcsuperscript1Mpc\,h^{-1}{\rm Mpc}italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc for all operators except for 2δLadv.superscript2superscriptsubscript𝛿𝐿adv\nabla^{2}\delta_{L}^{\rm adv.}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT, for which we adopt a grid size of 0.5 h1Mpcsuperscript1Mpc\,h^{-1}{\rm Mpc}italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc due to its numerical instability on small scales.

III Results

Refer to caption
Figure 5: Cross-correlation coefficient between halos and the 21 cm field. We split the halos by virial mass into three samples: logMvir>9.5, 10, 10.5subscript𝑀vir9.51010.5\log M_{\rm vir}>9.5,\ 10,\ 10.5roman_log italic_M start_POSTSUBSCRIPT roman_vir end_POSTSUBSCRIPT > 9.5 , 10 , 10.5, where the masses are expressed in units of h1Msuperscript1subscript𝑀direct-producth^{-1}M_{\odot}italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT. Since on large scales the 21 cm signal traces large-scale structure, we expect these fields to be strongly correlated. The cross-correlation is highest for the lowest-mass sample, as it has the lowest bias, i.e. is closest to the matter field that the neutral hydrogen traces, and has the highest number density, i.e. the lowest level of shotnoise, which additionally decorrelates the two fields.

In this section, we present the main findings of this study. We first describe our bias-fitting method and then assess its performance in terms of the cross-correlation coefficient between our model and the 21 cm field and the error power spectrum. To put our findings into context, we also draw a comparison between the thermal noise of a futuristic radio interferometer experiment and the theoretical noise of our model.

III.1 Fitting method

Here we elaborate on the methodology to extract and minimize the stochastic contributions in LSS models by enhancing the accuracy of bias parameter estimates (see PhysRevD.100.043514, ; 2022MNRAS.514.2198K, , for more details). The goal is to estimate the bias parameters bisubscript𝑏𝑖b_{i}italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT of our model by minimizing the difference between the model and the observable δ21subscript𝛿21\delta_{\rm 21}italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT. Our result can be expressed through the stochastic field ϵ(𝐤)italic-ϵ𝐤\epsilon(\mathbf{k})italic_ϵ ( bold_k ), which represents residuals after removing deterministic contributions from the model:

ϵ(𝐤)=δ21(𝐤)δm(𝐤)ibi(𝐤)𝒪i(𝐤),italic-ϵ𝐤subscript𝛿21𝐤subscript𝛿𝑚𝐤subscript𝑖subscript𝑏𝑖𝐤subscript𝒪𝑖𝐤\epsilon(\mathbf{k})=\delta_{\rm 21}(\mathbf{k})-\delta_{m}(\mathbf{k})-\sum_{% i}b_{i}(\mathbf{k})\mathcal{O}_{i}(\mathbf{k}),italic_ϵ ( bold_k ) = italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ( bold_k ) - italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_k ) - ∑ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_b start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_k ) caligraphic_O start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_k ) , (8)

We obtain the bias parameters by solving the least squares problem of minimizing the error power spectrum, which leads us to the following expression for the bias transfer functions:

b^i(𝐤)=𝒪i𝒪j1(𝐤)𝒪j(𝐤)[δh(𝐤)δm(𝐤)].subscript^𝑏𝑖𝐤superscriptdelimited-⟨⟩subscript𝒪𝑖subscript𝒪𝑗1𝐤delimited-⟨⟩subscript𝒪𝑗𝐤delimited-[]subscript𝛿𝐤subscript𝛿𝑚𝐤\hat{b}_{i}(\mathbf{k})=\left\langle\mathcal{O}_{i}\mathcal{O}_{j}\right% \rangle^{-1}(\mathbf{k})\left\langle\mathcal{O}_{j}(-\mathbf{k})\left[\delta_{% h}(\mathbf{k})-\delta_{m}(\mathbf{k})\right]\right\rangle.over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_k ) = ⟨ caligraphic_O start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT caligraphic_O start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ⟩ start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_k ) ⟨ caligraphic_O start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( - bold_k ) [ italic_δ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_k ) - italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_k ) ] ⟩ . (9)

In particular, we do this as follows:

b^i(𝐤)=Mij1(𝐤)Aj(𝐤).subscript^𝑏𝑖𝐤superscriptsubscript𝑀𝑖𝑗1𝐤subscript𝐴𝑗𝐤\hat{b}_{i}(\mathbf{k})=M_{ij}^{-1}(\mathbf{k})A_{j}(\mathbf{k}).over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_k ) = italic_M start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ( bold_k ) italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_k ) . (10)

where Aj(𝐤)subscript𝐴𝑗𝐤A_{j}(\mathbf{k})italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_k ) and Mij(𝐤)subscript𝑀𝑖𝑗𝐤M_{ij}(\mathbf{k})italic_M start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( bold_k ) are defined as

Aj(𝐤)=[𝒪j(𝐱)(δh(𝐱)δm(𝐱))]kb,kb+1,subscript𝐴𝑗𝐤delimited-⟨⟩subscriptdelimited-[]subscript𝒪𝑗𝐱subscript𝛿𝐱subscript𝛿𝑚𝐱subscript𝑘bsubscript𝑘b+1A_{j}(\mathbf{k})=\left\langle[\mathcal{O}_{j}(\mathbf{x})(\delta_{h}(\mathbf{% x})-\delta_{m}(\mathbf{x}))]_{k_{\text{b}},k_{\text{b+1}}}\right\rangle,italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_k ) = ⟨ [ caligraphic_O start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_x ) ( italic_δ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT ( bold_x ) - italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ( bold_x ) ) ] start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT b end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT b+1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⟩ , (11)
=kb<|𝐤|<kb+1d3k(2π)3𝒪j(𝐤)[δhδm](𝐤),absentsubscriptsubscript𝑘b𝐤subscript𝑘b+1superscript𝑑3𝑘superscript2𝜋3subscript𝒪𝑗𝐤superscriptdelimited-[]subscript𝛿subscript𝛿𝑚𝐤=\int_{k_{\text{b}}<{|\mathbf{k}|<k_{\text{b+1}}}}\frac{d^{3}k}{(2\pi)^{3}}% \mathcal{O}_{j}(\mathbf{k})[\delta_{h}-\delta_{m}]^{*}(\mathbf{k}),= ∫ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT b end_POSTSUBSCRIPT < | bold_k | < italic_k start_POSTSUBSCRIPT b+1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_d start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_k end_ARG start_ARG ( 2 italic_π ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG caligraphic_O start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_k ) [ italic_δ start_POSTSUBSCRIPT italic_h end_POSTSUBSCRIPT - italic_δ start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT ] start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_k ) , (12)

and

Mij(𝐤)=[𝒪i(𝐱)𝒪j(𝐱)]kb,kb+1,subscript𝑀𝑖𝑗𝐤delimited-⟨⟩subscriptdelimited-[]subscript𝒪𝑖𝐱subscript𝒪𝑗𝐱subscript𝑘bsubscript𝑘b+1M_{ij}(\mathbf{k})=\left\langle[\mathcal{O}_{i}(\mathbf{x})\mathcal{O}_{j}(% \mathbf{x})]_{k_{\text{b}},k_{\text{b+1}}}\right\rangle,italic_M start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ( bold_k ) = ⟨ [ caligraphic_O start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_x ) caligraphic_O start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ( bold_x ) ] start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT b end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT b+1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⟩ , (13)
=kb<|𝐤|<kb+1d3k(2π)3𝒪i(𝐤)𝒪j(𝐤),absentsubscriptsubscript𝑘b𝐤subscript𝑘b+1superscript𝑑3𝑘superscript2𝜋3subscript𝒪𝑖𝐤superscriptsubscript𝒪𝑗𝐤=\int_{k_{\text{b}}<{|\mathbf{k}|<k_{\text{b+1}}}}\frac{d^{3}k}{(2\pi)^{3}}% \mathcal{O}_{i}(\mathbf{k})\mathcal{O}_{j}^{*}(\mathbf{k}),= ∫ start_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT b end_POSTSUBSCRIPT < | bold_k | < italic_k start_POSTSUBSCRIPT b+1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_d start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT italic_k end_ARG start_ARG ( 2 italic_π ) start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_ARG caligraphic_O start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_k ) caligraphic_O start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ( bold_k ) , (14)

where the subscript b represents the index that runs from 1 to Nbsubscript𝑁bN_{\rm b}italic_N start_POSTSUBSCRIPT roman_b end_POSTSUBSCRIPT, which corresponds to the number of bins into which we have divided our Fourier modes. Once the b^i(𝐤)subscript^𝑏𝑖𝐤\hat{b}_{i}(\mathbf{k})over^ start_ARG italic_b end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_k ) coefficients are obtained, they can be substituted back into our model to estimate the error power spectrum as follows:

Perr(𝐤)ϵ(𝐤)ϵ(𝐤).subscript𝑃err𝐤delimited-⟨⟩italic-ϵ𝐤italic-ϵ𝐤P_{\text{err}}(\mathbf{k})\equiv\left\langle\epsilon(\mathbf{k})\epsilon(-% \mathbf{k})\right\rangle.italic_P start_POSTSUBSCRIPT err end_POSTSUBSCRIPT ( bold_k ) ≡ ⟨ italic_ϵ ( bold_k ) italic_ϵ ( - bold_k ) ⟩ . (15)

While typically in cosmological analyses we marginalize over the bias parameters, they can also be linked to astrophysical quantities relevant to reionization. Following Ref. (2018JCAP…10..016M, ), we can draw the following connections (though we note that our analysis is performed in Lagrangian space):

  • b1subscript𝑏1b_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT constrains the source bias and the global neutral fraction and within a simple model is roughly b1(1xHI)(2+b1s)similar-tosubscript𝑏11subscript𝑥HI2superscriptsubscript𝑏1𝑠b_{1}\sim-(1-x_{\rm HI})(2+b_{1}^{s})italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∼ - ( 1 - italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT ) ( 2 + italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT ), where b1ssuperscriptsubscript𝑏1𝑠b_{1}^{s}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT is the source bias. If we assume that the source bias varies more slowly than xHIsubscript𝑥HIx_{\rm HI}italic_x start_POSTSUBSCRIPT roman_HI end_POSTSUBSCRIPT, then we roughly expect b1subscript𝑏1b_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to increase (in absolute magnitude), as we plunge deeper into the EoR.

  • Because reionization is a patchy process and highly correlated with regions of high density (and thus large source bias), the b2subscript𝑏2b_{2}italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT parameter is expected to be quite large and in fact be the dominant term when b10subscript𝑏10b_{1}\approx 0italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≈ 0.

  • bssubscript𝑏𝑠b_{s}italic_b start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is linked to bsssuperscriptsubscript𝑏𝑠𝑠b_{s}^{s}italic_b start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_s end_POSTSUPERSCRIPT of the source and is likely subdominant, as the scale of displacements in the matter field are smaller than the reionization front scale.

  • Finally, the bsubscript𝑏b_{\nabla}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT bias associated with the 2δLadv.superscript2superscriptsubscript𝛿𝐿adv\nabla^{2}\delta_{L}^{\rm adv.}∇ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT tells us how significant and what scale non-localities occur, which in the case of reionization is related to the characteristic bubble size.

At the end of reionization (z6similar-to𝑧6z\sim 6italic_z ∼ 6), there is some residual neutral hydrogen found in galaxies, and thus the 21 cm field effectively becomes a positively biased tracer of large-scale structure. In this regime, perturbative bias expansion methods have been extensively tested and can be straightforwardly applied to model the late-time 21 cm field 2018arXiv181009572C ; 2019JCAP…09..024M ; 2019JCAP…11..023M ; 2021JCAP…12..049S ; 2023PhRvD.108h3528O ; 2024arXiv240518559F .

III.2 Field-level fits

Refer to caption
Figure 6: Scale-dependent bias parameter, b1(k)subscript𝑏1𝑘b_{1}(k)italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_k ), multiplying the first-order Lagrangian operator δLadv.superscriptsubscript𝛿𝐿adv\delta_{L}^{\rm adv.}italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT in our expansion model outlined in Section II.2. As expected, b1subscript𝑏1b_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, is stable on large scales, k0.4hMpc1less-than-or-similar-to𝑘0.4superscriptMpc1k\lesssim 0.4\,h\,{\rm Mpc}^{-1}italic_k ≲ 0.4 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, and then becomes scale-dependent on smaller scales where the quadratic bias model starts to break down as we reach the scale corresponding to the typical bubble size.

We first explore the scale dependence of the first-order bias parameter b1(k)subscript𝑏1𝑘b_{1}(k)italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_k ) in Fig. 6, which as argued in Section III.1 can be connected to the mean reionization fraction and the source bias. As claimed above, in the case of a nearly constant (in time) source bias, we expect b1subscript𝑏1b_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT to become more and more negative as we go to lower redshifts. This is also what we find in this figure, though we note that these arguments break down towards the end of reionization where the 21 cm field becomes highly anisotropic and non-linear and the bubbles merge to form large ionized regions. We see that as a function of scale, b1subscript𝑏1b_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is stable on scales where k0.4hMpc1less-than-or-similar-to𝑘0.4superscriptMpc1k\lesssim 0.4\,h\,{\rm Mpc}^{-1}italic_k ≲ 0.4 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, corresponding to scales larger than the bubble sizes. This characteristic wave number is larger at high redshifts (k0.8hMpc1less-than-or-similar-to𝑘0.8superscriptMpc1k\lesssim 0.8\,h\,{\rm Mpc}^{-1}italic_k ≲ 0.8 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT at z=10.8𝑧10.8z=10.8italic_z = 10.8) and smaller at low redshifts, at which the neutral regions occupy an increasingly smaller volume, making both the measurement more noisy and the theory more inaccurate.

In Fig. 7, we explore the connection between the characteristic bubble size, Reffsubscript𝑅effR_{\rm eff}italic_R start_POSTSUBSCRIPT roman_eff end_POSTSUBSCRIPT, and bsubscript𝑏b_{\nabla}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT, which as conjectured in Section III.1 is related to Reffsubscript𝑅effR_{\rm eff}italic_R start_POSTSUBSCRIPT roman_eff end_POSTSUBSCRIPT as b1/3Reff2similar-tosubscript𝑏13superscriptsubscript𝑅eff2b_{\nabla}\sim 1/3R_{\rm eff}^{2}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT ∼ 1 / 3 italic_R start_POSTSUBSCRIPT roman_eff end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (note that bsubscript𝑏b_{\nabla}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT is thus expected to be positive). We find that bsubscript𝑏b_{\nabla}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT is fairly scale independent at high redshifts, where the approximation works quite well. At low redshifts (z7.5less-than-or-similar-to𝑧7.5z\lesssim 7.5italic_z ≲ 7.5), it starts to break down likely because the k2δLsuperscript𝑘2subscript𝛿𝐿k^{2}\delta_{L}italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_δ start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT term struggles to catch up when the bubble size grows exponentially (see Fig. 1).

Refer to caption
Figure 7: Comparison between the theoretical prediction for b1/3Reff2similar-tosubscript𝑏13superscriptsubscript𝑅eff2b_{\nabla}\sim 1/3R_{\rm eff}^{2}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT ∼ 1 / 3 italic_R start_POSTSUBSCRIPT roman_eff end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (see Section III.1), and its numerical value from the scale-dependent fits to the Thesan-1 21 cm signal. We find that at high redshifts, this approximation works pretty well, but at low redshifts (z7.5less-than-or-similar-to𝑧7.5z\lesssim 7.5italic_z ≲ 7.5), where the bubble size grows exponentially (see Fig. 1), it starts to break down.

We next study the monopole of the redshift-space power spectrum and the cross-correlation coefficient between model and observable in Fig. 8 for the best-fit bias parameters obtained as described in Section III.1. At very high redshifts z129similar-to-or-equals𝑧129z\simeq 12-9italic_z ≃ 12 - 9, the 21 cm power spectrum drops on large scales as ionization bubbles start to form and take out regions with high bias. Around z9similar-to𝑧9z\sim 9italic_z ∼ 9, the power spectrum amplitude picks up again as the density fluctuations in the 21 cm field start to evolve. At the same time, due to the growing reionization fronts, the 21 cm signal begins to downgrade on small scales, leading to the characteristic flattening of P21(k)k3subscript𝑃21𝑘superscript𝑘3P_{21}(k)k^{3}italic_P start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT ( italic_k ) italic_k start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT (see e.g., 2008ApJ…680..962L, , for a discussion). This is exactly what we observe in the top panel. We also see that the perturbative model provides a good description of the true 21 cm power spectrum on large scales and breaks down as we reach the scales of non-locality, where the model power spectrum becomes deficient. As expected, our model is more successful at high redshifts compared with low redshifts.

Luckily, the cosmological information to be gained from these lower-redshift slices is likely quite negligible. One can understand this by noticing that the 21 cm power spectrum, P21subscript𝑃21P_{\rm 21}italic_P start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT, is very flat towards the end of reionization (see Fig. 8), so the power spectrum is effectively ‘featureless’ and all we can extract is an amplitude (2008ApJ…680..962L, ). This is also made worse by the fact that current and near-future experiments will only measure the signal in a few bandpowers, thus additionally smearing any features that could otherwise yield information. A lot of the power found on small scales is likely due to non-linear astrophysical effects related to the growing and merging population of ionization bubbles. It is thus uncorrelated with the large-scale density field and effectively contributes ‘noise’ from the perspective of our perturbation model. We verify that indeed at z6.5𝑧6.5z\approx 6.5italic_z ≈ 6.5 the correlation between the dark matter field and the 21 cm signal drops off sharply (and more so than at higher redshifts) at k0.2hMpc1greater-than-or-equivalent-to𝑘0.2superscriptMpc1k\gtrsim 0.2h{\rm Mpc}^{-1}italic_k ≳ 0.2 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, providing evidence in support of this conjecture. For completeness, we provide the wavenumber values, kr=0.5subscript𝑘𝑟0.5k_{r=0.5}italic_k start_POSTSUBSCRIPT italic_r = 0.5 end_POSTSUBSCRIPT, at which the cross-correlation coefficient with the DM field, r(k)𝑟𝑘r(k)italic_r ( italic_k ), drops below 0.5: kr=0.50.4h1Mpcsubscript𝑘𝑟0.50.4superscript1Mpck_{r=0.5}\approx 0.4\,h^{-1}{\rm Mpc}italic_k start_POSTSUBSCRIPT italic_r = 0.5 end_POSTSUBSCRIPT ≈ 0.4 italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc at z=6.5𝑧6.5z=6.5italic_z = 6.5, kr=0.50.6h1Mpcsubscript𝑘𝑟0.50.6superscript1Mpck_{r=0.5}\approx 0.6\,h^{-1}{\rm Mpc}italic_k start_POSTSUBSCRIPT italic_r = 0.5 end_POSTSUBSCRIPT ≈ 0.6 italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc at z=7.5𝑧7.5z=7.5italic_z = 7.5, kr=0.52h1Mpcsubscript𝑘𝑟0.52superscript1Mpck_{r=0.5}\approx 2\,h^{-1}{\rm Mpc}italic_k start_POSTSUBSCRIPT italic_r = 0.5 end_POSTSUBSCRIPT ≈ 2 italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT roman_Mpc at z=10.8𝑧10.8z=10.8italic_z = 10.8, and note that kr=0.5subscript𝑘𝑟0.5k_{r=0.5}italic_k start_POSTSUBSCRIPT italic_r = 0.5 end_POSTSUBSCRIPT increases smoothly with redshift.

The cross-correlation coefficient between our model and the 212121\,21cm field exhibits a similar trend (Fig. 8). On large scales, we see that all redshifts display an almost perfect correlation r1𝑟1r\approx 1italic_r ≈ 1, with the z=10.8𝑧10.8z=10.8italic_z = 10.8 curve being consistently larger than r>0.9𝑟0.9r>0.9italic_r > 0.9 all the way until k1hMpc1similar-to𝑘1superscriptMpc1k\sim 1\,h\,{\rm Mpc}^{-1}italic_k ∼ 1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Even at low redshifts, close to the end of reionization, the cross-correlation coefficient is still very close to one up until k0.3hMpc1similar-to𝑘0.3superscriptMpc1k\sim 0.3\,h\,{\rm Mpc}^{-1}italic_k ∼ 0.3 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. This indicates that our model does not simply match the power spectrum out of shear luck but does so at the field level, matching the phases of the 21 cm Fourier modes. This is important, because the bias parameters affect only the amplitudes of the terms, not their phases. There is a significant amount of noise in all curves, which we attribute to the limited volume of the simulation. We expect r(k)𝑟𝑘r(k)italic_r ( italic_k ) to be more stable and consistently closer to one in a larger-box simulation. We note that when r𝑟ritalic_r becomes small, r(k)<1𝑟𝑘1r(k)<1italic_r ( italic_k ) < 1, then we can say that the 21 cm field is not faithfully tracing large-scale structure in the Universe.

Refer to caption
Refer to caption
Figure 8: Monopole of the redshift-space power spectrum (top) and cross-correlation coefficient (bottom) of the 21 cm field and our model, described in Section II.2. Top panel: overall the amplitude of the power spectrum increases on large scales with time as the fluctuations in the matter field continue to evolve linearly. At the same time, bubble formation also causes the power spectrum to drop on large scales, explaining the behavior of the z=10.8𝑧10.8z=10.8italic_z = 10.8 curve relative to the rest. Visually, our model (dashed line) provides a good description of the true 21 cm power spectrum down to small scales, k0.6hMpc1similar-to𝑘0.6superscriptMpc1k\sim 0.6\,h\,{\rm Mpc}^{-1}italic_k ∼ 0.6 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, at high redshifts and starts to deviate from the truth earlier at low redshifts. Bottom panel: consequently, we see this reflected in the cross-correlation coefficient. On large scales we see all redshifts displaying an almost perfect correlation r1𝑟1r\approx 1italic_r ≈ 1, with the z=10.8𝑧10.8z=10.8italic_z = 10.8 curve being consistently larger than r>0.9𝑟0.9r>0.9italic_r > 0.9 on the scales shown, indicating that our model is successful at recovering the scales relevant for cosmological analyses. There is also a significant amount of noise in all curves due to the limited volume of the simulation. The vertical dotted lines correspond to 1/Reff1subscript𝑅eff1/R_{\rm eff}1 / italic_R start_POSTSUBSCRIPT roman_eff end_POSTSUBSCRIPT at each redshift and indicates the scale of non-locality due to bubble growth.

In addition to the lower cross-correlation coefficient with the matter field, we also verify that this holds for its cross-correlation coefficient with halos (and galaxies, by extension), as expected, since they are biased tracers of large-scale structure. This needs to be taken into account for science cases that rely on cross-correlation of 212121\,21cm surveys with surveys of galaxies or star-formation-based line emission. We show the cross-correlation coefficient between halos and the 21 cm field at z=8.3𝑧8.3z=8.3italic_z = 8.3 in Fig. 5. We split the halos by virial mass into three samples: logMvir>9.5, 10, 10.5subscript𝑀vir9.51010.5\log M_{\rm vir}>9.5,\ 10,\ 10.5roman_log italic_M start_POSTSUBSCRIPT roman_vir end_POSTSUBSCRIPT > 9.5 , 10 , 10.5, where the masses are expressed in units of h1Msuperscript1subscript𝑀direct-producth^{-1}M_{\odot}italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT, corresponding to number densities of nhalo5×102subscript𝑛halo5superscript102n_{\rm halo}\approx 5\times 10^{-2}italic_n start_POSTSUBSCRIPT roman_halo end_POSTSUBSCRIPT ≈ 5 × 10 start_POSTSUPERSCRIPT - 2 end_POSTSUPERSCRIPT, 7×1037superscript1037\times 10^{-3}7 × 10 start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT, and 7×104h3Mpc37superscript104superscript3superscriptMpc37\times 10^{-4}h^{3}{\rm Mpc}^{-3}7 × 10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT italic_h start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT roman_Mpc start_POSTSUPERSCRIPT - 3 end_POSTSUPERSCRIPT, respectively. The cross-correlation is highest for the lowest-mass sample, as it has the lowest bias, i.e. is closest to the matter field that the neutral hydrogen traces, and has the highest number density, i.e. the lowest level of shotnoise, which additionally decorrelates the two fields. Additionally, the 109.5h1Msuperscript109.5superscript1subscript𝑀direct-product10^{9.5}\ h^{-1}M_{\odot}10 start_POSTSUPERSCRIPT 9.5 end_POSTSUPERSCRIPT italic_h start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT ⊙ end_POSTSUBSCRIPT threshold corresponds to roughly a virial temperature of 104.1Ksuperscript104.1K10^{4.1}\ {\rm K}10 start_POSTSUPERSCRIPT 4.1 end_POSTSUPERSCRIPT roman_K, which is close to the expected virial temperature of 104Ksuperscript104K10^{4}\ {\rm K}10 start_POSTSUPERSCRIPT 4 end_POSTSUPERSCRIPT roman_K to form ionizing bubbles (2022GReGr..54..102C, ). An open question in EoR studies is the connection between the properties of the ionizing halos and the characteristic bubble size. As proposed in Ref. (2005MNRAS.363.1031F, ), the mean free path of ionizing photons and thus the characteristic bubble size are likely related to the number density and mass of the photon sinks. It is thus through cross-correlations between the 21 cm emission signal and large-scale structure tracers such as galaxies and quasars that we can reveal information about the UV radiation sources such as their characteristic mass.

Due to this limitation in the number of Fourier modes on large scales, we also find a significant amount of noise in the ratio between the error power spectrum (of the difference between model and observable) and the observable power spectrum, studied in Fig. 9. We find that the error is smallest on large scales, as expected, and is around similar-to\sim1%. It then rapidly increases as we push to smaller scales near the scale of non-locality at that redshift. Generally, the highest redshifts have the smallest error power spectra, with the z=10.8𝑧10.8z=10.8italic_z = 10.8 curve reaching the 10% mark only at k0.8hMpc1𝑘0.8superscriptMpc1k\approx 0.8\,h\,{\rm Mpc}^{-1}italic_k ≈ 0.8 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT. Moreover, we note that the error on the power spectrum is well-behaved even on small scales, indicating that our method might be well-suited for analysis even on scales beyond k0.5hMpc1greater-than-or-equivalent-to𝑘0.5superscriptMpc1k\gtrsim 0.5\,h\,{\rm Mpc}^{-1}italic_k ≳ 0.5 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT and overall seems to perform better in that regime than pure perturbative methods (2018JCAP…10..016M, ; 2022PhRvD.106l3506Q, ; 2022JCAP…10..007S, ), where the error tends to increase exponentially on these scales. We do not find a strong dependence of this ratio on the angle with respect to the LOS, suggesting that our method for handling redshift-space distortions is adequate. We thus opt to only show the monopole result, as k𝑘kitalic_k-μ𝜇\muitalic_μ slices suffer from a significant amount of cosmic variance. We note that the breakdown of the bias expansion towards the end of reionization also suggests that the (small) box does not contain a representative sample of bubble morphologies and sizes at that epoch, i.e.  keffsubscript𝑘effk_{\rm eff}italic_k start_POSTSUBSCRIPT roman_eff end_POSTSUBSCRIPT is approaching kfund=2π/Lboxsubscript𝑘fund2𝜋subscript𝐿boxk_{\rm fund}=2\pi/L_{\rm box}italic_k start_POSTSUBSCRIPT roman_fund end_POSTSUBSCRIPT = 2 italic_π / italic_L start_POSTSUBSCRIPT roman_box end_POSTSUBSCRIPT at late times. In fact, as seen in Fig. 1, by z6𝑧6z\approx 6italic_z ≈ 6, the bubbles cover almost the entire volume of the Thesan-1 simulation, i.e. ReffLboxsimilar-tosubscript𝑅effsubscript𝐿boxR_{\rm eff}\sim L_{\rm box}italic_R start_POSTSUBSCRIPT roman_eff end_POSTSUBSCRIPT ∼ italic_L start_POSTSUBSCRIPT roman_box end_POSTSUBSCRIPT.

Refer to caption
Figure 9: Ratio between the error power spectrum (i.e., of the difference between model and observable) and the power spectrum of our observable, the redshift-space 21 cm field, averaged over the LOS angle (monopole). The black dashed curve serves to guide the eye and indicate the 10% error threshold. As expected, the error is smallest on large scales (similar-to\sim1%) and increases on small scales, with the highest redshifts exhibiting more perturbative behavior than the lowest ones. Moreover, we note that the error on the power spectrum is behaved well even on small scales, indicating that our method might be well-suited for analysis even on scales beyond k0.5hMpc1greater-than-or-equivalent-to𝑘0.5superscriptMpc1k\gtrsim 0.5\,h\,{\rm Mpc}^{-1}italic_k ≳ 0.5 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT in contrast with pure perturbative methods. We note that there is a significant amount of noise on large scales due to the limited number of Fourier modes.

III.3 Survey realism

In this section, we present a comparison between the thermal noise of a futuristic experiment and the theoretical noise associated with our model. The goal is to qualitatively explore the question of whether our model is sufficiently good at predicting the 21 cm field signal at the level that meets the sensitivity of next-generation experiments. We note that the frequency coverage of existing and planned experiments is sufficient to encompass most of the relevant range of redshifts. In order to probe the regime of k0.1hMpc1similar-tosubscript𝑘perpendicular-to0.1superscriptMpc1k_{\perp}\sim 0.1\,h\,\mathrm{Mpc}^{-1}italic_k start_POSTSUBSCRIPT ⟂ end_POSTSUBSCRIPT ∼ 0.1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT at z10similar-to𝑧10z\sim 10italic_z ∼ 10, we need hundred-meter baseline interferometers, which next-generation facilities aim to provide. However, at these intermediate scales and high redshifts, we are sky noise dominated, so the exact baseline distribution is of secondary importance, and we are mostly sensitive to the total collecting area and the integration time. The minimum ksubscript𝑘parallel-tok_{\parallel}italic_k start_POSTSUBSCRIPT ∥ end_POSTSUBSCRIPT is limited by the ability of the experiment to subtract foregrounds that are many orders of magnitude brighter than the signal. We note that in this analysis, we ignore other sources of contamination from e.g., Galactic and extragalactic foregrounds such as synchrotron radiation and radio point sources as well as non-uniform instrument noise such as cross-talk, confusion from spurious Earth and electronics radio signals, which can have a spatially anisotropic contribution and thus be confused with the 21 cm signal, especially for large incidence angles, an effect referred to as ‘foreground leakage.’

Refer to caption
Refer to caption
Figure 10: Comparison between the error and the thermal noise ratio with respect to the 21 cm power spectrum for the two slices: μ=0.1𝜇0.1\mu=0.1italic_μ = 0.1 and μ=0.95𝜇0.95\mu=0.95italic_μ = 0.95, at three different redshifts. It is evident that for the futuristic HERA-like experiment described in this work (and ignoring the presence of foreground leakage and other systematic errors), the thermal noise is comparable with the theory error, indicating that our model provides a viable description of the 21 cm field on the scales accessible by terrestrial radio interferometers.

Here, we consider a baseline interferometer instrument that covers 10% of the sky with a cadence of Δz=0.2Δ𝑧0.2\Delta z=0.2roman_Δ italic_z = 0.2 at 7<z<107𝑧107<z<107 < italic_z < 10 and a hexagonally packed array of 103superscript10310^{3}10 start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT 14 m dishes. This design is in effect a scaled-up version of the Hydrogen Epoch of Reionization Array (HERA 2017PASP..129d5001D, ). The integration time we consider is 2 years (inversely proportional to the thermal noise), and we adopt a power spectrum binning of Δk=0.05Δ𝑘0.05\Delta k=0.05roman_Δ italic_k = 0.05, Δμ=0.01Δ𝜇0.01\Delta\mu=0.01roman_Δ italic_μ = 0.01. In this analysis, we ignore all contributions to the noise budget apart from thermal noise, and we direct the reader to Ref. (2012ApJ…752..137M, ) for a more thorough discussion of ‘foreground wedges’. We compute the thermal noise power spectrum following Appendix D (see Eq. D4) of Ref. 2018arXiv181009572C (see also Refs. 2015ApJ…803…21B ; 2010ApJ…721..164S ) with the baseline distribution calculated with 21cmSense222https://github.com/steven-murray/21cmSense. We then divide the thermal noise power spectrum by the mean 21cm brightness temperature squared to convert from (temperature)2×volumevolumesuperscripttemperature2volumevolume({\rm temperature})^{2}\times{\rm volume}\to{\rm volume}( roman_temperature ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT × roman_volume → roman_volume units. We note that because the foregrounds vary smoothly in frequency, most of their power is confined to large-scale modes parallel to the line of sight, so we typically filter out the lowest ksubscript𝑘parallel-tok_{\parallel}italic_k start_POSTSUBSCRIPT ∥ end_POSTSUBSCRIPT, which removes the bulk of the foreground contamination.

As noted above, we do not explicitly model a foreground ‘wedge’. Our understanding of instrument calibration and data analysis techniques for minimizing the influence of the wedge are evolving rapidly, so any choice would be quickly rendered obsolete. We will note that in the presence of a significant wedge only the high k𝑘kitalic_k and high μ𝜇\muitalic_μ regime would be accessible. This is unfortunately where the cosmological information is minimized compared to the complex details of the reionization process and where our models perform the worst. Thus, qualitatively, our ability to extract large-scale structure information depends upon our success in handling the wedge. We direct the reader to Refs. (2014PhRvD..90b3019L, ; 2015MNRAS.447.1705P, ; 2016MNRAS.456.3142S, ; 2020PASP..132f2001L, ; 2022ApJ…925..221A, ) for extensive discussions of foreground issues in current and future experiments.

In Fig. 10, we show the comparison between the error and the thermal noise ratio with respect to the 21 cm power spectrum for the two slices: μ=0.1𝜇0.1\mu=0.1italic_μ = 0.1 and μ=0.95𝜇0.95\mu=0.95italic_μ = 0.95, at three different redshifts. It is evident that for the futuristic HERA-like experiment described in this work (and ignoring the presence of foreground leakage and other systematic errors), the thermal noise, which is the absolute minimum noise threshold, is comparable with the error of the model we propose. This indicates that the HEFT method developed in this work provides a viable description of the 21 cm field on the scales accessible by terrestrial radio interferometers, and even in idealized conditions (of no foregrounds) can be utilized to analyze the signal with a high degree of success.

IV Discussion and conclusions

In this work, we have developed a hybrid effective field theory (HEFT) model that combines N𝑁Nitalic_N-body simulations and a quadratic bias model to describe the redshift-space spatial fluctuations in the 21 cm radiation field during the epoch of reionization. We compare our model predictions to the radiative transfer hydrodynamical simulation Thesan and demonstrate that the 21 cm signal is well-approximated by our HEFT model over the observationally relevant wavenumber range. There exist other methods for modeling the 21 cm signal, including directly through the use of radiative transfer hydrodynamical simulations (2017MNRAS.466..960P, ; 2022MNRAS.514.3857K, ), semi-analytic (2022MNRAS.511.3657M, ), semi-numerical (2022ApJ…927..186T, ), and purely perturbative bias expansion (2019MNRAS.487.3050H, ; 2018JCAP…10..016M, ; 2022PhRvD.106l3506Q, ) models. Our approach performs a symmetries-based bias expansion in Lagrangian space and advects the initial conditions fields by directly looking up the non-linear displacements in an N𝑁Nitalic_N-body simulation. Our method thus takes advantage of the fact that despite the complexity of the physical processes that govern reionization, the 21 cm field must still obey certain symmetries (e.g. rotational invariance and the equivalence principle), allowing us to construct a model that is valid on scales larger than the typical size of an ionized bubble.

In particular, we study the performance of our model in terms of metrics relevant to cosmological analyses such as power spectrum, cross-correlation coefficient and model error at the field-level. The performance at the power spectrum and correlation function level is summarized in Fig. 8 for 6 different redshifts, ranging from z=6.5𝑧6.5z=6.5italic_z = 6.5 and 10.8, with corresponding neutral hydrogen fraction of 0.92 and 0.3. Visually, we see excellent cross-correlation on the largest scales available in the Thesan simulation (k0.1hMpc1𝑘0.1superscriptMpc1k\approx 0.1\,h\,\mathrm{Mpc}^{-1}italic_k ≈ 0.1 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT), which rapidly falls off on small scales. As expected, at the dawn of reionization, when the bubble sizes are much smaller, the HEFT model works very well. As can be seen in Fig. 9 where we study the error power spectrum ratio, even deep into reionization our model is capable of recovering the 21 cm power spectrum well. We show that our method works best at high redshifts and on large scales, where we find agreement between model and the observable at less-than-or-similar-to\lesssim 10% down to k0.5hMpc1less-than-or-similar-to𝑘0.5superscriptMpc1k\lesssim 0.5\,h\,{\rm Mpc}^{-1}italic_k ≲ 0.5 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT, with that wavenumber increasing to k0.8hMpc1𝑘0.8superscriptMpc1k\approx 0.8\,h\,{\rm Mpc}^{-1}italic_k ≈ 0.8 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT at z=10.8𝑧10.8z=10.8italic_z = 10.8 and dropping down to k0.2hMpc1𝑘0.2superscriptMpc1k\approx 0.2\,h\,{\rm Mpc}^{-1}italic_k ≈ 0.2 italic_h roman_Mpc start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT at z=6.5𝑧6.5z=6.5italic_z = 6.5.

Refer to caption
Figure 11: Evolution of the Eulerian bias coefficients, obtained by converting our best-fit Lagrangian biases to the Eulerian picture according to App. A of Ref. (2024arXiv240518559F, ). Qualitatively, we find very similar trends to Ref. (2022PhRvD.106l3506Q, ) for the parameters b1Esuperscriptsubscript𝑏1Eb_{1}^{\rm E}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT, b2Esuperscriptsubscript𝑏2Eb_{2}^{\rm E}italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT and bsEsuperscriptsubscript𝑏𝑠Eb_{s}^{\rm E}italic_b start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT, while bEsuperscriptsubscript𝑏Eb_{\nabla}^{\rm E}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT grows steadily as the bubble size squared in our model and is exponentially increasing and negative in the model of Ref. (2022PhRvD.106l3506Q, ). Remaining differences may be attributed to renormalization and scale cut choices.

We compare our findings for the bias parameter evolution to Ref. (2022PhRvD.106l3506Q, ), which develops a Standard perturbation theory and tests it against the Thesan suite of simulations. In Fig. 11, we display the evolution of the Eulerian bias coefficients, obtained by converting our best-fit Lagrangian biases to the Eulerian picture biases according to App. A of Ref. (2024arXiv240518559F, ). Qualitatively, we find very similar trends to Ref. (2022PhRvD.106l3506Q, ). Namely, b1Esuperscriptsubscript𝑏1Eb_{1}^{\rm E}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT is close to zero at high redshifts and becomes increasingly more negative as reionization progresses. b2Esuperscriptsubscript𝑏2Eb_{2}^{\rm E}italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT is negative throughout the EoR and is largest in terms of its absolute value. This also provides a plausible explanation for the high cross-correlation coefficient between δL2subscriptsuperscript𝛿2𝐿\delta^{2}_{L}italic_δ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_L end_POSTSUBSCRIPT and δ21subscript𝛿21\delta_{\rm 21}italic_δ start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT we find in Fig. 4. As expected from previous arguments in Section III.1, bsEsuperscriptsubscript𝑏𝑠Eb_{s}^{\rm E}italic_b start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT is positive and small in magnitude. Specifically, bEsuperscriptsubscript𝑏Eb_{\nabla}^{\rm E}italic_b start_POSTSUBSCRIPT ∇ end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_E end_POSTSUPERSCRIPT differs the most: we find that in our model it is proportional to the bubble size squared and increases steadily in magnitude, whereas in Ref. (2022PhRvD.106l3506Q, ) it is negative and grows exponentially. We attribute remaining differences to renormalization choices and scale cuts used in the fits. The relatively large values of b2subscript𝑏2b_{2}italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT compared with b1subscript𝑏1b_{1}italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT are not concerning, as the |δadv.|superscript𝛿adv|\delta^{\rm adv.}|| italic_δ start_POSTSUPERSCRIPT roman_adv . end_POSTSUPERSCRIPT | field is quite small and b2subscript𝑏2b_{2}italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT multiplies an even smaller quantity, |δ2,adv.|superscript𝛿2adv|\delta^{2,\ {\rm adv.}}|| italic_δ start_POSTSUPERSCRIPT 2 , roman_adv . end_POSTSUPERSCRIPT |. We caution that at the lowest redshifts, where the size of the bubbles is comparable to the size of the simulation box, the bias parameters cannot be reliably measured.

Finally, we study the thermal noise of a futuristic baseline interferometer experiment and compare it with the theoretical noise associated with our model in Fig. 10. We find that our model performs well in the range of scales that such an experiment would be sensitive to. This suggests that a HEFT-like model is more than suitable to model the observed data in the range of interest.

In the future, we aim to further validate our model in simulations of larger volumes and different resolutions. We conjecture that the photon sinks in high-resolution simulations (which are reflective of photon sinks in the Universe) may be very small which would lead to a decrease in the characteristic size of bubbles relative to Thesan-1 and thus would extend our method deep into the large-wavenumber regime. We also aim to investigate the effect of foregrounds and foreground leakage more thoroughly by e.g., running the ‘clean’ output of Thesan-1 through an end-to-end simulator of a future 21 cm experiment.

Finally, a potential simplification of our model involves using a ‘cheaper’ and purely theoretical forward model that uses e.g. Zel’dovich or second-order LPT displacements as opposed to N𝑁Nitalic_N-body displacements333See https://github.com/andrejobuljen/Hi-Fi_mocks for an example of such a model., since at most redshifts during reionization, the non-linear scale occurs at larger k𝑘kitalic_k modes than the scale at which the bias expansion breaks down due to the reionization bubbles growing.

Acknowledgements.
We would like to thank Aaron Smith, Rahul Kannan, Nick Kokron, Ruediger Pakmor and Wenzer Qin both for several fruitful discussions during the preparation of this paper, and we are especially grateful to Enrico Garaldi and Meredith Neyer for providing additional information for running the initial conditions code and studying the bubble size distribution. We wish to acknowledge the support of the N3AS undergraduate research program at UC Berkeley for supporting D.B. through the academic 2023-2024 year. B.H. is grateful to the Miller Institute for their generous support.

References

  • (1) M. McQuinn, ARA&A54, 313 (2016).
  • (2) Planck Collaboration et al., A&A596, A108 (2016).
  • (3) I. D. McGreer, A. Mesinger, and V. D’Odorico, MNRAS447, 499 (2015).
  • (4) S. R. Furlanetto, M. McQuinn, and L. Hernquist, MNRAS365, 115 (2006).
  • (5) M. McQuinn et al., MNRAS377, 1043 (2007).
  • (6) B. Ciardi et al., MNRAS366, 689 (2006).
  • (7) T. R. Choudhury, M. G. Haehnelt, and J. Regan, MNRAS394, 960 (2009).
  • (8) S. R. Furlanetto and S. P. Oh, ApJ682, 14 (2008).
  • (9) R. A. Monsalve, A. E. E. Rogers, J. D. Bowman, and T. J. Mozdzen, ApJ835, 49 (2017).
  • (10) D. C. Price et al., MNRAS478, 4193 (2018).
  • (11) L. Philip et al., Journal of Astronomical Instrumentation 8, 1950004 (2019).
  • (12) S. Singh et al., Experimental Astronomy 45, 269 (2018).
  • (13) A. R. Parsons et al., AJ139, 1468 (2010).
  • (14) S. J. Tingay et al., PASA30, e007 (2013).
  • (15) J. D. Bowman et al., PASA30, e031 (2013).
  • (16) M. P. van Haarlem et al., A&A556, A2 (2013).
  • (17) D. R. DeBoer et al., PASP129, 045001 (2017).
  • (18) A. Weltman et al., PASA37, e002 (2020).
  • (19) K. Bandura et al., in Ground-based and Airborne Telescopes V, Vol. 9145 of Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, edited by L. M. Stepp, R. Gilmozzi, and H. J. Hall (PUBLISHER, ADDRESS, 2014), p. 914522.
  • (20) L. B. Newburgh et al., in Ground-based and Airborne Telescopes VI, Vol. 9906 of Society of Photo-Optical Instrumentation Engineers (SPIE) Conference Series, edited by H. J. Hall, R. Gilmozzi, and H. K. Marshall (PUBLISHER, ADDRESS, 2016), p. 99065X.
  • (21) K. Vanderlinde et al., in Canadian Long Range Plan for Astronomy and Astrophysics White Papers (PUBLISHER, ADDRESS, 2019), Vol. 2020, p. 28.
  • (22) A. Mesinger, S. Furlanetto, and R. Cen, MNRAS411, 955 (2011).
  • (23) M. G. Santos et al., MNRAS406, 2421 (2010).
  • (24) S. R. Furlanetto and S. P. Oh, MNRAS363, 1031 (2005).
  • (25) M. A. Alvarez and T. Abel, ApJ747, 126 (2012).
  • (26) E. Sobacchi and A. Mesinger, MNRAS440, 1662 (2014).
  • (27) N. Y. Gnedin and J. P. Ostriker, ApJ486, 581 (1997).
  • (28) B. Ciardi, F. Stoehr, and S. D. M. White, MNRAS343, 1101 (2003).
  • (29) I. T. Iliev et al., MNRAS369, 1625 (2006).
  • (30) H. Trac and R. Cen, ApJ671, 1 (2007).
  • (31) N. Y. Gnedin and A. A. Kaurov, ApJ793, 30 (2014).
  • (32) A. H. Pawlik et al., MNRAS466, 960 (2017).
  • (33) R. Barkana and A. Loeb, ApJ624, L65 (2005).
  • (34) J. Miralda-Escudé, M. Haehnelt, and M. J. Rees, ApJ530, 1 (2000).
  • (35) N. Y. Gnedin, ApJ542, 535 (2000).
  • (36) P. R. Shapiro, I. T. Iliev, and A. C. Raga, MNRAS348, 753 (2004).
  • (37) O. Zahn et al., ApJ654, 12 (2007).
  • (38) O. Zahn et al., MNRAS414, 727 (2011).
  • (39) J. Zhang, L. Hui, and Z. Haiman, MNRAS375, 324 (2007).
  • (40) Y. Mao et al., Phys. Rev. D91, 083015 (2015).
  • (41) M. McQuinn and A. D’Aloisio, J. Cosmology Astropart. Phys2018, 016 (2018).
  • (42) W. Qin et al., Phys. Rev. D106, 123506 (2022).
  • (43) N. Sailer, S.-F. Chen, and M. White, J. Cosmology Astropart. Phys2022, 007 (2022).
  • (44) L. Koopmans et al., in Advancing Astrophysics with the Square Kilometre Array (AASKA14) (PUBLISHER, ADDRESS, 2015), p. 1.
  • (45) P. McDonald and A. Roy, J. Cosmology Astropart. Phys2009, 020 (2009).
  • (46) V. Desjacques, D. Jeong, and F. Schmidt, Phys. Rep.733, 1 (2018).
  • (47) R. Weinberger et al., MNRAS465, 3291 (2017).
  • (48) A. Pillepich et al., MNRAS473, 4077 (2018).
  • (49) R. E. Angulo and A. Pontzen, MNRAS462, L1 (2016).
  • (50) V. Springel, MNRAS401, 791 (2010).
  • (51) R. Kannan et al., MNRAS485, 117 (2019).
  • (52) R. Kannan et al., MNRAS514, 3857 (2022).
  • (53) E. Garaldi et al., MNRAS512, 4909 (2022).
  • (54) M. Neyer et al., MNRAS(2024).
  • (55) F. Bernardeau, S. Colombi, E. Gaztañaga, and R. Scoccimarro, Phys. Rep.367, 1 (2002).
  • (56) S. Dodelson and F. Schmidt, Modern Cosmology (PUBLISHER, ADDRESS, 2020).
  • (57) Z. Vlah, M. White, and A. Aviles, J. Cosmology Astropart. Phys2015, 014 (2015).
  • (58) R. A. Porto, L. Senatore, and M. Zaldarriaga, J. Cosmology Astropart. Phys2014, 022 (2014).
  • (59) L. Senatore, J. Cosmology Astropart. Phys2015, 007 (2015).
  • (60) V. Assassi, D. Baumann, D. Green, and M. Zaldarriaga, J. Cosmology Astropart. Phys2014, 056 (2014).
  • (61) Z. Vlah, E. Castorina, and M. White, J. Cosmology Astropart. Phys2016, 007 (2016).
  • (62) T. Matsubara, Phys. Rev. D77, 063530 (2008).
  • (63) T. Matsubara, Phys. Rev. D78, 109901 (2008).
  • (64) Z. Vlah and M. White, J. Cosmology Astropart. Phys2019, 007 (2019).
  • (65) S.-F. Chen, Z. Vlah, E. Castorina, and M. White, J. Cosmology Astropart. Phys2021, 100 (2021).
  • (66) C. Modi, S.-F. Chen, and M. White, MNRAS492, 5754 (2020).
  • (67) B. Hadzhiyska et al., J. Cosmology Astropart. Phys2021, 020 (2021).
  • (68) N. Kokron et al., MNRAS505, 1422 (2021).
  • (69) M. Pellejero Ibañez et al., MNRAS520, 3725 (2023).
  • (70) C. A. Watkinson, B. Greig, and A. Mesinger, MNRAS510, 3838 (2022).
  • (71) S.-F. Chen, Z. Vlah, and M. White, J. Cosmology Astropart. Phys2020, 062 (2020).
  • (72) N. Kokron et al., J. Cosmology Astropart. Phys2022, 059 (2022).
  • (73) J. DeRose, S.-F. Chen, N. Kokron, and M. White, J. Cosmology Astropart. Phys2023, 008 (2023).
  • (74) B. Hadzhiyska et al., The Open Journal of Astrophysics 6, 38 (2023).
  • (75) M. Schmittfull, M. Simonović, V. Assassi, and M. Zaldarriaga, Phys. Rev. D 100, 043514 (2019).
  • (76) N. Kokron et al., MNRAS514, 2198 (2022).
  • (77) Cosmic Visions 21 cm Collaboration et al., arXiv e-prints arXiv:1810.09572 (2018).
  • (78) C. Modi, E. Castorina, Y. Feng, and M. White, J. Cosmology Astropart. Phys2019, 024 (2019).
  • (79) C. Modi, M. White, A. Slosar, and E. Castorina, J. Cosmology Astropart. Phys2019, 023 (2019).
  • (80) N. Sailer, E. Castorina, S. Ferraro, and M. White, J. Cosmology Astropart. Phys2021, 049 (2021).
  • (81) A. Obuljen, M. Simonović, A. Schneider, and R. Feldmann, Phys. Rev. D108, 083528 (2023).
  • (82) S. Foreman, A. Obuljen, and M. Simonović, arXiv e-prints arXiv:2405.18559 (2024).
  • (83) A. Lidz et al., ApJ680, 962 (2008).
  • (84) T. R. Choudhury, General Relativity and Gravitation 54, 102 (2022).
  • (85) M. F. Morales, B. Hazelton, I. Sullivan, and A. Beardsley, ApJ752, 137 (2012).
  • (86) P. Bull, P. G. Ferreira, P. Patel, and M. G. Santos, ApJ803, 21 (2015).
  • (87) H.-J. Seo et al., ApJ721, 164 (2010).
  • (88) A. Liu, A. R. Parsons, and C. M. Trott, Phys. Rev. D90, 023019 (2014).
  • (89) J. C. Pober, MNRAS447, 1705 (2015).
  • (90) H.-J. Seo and C. M. Hirata, MNRAS456, 3142 (2016).
  • (91) A. Liu and J. R. Shaw, PASP132, 062001 (2020).
  • (92) Z. Abdurashidova et al., ApJ925, 221 (2022).
  • (93) J. B. Muñoz et al., MNRAS511, 3657 (2022).
  • (94) H. Trac et al., ApJ927, 186 (2022).
  • (95) K. Hoffmann et al., MNRAS487, 3050 (2019).