Also at ]INFN-Roma Tre, 00146 Roma, ItalyNow at ]Department of Physics & Astronomy, Bucknell University, Lewisburg, PA, USAAlso at ]Coimbra Polytechnic - ISEC, 3030-199 Coimbra, PortugalAlso at ]Physikalisches Institut, Universität Heidelberg, Heidelberg, GermanyXENON Collaboration
XENONnT WIMP Search: Signal & Background Modeling and Statistical Inference
Abstract
The XENONnT experiment searches for weakly-interacting massive particle (WIMP) dark matter scattering off a xenon nucleus. In particular, XENONnT uses a dual-phase time projection chamber with a 5.9-tonne liquid xenon target, detecting both scintillation and ionization signals to reconstruct the energy, position, and type of recoil. A blind search for nuclear recoil WIMPs with an exposure of 1.1 tonne-years yielded no signal excess over background expectations, from which competitive exclusion limits were derived on WIMP-nucleon elastic scatter cross sections, for WIMP masses ranging from 6 GeV/ up to the TeV/ scale. This work details the modeling and statistical methods employed in this search. By means of calibration data, we model the detector response, which is then used to derive background and signal models. The construction and validation of these models is discussed, alongside additional purely data-driven backgrounds. We also describe the statistical inference framework, including the definition of the likelihood function and the construction of confidence intervals.
I Introduction
Astrophysical and cosmological observations at various scales provide clear evidence for the existence of dark matter (DM) as a fundamental building block of our universe [1, 2]. Numerous extensions of the standard model of particle physics predict additional fundamental particles as potential candidates for DM [3, 4]. Measuring the direct interaction of particles from the DM halo of the Milky Way with a suitable detector could provide conclusive evidence for the particle DM hypothesis. The XENONnT experiment aims to detect a signal from weakly interacting massive particles (WIMPs) elastically scattering off xenon nuclei with a detector operated underground at the INFN Laboratori Nazionali del Gran Sasso (LNGS) in Italy. The collaboration published the first WIMP search results from a data-taking period between July and November 2021 with a total live time of days and a fiducial mass of , referred to as science run 0 (SR0). We performed a blind analysis and observed no significant excess above background expectations [5]. This led to a minimum upper limit on the spin-independent WIMP-nucleon cross section of for a WIMP mass of at confidence level.
The detector used for the XENONnT experiment is a dual-phase xenon time projection chamber (TPC). A particle interaction in the active liquid xenon (LXe) target results in prompt vacuum ultraviolet scintillation light as well as free ionization electrons. The instantaneous light signal, called the S1 signal, is detected by two arrays of photomultiplier tubes (PMTs) at the top and bottom of the approximately cylindrical TPC. An electric field ( average electric drift field during SR0) is applied to the target volume to drift the ionization electrons toward the liquid xenon surface. Here, they are extracted and accelerated into a xenon gas volume above the liquid via a stronger electric field ( electric extraction field during SR0). The kinetic energy of the accelerated electrons in the gas phase is sufficient for the emission of electroluminescent light, which is proportional to the number of extracted electrons. This second light signal is referred to as the S2 signal. The distribution of the S2 signal detected by the PMTs in the top array is used to infer the position of the initial interaction in the horizontal plane parallel to the liquid surface, which defines the radial coordinate . The vertical component follows from the drift time of electrons, which is determined as the time difference between the S1 and S2 signals. This three-dimensional position reconstruction of the interaction vertex enables the discrimination of multi-site events as well as the selection of an inner fiducial volume (FV) within the TPC. This volume benefits from a particularly low background level thanks to the excellent self-shielding properties of LXe, given its high density. Moreover, the relative magnitudes of the S1 and S2 signals can be used to determine if a particle interacted with the xenon nucleus (expected from WIMPs) or xenon shell electrons (typical of the dominant sources of background), i.e. nuclear recoil (NR) or electronic recoil (ER) events. The electric drift and extraction fields are established by three parallel-wire electrodes (cathode, gate, and anode from bottom to top). To reduce wire sagging, two and four horizontal perpendicular wires support the gate and anode electrodes, respectively.
The TPC is nested in an active neutron veto, which in turn is housed in an active muon veto. Both veto systems are water Cherenkov detectors. More details on the TPC, the veto systems, and other subsystems of the detector are provided in [6, 7].
The data analysis chain of the WIMP DM search in XENONnT is subdivided into two major parts, like in XENON1T [8, 9]. The first part is discussed in [10] and comprises signal and event reconstruction, corrections, event selection, and energy scale calibration. Here we present the second part of the analysis chain: The detector response to the different interaction types is modeled based on calibration data and discussed in Sec. II. The derived best-fit models are important inputs for the definition of signal and background models, detailed in Sec. III. This section also covers data-driven background models that do not rely on the detector response. Finally, the statistical methods used to compute constraints on DM given the XENONnT data are discussed in Sec. IV.
II Modeling the detector response
Modeling the detector response to energy depositions allows for a powerful discrimination between NR signal and ER background in S1–S2 space. NR events are produced by particles scattering elastically off xenon nuclei, while in ER events, particles scatter off xenon shell electrons. The different energy loss processes involved after the two types of recoil cause ER and NR events to form separate populations in S1–S2 space, which is exploited in the WIMP DM search.
In this section, we describe the TPC response model to energy depositions in LXe via ER or NR interactions, which is obtained from fits to calibration data. This detector response model is separated into two parts. The first one is the empirically parametrized LXe emission model, which describes the production processes of the detectable quanta, i.e. the conversion of deposited energy into scintillation photons and ionization electrons. The second part is the detector reconstruction model, which covers the conversion from the produced photons and electrons into the observed and spacetime-dependence corrected S1 and S2 signals (cS1 and cS2). Due to its complexity, it is unfeasible in the fit to simulate processes of photon and electron propagation on a per-quantum basis. Instead, toy-Monte-Carlo (toy-MC) simulations of the detector reconstruction model are used which sample from effective maps provided either by data-driven analyses described in [10] or by simulation-driven analyses using the XENONnT Monte Carlo (MC) framework [11, 12]. Then all the model parameters in the simulations are fit to both ER and NR calibration data. The two parts of the detector response model together with the toy-MC simulation workflow are described in Secs. II.1 and II.2, and the fit to ER and NR calibration data is detailed in Sec. II.3.
II.1 Liquid Xenon Emission Model
The production of quanta from energy depositions in a xenon target is complex and lacks a comprehensive description derived from first principles. Thus, a semi-empirical parametrization of the emission model is commonly used. The parametrization used in XENONnT SR0 is similar to XENON1T [9], which is based on the Noble Element Simulation Technique (NEST) model described in [13, 14]. The simulation workflow of the emission model is described in detail in Appendices A and B, and is summarized in the following.
In an ER event, the recoil energy is transmitted into the excitation and ionization of xenon atoms. Recoiling xenon nuclei from an NR event, on the other hand, lose a significant amount (roughly 80 %) of their kinetic energy to atomic motion in collisions with other xenon atoms [13]. This thermalization process is undetectable in LXe TPCs, resulting in an effective signal loss. For both recoil types, the total number of detectable quanta equals the sum of the number of produced excitons and ions . A fraction of ions and electrons recombine, resulting in the production of additional excitons. An exciton can combine with a ground-state xenon atom forming an excimer, which de-excites, producing a scintillation photon. In an event, photons are generated, part of which are then detected by PMTs as the signal. The electrons that do not recombine are drifted to the liquid-gas interface and eventually form an signal.
In the region of interest (ROI) for the SR0 WIMP search, in [0, 100] PE and in [, ] PE, NR interactions have a larger recombination probability than ER interactions given the same total energy deposition. This leads to smaller ratios of S2 to S1 areas, which is the principle behind ER-NR discrimination in LXe TPCs. We parametrize the mean recombination fraction for ER interactions using the Thomas-Imel box model [15] with an additional Fermi-Dirac term, as in [9]. For NR interactions, the emission model follows the parametrization used in the NEST v1 model detailed in [14]. While there are newer versions of fits available in the NEST framework that use a different parametrization for NR interactions (which we refer to as NEST v2 in the following), we choose to stay with the previous version (referred to as NEST v1). The model is simpler, fits our data well, and the best-fit values and uncertainties of parameters of NEST v2 were not published at the time of the analysis.
II.2 Detector Reconstruction Model
The detector reconstruction model covers all aspects of the signal reconstruction, starting from the produced scintillation photons and ionization electrons up to the measured S1 and S2 signal sizes. The XENONnT detector reconstruction model is almost identical to the one presented in [9] and is briefly outlined here. The detailed simulation workflow of the detector reconstruction model is described in Appendix C.
For S1 signals, the spatial dependence of geometric effects during photon propagation, the photon detection efficiency of the PMTs, and the effect of double photoelectron emission (DPE) from the PMT photocathode [16, 17] are all modeled in the simulations. For S2 signals, we model the loss of electrons along their drift path due to attachment to electronegative impurities as well as due to electric field effects close to the TPC wall, the efficiency of electron extraction from the liquid into the gas phase, and the single-electron gain of extracted electrons to detected photoelectrons (PE) via the electroluminescence process in gas. In addition to these physical processes, we also account for the influence of software reconstruction effects on the signals. For both signal types, the software reconstruction process can introduce biases and fluctuations in the recorded signal size. We also account for the S1 signal reconstruction efficiency, which vanishes at about 3 PE, as we require that at least 3 PMTs detect a photon within 50 ns around the maximal amplitude. Finally, we model the uncertainty of the event position reconstruction and the acceptances of data quality selections [10].
The modeling of ER interactions relies on calibration data from sources dissolved in xenon, in particular 220Rn and 37Ar. The 220Rn progeny 212Pb undergoes a -decay with a Q-value at about 0.6 MeV [18], giving an approximately flat ER energy spectrum in the WIMP ROI, i.e. below about keV. This dataset is used for the fit of the ER model. The 37Ar source provides a mono-energetic ER peak when its K-shell electron capture process leads to a fast cascade of X-rays and Auger-Meitner electrons, with a total energy of about 2.8 keV [19]. Both ER sources provide single-scatter (SS) events, producing one S1 and one S2 signal for the total deposited energy, and the above-described detector reconstruction processes are simulated accordingly.
The NR model is calibrated using an external 241AmBe neutron source (referred to as AmBe source in the following). It emits a fast neutron via the 9BeC capture reaction. Neutrons often scatter multiple times in the LXe target in so-called multi-scatter (MS) events, creating distinct energy depositions at each interaction site, thus the topologies of neutron events must be considered in the modeling. Since the neutrons travel at about 10 % of the speed of light and the S1 width is , the S1 signals from the separate energy depositions in a MS event get reconstructed into one merged S1 signal. The S2 signals might be distinguished based on the separation in spacetime between the energy depositions of the individual scatters, but can get (partially) merged if the interaction sites are close together. Due to the low drift field of in XENONnT SR0, the separation resolution between two S2 signals is reduced with respect to the design because of the increased drift time and consequently larger S2 spread due to the diffusion of electrons. This effect is accounted for in the fit of the NR model to the neutron calibration data: we first produce the photons and electrons for each energy deposition in a MS event separately, based on the emission model. While the primary scintillation photons are summed to produce the merged S1 signal, we only sum S2 signals that will become part of the largest S2 signal after going through the remaining detector reconstruction processes. This information is provided by PMT waveform simulations of neutron scatters from the AmBe source, using the Monte Carlo (MC) framework of XENONnT [11, 12] (see also Sec. III.3 for details on neutron simulations). The results of these waveform simulations are used as inputs to the toy-MC simulations and fitting framework. Data selections that remove MS events (both resolved and unresolved) are directly applied to these inputs, because their acceptances can only be determined from full simulations, and are correspondingly dropped from the selection acceptance curves for the NR fit.
II.3 Fit to Calibration Data
The parameters in the full detector response model are fitted to 220Rn, 37Ar, and AmBe calibration data. The ROI selection is the same as the SR0 WIMP search region defined in Sec. II.1. All data quality selections are applied to the calibration data within the ROI. For the AmBe neutron data, an additional selection is applied using coincident gammas detected in the neutron veto. In 9BeC capture reactions in the AmBe source, 12C is left in the first excited state after neutron emission in about 50 % of the cases, estimated with neutron veto data [20]. The 12C excited state promptly decays via the emission of a 4.44 MeV gamma ray, coincident with the neutron emission. The requirement of the time coincidence within between the 4.44 MeV gamma ray detected in the neutron veto and the event in the TPC, leads to a clean calibration dataset, with only about one expected event from accidental coincidences between the two detectors in the dataset (0.05 % of the resulting sample). The events recorded in the ROI that pass (fail) the neutron veto coincidence selection are shown in red (gray) in Fig. 1.
For the fit to the ER data, an additional background component is added to account for accidental coincidence (AC, see subsection III.5 for details) events formed by a wrong pairing of S1 and S2 signals in the calibration dataset.
After all selections, the 220Rn and AmBe datasets consist of 2000 events each. We use a downsampled dataset of 10 000 37Ar events to perform a combined 220Rn-37Ar ER fit. With the downsampling we avoid overconstraining the fit to the narrow, very low energy range of 37Ar.
For the fit, an affine-invariant Markov chain Monte Carlo (MCMC) algorithm [35, 36] is used to sample from the high-dimensional posterior distribution of the parameters. In each step of the sampling, a GPU-accelerated toy-MC simulation is performed for every set of the ER and NR model parameters, following the steps described in the previous two sections, producing a model of the detector response in the space of corrected signals, i.e. –. The model parameters are described in detail in the appendix and are listed in Tab. 2. These toy-MC models are then compared to calibration data by calculating the binned Poisson likelihood in – space. The likelihoods are multiplied by the prior distributions of the parameters to yield the posteriors. After sufficient iterations of the MCMC, the samples in the chain then converge to the posterior distribution of the parameters.
In the ER emission model, the parametrization is the same as in XENON1T [9] with very wide flat priors, referred to as free priors. For the NR emission model, the parameter priors are taken from [14]. A Gaussian-like distribution with different widths on either side is used as the prior of parameters with asymmetric uncertainties. However, due to the low drift field of 23 V/cm, where no literature data on NR yields exist, the validity of the field dependence in the model is unverified. Therefore, the field dependence in the emission model from [14] was modified. Two scaling parameters were freed ( and in Eq. 24 and Eq. 27 in the appendix), and two field dependence parameters ( and ) were fixed to the reported best-fit values. The widths of all other parameter priors were doubled in order to allow more freedom in the fit. Because the dependence of NR signals on the drift field has been shown to be small (see also literature measurements shown in Fig. 3), we consider this a reasonable choice.
The fit results of all emission model parameters are summarized in Tab. 2 in the appendix. No significant tension between the posteriors and the priors of any parameter is found. Suitable goodness-of-fit (GOF) tests are performed to assess whether the best-fit models adequately describe the 220Rn and AmBe calibration data. Specifically, we employ binned Poisson likelihood tests in the – space. We adopt an equiprobable binning scheme using the GOFevaluation package [37], ensuring that the number of expected events under the best-fit model is the same in each bin, . In Fig. 2 the results of the 2D – Poisson likelihood test are shown for both the ER and the NR fit, overlaid with the 220Rn and the AmBe calibration data, respectively. The resulting p-values of the tests indicate no significant discrepancy between the best-fit models and the calibration data.
Figure 3 shows the photon and charge yields as a function of deposited energy for the ER and NR emission models using our best-fit parameters (dark red). For comparison, NEST models calculated at a drift field of 23 V/cm and several published measurements at different drift fields are shown. For the NR yields, we also compare our results to the NEST v1 model at a drift field of 23 V/cm (red dashed lines in Fig. 3) which uses the same parametrization as the model in this work (see Sec. II.1), showing very good agreement.
III Signal and background modeling
The dominant sources of background in the WIMP search are ER events from intrinsic -decays from materials and neutrinos (Sec. III.2), NR events from radiogenic neutrons from detector materials (Sec. III.3) and coherent neutrino scattering (Sec. III.4), AC events (Sec. III.5), and surface background (Sec. III.6). Besides their total expected rates, the distribution of these background events in the analysis space cS1–cS2– is derived. In Fig. 4, the distributions of background models in the – space are shown, compared to the signal model for elastic spin-independent scattering of a 200 GeV mass WIMP.
III.1 WIMP Signal Model
For non-relativistic, spin-independent elastic WIMP-nucleus coherent scattering, the event rate scales with the square of the atomic mass number of the target and can be written as [38]
(1) |
where is the recoil energy, is the local DM density of 0.3 GeV/(cm3), is the WIMP mass, is the WIMP-nucleon reduced mass, is the WIMP velocity in the lab-frame, is the WIMP-nucleon cross section, and is the nuclear form factor as a function of the momentum transfer to the xenon nucleus [39]. The DM velocity distribution is averaged using the parameter values of the standard halo model with values from [40, 41, 42, 43, 44, 45], as recommended in [46]. Fig. 5 shows the cS1–cS2 distribution of WIMP-nucleus scattering for different WIMP masses. For increasing masses, the contours extend to higher cS1 and cS2 values, up to about 200 GeV/, where the shape does not change significantly anymore due to kinematic effects. For this reason, confidence intervals in (see Sec. IV) for approximately scale proportional to the inverse of the assumed WIMP mass .
For the spin-dependent WIMP-nucleus interaction, assuming natural abundances of xenon isotopes, only 129Xe and 131Xe can contribute since they are the only two stable isotopes of xenon with non-zero nuclear spin . The cross section is usually written as
(2) |
The axial-vector structure factor of xenon is taken from [47]. In the XENONnT SR0 analysis, the mean of the structure factor is used and the uncertainty is neglected since it is much smaller than the uncertainty from the NR fit.
Uncertainties from the posterior distribution of the NR model parameters and other efficiencies can be propagated to the NR rate uncertainty. For a 6 GeV (50 GeV) WIMP, the relative rate uncertainty is 30% (10%). The NR model shape can also be affected by the posterior distribution. However, because of the low statistics of NR events in SR0 (for both background and signal expectation), the impact of the shape uncertainty of the NR model is negligible compared to the uncertainty of its absolute rate. Thus, the NR model shape uncertainty is not propagated to the final likelihood.
III.2 Electronic Recoil Background Model
ER events are one of the dominant background sources in the XENONnT SR0 WIMP search, primarily originating from -decays of intrinsic radioactive contaminants such as 214Pb (a product of the 222Rn decay chain) and 85Kr. Contributions from 136Xe double- decays, solar neutrino interactions, and gamma events from the materials are also taken into account. In the ROI for the WIMP search, the total ER energy spectrum is approximately flat. The distribution of ER background events in – space is generated from the ER model fitted to 220Rn and 37Ar calibration data, as discussed in Sec. II.3. The total ER rate is left unconstrained and is fitted in the final WIMP likelihood, which will be discussed in Sec. IV.
In contrast to the NR model, the uncertainty in the shape of the ER model can affect the sensitivity of the WIMP search. Ideally, the posterior distribution of all parameters should be propagated to the ER model as nuisance parameters in the WIMP search likelihood function. However, an excessive number of correlated nuisance parameters becomes computationally challenging. In XENONnT SR0, a principal component analysis (PCA) [48] is used to remove correlation among parameters from the MCMC sampler. All ER parameters shown in Tab. 2, together with , , and (see details in the appendix), are included in the PCA. Different from the original PCA, the variance of
(3) |
along each principal component is used to quantify its importance. Here, and are the probability of a 50 GeV WIMP and ER background in the bin of cS1–cS2 space, respectively. A larger variance of means that the uncertainty of that component can be more impactful for the WIMP sensitivity. In the end, the two most important principal components are kept as nuisance parameters. Their shape uncertainty, constrained by the calibration data, is shown in Fig. 6.
III.3 Neutron Background Model
Radiogenic neutrons are mainly produced through spontaneous fission and reactions in the detector materials due to intrinsic traces of radioactive impurities. The cosmogenic neutron background is subdominant [49], hence it is neglected in XENONnT SR0 analysis. The neutron yield and energy per isotope and material are calculated with the SOURCES-4A software [50], using the radioactive impurity levels of the relevant detector components obtained via the combined XENON1T and XENONnT radioassay campaigns [51, 52]. A full-chain simulation pipeline [11] is used to estimate the neutron background rate, the geometrical distribution, and the cS1–cS2 distributions for SR0.
The propagation of the neutrons in the XENONnT detector is simulated with the Geant4 toolkit [53, 54], where the recoil type and energy deposition per interaction in the target are recorded. We compute the number of photons and electrons for a given interaction via the custom-developed epix package [55], which utilizes the energy-dependent LXe response derived from the SR0 calibrations, shown in Fig. 3. This package also handles the clustering of the individual energy-depositing steps at the LXe microphysics scale before the quanta generation. The photons and electrons produced by epix are passed to the waveform simulator (WFSim) [56], which computes the S1 and S2 signals up to the waveform level, by means of a precise set of simulations- and data-driven corrections which characterize the XENONnT detector response. The event-by-event simulated waveforms share the same data structure as the science data after applying PMT and DAQ effects, which allows us to process them with the same software used for the real data (straxen [57]), as well as to apply the same data selections. The final SS neutron rate arises from weighting the rates obtained via the entire waveform simulation pipeline with the specific activities of the corresponding material and isotopic neutron yield.
When a neutron scatters in the 250 kg of LXe between the cathode and the bottom PMT array, only the scintillation light for these events can be detected. The electrons, in turn, are lost due to an electric field pointing in the opposite direction to the active volume. Neutron events that consist of scatters above and below the cathode are referred to as neutron-X events, where “X” means additional S1 contribution from charge-insensitive scatters. They are modeled as a separate background since they have a larger cS1 to cS2 ratio than normal neutron scatters.
The event parameters having discrimination power on MS interactions, such as S2 pulse shape and PMT hit patterns of S1, are matched and validated with calibrations, such that the relevant data quality selections can be applied to the simulation outputs. Notably, a validation of the multiple-to-single scatter ratio and total rate in the TPC of the AmBe calibration data was conducted, from where we obtained the systematic uncertainty associated with the accuracy of the full-chain simulations.
With this agreement between simulations and calibration data, we decided before unblinding the WIMP ROI to proceed with the sideband unblinding of the events tagged by the neutron veto. Initially, this confirmed the simulation-driven neutron background prediction. However, a mistake in the definition of the neutron veto time window was found after the unblinding of the WIMP ROI. After fixing this issue, a mismatch was found, with the neutron background rate being larger than predicted in the ROI [5]. We therefore decided to constrain the neutron background rate in a purely data-driven way based on the aforementioned sideband unblinding with the correct neutron veto tagging window. The results of the sideband unblinding are shown in Fig. 7. Based on the multiple-to-single scatter ratio of 2.2, the neutron veto tagging efficiency, and the three observed MS events and one SS neutron event tagged by the neutron veto in the fiducial volume, a data-driven prediction of events was derived for the SR0 exposure. We used the simulation-driven ratio between normal neutron background and neutron+X events to estimate the neutron+X event rate.
No further modification was propagated into the analysis after the data-simulations rate mismatch was identified: the time veto window between the TPC and the neutron veto, chosen due to the reduced background rate initially predicted, and the fiducial volume of the WIMP search remained as defined prior to unblinding. An underestimated contamination from some of the surrounding materials is considered as a possible cause of the discrepancy between the expected rate and the observed neutron background. Studies to constrain the material’s radiopurity by means of the high-energy gamma rays for specific detector regions are ongoing.
III.4 CENS Background Model
Neutrinos can also contribute to the NR background via coherent elastic neutrino-nucleus scattering (CENS). Signals from solar, atmospheric, and diffuse supernova neutrinos (DSN) will be in the WIMP search ROI. Due to the weak interaction between neutrinos and nuclei, CENS events are expected to be single-scatter and spatially uniform.
The recoil energy spectrum of solar CENS is almost identical to that of a 6 GeV spin-independent WIMP [58], and the flux is [59, 60]. After applying all selections and their corresponding efficiencies, the expected number of events is in SR0, also shown in Tab. 1. Atmospheric neutrinos and DSN mainly affect the search for heavier WIMPs. Their recoil energy spectra are taken from [49], and the SR0 expectation value is events. Due to the low cross section and the similarity to WIMP interactions, CENS background will be the major limitation to the WIMP search sensitivity of the next generation of LXe experiments.
III.5 Accidental Coincidence Background Model
AC events consist of incorrectly paired S1 and S2 signals. These S1 and S2 signals can occur, for example, when either the S1 or the S2 signal of a physical event is not reconstructed due to detector effects, or when a single electron S2 signal is misclassified as an S1 signal. Such signals are referred to as “isolated” S1 and S2 signals. If an isolated S1 and an isolated S2 fall within the event-building time interval [8], they form an AC event.
The AC background is modeled with a data-driven approach. Isolated S1 and S2 signals as well as their surrounding S1 and S2 signals that are close in time are sampled and paired into events. S1 signals 150 PE are selected as isolated S1 signals, and the isolated S2 signals are taken from events whose S1 area is 150 PE, together with all pulses in the event window. The AC event rate is computed as
(4) |
where and are the isolated S1 and S2 rates, and is the event-building time interval, which is defined according to the maximum drift time of 2.2 ms in XENONnT. Occasionally during SR0, some localized high rates of S2 signals appeared in the TPC. Excluding these periods from the analysis, and remained stable throughout SR0 at an average rate of 1.5 Hz and 80 mHz, respectively. In XENONnT, the isolated S2 rate is an order of magnitude higher compared to XENON1T [61]. This can be explained by the lower electron extraction efficiency which causes an increased rate of delayed electrons. The isolated S1 and S2 signals are fed into the data processing pipeline [62, 57] to reconstruct the events. This provides the background distribution of all relevant event properties, especially the distribution in cS1 and cS2, as shown in Fig. 4 (lower right).
Validation performed on calibration data provides the rate and shape uncertainty of the AC model. For this, we compared the predicted and observed AC events in a dedicated dataset by performing both 1D and 2D GOF tests with equiprobable bins in all relevant parameters. The AC prediction is provided by the AC modeling method discussed above, and the AC dataset is selected by inverting the data selections to avoid other physical events. Because of the large statistics of 37Ar calibration data, it delivers the most stringent constraint on the AC model. 37Ar events with S2 areas smaller than 400 PE are selected to test the AC model. The predicted number of AC events was 731.6, while 733 events were observed in the data. The statistics in the AC model is very large so the uncertainty of the predicted AC rate is neglected in the following GOF tests. The 1D GOF tests in S1, S2, , , and the 2D test in S1–S2 all yield p-values between 0.05 and 0.95. The result of the 2D test with a p-value of 0.61 is shown in LABEL:fig:ac_${}^{37}$Ar. The model was further validated with events removed by selections targeting the AC background inside the WIMP ROI. Similar tests were also performed on 220Rn calibration data. All these tests show the model and data to be compatible.
To suppress AC events, a selection criterion based on a gradient-boosted decision tree (GBDT) classifier utilizing the S2 pulse shape and the drift time was developed. Due to an insufficient model for the S2 pulse shape near the perpendicular wires, the GBDT classifier is only applied in the region at least 4.45 cm far from the wires (far-wire region), while in the region near the wires (near-wire region), a data-driven S2 shape selection is applied instead. The resulting AC background rate per ton-year in the near-wire region is about 5.6 times higher compared to the rate in the far-wire region. Consequently, the modeling of the TPC response is split into two parts. Due to an over-fitting issue found in the GBDT training process, we conservatively estimated the rate uncertainty to be 30% for both the near- and far-wire regions. The expectation value of AC events in the WIMP search dataset in the ROI is .
III.6 Surface Background Model
In XENONnT, the TPC wall is made of PTFE, which is known to accumulate isotopes from the decay chain of atmospheric 222Rn, which decays down to 210Pb, an isotope with a half-life of 22.2 years. Radioactive decays on the wall surface can result in events with reduced S2 signals due to charge losses, which gives rise to a background that can leak into the WIMP signal region. Three components contribute to the surface background in the WIMP ROI: the two -decays of 210Pb and 210Bi, and the recoiling 206Pb following the -decay of 210Po. Due to the complexity of electric field conditions near the surface and the loss of ionization electrons to the detector walls, we employ a data-driven approach to model this background component in the space of cS1–cS2––.
The surface background model in the space of – was developed using 210Po -events. These events are mono-energetic (5.4 MeV) and thus easily identifiable through their characteristic S1 signal, yet they present a wide range of S2 sizes due to variable charge loss to the walls. The and distributions of 210Po -events were seen to match those of the lower-energy -events. The radial profiles were modeled using a Student’s generalized skew-t distribution. The radial distribution was fitted independently for different S2 sizes as shown in Fig. 9 to account for the S2-size dependent position resolution [10]. The profile was modeled using a linear function, to account for lower rates of surface events observed at the bottom of the TPC, likely due to the increased charged insensitivity near the walls at the bottom of the detector [7].
The cS1–cS2 distribution was modeled using a 2D Gaussian adaptive kernel density estimation (aKDE), built using events reconstructed outside the detector walls. The resulting model was then validated against the events reconstructed between the walls and the fiducial volume, in order to rule out any dependence between cS1, cS2, and R. Figure 4 (lower left) shows the projection of the four-dimensional model on cS1–cS2 in the ROI. The absolute rate of events in the blinded region was inferred from the radial distributions of adjacent, non-blinded regions in cS1–cS2.
Uncertainties in the model were obtained from the and fit parameter uncertainties, as those are the parameters of interest in the development of the FV. Uncertainties on the overall measured surface event rate were also propagated. Uncertainties in the (cS1, cS2) aKDE were neglected, as toy-MC tests were performed to show that they had little impact on overall expectation. For the FV, a maximum radius of compared to the used in [63] was chosen to reduce the number of surface events to , which improves the robustness against mismodeling.
IV Statistical Inference
In this section, we describe the statistical methods used to derive the WIMP search results, which generally follow the recommendations formulated in [46]. We first describe the likelihood function used for the search in Sec. IV.1 and its nuisance parameters in Sec. IV.2, followed by an illustration of the procedure for constructing confidence intervals and computing the discovery significance in Sec. IV.3. The GOF test to validate the best-fit models and the blinding procedure are described in Sec. IV.4. We omit some details already presented in [9], as the methods here presented build upon that previous work.
IV.1 Definition of the Likelihood Function
The fundamental ingredient for the statistical analysis of our WIMP search data is the likelihood function . It depends on the WIMP-nucleon cross section , which is our parameter of interest, and a set of nuisance parameters . The likelihood function is defined as a product of four terms: two for the WIMP search dataset ( and ), one for the 220Rn calibration dataset (), and one for ancillary measurements of nuisance parameters ():
(5) | ||||
In the WIMP search data, we categorize events based on their proximity to the perpendicular wires of the gate and anode electrodes, distinguishing between those that are near (, corresponding to about of the total FV) and far from the wires in the – plane. This approach allows us to account for the higher AC rate observed near the wires as discussed in Sec. III.5, without introducing a full position-dependence in the likelihood. Other backgrounds, in particular radiogenic neutron and surface background, exhibit a substantial radial dependence. Thus, the likelihood is modeled and evaluated in , in addition to cS1 and cS2. For the near-wire region, the radial component is not included in the modeling. Each WIMP search term is an extended unbinned likelihood function of the form
(6) | ||||
where the index “region” runs over “near-wire” and “far-wire”. The index runs over all observed events in the WIMP search far-wire (near-wire) dataset, where is a vector with entries cS1, cS2, and (cS1 and cS2). The PDFs of each signal and background component with expectation values are evaluated for each event. The total expectation value is given by .
The term is the extended unbinned likelihood function of the 220Rn calibration dataset. It depends on the two shape parameters introduced in Sec. III.2 that parameterize the range of models consistent with the ER calibration data, selected using the PCA method. By incorporating this likelihood term, we simultaneously fit the shape parameters to the 220Rn calibration dataset and the ER background model in the WIMP search dataset. Through this procedure, the constraint from the calibration data on the shape of the ER model is propagated to the final inference.
Finally, the ancillary likelihood function is a product of constraint terms for some of the nuisance parameters , which are detailed in the following section.
IV.2 Nuisance Parameters and Constraints
Rate Parameter | Constraint | Nominal | Best Fit |
---|---|---|---|
ER | WIMP search data | ||
Neutron | Ancillary measurement | ||
Neutron-X | Ancillary measurement | ||
CENS (Solar ) | Ancillary measurement | ||
CENS (Atm + DSN) | Ancillary measurement | ||
AC (near-wire) | Ancillary measurement | ||
AC (far-wire) | Ancillary measurement | ||
Surface | Ancillary measurement | ||
220Rn calibration | 220Rn dataset | ||
Shape Parameter | Constraint | Nominal | Best Fit |
ER shape parameter 1 | 220Rn dataset | ||
ER shape parameter 2 | 220Rn dataset | ||
Signal Parameter | Constraint | Nominal | Best Fit |
Signal efficiency | NR model uncertainty | ||
WIMP cross section [] | Parameter of interest | 3.22 |
In addition to our parameter of interest , we parameterize the space of background hypotheses using nuisance parameters in the likelihood function of Eq. 5. These parameters control both background rates and shape as well as the signal efficiency. In total, twelve nuisance parameters are defined, which are listed in Tab. 1. The uncertainties of the nominal expectation values correspond to the width of a Gaussian constraint term. The ancillary likelihood function is the product of all constraint terms. Most parameters are constrained via ancillary measurements, which are obtained from sidebands (e.g. data outside the ROI) or external measurements in combination with simulations. More details on the constraints on neutron, CENS, AC, and surface background rates were discussed in Sec. III. The ER rate is a free parameter in the fit and is fully constrained by the WIMP search data. The ER shape parameters obtained with PCA are constrained via the simultaneous fit of the calibration dataset via the term in Eq. 5.
The best-fit values from the XENONnT SR0 WIMP search data [5] for a fit including an unconstrained signal component are given in the last column of Tab. 1. The corresponding uncertainties represent the two-sided one-sigma confidence intervals derived from a profile likelihood scan in the respective parameter. More details on the construction of confidence intervals are discussed in the following.
IV.3 Confidence Intervals and Discovery Significance
The construction of confidence intervals is based on the profile likelihood test statistic
(7) |
Quantities with a single hat denote the global maximum likelihood estimator of a parameter, while denotes the set of nuisance parameters that maximize the likelihood if the cross section is fixed to a given . The likelihood is defined for a specific signal model, for example a with a certain mass and interaction type (e.g. spin-independent or spin-dependent coupling). In our statistical inference, we consider signal models across a range of WIMP masses, from to . For higher WIMP masses, the PDF remains constant and the flux for a given cross section decreases approximately linearly with mass, as discussed in Sec. III.1. According to this, limits can be extrapolated even beyond .
Knowing the distribution of under different hypotheses is essential for calculating discovery significances and confidence intervals. Due to the low rate of background resembling the signal, asymptotic formulae as discussed in [64] are not applicable. Instead, toy-MCs are used to estimate the distribution of the test statistic given that is the true cross section. We use the custom-developed blueice [65] framework to define the likelihood function. This Python package provides an efficient interpolation (“template morphing”) between PDFs evaluated for different discrete values of shape nuisance parameters. This allows shape parameters to be considered for which we have no analytical description.
For each signal model considered, we compute the discovery significance and the confidence interval by comparing the measured test statistic with the distribution under each hypothesis [66, 67]. Testing yields the (local) discovery significance, while confidence intervals are computed by finding the region where is rejected at a confidence level (CL) given the data, which is illustrated for three background-only toy-MCs (three black parabola-like lines) in Fig. 10 (top). The minimum of each curve corresponds to the respective best-fit value . Upper limits (ULs) and lower limits are obtained by finding the cross sections at which reaches the critical region. The threshold of the 90 CL critical region is precomputed as the 90th percentile of the test statistic distribution from toy-MCs with injected signal corresponding to the respective cross section. For low cross sections, it deviates from the asymptotic two-sided threshold indicated as a horizontal gray dotted line.
Repeating this procedure times yields the expected distribution of ULs that can be obtained in the absence of any signal, shown in Fig. 10 (bottom). The experiment’s sensitivity is given by the median of these ULs and the sensitivity band (“Brazil band”) marked by the 2-sigma and 1-sigma quantiles indicates the spread. The distribution of lower limits for background-only toy-MCs is also shown in Fig. 10 (bottom). As recommended in [46], we decide on a discovery significance threshold for reporting two-sided confidence intervals of sigma, following [9].
Statistical fluctuations as well as mismodeling, such as overestimated background rates, can yield arbitrarily low ULs, which may result in the spurious exclusion of models beyond the experiment’s sensitivity. To mitigate this issue, various methods have been proposed [68, 69, 70]. The XENONnT SR0 WIMP search follows the recommendation of [46] to use power-constrained limits (PCL) [70]. In this method, a signal size threshold is selected, at which the experiment has a “rejection power” – the probability of excluding a given signal under the background-only hypothesis. If an UL falls below this threshold, the threshold value is reported instead. These thresholds correspond to the quantiles of the UL distribution used to compute the sensitivity band illustrated in Fig. 10. For instance, choosing sets the threshold to the -1 sigma quantile of the band, while truncates ULs at the band’s median. In [46], the fiducial choice of was suggested. However, the choice was erroneously based on the discovery power, which corresponds to the probability of rejecting the background-only hypothesis given an alternative hypothesis with a specific signal size. In the absence of an updated recommendation, we chose the very conservative choice of for the first WIMP search of XENONnT.
IV.4 Goodness of Fit and Blinding Procedure
To verify that the best-fit models describe our WIMP search data well, we defined a GOF test before unblinding, and after studies to optimize the power to reject mismodeling. The test uses a binned Poisson likelihood with 15 equiprobable bins in the cS1–cS2 space, defined from the model being tested. To compute a p-value, the distribution of the test statistic under the best-fit hypothesis is derived through toy-MCs. Specifying the test and acceptance threshold (90% CL) before unblinding ensured that the results were not influenced by statistical fluctuations expected from the low-statistic WIMP search dataset. For the background-only fit we found a p-value of and for a fit with an additional WIMP signal the p-value is . Both indicate no strong mismodeling in the predefined parameter space. Using GOF tests with well-studied power to discover signal-like mismodeling was also designed to replace the “safeguard” ER shape parameter defined in [71], due to its computational cost and susceptibility to some kinds of mismodeling.
When searching for new signals in data, it is crucial to avoid the experimenter’s bias on the result [72]. In our WIMP search, we adopt a common strategy for bias mitigation: blinding the signal region until all steps of the data analysis are finalized. Initially, events in the ER and NR bands with a reconstructed ER energy below were blinded. This involved defining the ER and NR bands in cS1 and cS2 based on quantiles in cS2 for slices in cS1 of calibration data. In the first step, all events above and those above the sigma quantile of the ER band were unblinded for the analysis presented in [63]. The signal region for the WIMP search (indicated in Fig. 5) remained blinded until the analysis procedure was finalized. The unblinded SR0 WIMP search data showed no significant excess for any of the tested WIMP masses with local discovery p-values . For this reason, we reported new ULs on the spin-independent WIMP-nucleon cross section across WIMP masses ranging from to , with a minimum of at .
V Summary
The XENONnT WIMP dark matter search relies on a detector response model as well as simulation- and data-driven background models. These were combined to construct a statistical model in cS1, cS2, and , which was used to infer limits on WIMP-nucleus scattering cross sections.
The full detector response model for electronic recoil (ER) and nuclear recoil (NR) interactions, including both the xenon emission and detector reconstruction model, was successfully fitted to calibration data. Accurate simulations of particle interactions up to the data acquisition waveform level made this possible, in particular, to correctly model the S2 multiplicity of events with several, potentially unresolved energy deposits. Using these models, we derived the distributions for ER and NR backgrounds, as well as the signals, in our analysis space. Except for ER, the background rates were constrained with ancillary measurements. The radiogenic neutron background rate was constrained by first matching the simulated ratio of multiple-to-single scatter interactions and the neutron veto tagging efficiency with NR calibration data. After unblinding the neutron veto-tagged events, these three inputs were combined to derive a prediction for the remaining non-vetoed single-scatter neutron background. Two shape parameters were propagated to the final inference to account for the uncertainty of the ER model in cS1–cS2 space. Accidental coincidence and surface background were modeled with data-driven approaches. The validity of the models was confirmed in calibration data as well as science data outside the region of interest of the WIMP search.
A blind analysis was performed for the first science data from XENONnT. The statistical methods largely follow the previous XENON1T approach and community recommendations, by using a toy-MC-calibrated profile log-likelihood ratio test statistic. One departure from these recommendations was raising the minimal required power of the power-constrained limit (PCL) threshold from 0.15 to 0.5, corresponding to placing upper limits only at or above the median unconstrained upper limit. Both the calibration fits and the final best-fit model were assessed and found acceptable with goodness-of-fit tests that were chosen based on their mismodeling rejection power, and defined prior to unblinding the data. Analysis of the data with an exposure of 1.1 tonne-years revealed no signal excess over backgrounds. Therefore, new upper limits on the spin-independent WIMP-nucleon cross section were derived.
Acknowledgements.
We gratefully acknowledge support from the National Science Foundation, Swiss National Science Foundation, German Ministry for Education and Research, Max Planck Gesellschaft, Deutsche Forschungsgemeinschaft, Helmholtz Association, Dutch Research Council (NWO), Fundacao para a Ciencia e Tecnologia, Weizmann Institute of Science, Binational Science Foundation, Région des Pays de la Loire, Knut and Alice Wallenberg Foundation, Kavli Foundation, JSPS Kakenhi and JST FOREST Program ERAN in Japan, Tsinghua University Initiative Scientific Research Program, DIM-ACAV+ Région Ile-de-France, and Istituto Nazionale di Fisica Nucleare. This project has received funding/support from the European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No 860881-HIDDeN. We gratefully acknowledge support for providing computing and data-processing resources of the Open Science Pool and the European Grid Initiative, in the following computing centers: the CNRS/IN2P3 (Lyon - France), the Dutch national e-infrastructure with the support of SURF Cooperative, the Nikhef Data-Processing Facility (Amsterdam - Netherlands), the INFN-CNAF (Bologna - Italy), the San Diego Supercomputer Center (San Diego - USA) and the Enrico Fermi Institute (Chicago - USA). We acknowledge the support of the Research Computing Center (RCC) at The University of Chicago for providing computing resources for data analysis. We are grateful to Laboratori Nazionali del Gran Sasso for hosting and supporting the XENON project.References
- Bertone and Hooper [2018] G. Bertone and D. Hooper, History of dark matter, Rev. Mod. Phys. 90, 045002 (2018).
- Aghanim et al. [2020] N. Aghanim et al. (Planck collaboration), Planck 2018 results: Vi. cosmological parameters, A&A 641, A6 (2020).
- Bertone et al. [2005] G. Bertone, D. Hooper, and J. Silk, Particle dark matter: evidence, candidates and constraints, Phys. Rept. 405, 279 (2005).
- Roszkowski et al. [2018] L. Roszkowski, E. M. Sessolo, and S. Trojanowski, WIMP dark matter candidates and searches—current status and future prospects, Rep. Prog. Phys. 81, 066201 (2018).
- E. Aprile et al. [2023a] E. Aprile et al. (XENON collaboration), First dark matter search with nuclear recoils from the XENONnT experiment, Phys. Rev. Lett. 131, 041003 (2023a).
- E. Aprile et al. [2024a] E. Aprile et al. (XENON collaboration), The XENONnT Dark Matter Experiment, preprint (2024a), arXiv:2402.10446 [physics.ins-det] .
- Aprile et al. [2024] E. Aprile et al. (XENON collaboration), Design and performance of the field cage for the XENONnT experiment, Eur. Phys. J. C 84, 138 (2024).
- E. Aprile et al. [2019a] E. Aprile et al. (XENON collaboration), XENON1T dark matter data analysis: Signal reconstruction, calibration, and event selection, Phys. Rev. D 100, 052014 (2019a).
- E. Aprile et al. [2019b] E. Aprile et al. (XENON collaboration), XENON1T dark matter data analysis: Signal and background models and statistical inference, Phys. Rev. D 99, 112009 (2019b).
- E. Aprile et al. [2024b] E. Aprile et al. (XENON collaboration), XENONnT Analysis: Signal Reconstruction, Calibration and Event Selection; In preparation (2024b).
- Ramírez García [2022] D. Ramírez García, Ph.D. thesis, Albert-Ludwigs-Universität Freiburg (2022).
- Zhu [2022] T. Zhu, Ph.D. thesis, Columbia University (2022).
- Szydagis et al. [2011] M. Szydagis, N. Barry, K. Kazkaz, J. Mock, D. Stolp, M. Sweany, M. Tripathi, S. Uvarov, N. Walsh, and M. Woods, NEST: a comprehensive model for scintillation yield in liquid xenon, JINST 6, P10002 (2011).
- Lenardo et al. [2015] B. Lenardo, K. Kazkaz, A. Manalaysay, J. Mock, M. Szydagis, and M. Tripathi, A global analysis of light and charge yields in liquid xenon, IEEE TNS 62, 3387 (2015).
- Thomas and D. A. Imel [1987] J. Thomas and D. A. Imel, Recombination of electron-ion pairs in liquid argon and liquid xenon, Phys. Rev. A 36, 614 (1987).
- López Paredes et al. [2018] B. López Paredes, H. Araújo, F. Froborg, N. Marangou, I. Olcina, T. Sumner, R. Taylor, A. Tomás, and A. Vacheret, Response of photomultiplier tubes to xenon scintillation light, Astropart. Phys. 102, 56 (2018).
- Faham et al. [2015] C. Faham, V. Gehman, A. Currie, A. Dobi, P. Sorensen, and R. Gaitskell, Measurements of wavelength-dependent double photoelectron emission from single photons in VUV-sensitive photomultiplier tubes, JINST 10, P09010 (2015).
- Jörg et al. [2023] F. Jörg, S. Li, J. Schreiner, H. Simgen, and R. F. Lang, Characterization of a Rn source for low-energy electronic recoil calibration of the XENONnT detector, JINST 18, P11009 (2023).
- E. Aprile et al. [2023b] E. Aprile et al. (XENON collaboration), Low-energy calibration of XENON1T with an internal Ar source, Eur. Phys. J. C 83, 1 (2023b).
- D. Wenz [2023] D. Wenz, Ph.D. thesis, Johannes Gutenberg-Universität Mainz (2023).
- E. Aprile et al. [2024c] E. Aprile et al. (XENON collaboration), Erratum to XENON1T Dark Matter Data Analysis: Signal & Background Models, and Statistical Inference, Phys. Rev. D (2024c), in preparation.
- Baudis et al. [2021] L. Baudis, P. Sanchez-Lucas, and K. Thieme, A measurement of the mean electronic excitation energy of liquid xenon, Eur. Phys. J. C 81, 1060 (2021).
- Boulton et al. [2017] E. M. Boulton et al., Calibration of a two-phase xenon time projection chamber with a 37Ar source, JINST 12, P08004 (2017).
- D. S. Akerib et al. [2017] D. S. Akerib et al. (LUX collaboration), Ultralow energy calibration of LUX detector using 127Xe electron capture, Phys. Rev. D 96, 112011 (2017).
- D. S. Akerib et al. [2016a] D. S. Akerib et al. (LUX collaboration), Tritium calibration of the LUX dark matter experiment, Phys. Rev. D 93, 072009 (2016a).
- Aprile et al. [2005] E. Aprile, K. L. Giboni, P. Majewski, K. Ni, M. Yamashita, R. Hasty, A. Manzur, and D. N. McKinsey, Scintillation response of liquid xenon to low energy nuclear recoils, Phys. Rev. D 72, 072006 (2005).
- E. Aprile et al. [2006] E. Aprile et al. (XENON collaboration), Simultaneous measurement of ionization and scintillation from nuclear recoils in liquid xenon as target for a dark matter experiment, Phys. Rev. Lett. 97, 081302 (2006).
- E. Aprile et al. [2009] E. Aprile et al. (XENON collaboration), New Measurement of the Relative Scintillation Efficiency of Xenon Nuclear Recoils Below 10 keV, Phys. Rev. C 79, 045807 (2009).
- E. Aprile et al. [2013] E. Aprile et al. (XENON collaboration), Response of the XENON100 Dark Matter Detector to Nuclear Recoils, Phys. Rev. D 88, 012006 (2013).
- Plante et al. [2011] G. Plante, E. Aprile, R. Budnik, B. Choi, K.-L. Giboni, L. W. Goetzke, R. F. Lang, K. E. Lim, and A. J. Melgarejo Fernandez, New Measurement of the Scintillation Efficiency of Low-Energy Nuclear Recoils in Liquid Xenon, Phys. Rev. C 84, 045805 (2011).
- P. Sorensen et al. [2009] P. Sorensen et al. (XENON collaboration), The scintillation and ionization yield of liquid xenon for nuclear recoils, Nucl. Instrum. Meth. A 601, 339 (2009).
- Manzur et al. [2010] A. Manzur, A. Curioni, L. Kastens, D. N. McKinsey, K. Ni, and T. Wongjirad, Scintillation efficiency and ionization yield of liquid xenon for mono-energetic nuclear recoils down to 4 keV, Phys. Rev. C 81, 025808 (2010).
- D. S. Akerib et al. [2016b] D. S. Akerib et al. (LUX collaboration), Low-energy (0.7-74 keV) nuclear recoil calibration of the LUX dark matter experiment using D-D neutron scattering kinematics, preprint (2016b), arXiv:1608.05381 [physics.ins-det] .
- D. S. Akerib et al. [2022] D. S. Akerib et al. (LUX collaboration), Improved Dark Matter Search Sensitivity Resulting from LUX Low-Energy Nuclear Recoil Calibration, preprint (2022), arXiv:2210.05859 [physics.ins-det] .
- Foreman-Mackey et al. [2013] D. Foreman-Mackey, D. W. Hogg, D. Lang, and J. Goodman, emcee: The MCMC hammer, PASP 125, 306 (2013).
- Foreman-Mackey, Daniel and others [2019] Foreman-Mackey, Daniel and others, emcee v3: A Python ensemble sampling toolkit for affine-invariant MCMC, J. Open Source Softw. 4, 1864 (2019), arXiv:1911.07688 [astro-ph.IM] .
- XENON collaboration [2022] XENON collaboration, XENONnT/GOFevaluation: v0.1.1 (2022).
- Lewin and Smith [1996a] J. Lewin and P. Smith, Review of mathematics, numerical factors, and corrections for dark matter experiments based on elastic nuclear recoil, Astropart. Phys. 6, 87 (1996a).
- R. H. Helm [1956] R. H. Helm, Inelastic and elastic scattering of 187-MeV electrons from selected even-even nuclei, Phys. Rev. 104, 1466 (1956).
- Lewin and Smith [1996b] J. D. Lewin and P. F. Smith, Review of mathematics, numerical factors, and corrections for dark matter experiments based on elastic nuclear recoil, Astropart. Phys. 6, 87 (1996b).
- Smith et al. [2007] M. C. Smith et al., The RAVE Survey: Constraining the Local Galactic Escape Speed, Mon. Not. Roy. Astron. Soc. 379, 755 (2007).
- McCabe [2014] C. McCabe, The Earth’s velocity for direct detection experiments, JCAP 02, 027 (2014).
- Schoenrich et al. [2010] R. Schoenrich, J. Binney, and W. Dehnen, Local Kinematics and the Local Standard of Rest, Mon. Not. Roy. Astron. Soc. 403, 1829 (2010).
- Bland-Hawthorn and Gerhard [2016] J. Bland-Hawthorn and O. Gerhard, The galaxy in context: Structural, kinematic, and integrated properties, Annu. Rev. Astron. Astrophys. 54, 529 (2016).
- Abuter, R. et al. [2021] Abuter, R. et al. (GRAVITY Collaboration), Improved gravity astrometric accuracy from modeling optical aberrations, A&A 647, A59 (2021).
- Baxter et al. [2021] D. Baxter et al., Recommended conventions for reporting results from direct dark matter searches, Eur. Phys. J. C 81, 907 (2021).
- Klos et al. [2013] P. Klos, J. Menéndez, D. Gazit, and A. Schwenk, Large-scale nuclear structure calculations for spin-dependent wimp scattering with chiral effective field theory currents, Phys. Rev. D 88, 083516 (2013).
- Pearson [1901] K. Pearson, LIII. On lines and planes of closest fit to systems of points in space, London Edinburgh Dublin Philos. Mag. & J. Sci. 2, 559 (1901).
- Aprile et al. [2020] E. Aprile et al. (XENON collaboration), Projected WIMP sensitivity of the XENONnT dark matter experiment, JCAP 11, P031 (2020).
- Madland et al. [1999] D. G. Madland et al., SOURCES 4A: A code for calculating (alpha,n), spontaneous fission, and delayed neutron sources and spectra 10.2172/15215 (1999).
- E. Aprile et al. [2017] E. Aprile et al. (XENON collaboration), Material radioassay and selection for the XENON1T dark matter experiment, Eur. Phys. J. C 77, 890 (2017).
- E. Aprile et al. [2022a] E. Aprile et al. (XENON collaboration), Material radiopurity control in the XENONnT experiment, Eur. Phys. J. C 82, 599 (2022a).
- S. Agostinelli et al. [2003] S. Agostinelli et al. (Geant4 collaboration), Geant4: A Simulation toolkit, Nucl. Instrum. Meth. A 506, 250 (2003).
- J. Allison et al. [2016] J. Allison et al. (Geant4 collaboration), Recent developments in Geant4, Nucl. Instrum. Meth. A 835, 186 (2016).
- XENON collaboration [2023] XENON collaboration, XENONnT/epix: v0.3.4 (2023).
- XENON collaboration [2022a] XENON collaboration, XENONnT/WFSim: v1.0.2 (2022a).
- XENON collaboration [2022b] XENON collaboration, XENONnT/straxen: v1.8.3 (2022b).
- E. Aprile et al. [2021] E. Aprile et al. (XENON collaboration), Search for Coherent Elastic Scattering of Solar 8B Neutrinos in the XENON1T Dark Matter Experiment, Phys. Rev. Lett. 126, 091301 (2021).
- B. Aharmim et al. [2013] B. Aharmim et al. (SNO collaboration), Combined Analysis of all Three Phases of Solar Neutrino Data from the Sudbury Neutrino Observatory, Phys. Rev. C 88, 025501 (2013).
- M. Agostini et al. [2018] M. Agostini et al. (BOREXINO collaboration), Comprehensive measurement of -chain solar neutrinos, Nature 562, 505 (2018).
- E. Aprile et al. [2018] E. Aprile et al. (XENON collaboration), Dark Matter Search Results from a One TonneYear Exposure of XENON1T, Phys. Rev. Lett. 121, 111302 (2018).
- Aalbers et al. [2022] J. Aalbers et al., AxFoundation/strax: v1.2.3 (2022).
- E. Aprile et al. [2022b] E. Aprile et al. (XENON collaboration), Search for new physics in electronic recoil data from XENONnT, Phys. Rev. Lett. 129, 161805 (2022b).
- Cowan et al. [2011a] G. Cowan, K. Cranmer, E. Gross, and O. Vitells, Asymptotic formulae for likelihood-based tests of new physics, Eur. Phys. J. C 71, 1554 (2011a).
- Aalbers et al. [2021] J. Aalbers, K. D. Morå, and B. Pelssers, JelleAalbers/blueice: v1.1.0 (2021).
- G. J. Feldman and R. D. Cousins [1998] G. J. Feldman and R. D. Cousins, Unified approach to the classical statistical analysis of small signals, Phys. Rev. D 57, 3873 (1998).
- Patrignani [2016] C. Patrignani (Particle Data Group collaboration), Review of Particle Physics, Chin. Phys. C40, 100001 (2016).
- A. L. Read [2002] A. L. Read, Presentation of search results: the cls technique, J. Phys. G: Nucl. Part. Phys. 28, 2693 (2002).
- Kashyap et al. [2010] V. L. Kashyap, D. A. van Dyk, A. Connors, P. E. Freeman, A. Siemiginowska, J. Xu, and A. Zezas, On computing upper limits to source intensities, ApJ 719, 900 (2010).
- Cowan et al. [2011b] G. Cowan, K. Cranmer, E. Gross, and O. Vitells, Power-constrained limits, preprint (2011b), arXiv:1105.3166 [physics.data-an] .
- Priel et al. [2017] N. Priel, L. Rauch, H. Landsman, A. Manfredini, and R. Budnik, A model independent safeguard against background mismodeling for statistical inference, JCAP 2017, 013 (2017).
- J. R. Klein and Roodman [2005] J. R. Klein and A. Roodman, Blind analysis in nuclear and particle physics, Annu. Rev. Nucl. Part. Sci. 55, 141 (2005).
- Aprile and Doke [2010] E. Aprile and T. Doke, Liquid Xenon Detectors for Particle Physics and Astrophysics, Rev. Mod. Phys. 82, 2053 (2010).
- Doke et al. [1976] T. Doke, A. Hitachi, S. Kubota, A. Nakamoto, and T. Takahashi, Estimation of fano factors in liquid argon, krypton, xenon and xenon-doped liquid argon, Nucl. Instr. and Meth. 134, 353 (1976).
- Lindhard et al. [1963] J. Lindhard, M. Scharff, and H. E. Schioett, Range concepts and heavy ion ranges (notes on atomic collisions, II), Kgl. Danske Videnskab. Selskab. Mat. Fys. Medd. 33, (1963).
- Thomas and Imel [1987] J. Thomas and D. A. Imel, Recombination of electron-ion pairs in liquid argon and liquid xenon, Phys. Rev. A 36, 614 (1987).
- D. M. Mei et al. [2008] D. M. Mei, Z. B. Yin, L. C. Stonehill, and A. Hime, A model of nuclear recoil scintillation efficiency in noble liquids, Astropart. Phys. 30, 12 (2008).
Appendix A Details on the ER Emission Model
Let be the recoil energy of an ER event. The total number of detectable quanta is sampled from a normal distribution
(8) |
is the mean energy to generate a quantum and is the Fano factor [73, 74]. The ER energy deposit will produce both excitons and electron-ion pairs. Using the mean ratio between number of excitons and ions , we simulate their numbers from a binomial distribution:
(9) |
(10) |
A fraction of ions recombine with electrons, depending on the electric drift field . We parametrize the mean value of in the same way as in XENON1T [9]:
(11) |
where
(12) |
The fluctuation of is parametrized via
(13) |
and the true recombination fraction is then sampled from
(14) |
All fitted ER parameters are listed in Tab. 2. Finally, the numbers of produced electrons and photons are given by
(15) |
(16) |
Appendix B Details on the NR Emission Model
Let be the recoil energy of an NR event. The total number of produced quanta is
(17) |
In contrast to electron recoils, recoiling xenon nuclei lose a significant amount of their recoil energy via atomic motion and collisions with other xenon atoms, which are processes that are undetectable in a LXe TPC. This quanta loss is modeled following the Lindhard theory [75], with the so-called Lindhard quenching factor , such that the number of detectable quanta becomes
(18) |
Following the parametrization in [14], the Lindhard factor is given by
(19) |
where is a function of deposited energy via
(20) |
(21) |
with the nuclear charge number for xenon. The numbers of produced ions and excitons are then simulated by
(22) |
(23) |
with the exciton-to-ion ratio parametrized as
(24) |
Unlike ERs, the recombination fluctuation in NRs is usually small, thus the number of photons produced from the recombination of electron-ion pairs is
(25) |
where follows the Thomas-Imel box model [76]
(26) |
(27) |
The number of excitons is further reduced by bi-excitonic and Penning quenching effects [77], such that the number of photons produced from de-excitation becomes
(28) |
with the scintillation quenching factor
(29) |
Then the final numbers of photons and electrons are
(30) |
(31) |
Appendix C Details on the Detector Response Model
The simulations of and signals from and are almost independent of each other.
To simulate signals, we first introduce the spatially dependent scintillation gain with DPE excluded:
(32) |
where is the averaged scintillation gain, is the relative spatial correction factor, and is the probability of the double photoelectron emission effect [17]. Then, for a given number of photons and the position of the recoil , , , the number of photons detected by the PMTs is
(33) |
Now accounting for the DPE effect, the expected number of photoelectrons (PE) in the S1 signal is simulated via
(34) |
(35) |
However, the is not always equal to the number of PEs due to reconstruction bias. This is modeled as
(36) |
Here, both , are functions of obtained from the XENONnT MC. In the end, the relative spatial correction is applied back to the signal, yielding
(37) |
where are the event positions as reconstructed in data, which are smeared by the S2-size-dependent resolution of the position reconstruction.
To get the signal from , the first step is to simulate the electron loss during the drift. The survival probability due to attachment to electronegative impurities in LXe is given by
(38) |
where is the electron drift survival time (“electron lifetime”) and is the drift velocity of electrons in LXe. Inside the FV the electric field is uniform enough to approximate as a constant. Electron losses due to drift field effects close to the wall are accounted for via a spatially dependent charge-insensitive-volume (CIV) probability function from field simulations of XENONnT [7]. The number of surviving electrons is then given by
(39) |
At the liquid-gas interface, a fraction of electrons are extracted,
(40) |
The extraction efficiency is - dependent and can be calculated by
(41) |
where is the averaged ionization gain, is the relative spatial correction factor, and is the single electron gain. Assuming the fluctuation of the secondary scintillation process is Poisson-like, the expected number of PEs is
(42) |
Similar to , the reconstruction bias is modeled by
(43) |
Finally, the correction is applied to the ,
(44) |
Here, is the electron lifetime used in the correction, which could be slightly off from the true value. The difference is one of the parameters to fit in the band fitting.
For both S1 and S2 signals, data quality selection efficiencies are applied which depend on the respective signal sizes. For S1 signals, we additionally apply the S1 reconstruction efficiency which is a function of the number of detected photons .
For NR multi-scatter events, the simulation is slightly altered. Here the total number of detected photons is defined as the sum of the detected photons for each single energy deposition ,
(45) |
Similarly, we sum over the number of S2 PEs of only those scatters that contribute to the S2,
(46) |
Reconstruction biases, signal corrections, and selection efficiencies are correspondingly applied to the merged signals.
Appendix D Details on the Calibration Fit Results
The parameters in the ER and NR band fits are summarized in Tab. 2, showing both the applied prior and the marginal posterior values. Parameters where prior values are given without uncertainties are fixed in the fit. The prior for is a uniform distribution in the stated range. For the posteriors, the stated values and uncertainties correspond to the median and the central 68% of the marginal posterior distributions.