Preprint typeset using LATEX style emulateapj v. 08/22/09
A NEW MAXIMUM-LIKELIHOOD TECHNIQUE FOR RECONSTRUCTING COSMIC-RAY ANISOTROPY AT ALL ANGULAR SCALES
arXiv:1601.07877v2 [astro-ph.HE] 18 May 2016
´lez1,3 , D.W.Fiorino1,4 , and S.Westerhoff1 M.Ahlers1,† , S.Y.BenZvi2 , P.Desiati1 , J.C.D´ıaz–Ve
ABSTRACT The arrival directions of TeV–PeV cosmic rays show weak but significant anisotropies with relative intensities at the level of one per mille. Due to the smallness of the anisotropies, quantitative studies require careful disentanglement of detector effects from the observation. We discuss an iterative maximum-likelihood reconstruction that simultaneously fits cosmic ray anisotropies and detector acceptance. The method does not rely on detector simulations and provides an optimal anisotropy reconstruction for ground-based cosmic ray observatories located in the middle latitudes. It is particularly well suited to the recovery of the dipole anisotropy, which is a crucial observable for the study of cosmic ray diffusion in our Galaxy. We also provide general analysis methods for recovering large- and small-scale anisotropies that take into account systematic effects of the observation by ground-based detectors. Subject headings: cosmic rays — reference systems — methods: data analysis 1. INTRODUCTION
During the past decade, a number of cosmic-ray, γray, and neutrino observatories have found anisotropies in the arrival directions of Galactic cosmic rays at TeV and PeV energies (Guillian et al. 2007; Amenomori et al. 2006; Abdo et al. 2008, 2009; Abbasi et al. 2011, 2012; Amenomori et al. 2012; Aartsen et al. 2014; Bartoli et al. 2013; Abeysekara et al. 2014; Bartoli et al. 2015). For a recent summary of experimental results, we refer to Di Sciascio & Iuppa (2014). The statistics and resolution of these experiments allow for a two-dimensional reconstruction in the form of anisotropy sky maps. The studies have revealed significant anisotropies at both large and small angular scales. At the largest scales, the anisotropy is approximately dipolar and has a relative intensity on the order of 10−3 . An explanation of the amplitude and phase of the dipole anisotropy is challenging, but the observations are qualitatively consistent with diffusive propagation of cosmic rays from Galactic sources (Ptuskin et al. 2006; Erlykin & Wolfendale 2006; Blasi & Amato 2012; Mertsch & Funk 2015). Medium- and small-scale anisotropies have been observed at the 10−4 level. These features are less understood but could be a combined effect of nearby cosmicray sources (Salvati & Sacco 2008; Biermann et al. 2013) and the local interstellar magnetic field structure, which can introduce an energy-dependent magnetic mirror leakage (Drury & Aharonian 2008), preferred cosmic-ray transport directions (Malkov et al. 2010), or magnetic lenses (Battaner et al. 2011, 2015). The observed power spectrum of cosmic-ray anisotropies agrees well with the expected effect of cosmic-ray scattering in the local magnetic field (Giacinti & Sigl 2012; Ahlers 2014; Ahlers & 1 WIPAC & Department of Physics, University of Wisconsin– Madison, Madison, WI 53706, USA 2 Department of Physics & Astronomy, University of Rochester, Rochester, NY 14627, USA 3 Centro Universitario de los Valles, Universidad de Guadalajara, Guadalajara, Jalisco 44130, M´ exico 4 now at Department of Physics, University of Maryland, College Park, MD, USA †
[email protected]
Mertsch 2015; L´opez-Barquero et al. 2015). The possible influence of the heliosphere via magnetic reconnections in the heliotail (Lazarian & Desiati 2010), nonisotropic particle transport in the heliosheath (Desiati & Lazarian 2013) or the heliospheric electric field structure (Drury 2013) has also been considered. More exotic origin models invoke strangelet production in molecular clouds (Kotera et al. 2013) or in neutron stars (PerezGarcia et al. 2014). The measurement of cosmic-ray anisotropies with relative intensity below 10−3 is an observational challenge, since it is necessary to account for minuscule variations in the acceptance and uptime of the detector carrying out the measurement. For illustration, Fig. 1 shows a simulated realization of the cosmic-ray anisotropy following the model of Ahlers (2014). This model was chosen due to its qualitative agreement with the observation of anisotropy in TeV cosmic rays by IceCube (Aartsen et al. 2014) and HAWC (Abeysekara et al. 2014). The cosmic-ray anisotropy is expected to remain constant in the celestial coordinate system over the period of the observation. Ground-based cosmic observatories with a limited field of view, however, are exposed to different parts of the celestial sphere as the Earth rotates during one sidereal day. As an example, the sky map of Fig. 1 indicates the instantaneous field of view of the HAWC detector (Abeysekara et al. 2014) (at latitude 19◦ N) at a local sidereal time of 9h. Hence, the observed event distribution accumulated over many sidereal days depends not only on the cosmic-ray anisotropy but also on the nonuniform and time-dependent detector exposure. It is therefore necessary to reconstruct a reference map that represents the response of the detector to an isotropic cosmic-ray flux. For ground-based detectors, in which the atmosphere acts as part of the observatory, sufficiently accurate simulations of the detector exposure are usually not achievable. Therefore, most analyses use the data themselves to estimate the relative intensity and detector exposure simultaneously. Examples of this technique are the timescrambling (Alexandreas et al. 1993) and directintegration (Atkins et al. 2003) methods, in which the
2
Ahlers, BenZvi, Desiati, D´ıaz–V´elez, Fiorino & Westerhoff anisotropy
tion 5 before concluding in Section 6.
North
60◦
2. MAXIMUM LIKELIHOOD METHOD 30◦
Zenith East 300◦
240◦
West 180◦
120◦
60◦
−30◦
E(t, ϕ, θ) ' E(t)A(ϕ, θ) .
South −60◦
-0.0008
In the following, we will assume that the total accumulated detector exposure E can be expressed as a product of its angular-integrated exposure E and relative acceptance A in terms of azimuth angle ϕ (from north increasing to the east) and zenith angle θ as
Equatorial
0.0008
Figure 1. Simulated cosmic ray anisotropy in equatorial coordinates using the model of Ahlers (2014). For illustration, we indicate the instantaneous field of view of the HAWC observatory (at latitude 19◦ north) at a local sidereal time of 9h and a zenith angle cut at 60◦ . The time-integrated field of view corresponds to the declination range −41◦ < δ < 79◦ .
rate of events observed in a detector as a function of local sidereal time is integrated against the relative acceptance of the detector during an integration period (or scrambling interval) ∆t. The idea behind this method is that variations in the event rates introduced by the cosmic-ray anisotropies will average out to some extent as the detector observes different parts of the celestial sphere over the course of each sidereal day. The result is an estimate of the number of events expected in the detector between time t and t + ∆t. Subtracting the expected number of events from the actual observations yields a residual counts map which can be explored for anisotropy. The integration time interval ∆t acts as an effective smoothing parameter for the counts map, since the method averages cosmic-ray arrival directions over angular scales of 15◦ (∆t/1hr). In principle, choosing ∆t = 24h would produce a residual counts map with features covering the full sky (360◦ ). However, for detectors located in the middle latitudes, the instantaneous exposure of the detector does not match the full daily exposure, since the 24h integrated field of view is much larger than the instantaneous field of view, cf. Fig 1. As a result, large-scale structures in the residual counts map, in particular the dipole, are strongly attenuated when using these methods. To improve estimates of large-scale anisotropy using detectors in the middle latitudes, we describe a maximum likelihood construction that can be used to disentangle the anisotropy from detector effects. The technique is based on the same ansatz used by the time-scrambling or direct-integration method, that the total accumulated exposure of the detector can be factorized into a timedependent event rate and a time-independent relative acceptance map. We begin by describing the technique in Section 2. In Section 3, we apply the maximumlikelihood method to simulated data and show that largeand small-scale anisotropies can be reconstructed with relatively little of the distortion observed in the directintegration or time-scrambling techniques. We compare our method to alternative techniques in Section 4. We then discuss analysis methods of large- and small-scale anisotropies of the reconstructed anisotropy maps in Sec-
(1)
Without loss of generality, we R require that the relative acceptance is normalized to dΩA(ϕ, θ) = 1. This approximation assumes that the relative acceptance of the detector remains approximately constant over time. The ansatz is identical to the approach used in direct integration or time scrambling. Let us also assume that the flux of cosmic rays at the energies of interest remains constant as a function of time, varying only as a function of celestial longitude α (right ascension) and latitude δ (declination). Due to the strong diffusion of cosmic rays in the Galactic environment, the flux is dominated by an isotropic term φiso . Hence, the total flux can be expressed as φ(α, δ) = φiso I(α, δ) ,
(2)
where I(α, δ) is the relative intensity of the flux as a function of position in the sky. The anisotropy is defined as the deviation δI = I − 1 1. Note that this ansatz ignores anisotropies associated with the relative motion of the Earth with respect to the Sun. We will come back to this subtlety in the discussion section. The local horizontal coordinate system and the celestial (or equatorial) coordinate system are related via a time-dependent transformation. We define n = (cos α cos δ, sin α cos δ, sin δ) as the unit vector corresponding to the coordinates (α, δ) in the right-handed equatorial system. Similarly, the unit vector corresponding to the coordinates (θ, ϕ) in the right-handed local system is n0 = (cos ϕ sin θ, − sin ϕ sin θ, cos θ). The two unit vectors are related via a time-dependent coordinate transformation n0 = R(t)n. For an experiment located at geographic latitude Φ and longitude Λ (measured east from Greenwich), the transformation is ! − cos ωt sin Φ − sin ωt sin Φ cos Φ sin ωt − cos ωt 0 R(t) = , (3) cos ωt cos Φ sin ωt cos Φ sin Φ where ω = 2π/24h and the local sidereal time t is related to the sidereal time at Greenwich t0 by t = t0 + Λ/ω. To simplify calculations on the local and celestial spheres, the sky is binned into pixels of equal area ∆Ω using the HEALPix parametrization of the unit sphere (Gorski et al. 2005). To make the equations more transparent, we use roman indices for pixels in the local sky map and fraktur indices for pixels in the celestial sky map. Time bins are indicated by greek indices. For instance, the data observed at a fixed sidereal time bin τ can be described in terms of the observation in local horizontal sky with bin i as nτ i or transformed into the celestial sky map with bin a as nτ a . Consider an angular element of the local coordinate sphere ∆Ωi corresponding to coordinates (θi , ϕi ). The
Maximum-Likelihood Technique for Reconstructing Cosmic-Ray Anisotropy number of cosmic rays expected from this location in a sidereal time interval ∆tτ with central value tτ is µτ i ' Iτ i Nτ Ai ,
(4)
where Nτ ≡ ∆tτ φiso E(tτ ) gives the expected number of isotropic background events in sidereal time bin τ . The quantity Ai ≡ ∆Ωi A(θi , ϕi ) is the binned relative acceptance of the detector for angular element i, and Iτ i ≡ I(R(tτ )n0 (Ωi )) is the relative intensity observed in the local horizontal system during time bin τ . For simplicity, all expressions that follow assume equal bin sizes ∆Ω = 4π/Npix on the local and celestial spheres and Ntime equal sidereal bins with ∆t = 24h/Ntime . However, a more general binning is also possible. Given µτ i , the likelihood of observing n cosmic rays is given by the product of Poisson probabilities Y (µτ i )nτ i e−µτ i L(n|I, N , A) = , (5) nτ i ! τi
3
Because of this degeneracy, we must choose whether to account for the local excess of cosmic rays as originating in an anisotropy in their relative intensity or originating in a variation in the local acceptance. AR natural choice is that the anisotropy is normalized to dαδI(α, δ)R = 0 for all declinations δ, consistent with the definition dΩδI(α, δ) = 0. This condition can also be formulated in terms of a spherical harmonics expansion of the relative intensity in the equatorial coordinate system (α, δ), as pointed out by Iuppa & Di Sciascio (2013). In general, the relative intensity can be decomposed as a sum over spherical harmonics Y `m , δI a =
` X X
b a`m Ya`m .
(12)
`≥1 m=−`
(7)
Our normalization condition can then be expressed as the condition b a`0 = 0 for all `. This projection significantly reduces the reconstruction of the low-` multipole components of the anisotropy, as we will discuss in the following sections. Note that the true multipole moments b a`m are an (infinite) superposition of the pseudo multipole moments a`m , which are defined as in Eq. (12), but for the product of the relative intensity with the weight function w of the field of view. Provided that the weight function is azimuthally symmetric, w(α, δ) = w(δ), the true multipole moments b a`0 are a linear superposition of pseudo multipole moments a`0 0 . In practice, we can hence use the normalization condition a`0 = 0 for all ` to ensure b a`0 = 0 for all `. In terms of the binned relative intensity and function this is equivalent to the condition P weight `0 a wa Ya δIa = 0 for all `.
To allow for the possibility of anisotropy, we maximize the likelihood ratio L(n|I, N , A) λ= (8) L(n|I (0) , N (0) , A(0) )
2.2. Maximum Likelihood Algorithm The maximum (I ? , N ? , A? ) of the likelihood ratio (8) must obey the implicit relations .X X Ia? = nτ a A?κa Nκ? , (13)
where nτ i is the number of events observed in the local pixel i during time bin τ . This likelihood can be maximized to provide estimators of the relative acceptance function Ai and the expected isotropic background count Nτ . Consider the case of the null hypothesis of no (0) anisotropy, which P we denote Ia = 1. Given the boundary condition i Ai = 1, the maximum likelihood estimators of Ai and Nτ are X Nτ(0) = nτ i , (6) i (0) Ai
=
X τ
nτ i
.X
nκj .
κj
τ
of signal over null hypothesis in N , A, and I. 2.1. Invariance under Declination-Scaling We will demonstrate in the following that the maximum-likelihood method can be used to search for anisotropy on all angular scales. However, before we continue, note that events recorded in a fixed position in the local coordinate system can only probe the cosmic-ray flux in a fixed declination band δ. Hence, the expectation values (4) are invariant under the rescaling I → I 0 ≡ I/a(δ)/b , N → N 0 ≡ N bc , A → A0 ≡ Aa(δ)/c ,
(9) (10) (11)
where a(δ) is an arbitrary function of declination and the normalization factors b and c are defined such that P P 0 0 τ Aτ = 1 and a δIa = 0 for the new values. In other words, the maximum-likelihood method is sensitive to anisotropy in right ascension but is insensitive to variations in intensity across declination bands. This is also known to be a limitation of other reconstruction methods, like direct integration or time scrambling (Amenomori et al. 2005; Iuppa & Di Sciascio 2013).
Nτ?
=
X
κ
nτ i
i
A?i =
X τ
.X
A?j Iτ?j ,
(14)
? Nκ? Iκi ,
(15)
j
nτ i
.X κ
P P ? `0 ? together with a wa Ya δIa = 0 and i Ai = 1. In Eq. (13), we introduced the binned quantity Aτ a ≡ ∆Ωa A(RT (tτ )n(Ωa )) corresponding to the relative acceptance seen in the equatorial coordinate system in pixel a during time bin τ . Equations (13) to (15) correspond to a nonlinear set of equations that cannot be solved in an explicit form. However, one can approach the best-fit solution via the following iterative method: (i) Initialize at the maximum of the null hypothesis, (I (0) , N (0) , A(0) ). (ii) Evaluate I (n+1) by inserting (I (n) , N (n) , A(n) ) into the right-hand side of Eq. (13). (iii) Remove the m = 0 (pseudo) multipole moments
4
Ahlers, BenZvi, Desiati, D´ıaz–V´elez, Fiorino & Westerhoff 3
of δI (n+1) , i.e., in the equatorial coordinate system P (n+1) `0 → 0. a wa Ya δIa ,N
(n)
,A
)
(v) Evaluate A(n+1) by inserting (I (n+1) , N (n+1) , A(n) ) into the right-hand side of Eq. (15). (n+1)
(n+1)
(n+1)
(vi) Renormalize N and A as N → N (n+1) c and A(n+1) → A(n+1) /c with normalizaP (n+1) tion factor c = i Ai .
2 [10−4 ]
(iv) Evaluate N by inserting (I into the right-hand side of Eq. (14).
(n)
3. SIMULATION AND PERFORMANCE To demonstrate the reconstruction of a cosmic-ray anisotropy with the maximum likelihood technique, we simulated a set of arrival directions based on the anisotropy shown in Fig. 1. The simulation is based on a random realization of a relative intensity δI, which follows a power-law spectrum with ` ≥ 1 of the form C` = 10−7 (18/(2` + 1)/(` + 1)/(` + 2)), after the model of Ahlers (2014). This model was chosen due to its qualitative agreement with the observation of anisotropy in TeV cosmic rays by IceCube (Aartsen et al. 2014) and HAWC (Abeysekara et al. 2014). The celestial sphere is binned following the HEALPix parametrization (Gorski et al. 2005) with parameter nside = 64 corresponding to Npix = 49, 152 pixels with a binsize of about 1◦ diameter. The simulated detector is located at geographic coordinates (Φ = 19◦ N and Λ = 97◦ W), the location of the HAWC observatory (Abeysekara et al. 2014). The instantaneous field of view is restricted to zenith angles below 60◦ . A projection of the instantaneous field of view at 9h local sidereal time onto the equatorial coordinate system was already shown in Fig. 1. We assume that the relative detector acceptance at any time follows A(θ, ϕ) ∝ cos θ[1 + A sin θ sin2 (ϕ − ϕ0 )] with A = 0.2 and ϕ0 = 10◦ . The local acceptance maps are reconstructed with the same resolution as the anisotropy map. The expected isotropic event number is binned in Ntime = 360 sidereal time bin size and P bins with 4 min 10 normalized such that τ Nτ ' 5 × 10 , corresponding to the integrated event number of the recent HAWC analysis (Abeysekara et al. 2014). We introduce a simple statistical toy model for the variation of the expected background rate Nτ ; the accumulated data are expected to be stable over periods sampled from an exponential distribution with expectation value of 20 min (5 sidereal
0
−1 −2 −3
(vii) Repeat from step (ii) until the solution has sufficient convergence, i.e., the likelihood ratio in Eq. (8) has ∆χ2 ' 2 ln(λ(n+1) /λ(n) ) 1.
0
3
6
9 12 15 local sidereal time [h]
18
21
24
300 250 2 ln(λ (n) /λ (1) )
Note that the cosmic-ray anisotropy obtained in the first iteration step, δI (1) , corresponds to the result that would be obtained by the method of direct integration (Atkins et al. 2003). This method was also used in the recent HAWC analysis (Abeysekara et al. 2014). The successive iteration steps of the maximum-likelihood method reoptimize the relative acceptance A and the isotropic background rate N for a given anisotropy. We will study this optimization process more quantitatively in the following section.
1
(n) /N (0) − 1
(n+1)
N
(n+1)
200 150 100 50 0
5
10 iteration step
15
20
Figure 2. Top panel: The relative optimization of the isotropic expectation value in terms of N (n) /N (0) − 1 for 20 iteration steps (light to dark colors). Bottom panel: The progressive loglikelihood values of the iteration. Note that the method already converges after about 10 iteration steps.
time bins). In each stable period, we assume that Nτ has a normal fluctuation of 1% around the background expectation. This model mimics the data variation of the analysis by Abeysekara et al. (2014). The results of the reconstruction6 are shown in Figs. 2 and 3. The top panel of Fig. 2 shows the optimization of the reconstructed background expectation N (n) as the relative quantity N (n) /N (0) − 1. The iteration renormalizes N (n) at the level of 10−4 depending on the local sidereal time. The relative features introduced by the optimization process are easy to understand. For instance, the bump at 9h local sidereal time corresponds to the instantaneous field of view indicated in Fig. 1. In comparison with the simulated anisotropy shown in Fig. 1, one can notice that at this time the detector observes a part of the sky with a strong underfluctuation. The initial estimate N (0) is therefore too low and the iteration compensates for this effect. The bottom panel of Fig. 2 shows the consecutive log-likelihood values of the iteration. Note that for this simulated data set the iteration only requires about 10 steps to converge. We run the reconstruction for 10 more iteration steps to verify the convergence and stability of the method. The map in Fig. 3a shows the expected relative intensity smoothed with a top-hat function with 10◦ radius after application of the m = 0 filter. The maps 6 We implemented the iterative algorithm as a Python program, which we can provide to the interested reader upon request.
Maximum-Likelihood Technique for Reconstructing Cosmic-Ray Anisotropy
5
(a) anisotropy with a`0 → 0 and 10◦ smoothing 60◦
30◦
300◦
240◦
180◦
120◦
60◦
−30◦
−60◦
-0.0006
Equatorial
0.0006
(b)
(c) = (b)−(a)
reconstructed anisotropy (10◦ smoothed) / iteration 1
residual anisotropy (10◦ smoothed) / iteration 1 60◦
60◦
30◦
300◦
240◦
180◦
120◦
60◦
30◦
300◦
240◦
180◦
120◦
60◦
−30◦
−60◦
-0.0006
−30◦
−60◦
Equatorial
0.0006
-0.0003
Equatorial
0.0003
(d)
(e) = (d)−(a)
reconstructed anisotropy (10◦ smoothed) / iteration 20
residual anisotropy (10◦ smoothed) / iteration 20 60◦
60◦
30◦
300◦
240◦
180◦
120◦
60◦
30◦
300◦
240◦
180◦
120◦
60◦
−30◦
−60◦
-0.0006
−30◦
−60◦
Equatorial
0.0006
-0.0003
Equatorial
0.0003
10◦
Figure 3. Panel (a): Expected relative intensity with m = 0 multipole filter (see text) and smoothing. Panel (b/d): Reconstructed relative intensity for the first and last iteration step, respectively. Panel (c/e): Residual anisotropy after subtracting the expected anisotropy in panel (a) from the reconstructed anisotropy in panels (b) and (d), respectively.
in Figs. 3b and 3d show the reconstructed anisotropy in the first iteration step (corresponding to the result from direct integration) and after 20 iterations, respectively. Comparing the maps in Figs. 3b and 3a one notices that, qualitatively, the expected small-scale features are already reproduced in the first iteration step. However, the difference map in Fig. 3c indicates, that the residual map has large-scale features that are misreconstructed. On the other hand, the corresponding difference map in Fig. 3e after 20 iterations is closer to the expected map in Fig. 3a. The residual anisotropy map shown in Fig. 3e is re-
lated to the Poisson variation of the event rate. We can make this statement more quantitative via a power spectrum analysis of expected and reconstructed anisotropy maps. The relative intensity can be decomposed as a sum over spherical harmonics, as in Eq. (12). Unfortunately, due to the limited integrated field of view, the true coefficients b a`m cannot be unambiguously reconstructed. However, for the present discussion of residual anisotropies in the iterative method, it is sufficient to study the pseudo multipole moments, a`m , corresponding to the harmonic expansion of the anisotropy multiplied by the weight function of the field of view. In our
6
Ahlers, BenZvi, Desiati, D´ıaz–V´elez, Fiorino & Westerhoff 10−7
pseudo power spectrum
iteration 20 iteration 1 expected
10−8
10−9
10−10
Nevents = 5 × 1010 & θzenith < 60◦ 2
4 6 multipole moment `
8
10
Figure 4. Pseudo power spectra of reconstructed and expected anisotropy maps. The error bars on the power of the reconstructed anisotropy is indicating the variance from Poisson statistics estimated from 100 runs with the same anisotropy and detector model. Note that the small bias of the reconstructed pseudo power toward higher values for ` ≥ 6 is due to the noise level of the maps as we will show in Section 5.
example, the weight function is simply equal to 1 for declinations −41◦ < δ < 79◦ and 0 otherwise. From these pseudo multipole moments, we can compute the pseudo power spectrum, 1 X C` = |a`m |2 . (16) 2` + 1 m Figure 4 shows the pseudo power spectrum of the anisotropy maps for the first and last iteration step in comparison to the expected spectrum corresponding to the true anisotropy with b a`0 = 0 and multiplied by the weight function. To estimate the variance of the reconstructed power spectra due to Poisson statistics, we repeat the analysis 100 times with low-resolution maps (nside = 16 with bin size ∆θ ' 4◦ ) and show the central 68% range of the data. One can see that the first iteration step drastically underestimates the power of the dipole (` = 1) and quadrupole (` = 2). This was already qualitatively visible in the map of Fig. 3c. On the other hand, the pseudo power spectrum of the last iteration step agrees well with the low-` multipoles. One can also notice a small bias of the ` ≥ 6 multipoles towards larger values. This is due to the noise level of the maps as we will discuss in detail in Section 5. 4. COMPARISON TO OTHER METHODS
Various reconstruction methods have been developed previously to extract the anisotropy from the isotropic background. We have already mentioned the methods of direct integration Atkins et al. (2003) and time scrambling (Alexandreas et al. 1993) in the introduction, which have been used in cosmic-ray anisotropy studies of Super-Kamiokande (Guillian et al. 2007), TibetASγ (Amenomori et al. 2006), Milagro (Abdo et al. 2008), IceCube (Abbasi et al. 2011, 2012; Aartsen et al. 2014), IceTop (Aartsen et al. 2013), HAWC (Abeysekara et al. 2014), and ARGO-YBJ (Bartoli et al. 2013). The time-integration method is closely related to the maximum-likelihood method presented here. The opti-
mal relative detector acceptance (7) and the isotropic background expectation (6) of the null hypothesis (δI = 0) are equivalent to the estimates via direct integration with an integration period of ∆t = 24 h. The map of Fig. 3b shows the resulting anisotropy estimate of this method. Another two-dimensional reconstruction technique is the Forward-Backward method used by Milagro (Abdo et al. 2009), which is closely related to the onedimensional East-West method (Bonino et al. 2011). These methods analyze the relative right-ascension derivative of event rates, ∂α n/n, at each sidereal time, either for individual declination bands (in the case of Forward-Backward) or the entire field of view (in the case of East-West). The anisotropy can then be reconstructed from the first derivative, noting that ∂α δI ' ∂α n/n, up to an overall normalization constant in each declination band. This again reflects the invariance (9) to (11) and the inability to reconstruct the a`0 moments in equatorial coordinates. In general, the Forward-Backward (and the East-West) method has the advantage that the quantity ∂α n/n does not explicitly depend on the instantaneous background level. For instance, the local deficit of background events at a local sidereal time of 9h in the example shown in Fig. 1 does not affect the estimate of the derivative ∂α n/n. However, the reconstruction methods discussed in Abdo et al. (2009) assume that the relative detector acceptance is quasi-symmetric under Forward-Backward (or East-West) reflection, corresponding to the transformation (θ, ϕ) → (θ, −ϕ). This condition can be expressed in terms of the asymmetry ≡ [A(θ, ϕ) − A(θ, −ϕ)]/[A(θ, ϕ) + A(θ, −ϕ)] as || 1. After proper renormalization of the reconstructed derivative terms (Abdo et al. 2009) the leading order effect is an O(2 ) correction of the anisotropy amplitude. We expect that the condition || 1 is met by most observatories. However, our method is also applicable for large asymmetries approaching 2 ' 1. Iterative methods for the reconstruction of the anisotropy have also been developed for and applied to data of Tibet-ASγ (Amenomori et al. 2005, 2010, 2012) and ARGO-YBJ (Bartoli et al. 2015). The Equi-Zenith Angle method (Cui & Yan 2003) estimates the isotropic cosmic-ray background at a celestial bin a to be the average of those acceptance-corrected events that arrived from the same zenith angle band as the field of view wraps around the celestial equator. This corresponds P to a generalization of the ansatz (1) to E = s E s As , where the sum runs over the different equi-zenith-angle sectors with individual background rates E s and relative acceptance As . It is straightforward to generalize our likelihood-based method with this ansatz (see Appendix A). The likelihood-based iteration method presented in this paper has the advantage that it can be derived from a firm statistical approach. If the relative detector acceptance can be regarded as stable in local sidereal time over the entire field of view, it provides an even simpler iterative reconstruction method. 5. HARMONIC ANALYSIS
We now turn to the harmonic analysis of the anisotropy. We have already introduced the harmonic expansion of the relative intensity in Eq. (12). Of par-
Maximum-Likelihood Technique for Reconstructing Cosmic-Ray Anisotropy ticular relevance for the theory of cosmic-ray diffusion is the strength of the dipole components (` = 1), but note that TeV–PeV cosmic-ray data also show significant multipole moments at smaller angular scales, like quadrupole (` = 2), octupole (` = 3), etc. We have already shown in Section 3 that traditional anisotropy methods can significantly underestimate the low-` pseudo power spectrum of the data. This effect can be compensated by the iterative method presented in Section 2. However, this method will not compensate for the loss in power of the m = 0 coefficients of the analysis. The observed spectrum is hence always a systematic underestimation of the true anisotropy. An additional uncertainty comes from the limited integrated field of view of most observatories. In the ideal case of a 4π sky coverage, the multipole moments a`m of the reconstructed anisotropy would carry all the information of the anisotropy (except b a`0 ). However, as already discussed earlier, the partial sky coverage of individual experiments does only allow reconstructing the pseudo multipole moments spectrum of the (reduced) anisotropy multiplied by the weight function. The pseudo multipole moments a`m are related to the true multipole moments b a`m via a linear transformation (see, e.g., the review by Efstathiou (2004)) X a`m = K`m`0 m0 b a`0 m0 , (17)
7
(a) large-scale anisotropy (truncated, ` ≤ 3) / iteration 20 60◦
30◦
300◦
240◦
180◦
120◦
60◦
−30◦
−60◦
-0.0005
0.0005
(b) small-scale anisotropy (` ≤ 3 removed, 10◦ smoothed) / iteration 20 60◦
30◦
300◦
240◦
180◦
120◦
60◦
−30◦
−60◦
Equatorial
`0 m0
where the coupling matrix K depends on the multipole spectrum b`m of the weight function of the field of view (see Appendix B). For an azimuthally symmetric weight function the strength of the mixing between moments a`m and b a`0 m is determined by the moments bk0 with |`−`0 | ≤ k ≤ `+`0 . This mixing can be small for individual moments as pointed out by Denton & Weiler (2015). However, it is important to emphasize that the full multipole spectrum cannot be unambiguously reconstructed from a partial sky coverage since the infinite-dimensional matrix K is not invertible.
-0.0004
0.0004
(c) significance (units of σ , 10◦ smoothed) / iteration 20 60◦
30◦
300◦
240◦
180◦
120◦
60◦
−30◦
5.1. Large-Scale Anisotropy Whereas the full transition matrix K cannot be inverted, we can attempt an approximate reconstruction of the low-` anisotropy via a truncation of the multipole expansion after a maximum `. The corresponding truncated matrix K0 is then invertible. For instance, assuming a pure dipole anisotropy, ` ≤ 1, and a uniform sky coverage between declination δ1 and δ2 gives the dipole transition elements 1 0 K1010 = sin3 δ2 − sin3 δ1 , (18) 2 1 0 K1111 = 3(sin δ2 − sin δ1 ) + sin3 δ1 − sin3 δ2 , (19) 4 0 0 = K1111 . and K1-11-1 The dipole strength perpendicular to the Earth’s rotation axis can then be estimated as r p |a11 |2 + |a1-1 |2 3 b A⊥ = . (20) 0 4π K1111
For instance, in our MC simulation with a zenith cut of 60◦ and a latitude Φ ' 19◦ , we have δ1 = −41◦ and δ2 =
−60◦
-9
Equatorial
9
Figure 5. Panel (a): Reconstructed large-scale anisotropy from the solution of the matrix equation Eq. (17) truncated after ` = 3. Panel (b): Residual small-scale anisotropy after subtracting the map in panel (a) from the full reconstructed anisotropy Panel (c): Significance of the small-scale structure using Eq. (24). To distinguish excesses from deficits we multiply by the sign of the smoothed residual anisotropy δI small . 0 79◦ giving K1111 ' 0.92. Hence, the pseudo multipole moments of the projected dipole, a11 and a1-1 , have to be 0 ' 1.09 to recover corrected by a moderate factor 1/K1111 the true moments a ˆ11 and a ˆ1-1 . On the other hand, using the same zenith cut at the location of IceCube/IceTop 0 gives δ1 = −90◦ and δ2 = −30◦ , yielding K1111 ' 0.16 0 and a correction factor 1/K1111 ' 6.4. However, we emphasize that this treatment is only correct under the assumption that the true anisotropy is dominated by a dipole. In Fig. 5a, we show a map of the reconstructed large-scale anisotropy including dipole
Ahlers, BenZvi, Desiati, D´ıaz–V´elez, Fiorino & Westerhoff
5.2. Small-Scale Anisotropy The statistical significance of residual anisotropy features in the final reconstructed map is usually estimated using the method introduced by Li & Ma (1983) for applications in gamma-ray astronomy. A direct application of this method does not account for the optimization process of the time-dependent exposure. However, it is rather straightforward to generalize the method of Li & Ma (1983) to our case. We begin by dividing the reconstructed relative intensity into a contribution of large-scale features and smallscale features, I = I large + I small . For each pixel a in the celestial sky, we define expected on-source and off-source event counts in a disc of radius ψ centered on that pixel. Given the set of pixels Da , the observed and expected counts are X X nτ b , (21) na = b∈Da
µa,on =
b∈Da
µa,off =
τ
X X
(22)
Aτ b Nτ Iblarge .
(23)
exp
0
−6 10−6 Nevents = 5 × 1010 & θzenith < 60◦ 10−7
expected C` noise N sample C` pseudo C` estimated Cb`
10−8
10−9
10−10
10−11
5
10
15 20 multipole moment `
25
30
Figure 6. Multipole power spectra of the final iteration map of the example. The dashed line shows the model expectation of the input power and the open circles show the sampled power spectrum of the input anisotropy map. The open boxes are the pseudo power spectrum for the reconstructed anisotropy. The filled diamonds show the estimated power spectrum accounting for the noise level (dotted line) and the field of view using the methods described in the main text. The upper plot shows the relative scatter of the reconstructed power spectrum from the expected input power.
τ
X X b∈Da
Aτ b Nτ Ib ,
6 C` /C` − 1
(` = 1), quadrupole (` = 2), and octupole (` = 3) with a truncation of the matrix in Eq. (17) after ` = 3. Note that the model of Ahlers (2014) used for this simulation also assumes significant power at higher multipoles with ` > 3. The truncation after ` = 3 is hence not per se justified by this model. However, we can still use the reconstructed large-scale anisotropy map in Fig. 5a as a background model to define the small-scale anisotropy in the full anisotropy map via subtraction.
power spectrum
8
τ
The significance map (in units of Gaussian σ) is then calculated as 1/2 √ µa,on Sa = 2 −µa,on + µa,off + na log . (24) µa,off In Fig. 5b, we show the residual anisotropy map after subtraction of the large-scale anisotropy map shown in Fig. 5a. The corresponding significance map using (24) is shown in Fig. 5c. Usually, the significance is multiplied by the sign of the (smoothed) anisotropy (middle panel) to distinguish deficits (blue) from excesses (red), and we follow here the same convention. We can see that after the subtraction of large-scale features there is still significant power in small-scale features at a significance level of 8σ. Another approach to study small-scale structure is via the power spectrum (16), which quantifies the absolute amplitude of the multipole components but ignores their phases. We have already used the power spectrum to quantify the performance of our iterative method described in Section 2. We can also use this quantity in situations where the small-scale anisotropy structure is dominated by random scattering of cosmic rays in local magnetic fields (Giacinti & Sigl 2012; Ahlers 2014; Ahlers & Mertsch 2015; L´ opez-Barquero et al. 2015). As mentioned in the previous section, the true multipole spectrum cannot be recovered unambiguously from a partial sky coverage, and the same is true for the power
spectrum. However, in certain situations one can make additional assumptions about the ensemble-averaged expectation values of the multipole components. In the following, we will assume that the harmonic coefficients are Gaussian random fields, which in the ensemble-average b` i. In this particular case, we follow hb a`m b a∗op i = δmp δ`o hC b` i can recover the ensemble-averaged power spectrum hC via the relation X b`0 i + N` . hC` i = M``0 hC (25) `0
The transfer matrix M is known from the study of temperature anisotropies in the cosmic microwave background (Efstathiou 2004). However, for our situation of cosmic-ray anisotropies we have again to account for the fact that the m = 0 moments are filtered out by the reconstruction. This leads to a modified expression for M that we discuss and provide in Appendix B. In Eq. (25), we have also introduced the noise power spectrum N` , which can in general be calculated from the relative intensity variance from the likelihood function (5). In the following, we will use an approximation that only depends on pixel-by-pixel Poisson noise, which gives a flat spectrum N` = N with 1 X wa2 ∆Ω2 P N ' . (26) 4π a τ nτ a In Fig. 6, we show the estimated power spectrum inferred from the relative intensity map of the last iteration step shown in Fig. 3d. The dashed line corre-
Maximum-Likelihood Technique for Reconstructing Cosmic-Ray Anisotropy sponds to the expected input power from which the input power spectrum (black data points) is sampled as a Gaussian random field. The sampling introduces a scatter of (∆C` )2 = 2C`2 /(2` + 1) around the expected input power (cosmic variance). The green data points show the pseudo power spectrum, not accounting for the weight function. The horizontal blue dotted line shows the noise level of Eq. (26). Note that the noise level scales as N ∝ 1/Ntot with the total number of events. The red data points are the best estimators of the true b` i. The variance of the pseudo and power spectrum hC estimated power spectra are given in Appendix B. 6. SUMMARY
In this paper, we have discussed a novel twodimensional cosmic-ray reconstruction method. It is based on a maximum likelihood analysis that provides implicit best-fit expressions for the relative intensity, relative detector acceptance, and background expectations. We have provided a detailed iterative method onhow the relative intensity can be reconstructed from these implicit maximum likelihood solutions. The performance of this likelihood-based method was studied via a simulated example, mimicking the position and performance of the HAWC observatory. In general, ground-based observatories are insensitive to cosmic-ray anisotropy variations across declination bands. In terms of a spherical harmonic expansion of the anisotropy in equatorial coordinates this corresponds to a filter of m = 0 multipole moments, which introduces a systematic underestimation of the observed anisotropy. In particular, the dipole anisotropy, which is a crucial observable for the study of cosmic ray diffusion in our Galaxy, can only be observed as a projection onto the celestial equator. This has important consequences for the interpretation of experimental dipole data. If the dipole orientation changes with rigidity the projected dipole can exhibit rigidity modulations introduced by the projection on top of a true rigidity dependence of the dipole amplitude. If the dipole vector aligns with the celestial poles these projection effects can lead to a drastic reduction of the observed dipole accompanied by a phase-flip. In addition, the limited integrated field of view of observatories affects the reconstruction of the multipole moments. As as consequence, even the m = 0 filtered anisotropy cannot be unambiguously reconstructed. This can introduce a large systematic uncertainty, in particular, for the low-` multipole moments of the anisotropy. We have discussed strategies how to account for this effect in the multipole reconstruction. We would like to conclude with a few remarks. In our anisotropy ansatz (2), we have not accounted for varia-
9
tions in the primary cosmic-ray flux during the observation period. The solar potential, with a variation on time scales of 11 years, only affects low-energy data and can be neglected for cosmic rays in the TeV to PeV energy range (Munakata et al. 2010). On the other hand, the relative motion of the Earth in the solar system is expected to be visible as a solar dipole (Compton & Getting 1935). This effect would be visible if the data is binned in terms of solar time. Indeed, a solar dipole anisotropy has been observed at the level of 10−4 in multi-TeV cosmic-ray data (Amenomori et al. 2004, 2006; Abdo et al. 2009; Abbasi et al. 2011, 2012; Bartoli et al. 2015). However, the solar dipole is expected to average out for the data binned in sidereal time as long as the observation period covers an integer number of full years. The method presented in this paper is designed for midlatitude observatories that are exposed to different parts of the celestial sky as the Earth rotates. The method is also applicable to the case of IceCube/IceTop located at the South Pole. However, for these observatories, the instantaneous field of view is identical to the time-integrated one. As a result, the iteration does not improve the estimates from direct integration or time scrambling. On the other hand, the special location in combination with a limited field of view makes these observatories particularly insensitive to low-` multipoles due to projection effects of the anisotropy discussed in the text. In this paper, we have only considered the case of the data analysis of individual observatories. A great advantage of the likelihood-based reconstruction method is the straightforward generalization to combined anisotropy studies of data sets from multiple observatories with overlapping integrated field of view, see e.g. (D´ıaz–V´elez, J. C. 2015). The expectation value (4) can then be simply generalized to a sum over data sets with individual detector exposures but same anisotropy, as long as the rigidity distributions of the data sets are very similar. We would also like to emphasize that cosmic-ray observations via satellites like Fermi could have an advantage since the observatory can be tilted during the observation. In this case, it is possible to break the degeneracy between local acceptance effects and cosmicray anisotropy. In principle, it is then possible to have a full reconstruction of the anisotropy without projection effects, provided that additional systematic effects are under control. Acknowledgements. The authors acknowledge support by the National Science Foundation (PHY-1306465, PHY-1306958, and PHY-1308033), by the Department of Energy (DE-SC0008475) and by the Wisconsin Alumni Research Foundation.
APPENDIX A. GENERALIZATION OF THE EXPOSURE ANSATZ
We can generalize the ansatz (1) by expressing the total accumulated exposure E as a sum over disjoint sky sectors, whose union covers the entire field of view. As before, we assume that the exposure in each sector can be expressed as a product of its angular-integrated exposure E s and relative acceptance in terms of azimuth ϕ and zenith angle θ as E(t, ϕ, θ) '
X sector s
E s (t)As (ϕ, θ) .
(A1)
10
Ahlers, BenZvi, Desiati, D´ıaz–V´elez, Fiorino & Westerhoff
The partition of the field of view into multiple sectors is at this stage arbitrary and should in general be guided by the property of the data. For instance, the Equal-Zenith-Angle Method divides the sky into ring-like sectors with limited zenith angle range, i.e., we have sector weight functions defined by ws (θ) = 1 if θs < θ < θs+1 and otherwise ws (θ) = 0. With this new ansatz, the best-fit relative acceptance and background rate of the null hypothesis become .X X X s (0) Nτs (0) = wis nτ i , Ai = wis nτ i wjs nκj . (A2) τ
i
κj
wis
Here, we have introduced the weight function of the sector s which is equal to 1 if the pixel i is located in the sector and 0 otherwise. The maximum of the signal hypothesis now obeys the implicit relation .X .X .X X X X ? Ia? = nτ a Asκa? Nκs ? , Nτs ? = wis nτ i Asj ? Iτ?j , Asi ? = wis nτ i Nκs ? Iκi . (A3) τ
sκ
i
j
τ
κ
B. POWER SPECTRUM ESTIMATOR AND VARIANCE
In the following, we will assume that the weight function is azimuthally symmetric. In this case, the transfer function m is block-diagonal K`m`0 m0 = δmm0 T`` 0 (w) with block elements defined via a sum over Wigner-3j coefficients, r 0 `+` X (2` + 1)(2`0 + 1)(2k + 1) ` `0 k ` `0 k m m (B1) T``0 (b) = (−1) bk0 m −m 0 . 0 0 0 4π k=|`−`0 |
For the ensemble-averaged multipole moments, we can evaluate the transfer matrix to 2 2`0 + 1 X [T 0 0 (w)]2 ` `0 k 0 M`` = (2k + 1)Wk 0 0 0 − `` . 4π 2` + 1
(B2)
k
Note that the unfamiliar last term in the previous equation accounts for the projection of the pseudo angular momentum onto m 6= 0 terms. 1 2 3 The variance can be expressed as h∆C` ∆C`0 i = V`` 0 + V``0 + V``0 with X XX 2 1 m m m bk ihC bk0 iTk` V`` hC (w)T`m (B3) 0 = 0 k (w)T`k 0 (w)Tk 0 `0 (w) , (2` + 1)(2`0 + 1) 0 m6=0 k
k
XX 2 m m m bk i [T m0 (u)T m (w)T m hC (B4) = `` k` `0 k (w) + T`0 ` (u)Tk`0 (w)T`k (w)] , 0 (2` + 1)(2` + 1) m6=0 k 2 1 X ` `0 k 3 V`` (2k + 1)Uk 0 0 0 , (B5) 0 = 2π k P where u`m is the multipole coefficient of the distribution ∆Ωwa2 / τ nτ a and U` the corresponding power spectrum. Since the variance matrix of C` is the same as for C` − N , we can express the variance of the true spectrum as 2 V`` 0
b` ∆C b`0 i = M −1 M −1 0 h∆C `k `0 k0 h∆Ck ∆Ck i . REFERENCES Aartsen, M. G., et al. 2013, Astrophys. J., 765, 55 —. 2014, Braz. J. Phys., 44, 415 Abbasi, R., et al. 2011, Astrophys.J., 740, 16 —. 2012, Astrophys. J., 746, 33 Abdo, A., Allen, B., Aune, T., et al. 2008, Phys.Rev.Lett., 101, 221101 —. 2009, Astrophys.J., 698, 2121 Abeysekara, A. U., et al. 2014, Astrophys. J., 796, 108 Ahlers, M. 2014, Phys.Rev.Lett., 112, 021101 Ahlers, M., & Mertsch, P. 2015, Astrophys.J.Lett., 815, L2 Alexandreas, D. E., Berley, D., Biller, S., et al. 1993, Nucl. Instrum. Meth. A, 328, 570 Amenomori, M., et al. 2004, Phys. Rev. Lett., 93, 061101 —. 2005, Astrophys. J., 633, 1005 —. 2006, Science, 314, 439 —. 2010, Astrophys. J., 711, 119 —. 2012, Astropart. Phys., 36, 237 Atkins, R. W., et al. 2003, Astrophys.J., 595, 803 Bartoli, B., et al. 2013, Phys.Rev., D88, 082001 —. 2015, Astrophys. J., 809, 90 Battaner, E., Castellano, J., & Masip, M. 2011, A&A, 527, A79 —. 2015, Astrophys.J., 799, 157 Biermann, P., Becker, J., Seo, E.-S., & Mandelartz, M. 2013, Astrophys.J., 768, 124 Blasi, P., & Amato, E. 2012, JCAP, 1201, 011
(B6)
Maximum-Likelihood Technique for Reconstructing Cosmic-Ray Anisotropy Bonino, R., Alekseenko, V. V., Deligny, O., et al. 2011, Astrophys. J., 738, 67 Compton, A. H., & Getting, I. A. 1935, Physical Review, 47, 817 Cui, S. W., & Yan, C. T. 2003, in Proceedings of ICRC 2003 (Tokyo, Japan: Universal Academy Press) Denton, P. B., & Weiler, T. J. 2015, Astrophys. J., 802, 25 Desiati, P., & Lazarian, A. 2013, Astrophys.J., 762, 44 Di Sciascio, G., & Iuppa, R. 2014, arXiv:1407.2144 D´ıaz–V´ elez, J. C. 2015, in Proceedings of the ICRC 2015, arXiv:1510.04134 Drury, L. 2013, arXiv:1305.6752 Drury, L., & Aharonian, F. 2008, Astropart.Phys., 29, 420 Efstathiou, G. 2004, Mon. Not. Roy. Astron. Soc., 349, 603 Erlykin, A. D., & Wolfendale, A. 2006, Astropart.Phys., 25, 183 Giacinti, G., & Sigl, G. 2012, Phys.Rev.Lett., 109, 071101 Gorski, K., Hivon, E., Banday, A., et al. 2005, Astrophys.J., 622, 759 Guillian, G., et al. 2007, Phys.Rev., D75, 062003 Iuppa, R., & Di Sciascio, G. 2013, Astrophys. J., 766, 9286 Kotera, K., Perez-Garcia, M. A., & Silk, J. 2013, Phys.Lett., B725, 196 Lazarian, A., & Desiati, P. 2010, Astrophys.J., 722, 188 Li, T. P., & Ma, Y. Q. 1983, Astrophys. J., 272, 317 L´ opez-Barquero, V., Farber, R., Xu, S., Desiati, P., & Lazarian, A. 2015, arXiv:1509.00892 Malkov, M., Diamond, P., Drury, L., & Sagdeev, R. 2010, Astrophys.J., 721, 750 Mertsch, P., & Funk, S. 2015, Phys.Rev.Lett., 114, 021101 Munakata, K., Mizoguchi, Y., Kato, C., et al. 2010, Astrophys. J., 712, 1100 Perez-Garcia, M. A., Kotera, K., & Silk, J. 2014, Nucl.Instrum.Meth., A742, 237 Ptuskin, V. S., Jones, F. C., Seo, E. S., & Sina, R. 2006, Adv. Space Res., 37, 1909 Salvati, M., & Sacco, B. 2008, Astron.Astrophys., 485, 527
11