LOCAL SAMPLING AND APPROXIMATION OF

LOCAL SAMPLING AND APPROXIMATION OF OPERATORS WITH BANDLIMITED KOHN-NIRENBERG SYMBOLS ¨ FELIX KRAHMER AND GOTZ E. PFANDER

Abstract. Recent sampling theorems allow for the recovery of operators with bandlimited Kohn-Nirenberg symbols from their response to a single discretely supported identifier signal. The available results are inherently non-local. For example, we show that in order to recover a bandlimited operator precisely, the identifier cannot decay in time nor in frequency. Moreover, a concept of local and discrete representation is missing from the theory. In this paper, we develop tools that address these shortcomings. We show that to obtain a local approximation of an operator, it is sufficient to test the operator on a truncated and mollified delta train, that is, on a compactly supported Schwarz class function. To compute the operator numerically, discrete measurements can be obtained from the response function which are localized in the sense that a local selection of the values yields a local approximation of the operator. Further, we exhibit that the derived mesurements allow for signal processing methods based on local features, such as coarse quantization. Central to our analysis is to conceptualize the meaning of localization for operators with bandlimited Kohn-Nirenberg symbol. Keywords. Operator identification, channel measurement, local approximation, pseudodifferential operators, Sigma-Delta quantization. 2010 Mathematics Subject Classification. 47G30, 94A20.

Primary 41A35, 94A20; Secondary 42B35, 47B35,

Contents 1. Introduction 2. Main Results 3. Bandlimited pseudodifferential operators and operator Paley-Wiener spaces 4. Local approximation of bandlimited operators 5. Operator identification using localized identifiers 6. Reconstruction of bandlimited operators from discrete measurements 7. Quantization of bandlimited operators Acknowledgments References

1 3 9 10 14 18 24 26 26

1. Introduction In communications engineering, the effect of a slowly time-varying communication channel is commonly modeled as superposition of translations (time shifts due to multipath propagation) and modulations (frequency shifts caused by Doppler effects). In order to recover transmitted signals from their channel outputs, precise knowledge of the nature of the channel is required. A common procedure for channel identification in this sense is to periodically send short duration test signals. Felix Krahmer is with the University of G¨ ottingen, Institute for Numerical and Applied Mathematics, Lotzestraße 16-18, 37083 G¨ ottingen, Germany, Tel.: +49 551 39 10584, Fax: +49 551 39 3944, [email protected]. G¨ otz E. Pfander is with Jacobs University Bremen, School of Engineering and Science, Campus Ring 12, 28759 Bremen, Germany. Tel.: +49 421 200 3211, Fax: +49 421 200 49 3211, [email protected]. Date: November 26, 2012. 1

The resulting outputs are then used to estimate channel parameters which allow for an inversion of the operator [14, 2, 15, 25, 1, 13]. Kailath [14] and Bello [2] analyzed the identifiability of such channels. In mathematical terms, these channels are characterized by bandlimited Kohn-Nirenberg symbols and the channel identification problem becomes an operator identification problem: can an operator with bandlimited Kohn-Nirenberg symbol be identified from the output corresponding to a given test input signal? Kozek and Pfander [15], and Pfander and Walnut [25] gave mathematical proof of the assertions by Kailath and Bello that a suitable test signal exists as long as the band support of the symbol of the operator has outer Jordan content less than one. The suggested test signals are periodically weighted regularly spaced Dirac-delta distributions. In [22], Pfander coined the term operator sampling as the resulting theory has many direct parallels to the sampling theory for bandlimited functions. For example, an operator sampling reconstruction formula was established which generalizes the reconstruction formula in the classical sampling theorem for bandlimited functions (see [22] and Theorem 2.2 below). The operator sampling results in [15, 25, 22, 24] rely on using test functions as those described above. These decay neither in time nor in frequency and cannot be realized in practice. In this paper, we show that indeed, for stable identification of operator classes defined by a bandlimitation of the Kohn-Nirenberg symbol, test signals that lack decay in time and frequency are necessary. When seeking to recover only the operator’s action on a time-frequency localized subspace, then this ideal but impractical signal can be replaced with a mollified and truncated copy; the test signal can thereby be chosen to be a compactly supported Schwartz function as shown below. Furthermore, an important difference to the sampling theory for bandlimited functions is that the response to a test signal in operator sampling is a square-integrable function rather than a discrete set of sample values. Of course, one can obtain a discrete representation using any basis or frame of the range space of square integrable functions, but the question remains which of the multitude of commonly considered representations allow to recover the operator most efficiently. In the case of a bandlimited function, one feature that distinguishes the representation by samples is locality: a sample is the function value at a given location; due to the smoothness of bandlimited functions it represents the function in the neighborhood of the sampling point. This feature has many fundamental approximation theoretic consequences. First, it allows to approximate the function in a given region using only samples taken in a fixed-size neighborhood of it. Second, locality is a key ingredient for many signal processing methods. Examples are coarse quantization schemes such as Sigma-Delta modulation, where the quantization accuracy depends on the good control over finite differences between neighboring samples. In this paper we develop discrete representations of operators with bandlimited Kohn-Nirenberg symbols that, on the one hand, can be computed in a direct and simple way from the output corresponding to a test signal and, on the other hand, have locality properties analogous to those we appreciate in the classical sampling theory. We work with the same concept of locality as in the localized sampling results mentioned above, namely, locality will be defined through the action of the operator on time-frequency localized functions. Combining the results of our analysis, we show that a suitable local discrete representation arises from time-frequency measurements of the output corresponding to a truncated and mollified weighted sum of Dirac delta distributions. Moreover, these discrete measurements allow for arbitrary oversampling rates, which allows to design coarse quantization schemes based on the resulting representations. The paper is organized as follows. We summarize our main results and put them in the context of previous work in Section 2. In Section 3 we recall operator sampling terminology in some detail. Section 4 provides results on local approximations of operators; in Section 5 we discuss identification using smooth and finite duration test signals. Section 6 uses Gabor frames to derive our novel discretization scheme for operators with bandlimited Kohn-Nirenberg symbols, and in Section 7 we use the resulting representations to devise a coarse quantization scheme for such operators. 2

2. Main Results Every bounded linear operator on L2 (R) has a formal Kohn-Nirenberg symbol representation Z (2.1) Hf (x) = σH (x, ξ) fb(ξ) e2πixξ dξ where σH is a tempered Rdistribution on R2 , the integral is understood to converge weakly, and here and in the following denotes integration over R. The Fourier transform fb is normalized by Z b Ff (ξ) = f (ξ) = f (t) e−2πitξ dt for integrable f . The space of bounded operators whose Kohn-Nirenberg symbols are bandlimited to a given set M — we will also use the shorthand terminology bandlimited operators — is called operator Paley-Wiener space1; it is denoted by OP W (M ) = {H ∈ L(L2 (R)) : supp Fs σH ⊆ M }, where the symplectic Fourier transform Fs is defined densely by ZZ (2.2) Fs σ(t, γ) = σ(x, ξ)e−2πi(xγ−tξ) dx dξ . The Kohn-Nirenberg symbol of an L2 -bounded operator with supp Fs σH compact is bounded. In fact, for some A, B > 0 we have, (2.3)

AkσH kL∞ (R) ≤ kHkL(L2 (R)) ≤ BkσH kL∞ (R) ,

for all H ∈ OP W (M ), where kHkL(L2 (R)) is the operator norm of H (Proposition 4.1 below). Certainly, if we have direct access to σH , then some of our approximation theoretic goals can be accomplished using classical two-dimensional sampling results applied to σH . In the model considered here, however, we do not have access to any of the values of the symbol σH of the operator H directly, but we must rely on the operator output Hw resulting from applying H to a single test input w. Due to stability consideration, we say that the linear space OP W (M ) is identifiable by w if for A, B > 0 we have (2.4)

AkHkL(L2 (R)) ≤ kHwkL2 (R) ≤ BkHkL(L2 (R)) ,

for all H ∈ OP W (M ) [15]. “Sampling” the operator means that the identifier w in (2.4) is a weighted sequence of Dirac delta distributions, that is, X w= ck δkT , k∈Z

where ck is an appropriately chosen periodic sequence [18, 25, 22]. A guiding paradigm in the sampling theory of operators is the direct analogy to sampling of bandlimited functions. To illustrate this analogy, we compare the classical sampling theorem (often credited to Cauchy, Kotelnikov, Shannon, and Whittaker, among others), Theorem 2.1, with the corresponding result for operators, Theorem 2.2 [22]. Note that Theorem 2.1 formally follows from Theorem 2.2 by choosing the operator H in Theorem 2.2 to be the pointwise multiplication operator f 7→ σ · f [22]. The engineering intuition underlying sampling theorems is that reducing a function to periodic samples at a rate of 1/T samples per unit interval corresponds to a periodization with shift 1/T in 1In general terms, operator Paley-Wiener spaces are defined by requiring its members to have bandlimited KohnNirenberg symbol which are in a prescribed weighted and mixed Lp space [22]. For example, to restrict the attention to bandlimited Hilbert-Schmidt operators, we would consider only operators with square integrable symbols. These form a subset of the operators considered in this paper. 3

frequency space [20]. Thus, as long as T Ω ≤ 1, a function bandlimited to [− Ω2 , Ω2 ] can be recovered via a convolution with a low-pass kernel, that is, a function φ that satisfies ( 1/Ω, if |ξ| ≤ Ω2 , b (2.5) φ(ξ) = 1 0, if |ξ| ≥ 2T . ) If T Ω = 1, the only such function is φ(t) = sinc(πt/T ) = sin(πt/T πt/T . For T Ω < 1, there are many such functions; in particular φ in the Schwartz class is possible. With this notion, the classical sampling theorem reads as follows.

Theorem 2.1. For g ∈ L2 (R) with supp Fg ⊆ [− Ω2 , Ω2 ] and T Ω ≤ 1, we have X (2.6) g(x) = g(nT ) φ(x − nT ) n∈Z

with uniform convergence and convergence in L2 (R). Here, φ is any low-pass kernel satisfying (2.5). Recall that every operator H on L2 (R) is in one-to-one correspondence with its kernel κH , that R is, for a unique tempered distribution κH , we have Hf (x) = κH (x, y) f (y) dy weakly. In the following, χA denotes the characteristic function of a set A. Theorem 2.2. [[22]] For H : L2 (R) −→ L2 (R) with σH ∈ L2 (R2 ), supp Fs σH ⊆ [0, T ]×[− Ω2 , Ω2 ], and T Ω ≤ 1, we have X X (2.7) κH (x + t, x) = χ[0,T ] (t) H δkT (t + nT ) φ(x − nT ), n∈Z 2

k∈Z

2

with convergence in L (R ) and uniform convergence in x for each t. Again, φ is any low-pass kernel satisfying (2.5). We point to an important difference between the applicability of Theorems 2.1 and 2.2: in Theorem 2.1, a bandlimitation to a large set [− Ω2 , Ω2 ] can be resolved by choosing a small T ; on the other side, Theorem 2.2 is not applicable if the bandlimiting set [0, T ]×[− Ω2 , Ω2 ] has area greater than one. Indeed, in [25, 23] the following is shown. P Theorem 2.3. OP W (M ) is identifiable in the sense of (2.4) with appropriate w = n∈Z cn δnT if M is compact with measure less than 1. If M is open and has area greater than 1, then exists no tempered distribution w identifying OP W (M ). Hence, it is necessary to restrict ourselves to operator Paley-Wiener spaces defined by compact sets M with Lebesgue measure one. For such spaces, one can extend Theorem 2.2 to the following. Theorem 2.4. [[24]] Let M be compact with Lebesgue measure less than one. Then exists P T, Ω > 0 with T Ω = L1 , L prime, δ > 0, and an L-periodic sequence (cn ) so that with w = n cn δnT , we have for H ∈ OP W (M ) (2.8)

κH (x + t, x) = LT

L−1 X

X r(t − kj T ) bjq Hw(t − (kj − q)T )φ(x + (kj − q)T ) e2πinj Ωx ,

j=0

q∈Z

where r, φ are Schwartz class functions that satisfy b r(t)φ(γ) = 0 if (t, γ) ∈ / (−δ, T + δ) × (−δ, Ω + δ), and (2.9)

X

r(t − kT ) ≡ 1 ≡

X

b − nΩ) . φ(γ

n∈Z

k∈Z 2

Moreover, (2.8) converges in L (R) and uniformly in t. 4

In both sampling scenarios, working with Schwartz class kernels r,φ is of advantage. Indeed, in the classical sampling theorem, the slow decay of the sinc function sin x /x in (2.6) implies that a small perturbation of just a few coefficients g(nT ) can lead to significant deviations of all values g(t) outside of the sampling grid T Z; this includes values achieved at locations far from the sampling points nT . Hence to approximately recover the function values locally, that is, on an interval [a, b], it does not suffice to know the function samples in a constant size neighborhood of that interval. When working with Schwartz class kernels, in contrast, such a local approximate reconstruction is possible; one has X (2.10) g(x) ≈ g(nT ) φ(x − nT ), n∈[a−d,b+d]

where the neighborhood size d does not depend on the interval [a, b]. A corresponding possibility of using local P information for local reconstruction is not given in Theorem 2.2. Moreover, the identifier w = n∈Z δnT neither decays in time or in frequency, clearly showing that in practice, this input signal is not usable. However, in the framework of Theorem 2.2, this is unavoidable, as the following theorem shows. Theorem 2.5. If the tempered distribution w identifies OP W ([0, T ]×[−Ω/2, Ω/2]), T Ω > 0, then w decays weakly neither in time nor in frequency, that is, we have neither x→±∞

hw, ϕ(· − x)i −→ 0

nor

ξ→±∞

hw, b ϕ(· − ξ)i −→ 0

for all Schwartz class functions ϕ. We address this problem by developing a concept of “local recovery” of an operator, in analogy to the local recovery of a function in (2.10). Indeed, the key to most results presented in this paper is to aim only for the recovery of the operator restricted to a set of functions “localized” on a prescribed set S in the time-frequency plane. This is indeed reasonable in communications where band and time constraints on transmitted signals are frequently present. In [13], for example, operators that map bandlimited input signals to finite duration output signals are considered. Bivariate Fourier series expansions of such an operator’s compactly supported Kohn-Nirenberg symbol allows the authors to discretize the a-priori continuous input-output relations (2.1) and (3.1). The definition of functions localized in time and frequency is based on Gabor frames. Their definition involves translation and modulation operators, Tt f : f 7→ f (· − t) and Mν : f 7→ e2πiν(·) f. These operators are unitary on L2 (R) and isomorphisms on all function and distribution spaces considered in this paper. For any g ∈ L2 (R) and a, b > 0, we say that the Gabor system (g, aZ × bZ) = {Tka M`b g}k,`∈Z is a tight frame for L2 (Rd ) if for some A > 0, we have X f =A hf, Tka M`b gi Tka M`b g k,`∈Z 2

d

for all f ∈ L (R ). Each coefficient in this expansion can be interpreted to reflect the local behavior of the function near the indexing point in time-frequency space. Hence, a natural way to define time-frequency localized functions is that all but certain expansion coefficients are small. Definition 2.6. Let (g, aZ × bZ), g ∈ S(R), be a tight frame for L2 (R) with frame bound 1. We say that f ∈ L2 (R) is –time-frequency localized on the set S if X X |hf, M`b Tka gi|2 ≥ (1 − 2 ) |hf, M`b Tka gi|2 . (ka,`b)∈R2

(ka,`b)∈S 5

Our first result states that a sufficient condition for two operators to approximately agree on functions –time-frequency localized on a set S is that their Kohn-Nirenberg symbols almost agree on a neighborhood of S. Below, B(r) denotes the Euclidean unit ball with radius r and center 0; the dimension be clear from the context. For brevity of notation, we set S − B(r) = c will always c 2 S + B(r) for S ⊆ R . Theorem 2.7. Fix M compact with µ(M ) < 1 and let (g, aZ × bZ), g ∈ S(R), be a tight frame for L2 (R) with frame bound 1. Then exists C > 0 and a strictly monotone function d : (0, 1) −→ R+ , e ∈ OP W (M ) satisfy on a set S ⊆ R2 the bounds lim→0 d() = +∞, with the property that if H, H kσH kL∞ (R2 ) , kσHe kL∞ (R2 ) ≤ µ

and

kσH − σHe kL∞ (S) ≤ µ,

then e kL2 (R) ≤ C µ kf kL2 (R) kHf − Hf for all f ∈ L2 (R) that are –time-frequency localized on S − B d() in the sense of Definition 2.6. This observation is a key ingredient P in the proof of our next main result. It concerns truncated and mollified versions of the identifier n cn δnT and provides localized versions of Theorems 2.2 and 2.4. For S = R2 , it reduces to Theorems 2.2 and 2.4. Theorem 2.8. Fix M compact with µ(M ) < 1 and let (g, aZ × bZ), g ∈ S(R), be a tight frame for L2 (R) with frame bound 1. Choose δ > 0 such that µ(M + [−3δ, 3δ]2 ) < 1. Then exists C > 0 and a strictly monotone function d : (0, 1) −→ R+ , lim→0 d() = +∞, with the following property: Let H ∈ OP W (M ) satisfy the bound kσH kL∞ (R2 ) ≤ µ and let S ⊆ I1 × I2 ⊆ R2 , where I1 and I2 may coincide with R. Furthermore, let X w e= cn ϕ(· − nT ), nT ∈I1

where the tempered distribution ϕ is chosen such that ϕ ≥ 0 and ϕ b ≡ 1 on I2 and define for e via H ∈ OP W (M ) the operator H (2.11) κHe (x + t, x) = LT

L−1 X

r(t − kj T )

X

j=0

bjq H w(t e − (kj − q)T )φ(x + (kj − q)T ) e2πinj Ωx ,

q∈Z

where r, φ are Schwartz class functions defined as in Theorem 2.4, but for the above choice of δ. For rectangular bandlimitation domains M = [0, T ]×[− Ω2 , Ω2 ] one can choose the identifier P e via the formula ϕ(· − nT ) and define H nT ∈I1

κHe (x + t, x) = T

X n∈Z

H

X

ϕ(· − nT ) (t + nT ) φ(x − nT ) .

nT ∈I1

In both cases one has e kL2 (R) ≤ C µ kf kL2 (R) kHf − Hf for all f ∈ L2 (R) that are –time-frequency localized on S − B d() in the sense of Definition 2.6. Note that this theorem is completely analogous to the condition (2.10) for localized function sampling. Due to the two-dimensional nature of the operator, however, localization is an issue in both time (restricting to a finite number of deltas) and frequency (replacing the deltas by approximate identities). If one is interested in localization only in time or only in frequency, one can choose one of the Ii to be R and thus consider X X w e= cn δnT or w e= cn ϕ(· − nT ), n

nT ∈I1

and with (cn ) ≡ 1 in case of rectangular domains M . 6

An additional important structural difference between classical sampling and operator sampling remains: in Theorems 2.2 and 2.8, the reconstruction formulas (2.7) and (2.8) involve as “coefficients” functions, not scalars. Among the many possibilities to discretely represent the operator’s response to the identifier w, we consider Gabor representations of this sample function. A timefrequency localized subset of the coefficients will then yield a corresponding local approximation of the operator. Theorem 2.9 below establishes a reconstruction formula based on Gabor coefficients that allows for the exact recovery of the operator; Theorem 2.10 shows that a local subset of the coefficients yields a local approximation of the operator. Again, one obtains considerably simpler formulas for rectangular domains, but for reasons of brevity, we focus on the comprehensive setup of arbitrary domains. For a Schwartz class function φ and a tempered distribution f on R we call Vφ f (x, ξ) = hf, Mξ Tx φi,

(2.12)

x, ξ ∈ R,

the short-time Fourier transform of f with respect to the window function φ. Throughout this paper, all pairings h·, ·i are taken to be linear in the first component and antilinear in the second. Theorem 2.9. For M compact with µ(M ) < 1 exists L prime, δ > 0, T, Ω > 0 with T Ω = 1/L, an L-periodic sequence (cn ), and a sequence {B jq }j=0,...,L−1,q∈Z (which is L periodic in q and P depends only on the sequence (cn )), such that with w = n cn δnT we have for H ∈ OP W (M ) (2.13) σH (x, ξ) =

L−1 mL `L LT X −2πi(xnj Ω+ξkj T ) 2πinj Ωkj T X (j) e e σm,` Vφ r x− + kj T, ξ − + nj Ω , β1 β2 j=0 β1 β2 m,`∈Z

where (j)

σm,` =

X

B jq φ (−q − kj − mL/β1 )T hHw, TqT M`ΩL/β2 ri,

q∈Z

and r, φ are Schwartz class functions such that r and φb are real valued and satisfy2 b (2.14) r(t) = 0 if t ∈ / (−δ, δ + T ), φ(γ) = 0 if γ ∈ / (−δ − Ω/2, δ + Ω/2), and (2.15)

X

|r(t + kT )|2 ≡ 1 ≡

X

b + nΩ)|2 , |φ(γ

n∈Z

k∈Z

with oversampling rates β2 ≥ 1 + 2δ/T and β1 ≥ 1 + 2δ/Ω.3 Observe that the reconstruction formulas given in Theorems 2.4 and 2.8 require r and φb to generate partitions of unity (2.15), while (2.9) above requires that their modulus squared form partitions of unity. Theorem 2.10. Fix M compact with µ(M ) < 1, let T, Ω, L and w, r, φ be defined in Theorem 2.4, and let (g, aZ × bZ), g ∈ S(R), be a tight frame for L2 (R) with frame bound 1. Then exists C > 0 and a strictly monotone function d : (0, 1) −→ R+ , lim→0 d() = +∞, with the following property: Let H ∈ OP W (M ) satisfy the bound kσH kL∞ (R2 ) ≤ µ, fix a not necessarily bounded set S ⊆ e via its symbol I1 × I2 in R2 , choose ϕ and w e as in Theorem 2.8, and define the operator H σ e(x, ξ) =

L−1 LT X −2πi(xnj Ω+ξkj T ) 2πinj Ωkj T e e β1 β2 j=0

X

(j)

σ em,` Vφ r(x−

(mLT /β1 ,`LΩ/β2 )∈S

mL `L +kj T, ξ− +nj Ω β1 β2

2For example, we can choose r = χ [0,T ) ∗ϕδ , where ϕδ is an approximate identity, that is, a non-negative function

R with ϕδ ∈ S(R), supp ϕδ ⊆ [−δ/2, δ/2], and ϕδ = 1. 3Then the Gabor systems {r b k,l = TkT M`/β2 T r}k,`∈Z , {TnΩ Mm/β1 Ω φ}m,n∈Z , and {Φm,−n,l,−k = T(mT L/β1 ,`LΩ) M(nΩ,/β2 ,kT ) }m,n,k,`∈Z are tight Gabor frames with A = β2 /T , A = β1 /Ω, and A = β1 β2 /(T Ω) = β1 β2 L, respectively, whenever β2 ≥ 1 + 2δ/T and β1 ≥ 1 + 2δ/Ω. 7

where (j)

σ em,` =

X

B jq φ (−q − kj − mL/β1 )T hH w, e TqT M`ΩL/β2 ri.

q∈Z

e satisfies Then H e kL2 (R) ≤ C ε µ kf kL2 (R) kHf − Hf for all f ∈ L2 (R) which are –time-frequency localized on S − B(D()) with respect to (g, aZ × bZ) in the sense of Definition 2.6. The discrete representations introduced in Theorems 2.9 and 2.10 resolve a fundamental conceptual difference between classical sampling and operator sampling. In contrast to classical sampling, which yields a set of separate function values, the contributions of the different Dirac-deltas in the operator sampling formula are combined in a single function and cannot easily be separated. Hence, while choosing a higher sampling rate in the function case yields more information, in the operator case, this additional information is mixed in an inseparable way. These aliasing effects [15] make it impossible to obtain redundant representations merely by oversampling in Theorem 2.2 or Theorem 2.4. In reconstruction formula (2.13), however, the oversampling parameters βi can be chosen arbitrarily, allowing for representations of arbitrarily large redundancy. The interplay of large redundancy and good local representation properties of the discrete coefficients allows for coarse quantization methods to be applied to operators. In the mathematical literature, the most common scenarios for such methods deal with frames in Rn [3, 4, 17] or the space of bounded bandlimited functions on R [6, 12, 7]. The underlying idea is to sample at a high rate and use the resulting redundancy to lower the number of bits needed to represent each sample. The following establishes a corresponding result for bandlimited operators. Theorem 2.11. Fix a compact set M with µ(M ) < 1 and the associated parameters L, Ω, and T as in Theorem 2.9. Then there are constants c, C such that for each H ∈ OP W (M ) satisfying (j) kσH kL∞ (R2 ) ≤ c and all oversampling rates β1 , β2 > 1, one can compute quantized values qm,` ∈ P {±1, ±3} directly from Hw with w = k ck δkT via a recursive procedure. These quantized values e with symbol give rise to an approximate reconstruction of H through the operator H σ ˜ (x, ξ) =

L−1 mL `L LT X −2πi(xnj Ω+ξkj T ) 2πinj Ωkj T X (j) e e qm,` Vφ r(x − + kj T, ξ − + nj Ω , β1 β2 j=0 β1 β2 m,`∈Z

which satisfies e kL2 (R) ≤ kHf − Hf

C kf kL2 (R) β 1 β2

for all f ∈ L2 (R). Combining all these results, we obtain a result about local reconstruction from local quantized values resulting from a localized identifier. Corollary 2.12. Fix a compact set M with µ(M ) < 1 and the associated parameters L, Ω, and T as in Theorem 2.9. Then there are constants c, C, and a strictly monotone function d : (0, 1) −→ R+ , lim→0 d() = +∞ such that for each H ∈ OP W (M ) satisfying kσH kL∞ (R2 ) ≤ c and all (j)

oversampling rates β1 , β2 > 1, one can compute quantized values qm,` ∈ {±1, ±3} from H w, e with w e as in Theorem 2.8, via a recursive procedure. Then using the finitely many quantized values with indices (m, `) satisfying (mLT /β1 , `LΩ/β2 ) ∈ S, one obtains a local approximate reconstruction of e with symbol H on S − d() through the operator H σ ˜ (x, ξ) =

L−1 LT X −2πi(xnj Ω+ξkj T ) 2πinj Ωkj T e e β1 β2 j=0

X

(j)

qm,` Vφ r(x−

(mLT /β1 ,`LΩ/β2 )∈S 8

mL `L +kj T, ξ− +nj Ω . β1 β2

This operator satisfies e kL2 (R) ≤ kHf − Hf

C kf kL2 (R) + C ε µkf kL2 (R) β1 β 2

for all f ∈ L2 (R) which are –time-frequency localized on S − B(D()) with respect to (g, aZ × bZ) in the sense of Definition 2.6. 3. Bandlimited pseudodifferential operators and operator Paley-Wiener spaces It is well known that every bounded linear operator H : S(Rd ) → S 0 (Rd ) is of the form Z Hf (x) = κ(x, t)f (t)dt for some κ ∈ S 0 (R2d ), where S(Rd ) is the Schwartz space, and S 0 (Rd ) is its dual, the space of tempered distributions [10]. This integral representation is understood in the weak sense, that is, hHf, gi = hκ, f ⊗gi for all f, g ∈ S(Rd ), where f ⊗g(x, y) = f (x)g(y) and h · , · i is the sesquilinear pairing between S and S 0 functions. Each such operator has a spreading function representation ZZ (3.1) Hf = η(t, γ)Mγ Tt f dt dγ, a time-varying impulse response representation Z Hf (x) = h(x, t)f (x − t)dt, and a Kohn-Nirenberg symbol representation Z Hf (x) = σ(x, ξ)fb(ξ) e2πixξ dξ. We write Hσ and σH , ηH , κH when it is necessary to emphasize the correspondence between H and σ, η, κ. The symbols σ and η are related via the symplectic Fourier transform Fs defined in (2.2), that is, σ = Fs η. For convenience, we use the symbol η H (t, γ) = e2πiγt ηH (t, γ), and denote its symplectic Fourier transform by ZZ σ H (x, ξ) = η H (t, γ) e2πi(xγ−tξ) dt dγ. The relationship between σH and σ H is given by (3.2)

σH ∗ = σ H ,

where H ∗ denotes the adjoint of H. Indeed, we have for Schwartz functions f, g ZZ ∗ hH f, gi = hf, Hgi = hf (x), ηH (t, ν)Mν Tt g(x) dt dνi ZZ = ηH (t, ν)hf, Mν Tt gi dt dν ZZ = ηH (t, ν)hT−t M−ν f, gi dt dν ZZ =h ηH (−t, −ν)Tt Mν f dt dν, gi ZZ =h ηH (−t, −ν)e−2πitν Mν Tt f dt dν, gi 9

and conclude using a density argument that ηH ∗ (t, ν) = ηH (−t, −ν) e−2πitν . Hence, ZZ σH ∗ (x, ξ) = Fs ηH ∗ (x, ξ) = ηH (−t, −ν)e−2πitν e−2πitξ e2πixν dt dν ZZ ηH (−t, −ν)e2πitν e2πitξ e−2πixν dt dν = ZZ = ηH (t, ν)e2πitν e−2πitξ e2πixν dt dν = Fs η H (x, ξ) = σ H (x, ξ). To prove our results, we shall frequently transition from σ to σ. This does not cause a problem in our analysis since (2.3) combined with kHkL(L2 (R)) = kH ∗ kL(L2 (R)) shows that for M ⊆ R2 compact exist A, B > 0 with AkσkL∞ (R2 ) ≤ kσkL∞ (R2 ) ≤ BkσkL∞ (R2 )

(3.3) for all Hσ ∈ OP W (M ).

4. Local approximation of bandlimited operators In this section we show that a local approximation of an operator’s symbol always yield a local approximation of the operator in the sense of Definition 2.6. The given results are of general interest and will be stated in more general terms than other results in this paper. This does not increase the difficulty of proof, but necessitates to recall additional terminology from time-frequency analysis. For that, recall that for any full rank lattice Λ = AZ2d ⊆ R2d , det A 6= 0, `p (Λ) denotes the set of sequences (cλ )λ∈Λ for which X 1/p |cλ |p < ∞. kck`p (Λ) = λ∈Λ

A time-frequency shift by λ = (t, ν) ∈ Λ is denoted by π(λ) = Mν Tt and in the following we will consider Gabor systems of the form (g, Λ) = {π(λ)g}λ∈Λ . Among the many equivalent definitions of modulation spaces, we choose the following. Let g0 (x) = e−kxk and 1 ≤ p ≤ ∞. Then (4.1)

M p (Rd ) = {f ∈ S 0 (Rd ) : kf kM p (Rd ) = k(hf, π(λ)g0 i)λ k`p ( 21 Z2d ) < ∞ }

(see, for example, [11, 8]). In the following we shall use the fact that whenever (g, Λ) is a tight L2 -Gabor frame (see below for a precise definition) with g ∈ M 1 (Rd ) then replacing the L2 -Gabor frame (g0 , 12 Z2d ) in (4.1) with (g, Λ) leads to an equivalent norm on M p (Rd ) [11]. That is, there exist positive constants A and B with X (4.2) Akf kpM p (Rd ) ≤ |hf, π(λ)gi|p ≤ Bkf kpM p (Rd ) , f ∈ M p (Rd ) λ∈Λ

if 1 ≤ p < ∞ and Akf kM ∞ (Rd ) ≤ sup |hf, π(λ)gi| ≤ Bkf kM ∞ (Rd ) ,

f ∈ M ∞ (Rd )

λ∈Λ

if p = ∞. In either case, we call (g, Λ) an `p -frame with lower frame bound A and upper frame bound B. If we can choose A = B in case of p = 2 then we call (g, Λ) a tight Gabor frame. The norm equivalence (2.3) follows from the following result since M 2 (R) = L2 (R). Theorem 4.1. Let 1 ≤ p ≤ ∞ and M compact. Then exist positiv constants A = A(M, p) and B = B(M, p) with A kσH kL∞ (R2 ) ≤ kHkL(M p (R)) ≤ B kσH kL∞ (R2 ) , 10

H ∈ OP W (M ).

Proof. Theorem 2.7 in [22] (see for example the proof of Theorem 3.3 in [22]) provides C = C(M, p) with kHf kM p (R) ≤ C kσH kL∞ (R2 ) kf kM p (R) for all H ∈ OP W (M ). This establishes the existence of B = B(M, p) above. In addition, we shall use the following facts. In [9, 11] it is shown that the operator norm of an operator mapping the modulation space M 1 (R) into its dual M ∞ (R) is equivalent to the M ∞ (R2 ) norm of its kernel κ, which can easily shown to be equivalent to the M ∞ (R2 ) norm of the time-varying impulse response h. Moreover, we use the fact that M ∞ (R2 ) is invariant under Fourier transforms (in some or all variables) and that the M ∞ (R2 ) norm can be replaced by the L∞ (R2 ) norm if we restrict ourselves to functions bandlimited to a fixed set M [19, 22]. Last but not least, we use that the identity map embedding M p (R) into M q (R), p ≤ q, is bounded. Writing . to express that A ≤ CB for some constant C depending only on the support M and A B to denote equivalence in norms, i.e., A . B and B . A, we obtain for all H ∈ OP W (M ) kσH kL∞ (R2 ) kσH kM ∞ (R2 ) khH kM ∞ (R2 ) kκH kM ∞ (R2 ) kHkL(M 1 (R),M ∞ (R)) . kHkL(M p (R)) and the result follows.

We proceed to prove the following generalization of Theorem 2.7. Indeed, the earlier stated result follows again from the fact that L2 (R) = M 2 (R). We focus on the case of arbitrary domains; a simpler proof for rectangular domains can be obtained using Theorem 2.2 instead of Theorem 2.4. Theorem 4.2. Fix M compact and p ∈ [1, ∞]. Let (g, Λ), g ∈ M 1 (R), be a tight frame for L2 (R) with frame constant 1.Then exists a constant C and a strictly monotone function d : (0, 1) −→ R+ , lim→0 d() = +∞, with the property that if H ∈ OP W (M ) satisfies kσH kL∞ (R2 ) ≤ µ

and

kσH kL∞ (S) ≤ µ,

then kHf kM p (R) ≤ C µ kf kM p (R) c for all f ∈ M p (R) time-frequency localized on S − B d() = S c + B d() in the sense that, for p < ∞, X X |hf, π(λ)gi|p ≥ (1 − p ) |hf, π(λ)gi|p , λ∈Λ

λ∈Λ∩(S−B(d()))

or, for p = ∞, sup |hf, π(λ)gi|, λ ∈ Λ ∩ S − B(d()) } ≥ (1 − ) sup |hf, π(λ)gi|, λ ∈ Λ . Proof. Step 1. Preliminary observations and choice of auxiliary objects. R φ ∈ S(R2 ) with φ(x) dx = 1 and supp φ ⊆ [− 21 , 12 ]2 . Recall that

Choose a nonnegative

Λ⊥ = {µ ∈ R2 : e2πihµ,λi = 1 for all λ ∈ Λ} e be a lattice containing Λ with the property that is called dual lattice of the lattice Λ in R2 . Let Λ e ⊥ which contains M + [− 1 , 1 ]2 . Set there exists a compact and convex fundamental domain D of Λ 2 2

σP = kχD ∗ φk−1 L2 (R2 ) F(χD ∗ φ) and, using the sampling theorem for lattices in Rn [21, 11], we obtain for all H ∈ OP W (M ) X σH = σH (λ) Tλ σP e λ∈Λ

and hence (4.3)

H=

X

σH (λ) π(λ)P π(λ)∗ .

e λ∈Λ 11

As explained above, the fact that (g, Λ) is a Gabor frame in L2 (R) with g ∈ M 1 (R), implies that it is also an `p -frame for M p (R) and there exists C1 , C2 > 0 with (4.4)

kf kM p (R) ≤ C1 k{hf, π(λ)gi}λ∈Λ k`p (Λ) ≤ C1 C2 kf kM p (R) ,

f ∈ M p (R).

As the synthesis map is the adjoint of the analysis map, we also have

X

(4.5) cλ π(λ)g M p (R) ≤ C2 k{cλ }λ∈Λ k`p (Λ) . λ∈Λ

e = Sn (Λ+µ` ), where n depends only on M and (g, Λ). It is For some µ1 , µ2 , . . . , µn , we have Λ `=1 easily seen that (g, Λ+µ` ), ` = 1, . . . , n, also satisfies (4.4) and (4.5). Setting ge = n−1/2 g ∈ M 1 (R), e is a tight frame for L2 (R) with frame bounds equal 1 we conclude that the Gabor system (e g , Λ) p p and an ` -frame with for M (R) with 1

1

1

1

e g i}e e k p e kf kM p (R) ≤ C1 n 2 − p k{hf, π(λ)e λ∈Λ ` (Λ) 1

1

≤ C1 n 2 − p C2 n p − 2 kf kM p (R) = C1 C2 kf kM p (R) ,

f ∈ M p (R).

We claim that n o e g i ∈ l 1 (Λ e × Λ) e . hP π(λ)e g , π(λ)e To see this, recall that σP ∈ S(R2 ) ⊆ M 1 (R2 ), and, hence, σ eP given by σP (x, ξ) e2πixξ is in M 1 (R2 ) 2πixξ as e is a Fourier multiplier and hence also a time multiplier for M 1 (R2 ) (Theorem 11, [5]). A e = (e direct computation implies that for λ = (t, ν) and λ t, νe) we have ZZ \ |hP π(t, ν)e g , π(e t, νe)e g i| = σP (x, ξ) e2πixξ M e(ξ)MνeTet ge(x) dξ dx ν Tt g ZZ σP (x, ξ) e2πixξ M−t Tν b = ge(ξ)MνeTet ge(x) dξ dx = he σP , M(eν ,t) T(et,ν) ge⊗b gei . 1 2 b e Λ) e e Λ) e since σf e, Λ× Equation (4.2) implies that the right hand side is in `1 (Λ× P ∈ M (R ) and (g⊗g is a Gabor frame with window ge⊗b ge in M 1 (R2 ). Fix > 0 and choose d() > 0 so that X X e g i < . g , π(λ)e hP π(λ)e c e Λ e e λ∈Λ∩B(d()) λ∈

e = hP π(λ)e e g i if λ ∈ Λ e ∩ B(d())c and 0 else. Now, set A(λ, λ) g , π(λ)e Step 2. Decomposing Hf as Hf = Hin fin + Hout fin + Hfout . Set e in = Λ e ∩ S − B(d()) , Λout = Λ \ Λin , Λin = Λ ∩ S − B(d()) , Λ

e out = Λ e \Λ e in . Λ

Let fin =

X

hf, π(λ)gi π(λ)g =

λ∈Λin

where cλ =

√

X

cλ π(λ)e g,

fout = f − fin ,

e in λ∈Λ

n hf, π(λ)gi if λ ∈ Λ and 0 else. Similarly, inspired by (4.3), we set for H ∈ OP W (M ) X Hin = σH (λ) π(λ)P π(λ)∗ , Hout = H − Hin e λ∈Λ∩S

and note that Hin , Hout ∈ OP W (D + [− 21 , 12 ]2 ). 12

e in and Λ e ∩ S c by d() to compute Step 3. Bounding kHout fin kM p (R) . We use the separation of Λ X X e g i e g i| = h σH (ν)π(ν)P π(ν)∗ cλ π(λ)e g , π(λ)e |hHout fin , π(λ)e c e ν∈Λ∩S

X

≤

e λ∈Λ

|σH (ν)|

c e ν∈Λ∩S

X

≤

X

|σH (ν)|

X

in e g i |cλ | hπ(ν)P π(ν)∗ π(λ)e g , π(λ)e

e in λ∈Λ

c e ν∈Λ∩S

≤

X

e − ν)e g , π(λ g i |cλ | hP π(λ − ν)e

e in λ∈Λ

X

|σH (ν)|

c e ν∈Λ∩S

e − ν) |cλ | A(λ − ν, λ

e in λ∈Λ

≤ kσH kL∞ (R2 )

X X

e − ν). |cλ | A(λ − ν, λ

e λ∈Λ e ν∈Λ

e 1/p + 1/q = 1, we conclude For every sequence {dλ } ∈ `q (Λ), e g i}e e , {de }e e i h{hHout fin , π(λ)e λ∈Λ λ λ∈Λ XX X e − ν) |de | |cλ | A(λ − ν, λ ≤ kσH kL∞ (R2 ) λ e Λ e λ∈Λ e e ν∈Λ λ∈

XXX

= kσH kL∞ (R2 )

e |de | |cλ+ν | A(λ, λ) λ+ν

e λ∈ e Λ e e ν∈Λ λ∈Λ

≤ kσH kL∞ (R2 ) k{cλ }k`p (Λ) e k{dλ }k`q (Λ) e

XX

e A(λ, λ)

e λ∈ e Λ e λ∈Λ

and 1 1 e g i|}k p e kHout fin kM p (R) ≤ n 2 − p C1 k{|hHout fin , π(λ)e ` (Λ) XX 1 1 −p e 2 C1 kσH kL∞ (R2 ) k{cλ }k`p (Λ) A(λ, λ) ≤n e

e λ∈ e Λ e λ∈Λ

≤n ≤n

1 1 2−p 1 1− p

1 2

C1 µ k{n hf, π(λ)gi}k`p (Λ) C1 C2 µ kf kM p (R) .

Step 4. Bounding kHfout kM p (R) . By Proposition 4.1 we have kHfout kM p (R) ≤ B(M, p) kσH kL∞ (R2 ) kfout kM p (R) . By hypothesis, for p < ∞ we have X

kfout kpM p (R) = k

hf, π(λ)giπ(λ)gkpM p (R)

λ∈Λout

≤ C2p

X

|hf, π(λ)gi|p

λ∈Λout

X

≤

C2p p

≤

λ∈Λ 2p p C2 kf kpM p (R) 13

|hf, π(λ)gi|p ,

and for p = ∞ we have kfout kM ∞ (R) = k

X

hf, π(λ)giπ(λ)gkM ∞ (R)

λ∈Λout

≤ C2 khf, π(λ)gik`∞ (Λout ) ≤ C2 khf, π(λ)gik`∞ (Λ) ≤ C22 kf kM ∞ (R) . We conclude kHfout kM p (R) ≤ B(M, p) C2 kσH kL∞ kf kM p (R) ≤ B(M, p) C2 µ kf kM p (R) .

Step 5. Bounding kHin fin kM p (R) .

Since σP ∈ S(R2 ), the operator

`∞ (Λ) → L∞ (R2 ),

{cλ } 7→

X

cλ Tλ σP

e λ∈Λ

is bounded, say with operator norm bound C3 . Then, Proposition 4.1 implies kHin fin kM p (R) ≤ B(D+[− 21 , 12 ]2 , p) kσHin kL∞ (R2 ) kfin kM p (R) ≤ B(D+[− 12 , 12 ]2 , p) C3 k{σH (λ)}k`∞ (Λ∩S) (1 + )kf kM p (R) e ≤ 2 B(D+[− 21 , 12 ]2 , p) C3 µ kf kM p (R) . Since all constants are independent of , µ, H, and f , we summarize kHf kM p (R) = kHin fin + Hout fin + Hfout kM p (R) ≤ C µ kf kM p (R) .

5. Operator identification using localized identifiers This section analyzes identifiers that are localized in time and frequency. Theorem 2.5 shows that such functions cannot serve as an identifier for the complete operator Paley-Wiener space as a whole. Proof of Theorem 2.5. Let r 6= 0 be a Schwartz function with supp r ⊆ [0, T ] and φ 6= 0 be a Schwartz function with supp φb ⊆ [−Ω/2, Ω/2]. Let Hn be defined via its kernel κn (x, y) = R b φ(x − n)r(x − y), so hn (x, t) = φ(x − n)r(t) and ηn (t, ν) = hn (x, t)e−2πixν dx = r(t)e2πinν φ(ν), so Hn ∈ OP W ([0, T ]×[−Ω/2, Ω/2]) with kσHn kL∞ (R2 ) = kb rkL∞ (R) kφkL∞ (R) . If w identifies OP W ([0, T ]×[−Ω/2, Ω/2]), then by definition Hn w ∈ L2 (R). Then Z Z |Hn w(x)|2 dx = |hκn (x, y), w(y)iy |2 dx Z = |φ(x − n)|2 |hr(x − y), w(y) iy |2 dx. x→±∞

n→±∞

Clearly, hr(x − y), w(y) iy −→ 0 would imply kHn wkL2 (R) −→ 0 and contradict identifiability (2.4) since by (2.3) we have kHn kL(L2 (R)) ≥ AkσHn kL∞ (R2 ) = Akb rkL∞ (R) kφkL∞ (R) for all n ∈ Z. To show that an identifier w cannot decay in frequency, we choose Hn ∈ OP W ([0, T ]×[− Ω2 , Ω2 ]) −2πitν b to have spreading functions ηn (t, ν) = r(t)e2πint φ(ν)e . Let g be a Schwartz function and compute using Fubini’s Theorem and, for notational simplicity, using bilinear pairings in place of 14

sesquilinear ones,

hHn w(x), g(x)ix = ηn (t, ν), he2πixν w(x − t), g(x)ix t,ν

b = r(t)e2πint φ(ν), he2πi(x−t)ν w(x − t), g(x)ix t,ν

b = r(t)e2πint w(x − t) g(x), hφ(ν), e2πi(x−t)ν iν t,x

= hr(t)e2πint , w(x − t) φ(x − t)it , g(x) x

b = hb r(ξ − n), e−2πixξ w b ∗ φ(ξ)i ξ , g(x) x

b = rb(ξ − n) w b ∗ φ(ξ), gb(ξ) ξ . Hence, 2 [ kHn wk2L2 (R) = kH n wkL2 (R) =

Z

2 b |b r(ξ − n)|2 |hw(ξ b − ν), φ(ν)i ν | dξ ,

and we can conclude as above. We proceed by showing that local identification of operators is possible with identifiers localized both in time and frequency, Theorem 2.8. Proof of Theorem 2.8. The proof proceeds in two steps. First we show that replacing each Dirac-delta by a suitable smoothed out version locally introduces only a small error and identification using the resulting smooth identifier can be interpreted as sampling a modified bandlimited operator. Second we show that reducing to a finite number of samples also locally yields only a small error. Applying this to the modified operator arising in the first part proves that both reductions together also yield only a small error. For the first part, choose ϕ ∈ S with supp ϕ ⊆ [−δ, δ], kϕk b L∞ (R) = 1, and |ϕ(ξ) b − 1| ≤ for ξ ∈ I2 . Define Cϕ : f 7→ f ∗ ϕ and set HC = H ◦ Cϕ . Observe that ZZ HC f (x) = ηH (t, ν)e2πixν f ∗ ϕ(x − t) dt dν ZZZ = ηH (t, ν)e2πixν f (x − t − y)ϕ(y) dy dt dν ZZZ = ηH (t − y, ν)e2πixν f (x − t)ϕ(y) dy dt dν ZZ Z = ηH (t − y, ν)ϕ(y) dy e2πixν f (x − t)dt dν , that is, ηHC (t, ν) = ηH (·, ν) ∗ ϕ(t) and supp ηHC ⊆ supp ηH + [−δ, δ]×{0}. We can apply Theorem 2.4 for the operator HC with M1 := M + [−δ, δ]×{0} in place of M . As by assumption M1 + [−δ, δ]2 still has measure less than one, this can be done with δ, r and φ as given in the theorem. Defining w1 := ϕ ∗ w, we obtain (5.1) κHC (x + t, x) = LT

L−1 X j=0

X r(t − kj T ) bjq Hw1 (t − (kj − q)T )φ(x + (kj − q)T ) e2πinj Ωx q∈Z

Observe that σHC (x, ξ) = Fs ηHC (x, ξ) = σH (x, ξ) ϕ(ξ), b and, by hypothesis, we have kσHC kL∞ (R2 ) ≤ kσH kL∞ (R2 ) ≤ µ and kσH − σHC kL∞ (S) ≤ µ. e so this establishes the Note that for I1 = R, (5.1) agrees with (2.11) and we have HC = H, result. For the second part, let us assume S ⊆ I1 × R and M1 ⊂ [c, d] × R. Let ψ ∈ S(R) be P nonnegative and satisfy n ψ(x − nT ) = 1 and supp ψb ⊂ [−1/T, 1/T ]. Such a function can be 15

obtained by choosing an arbitrary bandlimited, nonnegative ψ0 ∈ S with kψ0 kL1 = 1 and defining ψ = χ[0,T ] ∗ ψ0 . P Set PA (x) = nT ∈A ψ(x−nT ), so P[−N,N ] → 1 and P[−N,N ]c → 0 uniformly on compact subsets as N → ∞. Moreover, |PA (x)| ≤ 1 for all A. Choose N () so that | PI1 +[−N (),N ()] (x) − 1| ≤ for x ∈ I1 + [c, d] and choose R() with

X

(5.2)

kP[I1 +[−N (),N ()] (x) Vφ∗ r(x − q, ξ)kL1 (R2 ) < (1 − )D.

qT ∈I / 1 +[−R(),R()]

where the nature of D is derived by the computations below. The existence of such R() follows from the fact that PI1 +[−N (),N ()] (x) and Vφ∗ r decay faster than any polynomial. P e Let w2 = kT ∈I1 +[−R(),R()]+[−δ,T +δ] ck δkT and observe that H as defined in the theorem satisfies

hHe (x + t, t) = κHe (x + t, x) = LT

L−1 X

X r(t − kj T ) bjq HC w2 (t − (kj − q)T )φ(x + (kj − q)T ) e2πinj Ωx .

j=0

q∈Z

Since M1 ⊂ [c, d] × R, we have supp HC δy ⊆ [c + y, d + y], and therefore,

HC w(x) = HC

X

X

ck δkT (x) = HC

ck δkT (x) = HC w2 (x),

kT ∈I1 +[−R(),R()]+[−T −δ,δ]+[c,d]

k∈Z

x ∈ K ≡I1 + [−R(), R()] + [−T − δ, δ] .

e ∈ OP W (M2 ), where M2 = M1 + [−δ, δ]2 (for details, see, for example, [24]). As Note that H 2 M2 + [−δ, δ] still has measure less than one, this implies that we can apply Theorem 2.4 again with the same δ. We obtain

hHC (x + t, t) − hHe (x + t, t) =LT

L−1 X

X r(t − kj T ) bjq HC w − w2 (t − (kj − q)T ) φ(x + (kj − q)T ) e2πinj Ωx

j=0

=LT

L−1 X j=0

=LT

L−1 X j=0

q∈Z

r(t − kj T )

X

bjq HC w − w2 (t − (kj − q)T ) φ(x + (kj − q)T ) e2πinj Ωx

qT ∈K−(t−k / jT )

r(t − kj T )

X

bjq HC w − w2 (t − (kj − q)T ) φ(x + (kj − q)T ) e2πinj Ωx .

qT ∈I / 1 +[−R(),R()] 16

e = K c + [−δ, T + δ] and using that (σH (x, ξ) − σ e (x, ξ)) PI +[−N (),N ()] (x) is banSetting K C 1 H dlimited to M + {0}×[−1/T, 1/T ]), we compute kσHC − σHe kL∞ (S) ≤ 1/(1 − ) k(σHC (x, ξ) − σHe (x, ξ)) PI1 +[−N (),N ()] (x)kL∞ (R2 ) 1/(1 − ) k(σHC (x, ξ) − σHe (x, ξ)) PI1 +[−N (),N ()] (x)kM ∞ (R2 ) 1/(1 − ) k(hHC (x, t) − hHe (x, t)) PI1 +[−N (),N ()] (x)kM ∞ (R2 ) L−1

X

1/(1 − ) LT PI1 +[−N (),N ()] (x) r(t − kj T ) e2πinj Ω(x−t) j=0

bjq HC (w − w2 )(t − (kj − q)T ) φ(x − t + (kj − q)T )

X

M ∞ (R2 )

qT ∈I / 1 +[−R(),R()]

≤ LT /(1 − )

L−1 X

PI1 +[−N (),N ()] (x) r(t − kj T ) e2πinj Ω(x−t)

X

j=0 qT ∈I / 1 +[−R(),R()]

bjq HC (w − w2 )(t − (kj − q)T ) φ(x − t + (kj − q)T )

M ∞ (R2 )

≤ LT /(1 − )

L−1 X

HC (w − w2 )(t − (kj − q)T )

X

M ∞ (R2 )

j=0 qT ∈I / 1 +[−R(),R()]

PI1 +[−N (),N ()] (x) r(t − kj T ) e2πinj Ω(x−t) bjq φ(x − t + (kj − q)T )

M 1 (R2 )

LT ≤ kHC kL(M ∞ (R)) kw − w2 kM ∞ (R) 1− L−1

X X

|bjq | PI1 +[−N (),N ()] (x) r(t) φ(x − t − qT )

M 1 (R2 )

j=0 qT ∈I / 1 +[−R(),R()]

,

where we used the invariance of the M ∞ and M 1 norm under translation and modulation and, for the last inequality, Theorem 4.1 – noting that, for functions constant in one of the coordinate directions, the M ∞ (R) and M ∞ (R2 ) norms agree. The second to last inequality is based on M 1 (R2 ) being a Banach algebra, namely on kg1 g2 kM 1 (R2 ) ≤ kg1 kM 1 (R2 ) kg2 kM 1 (R2 ) for g1 , g2 ∈ M 1 (R2 ). Indeed, for f ∈ M ∞ (R2 ) and g ∈ M 1 (R2 ), we have kf gkM ∞ (R2 ) =

sup kfekM 1 (R2 ) =1

≤

sup

|hf g, fei| =

|hf, fegi| ≤

sup kfekM 1 (R2 ) =1

sup

kf kM ∞ (R) kfegkM 1 (R2 )

kfekM 1 (R2 ) =1

kf kM ∞ (R) kfekM 1 (R2 ) kgkM 1 (R2 ) = kf kM ∞ (R2 ) kgkM 1 (R2 ) .

kfekM 1 (R2 ) =1

Note that with φ∗ (t) = φ(−t), we have Z r(t)φ(x − t)e−2πitξ dt = Vφ∗ r(x, ξ), which is a bandlimited function since ZZ Z Vφ∗ r(x, ξ)e2πitξ−xν dx dξ = r(t)φ(x − t)e−2πixν dx = r(t)ϕ(ν) b e−2πitν . Using that the M 1 -norm is invariant under partial Fourier transforms and the equivalence between the M 1 and L1 norms which is implied by the bandlimitation of PI1 +[−N (),N ()] (x + q) Vφ∗ r(x, ξ) to (−1/T, 1/T )×{0} + (−δ, Ω + δ)×(−δ, T + δ), we obtain

PI1 +[−N (),N ()] (x + q) r(t) φ(x − t) 1 2 PI1 +[−N (),N ()] (x + q) Vφ∗ r(x, ξ) 1 2 M (R ) M (R )

PI1 +[−N (),N ()] (x + q) Vφ∗ r(x, ξ) 1 2 . L (R )

17

Fix g ∈ S(R) and observe that kVg f kLp (R2 ) defines a norm on M p (R) equivalent to the M p (R) norm given in (4.1) [11]. For any A ⊂ R we obtain the uniform bound X X X k cn δnT kM ∞ (R) kVg cn δnT kL∞ (R) = k cn g(nT − t)e2πiνnT kL∞ (R) nT ∈A

nT ∈A

X

≤k

nT ∈A

|cn | |g(nT − t)|kL∞ (R) ≤ k

nT ∈A

X

|cn | |g(nT − t)|kL∞ (R) < ∞.

n∈Z

The first norm inequality stems from the fact that for all g ∈ M 1 (R), kVg f kLp (R2 ) defines a norm on M p (R) equivalent to the M p (R) norm given in (4.1). Combining this upper bound on kw − w2 kM ∞ (R) with the above estimate for kσHC − σHe kL∞ (S) and (5.2), we conclude kσHC − σHe kL∞ (S) . DkHC kL(M ∞ (R))

L2 T kbjq k`∞ 1−

X

PI1 +[−N (),N ()] (x + q) Vφ∗ r(x, ξ)

qT ∈I / 1 +[−R(),R()]

L1 (R2 )

≤ DkHC kL(M ∞ (R)) DkσHC kL∞ (R2 ) ≤ DkσH kL∞ (R2 ) ≤ Dµ. Choosing R() above large to yield D small enough to compensate all the multiplicative constants, we obtain kσHC − σHe kL∞ (S) ≤ µ. As a meaningful statement is only obtained for < 1, this bound directly implies that kσHe kL∞ (R2 ) ≤ 2µ. Combining this with the bound kσH − σHe kL∞ (R2 ) ≤ kσH − σHC kL∞ (R2 ) + kσHC − σHe kL∞ (R2 ) ≤ 2µ, Theorem 2.7 directly yields the result with a constant of twice the size as in Theorem 2.7.

6. Reconstruction of bandlimited operators from discrete measurements This section concerns the discrete representation given in Theorem 2.9. First, we prove this theorem, hence establishing that indeed this representation is globally exact. Proof of Theorem 2.9: The proof is similar to the proof of Theorem 2.4 given in [24]. The main idea is to use a Jordan domain argument to cover a fixed compact set M of size less than one by shifts of a rectangle that still have combined area less than one and then to combine identifiability results for each of them to obtain identifiability for the whole set. Indeed, there exist L prime and T, Ω > 0 with T Ω = L1 such that supp(η) ⊆

L−1 [

R + (kj T, nj Ω) ⊆ [−(L − 1)T /2, (L + 1)T /2] × [−LΩ/2, LΩ/2]

j=0

= [−1/(2Ω) + T /2, 1/(2Ω) + T /2] × [−1/(2T ), 1/(2T )] where R = [0, T )×[−Ω/2, Ω/2), and the sequence (kj , nj ) ∈ Z2 consists of distinct pairs. For δ > 0 small enough (and possibly slightly smaller T, Ω, and a larger prime L), one can even achieve Mδ ⊆

L−1 [

R + (kj T, nj Ω) ⊆ [−(L − 1)T /2, (L + 1)T /2] × [−LΩ/2, LΩ/2]

j=0

where Mδ is the δ-neighborhood of M . Fix such δ and let r, φ ∈ S(R) satisfy (2.14) and (2.15) for this δ. Clearly, b −nΩ) = 0, (6.1) (k, n) 6= (kj , nj ) for all j implies Sδ ∩ R+(kT, nΩ) = ∅ and η(t, γ)r(t−kT )φ(γ a fact that we shall use below. 18

P Define the identifier w = n∈Z cn δnT , where {cn } is L-periodic and observe that ZZ Hw(x) = η(t, γ) e2πiγx w(x − t) dt dγ ZZ X = η(t, γ) e2πiγ(x−t) ck δkT (x − t) dt dγ k∈Z

=

X

Z

η(x − kT, γ) e2πiγkT dγ

ck

k∈Z

=

X L−1 X

Z ck+p

η(x − (mL + k + p)T, γ) e2πiγ(mL+k+p)T dγ

m∈Z k=0

for any p ∈ Z. We shall use the non-normalized Zak transform ZLT : L2 (R) −→ L2 [0, LT ) × [−Ω/2, Ω/2) defined by X f (t − nLT ) e2πinLT γ . ZLT f (t, γ) = n∈Z

We compute using the Poisson summation formula and the fact that Ω = 1/LT (ZLT ◦ H)w(t, ν) X = Hw(t − nLT ) e2πinLT ν n∈Z

=

X

e2πiT nLν

m,n∈Z

=

L−1 X

=

=

X

ck+p

e2πiT nLν

η(t − (nL + mL + k + p)T, γ) e2πiγ(mL+k+p)T dγ

Z

η(t − (mL + k + p)T, γ) e2πiγT ((m−n)L+k+p) dγ

m,n∈Z

ck+p

XZ

k=0

m∈Z

L−1 X

XZ

ck+p

η(t − (mL + k + p)T, γ) e2πiγ(mL+k+p)T

L−1 X

ck+p

k=0

X

e2πinL(ν−γ)T dγ

n∈Z

η(t − (mL + k + p)T, γ) e2πiγ(mL+k+p)T

1 X δn/LT (ν − γ)dγ LT n∈Z

m∈Z

k=0

=Ω

Z ck+p

k=0

k=0 L−1 X

L−1 X

X

η(t − (mL + k + p)T, ν + nΩ) e2πi(ν+Ωn)(mL+k+p)T

m,n∈Z

By (6.1) we get for p = 0, . . . , L − 1, (6.2)

b r(t)φ(ν)(Z LT ◦ H)w(t + pT, ν) =Ω

L−1 X

ck+p

=Ω

b r(t)φ(ν)η(t − (mL + k)T, ν + nΩ)e2πiT (ν+nΩ)(mL+k+p)

m,n∈Z

k=0 L−1 X

X

b cp+kj r(t)φ(ν)η(t + kj T, ν + nj Ω) e2πi(ν+nj Ω)T (p+kj ) .

j=0

= Ωe2πiνpT

L−1 X

b η(t + kj T, ν + nj Ω) , (T kj M nj c)p e2πiνkj T r(t)φ(ν)

j=0

where here and in the following, T : (c0 , c1 , . . . , cL−2 , cL−1 ) 7→ (cL−1 , c0 , . . . , cL−3 , cL−2 ) and M : (c0 , c1 , . . . , cL−2 , cL−1 ) 7→ (e2πi0/L c0 , e2πi1/L c1 , . . . , e2πi(L−2)/L cL−2 , e2πi(L−1)/L cL−1 ), that 19

is, (T kj M nj c)p = e2πi

nj (p+kj ) L

cp+kj . Equivalently, we obtain the matrix equation

L−1 b [e−2πiνpT r(t)φ(ν)(Z LT ◦ H)w(t + pT, ν)]p=0

(6.3)

L−1 b = ΩA[e2πiνkj T r(t)φ(ν)η(t + kj T, ν + nj Ω)]j=0

where A is a L × L matrix, whose jth column is T kj M nj c ∈ CL . A is a submatrix of the L × L2 marix G, whose columns are {T k M l c}L−1 k,l=0 . It was shown in [16] that if L is prime, then we can choose c ∈ CL such that every L × L submatrix of G is invertible. In fact, the set of such c ∈ CL is a dense open subset of CL [16]. Hence we can apply the matrix A−1 =: [bjp ]L j,p=1 on both sides of Equation (6.3) to obtain b e2πiνkj T r(t)φ(ν)η(t + kj T, ν + nj Ω)

(6.4)

= LT

L−1 X

b bjp e−2πiνpT r(t)φ(ν)(Z LT ◦ H)w(t + pT, ν)

p=0

for every j = 0, 1, . . . , L − 1. In fact, until this point the proof agrees with the proof of (2.8) in Theorem 2.4. Indeed, if we extend {bjp }p to a L-periodic sequence by setting bj,p+mL = bjp , replace the so far unused property (2.15) by (2.9) then further computations [22] give h(x, t) = LT

L−1 X

X bjq Hw(t − (kj + q)T )φ(x − t + (kj + q)T ) e2πinj Ω(x−t) . r(t − kj T )

j=0

q∈Z ΩL β2 Z) = {TkT M`LΩ/β2 r}k,`∈Z β2 1 ΩL Z× T Z) = (r, β2 T Z×ΩLZ) is

Observe that (2.15) implies that (r, T Z ×

is a tight Gabor frame

whenever β2 ≥ 1+2δ/T as, in this case, (r, an orthogonal sequence b ΩZ × LT Z) is a and the Ron-Shen criterion applies [11, 26]. The same arguments imply that (φ, β1 tight Gabor frame. Using a simple tensor argument, we obtain that {Ψm,n,l,k }m,n,l,k∈Z forms a tight Gabor frame where b ν) Ψm,n,l,k (t, ν) = T(kT,nΩ) ML(`Ωβ2 T,T /β1 ) r⊗φ(t, = e2πiL(

mT (ν−nΩ) `Ω(t−kT ) + ) β1 β2

b − nΩ) . r(t − kT ) φ(ν

The frame bound is T ΩL2 T Ω/(β1 β2 ) = 1/(β1 β2 ). We set Φm,−n,l,−k = Fs Ψm,n,l,k . Clearly, as Fs is unitary, we have that {Φm,n,l,k }m,n,l,k∈Z forms a tight frame with frame bound 1/(β1 β2 ), in fact, a tight Gabor frame as Φm,n,l,k (x, ξ) = Fs Ψm,−n,l,−k (x, ξ) b = (FT−kT M`LΩ/β2 r)(ξ) (F −1 T−nΩ MmT L/β1 φ)(x) = (MkT T`LΩ/β2 rb)(ξ) (MnΩ TmT L/β1 φ)(x) = e2πi(nm+kl)/λ (T`LΩ/β2 MkT rb)(ξ) (TmT L/β1 MnΩ φ)(x). Note that (6.1) together with the fact that the symplectic Fourier transform is unitary implies that the coefficients in the Gabor frame expansion of σ satisfy hσ, Φm,−nj ,l,−kj i = hη, Ψm,n,l,k i = 0 unless (n, k) = (nj , kj ) for some j. (j)

Hence we need to estimate σm,` = hσ, Φm,−nj ,l,−kj i for j = 0, 1, . . . , L − 1. We obtain by (6.4) 20

(j)

σm,` = hσ, Φm,−nj ,l,−kj i = hη, Ψm,nj ,l,kj i ZZ `Ω(t−kj T ) T m(ν−nj Ω) + ) b − nj Ω)dtdν β1 β2 r(t − kj T ) φ(ν = η(t, ν)e−2πiL( ZZ `tΩ mνT b = r(t) φ(ν)η(t + kj T, ν + nj Ω)e2πiνkj T e−2πi(L( β1 + β2 )+νkj T dtdν ZZ =

LT

L−1 X

−2πi(L( b bjp e−2πiνpT r(t)φ(ν)(Z LT ◦ H)w(t + pT, ν) e

mνT β1

+ `tΩ β )+νkj T ) 2

dtdν

p=0

= LT

L−1 X

ZZ bjp

b r(t)φ(ν) e−2πiνpT (ZLT ◦ H)w(t + pT, ν) e−2πi(L(

mνT β1

+ `tΩ β )+νkj T ) 2

dtdν

p=0

= LT

L−1 X

ZZ bjp

b r(t)φ(ν) e−2πiνpT

p=0

= LT

L−1 X p=0

= LT

X

= LT

X

= LT

X

X

Hw(t + pT − qLT ) e2πiνqLT e−2πi(L(

mνT β1

+ `tΩ β )+νkj T

bjp

XZ

dtdν

Z `tΩ b r(t)Hw(t + pT − qLT )e−2πiL β2 dt φ(ν) e2πiνT (qL−p−kj −mL/β1 ) dν

q∈Z

B jq

Z

Z `tΩ b r(t)Hw(t + qT )e−2πiL β2 dt φ(ν) e2πiνT (−q−kj −mL/β1 ) dν

q∈Z

B jq φ(T (−q − kj − mL/β1 ))

Z

Hw(t)e−2πiL

`Ω(t−qT ) β2

r(t − qT )dt

q∈Z

B jq φ(T (−q − kj − mL/β1 )) hHw, TqT M`LΩ/β2 ri,

q∈Z

where B jq = bjq0 for q = mL + q 0 with q 0 = 0, 1, . . . , L − 1. Set Cq,l (Hw) = hHw, TqT M`LΩ/β2 ri. In sum,

σ(x, ξ) =

L−1 1 X X hσ, Φm,−nj ,l,−kj iΦm,−nj ,l,−kj (x, ξ) β1 β2 j=0 m,`∈Z

(6.5)

2

q∈Z

=

LT β1 β 2

L−1 X

e−2πi(xnj Ω+ξkj T )

j=0

X m,`∈Z

(j)

σm,` rb ξ −

`LΩ mT L φ x− , β2 β1

where (j)

σm,` =

X

B jq φ(a(−q − kj − mL/β1 )) Cq,l (Hw).

q∈Z 21

Applying the symplectic Fourier transform to (6.5) yields η(t, ν) = e−2πiνt η(t, ν) = e−2πiνt

L−1 LT X X (j) σm,` Fs M(−nj Ω,−kj T ) T( mT L , `LΩ ) φ⊗b r (t, ν) β1 β2 β1 β2 j=0 m,`∈Z

= e−2πiνt

LT β1 β2

L−1 X

X

(j)

σm,` T(kj T,−nj Ω) M( `LΩ ,− mT L ) r⊗φb (t, ν) β2

j=0 m,`∈Z

β1

L−1 LT X X (j) = σm,` T(kj T,−nj Ω) M( `LΩ ,− mT L ) r⊗φb (t, ν) e−2πi(ν+nj Ω)(t−kj T ) , β2 β1 β1 β2 j=0 m,`∈Z

=

LT β1 β2

L−1 X

(j) σm,` e2πinj Ωkj T T(kj T,−nj Ω) M( `LΩ −nj Ω, kj T − mT L ) r⊗φb (t, ν) e−2πiνt .

X

β2

j=0 m,`∈Z

β1

For U (t, ν) = r⊗φb (t, ν) e−2πiνt , we have ZZ −2πiνt −2πi(ξt−νx) b Fs U (x, ξ) = r(t)φ(ν)e e dν dt Z Z = r(t)φ(x − t)e−2πiξt dt = r(t)φ(t − x)e−2πiξt dt = Vφ r(x, ξ), where we used that φb real valued implies φ(y) = φ(−y). Now, we compute σ(x, ξ) = Fs η (x, ξ) =

L−1 LT X X (j) 2πinj Ωkj T σm,` e Fs T(kj T,−nj Ω) M( `LΩ −nj Ω, kj T − mT L ) U (x, ξ), β2 β1 β1 β2 j=0 m,`∈Z

=

(6.6)

=

LT β1 β2

L−1 X

X

(j)

σm,` e2πinj Ωkj T M(−nj Ω,−kj T ) T( mT L −kj T, `LΩ −nj Ω) Vφ r (x, ξ), β1

j=0 m,`∈Z

β2

L−1 mT L `LΩ + nj Ω LT X −2πi(xnj Ω+ξkj T ) 2πinj Ωkj T X (j) e e σm,` Vφ r(x − . + kj T, ξ − β1 β2 j=0 β1 β2 m,`∈Z

The convergence in (6.5) and (6.6) is defined in the weak sense, but can be shown to converge absolutely and uniformly on compact subsets. Next we prove Theorem 2.10, that is, the direct local correspondence between the discretization values and the operator action. Proof of Theorem 2.10. We intend to apply Theorems 2.7 and 2.9. We assume that the set M as well as its enclosing rectangular grid are fixed, hence also the parameters T , Ω, and L. The dependence of the constants, auxiliary functions, etc., in the following derivations on these parameters will be suppressed for notational convenience; this should be seen as analogue to the one-dimensional scenario where the arising constants also depend on the shape and not just the size of the frequency support. Furthermore, set Q = max(LT, LΩ). We can bound using (3.3) (j)

|σm,` | = |hσ, Φm,−nj ,`,−kj i| (6.7)

≤ kσk∞ kΦm,−nj ,`,−kj k1 ≤ Bkσk∞ kˆ r ⊗ φk1 ˜ ≤ Bµ

For the second inequality, we used that the L1 -norm is invariant under translations and modulations. 22

2 Furthermore, note that Vφ r∈ S(R ), so there is a decreasing positive function ρ ∈ S([0, ∞)) 1 such that for ρe(x, ξ) = ρ |x| ρ |ξ| one has |Vφ r| ≤ 8CT e pointwise. ˜ ρ Now observe that, as ρ is decreasing, ∞ X

αρ(αj) ≤ ρ(0) +

j=0

Zαj ∞ X

ρ(t)dt = kρk1 + kρk∞ .

j=1 α(j−1)

We use this estimate to bound for arbitrary (x, ξ) |˜ σ (x, ξ)| LT L−1 X X `L mL (j) = + kj T, ξ − + nj Ω e−2πi(xnj Ω+ξkj T ) e2πinj Ωkj T σm,` Vφ r(x − β1 β2 j=0 β1 β2 (mLT /β1 ,`LΩ/β2 )∈S X mL LT `L (j) |σm,` | Vφ r(x − + kj T, ξ − + nj Ω ≤ β 1 β2 β1 β2 (mLT /β1 ,`LΩ/β2 )∈S

≤

LT β 1 β2

L−1 X

X

j=0 (mLT /β1 ,`LΩ/β2 )∈S

˜ mL Cµ `L ρ x − + kj T ρ ξ − + nj Ω ˜ β1 β2 8CT

L−1 ∞ µ X X LT mL LΩ `L 4 ρ T ρ Ω 8LΩT j=0 β1 β1 β2 β2 m,`=0 2 µ ≤ kρk1 + kρk∞ 2 and hence

≤

2 µ kρk1 + kρk∞ =: C1 µ. 2 By the definition of S, for every δ > 0, there is a constant C(δ) such that for any fixed 0 ≤ j < L, kσ − σ ˜ k∞ ≤ kσk∞ + k˜ σ k∞ ≤ µ +

δ ≥8kρkL1 (R+ ) kρkL1 [C(δ)−2Q,∞) ≥ 8ke ρkL1 (([−C(δ)+Q,C(δ)−Q]2 )c ) ,

(6.8)

and hence, for (x, ξ) ∈ S − B(C(δ)), (6.9)

δ≥

L2 β1 β2

mL `L ρ x − + kj T ρ ξ − + n j Ω . β1 β2

X

`,m∈Z

2 `L x− mL / β T,ξ− β Ω ∈[−C(δ),C(δ)] 1

2

To obtain (6.9) from (6.8), the boundary term in the discretization of the integral and the shifts by kj and nj , respectively, are each compensated by increasing the dimensions of the integration/summation domain by LT and LΩ in time and frequency, respectively, both of which are bounded by Q. Note furthermore that, as (x, ξ) ∈ S − B(C(δ)), a necessary condition for `L mL T, ξ − Ω ∈ / [−C(δ), C(δ)]2 x− β1 β2 is that (mLT /β1 , `LΩ/β2 ) ∈ / S. Thus, using (6.7) and the triangle inequality, we can bound (6.9) from below obtaining L2 T mL `L X (j) (6.10) δµ ≥ σm,` Vφ r x − + kj T, ξ − + nj Ω . β1 β2 β1 β2 (mLT /β1 ,`LΩ/β2 )∈S /

23

Hence forming a weighted average (with complex weighting factors of modulus one) of Equation (6.10) over the L choices of j, we obtain LT L−1 X X δµ ≥ e−2πi(xnj Ω+ξkj T ) e2πinj Ωkj T β1 β2 j=0 mLT `LΩ (

β1

,

β2

mL `L (j) σm,` Vφ r x − +kj T, ξ − +nj Ω β1 β2

)∈S /

=|σ(x, ξ) − σ ˜ (x, ξ)|. This yields kσ − σ ˜ kL∞ (S−B(C(δ))) ≤ δµ. Hence by Theorem 2.7, we conclude that ˜ k2 ≤ C δ µ kHf − Hf C1 for all functions f which are Cδ1 -time-frequency-localized to S − B(C(δ)) − B(d()). The result follows by choosing δ = min CC1 , C1 and D() = C(δ) + d(). 7. Quantization of bandlimited operators The underlying idea of the quantization schemes in Theorem 2.11 and Corollary 2.12 is based on Σ∆ modulation. At the core of this arguably most influential coarse quantization paradigm is the observation that the reconstruction formula in Theorem 2.1 directly corresponds to the application of a low-pass filter. The key idea is then that, while, due to the coarseness of the alphabet, the sequence yn − qn cannot be made uniformly small, it can be chosen to be approximately high-pass, that is, close to the kernel of the low-pass operator; hence it almost vanishes in the reconstruction P procedure. In other words, fe(t) = T n∈Z qn φ(t − nT ) is a good approximation for f . The main goal of this section is to show, in general terms, the possibility of combining the discretization procedure presented above with coarse quantization schemes. Hence, we will restrict ourselves to the simplest possible Σ∆ modulator, a so-called first order Σ∆ modulator. Usually, with so-called higher order modulators [6, 12, 7] considerably better error decay rates can be achieved. There are no specific obstacles that would prevent the direct application of such higher order modulators in the operator context. However, the estimates would be considerably more complicated without providing much additional insight, which is why we we refrain from presenting them here. To define one-bit Σ∆ modulator, we first fix a kernel φ such that φ ∈ S and h a first-order i supp φ ⊂ − λ20 , λ20 for some λ0 > Ω. Then the minimal sampling step is T0 = λ10 . So sampling at step size T corresponds to an oversampling ratio of λ = TT0 . As we are interested in the approximation behavior when the redundancy of the dictionary increases, we do not want to consider multiple quantization schemes. Hence we fix an underlying quantization alphabet with an associated quantization rule. Again, we will focus on the simplest possible scenario, namely one-bit quantization, where the quantization alphabet just has two elements. Using this quantization rule, a first order Σ∆ modulator computes a sequence of quantized values {qn } by means of the iterative scheme (7.1)

un = un−1 + f (nT ) − qn qn = sign(f (nT ) + un−1 ),

with an initial condition u0 . One can show (see for example [6]) that if |f (t)| ≤ 1 and |u0 | ≤ 1, then the state variable u in (7.1) satisfies ∀n ∈ Z, |un | ≤ 1. This entails a bound for the quantization error of kf − fekL∞ (R) ≤ 24

1 kφ0 kL1 (R) . 2Ωλ

Proof of Theorem 2.11. By Theorem 2.9, σ = LT σ (j) (x, ξ) =

PL

j=1

ωj σ (j) , where ωj are phase factors and

1 X (j) mL `L σm,` Vφ r(x − + kj T, ξ − + nj Ω . β1 β2 β1 β2 m,`∈Z

(j)

We will quantize the doubly-indexed coefficient sequence {σm,` } of the expansion in (7.2), separately for each j = 0, 1, . . . , L − 1. For simplicity, we will drop the superscript (j) for now, since most of the proof proceeds independently for each j. We also drop the nj and kj , as for fixed j, they can be absorbed in x and ξ. Hence we seek to quantize an expansion of the form mL `L 1 X σm,` Vφ r x − T, ξ − Ω . σ(x, ξ) = β1 β2 β1 β2 m,`∈Z

We proceed by applying first order Sigma-Delta modulators subsequently in both time and frequency. Such an approach has been successfully applied by Yılmaz [27] to devise a coarse quantization scheme for Gabor expansions. More specifically, we recursively define doubly indexed sequences u, p, v, r as follows. un1 ,n2 = un1 −1,n2 + σn1 ,n2 − pn1 ,n2 , (7.2)

pn1 ,n2 =

sign(un1 −1,n2 + σn1 ,n2 ).

vn1 ,n2 =

vn1 ,n2 −1 + un1 ,n2 − rn1 ,n2 ,

rn1 ,n2 =

sign(vn1 ,n2 −1 + un1 ,n2 ).

For a bivariate sequence a, we denote (∆1 a)n1 ,n2 = an1 ,n2 − an1 −1,n2 and (∆2 a)n1 ,n2 = an1 ,n2 − an1 ,n2 −1 . Then we have (7.3)

(∆1 ∆2 v)n1 ,n2 = σn1 ,n2 − qn1 ,n2 , qn1 ,n2 = pn1 ,n2 + (∆1 r)n1 ,n2 .

The resulting sequence qm,` ∈ {±1, ±3} of quantized values will be used for reconstructing σ, these equation hence define the bivariate Sigma-Delta modulator. Note that to properly define the Sigma-Delta modulator, one also needs to set an initial condition for some finite index. However, the above recurrence relations are reversible in the sense that they allow to apply the quantization procedure backwards in time and frequency, hence allowing for the acquisition of quantized values for all (m, `) ∈ Z2 . A more realistic scenario with only finitely many quantized values used for the approximation is obtained by combining this result with the localisation result given in Theorem 2.10. As the Sigma-Delta scheme given by (7.2) and (7.3) is a combination of two first order SigmaDelta schemes, its stability is a direct consequence of the stability of (7.1), see also [27]. We summarize this observation in the following proposition. Proposition 7.1. Suppose that |σn1 ,n2 | ≤ 1. If |u0,n2 | ≤ 1 for each n2 ∈ Z, and |vn1 ,0 | ≤ 1 for each n1 ∈ Z, then |un1 ,n2 | ≤ 1, |vn1 ,n2 | ≤ 1 for every n1 , n2 ∈ Z. Stability is a crucial ingredient for the error analysis of the bivariate Sigma-Delta scheme. Proposition 7.2. Let σ : R2 → C be as in (7.2) and qm,` , m, ` ∈ Z be the quantized values resulting from the bivariate Sigma-Delta quantization given by (7.2) and (7.3) with σm,` ≤ 1 for all m, ` ∈ Z2 . Then the symbol reconstructed according to the formula mL `L 1 X qm,` Vφ r x − T, ξ − Ω . σ ˜ (x, ξ) = β1 β2 β1 β2 m,`∈Z

satisfies kσ − σ ekL∞ ≤

1

∂2

Vφ r 1 .

β1 β2 ∂y ∂x L 25

Proof. We compute β1 β2 |σ(x, ξ) − σ e(x, ξ)| X m ` = (σm,` − qm,` ) Vφ x − LT, ξ − LΩ β1 β2 m,`∈Z X m ` = (∆1 ∆2 v)m,` Vφ x − LT, ξ − LΩ β1 β2 m,`∈Z Z X (m+1)LT /β1 ∂ ` ∂ `+1 = vm,` Vφ r x − u, ξ − LΩ − Vφ r x − u, ξ − LΩ du ∂x β2 ∂x β2 mLT /β1 m,`∈Z Z (m+1)LT /β1 Z (`+1)LΩ/β2 X ∂2 = vm,` Vφ r x − u, ξ − v dvdu ∂ξ ∂x `LΩ/β2 mLT /β1 m,`∈Z Z Z (`+1)LΩ/β2 (m+1)LT /β 1 X ∂2 ≤ |vm,` | Vφ r(x − u, ξ − v) dvdu ∂ξ ∂x `LΩ/β mLT /β 2 1 m,`∈Z ZZ 2 ∂ ≤ Vφ r(u, v) dvdu . ∂y ∂x In the last step we used the stability of the scheme, as established by Proposition 7.1. To complete the proof of Theorem 2.11, we recall from (6.7) that there exists a constant C˜ such (j) ˜ ˜ that, for all m, ` ∈ Z, |σm,` | ≤ Ckσk ∞ ≤ cC by assumption of the Theorem. Hence choosing 1 ˜, C

(j)

we ensure that |σm,` | ≤ 1 and we can apply Proposition 7.2 to conclude that

1

∂2 kσ (j) − σ ˜ (j) kL∞ ≤ Vφ r .

β1 β2 ∂ξ ∂x L1 The right hand side is bounded by an absolute constant, so by Proposition 4.1 for p = 2, we conclude ˜ (j) f kL2 ≤ C˜ kf kL2 , where H (j) and H ˜ (j) are the operators with symbol σ (j) that kH (j) f − H β1 β2 (j) 2 ˜ and σ ˜ , respectively. Choosing C = L T C, the theorem follows via the triangle inequality. c=

Sketch of proof of Corollary 2.12. We quantize H as described above. Due to the recursive nature, we need only finitely many of the σm,` to compute any given finite set of qm,` ’s. This gives rise to the first summand in the error bound. The step from the quantized representation to the quantized representation with only finitely summands proceeds completely analogously to the proof of Theorem 2.10; in particular, the error arising in this step can be bounded by the second summand of the error bound. Due to the completely analogous nature of the proof, we will not repeat the details here. Acknowledgments. The authors thank Onur Oktay, who participated in initial discussions on the project. Part of this research was carried out during a sabbatical of G.E.P. and a stay of F.K. at the Department of Mathematics and the Research Laboratory for Electronics at the Massachusetts Institute of Technology. Both are grateful for the support and the stimulating research environment. F.K. acknowledges support by the Hausdorff Center for Mathematics, Bonn. G.E.P. acknowledges funding by the German Science Foundation (DFG) under Grant 50292 DFG PF-4, Sampling Operators. References 1. W.U. Bajwa, K. Gedalyahu, and Y.C. Eldar, Identification of parametric underspread linear systems and superresolution radar, IEEE Trans. Signal Process. 59 (2011), no. 6, 2548–2561. 2. P.A. Bello, Measurement of random time-variant linear channels, IEEE Trans. Comm. 15 (1969), 469–475. ¨ Yılmaz, Sigma-Delta quantization and finite frames, IEEE Trans. Inform. 3. J. J. Benedetto, A. M. Powell, and O. Theory 52 (2006), 1990–2005. 26

4. J.J. Benedetto and O. Oktay, Pointwise comparison of PCM and Σ∆ quantization, Constr. Approx. 32 (2010), no. 1, 131–158. ´ ad B´ 5. Arp´ enyi, Karlheinz Gr¨ ochenig, Kasso A. Okoudjou, and Luke G. Rogers, Unimodular Fourier multipliers for modulation spaces, J. Funct. Anal. 246 (2007), no. 2, 366–384. 6. I. Daubechies and R. DeVore, Reconstructing a bandlimited function from very coarsely quantized data: A family of stable sigma-delta modulators of arbitrary order, Ann. Math. 158 (2003), 679–710. 7. P. Deift, C. S. G¨ unt¨ urk, and F. Krahmer, An optimal family of exponentially accurate one-bit sigma-delta quantization schemes, Comm. Pure Appl. Math. 64 (2011), no. 7, 883–919. 8. H.G. Feichtinger, Atomic characterizations of modulation spaces through Gabor-type representations, Rocky Mountain J. Math. 19 (1989), 113–126. 9. H.G. Feichtinger and K. Gr¨ ochenig, Gabor wavelets and the Heisenberg group: Gabor expansions and short time Fourier transform from the group theoretical point of view, Wavelets, Wavelet Anal. Appl., vol. 2, Academic Press, Boston, MA, 1992, pp. 359–397. 10. G.B. Folland, Harmonic analysis in phase space, Annals of mathematics studies, vol. 122, Princeton University Press, 1989. 11. K. Gr¨ ochenig, Foundations of Time-Frequency Analysis, Birkh¨ auser, Boston, 2001. 12. C. S. G¨ unt¨ urk, One-bit Sigma-Delta quantization with exponential accuracy, Comm. Pure Appl. Math. 56 (2003), 1608–1630. 13. R. Heckel and H. Boelcskei, Identification of sparse linear operators, preprint. 14. T. Kailath, Measurements on time-variant communication channels., IEEE Trans. Inform. Theory 8 (1962), no. 5, 229– 236. 15. W. Kozek and G.E. Pfander, Identification of operators with bandlimited symbols, SIAM J. Math. Anal. 37 (2005), no. 3, 867–888. 16. F. Krahmer, G.E. Pfander, and P. Rashkov, Uncertainty principles for timefrequency representations on finite Abelian groups, Appl. Comput. Harmon. Anal. 25 (2008), 209–225. ¨ Yılmaz, Alternative dual frames for digital-to-analog conversion in sigma17. M. Lammers, A.M. Powell, and O delta quantization, Adv. Comput. Math. 32 (2010), no. 1, 73–102. 18. J. Lawrence, G.E. Pfander, and D. Walnut, Linear independence of Gabor systems in finite dimensional vector spaces, J. Fourier Anal. Appl. 11 (2005), no. 6, 715–726. 19. K.A. Okoudjou, A Beurling-Helson type theorem for modulation spaces, J. Funct. Spaces Appl. 7 (2009), no. 1, 33–41. 20. A.V. Oppenheim, R.W. Schafer, and J.R. Buck, Discrete-time signal processing, 2nd ed., Prentice-Hall signal processing, Prentice-Hall, Upper Saddle River, NJ, 1999. 21. Daniel P. Petersen and David Middleton, Sampling and reconstruction of wave-number-limited functions in N -dimensional Euclidean spaces, Information and Control 5 (1962), 279–323. MR 0151331 (27 #1317) 22. G.E. Pfander, Sampling of operators, to appear in J. Four. Anal. Appl. 23. , Measurement of time–varying Multiple–Input Multiple–Output channels, Appl. Comp. Harm. Anal. 24 (2008), 393–401. 24. G.E. Pfander and D. Walnut, Sampling and reconstruction of operators, preprint. 25. G.E. Pfander and D.F. Walnut, Measurement of time-variant linear channels, IEEE Trans. Inform. Theory 52 (2006), no. 11, 4808–4820. 26. A. Ron and Z. Shen, Frames and stable bases for shift–invariant subspaces L2 (Rd ), Canadian Journal of Mathematics 47 (1995), no. 5, 1051–1094. ¨ Yılmaz, Coarse quantization of highly redundant time-frequency representations of square-integrable func27. O. tions, Appl. Comput. Harmon. Anal. 14 (2003), 107–132.

27

LOCAL SAMPLING AND APPROXIMATION OF

LOCAL SAMPLING AND APPROXIMATION OF

Suggest Documents

Provably Good Surface Sampling and Approximation - Stanford ...

Local approximation of superharmonic and superparabolic functions ...

Order of approximation for nonlinear sampling

Local approximation using Hermite functions

Local Approximation in N-dimensions

Local approximation using Hermite functions

Sampling approach to sparse approximation ... - Semantic Scholar

Approximation of rejective sampling inclusion probabilities and ... - arXiv

Sampling-Based Dimension Reduction for Subspace Approximation

Submodular Approximation: Sampling-based ... - Google Sites

Approximation by Nonlinear Multivariate Sampling Kantorovich Type

Submodular Approximation: Sampling-based Algorithms ... - CiteSeerX

Gaussian Approximation of Local Empirical Processes ... - CiteSeerX

Local finite-time Lyapunov exponent, local sampling and probabilistic ...

Local Polynomial Approximation for Unsupervised Segmentation of ...

Non-Local Compressive Sampling Recovery

Provably Good Surface Sampling and Approximation - EECS at UC ...

Neural network approximation of sampling yield-effort curves of rice ...

A Combination of Downward Continuation and Local Approximation ...

Local Adaption for Approximation and Minimization of Univariate ...

On local symbolic approximation and resolution of ODEs using Implicit

Appendix S1: Approximation of the sampling rate distribution - PLOS

Time-Dependent Superfluid Local Density Approximation

Parallel local approximation MCMC for expensive models