On the Complexity Distribution of Sphere Decoding - Communication

1

On the Complexity Distribution of Sphere Decoding Dominik Seethaler, Joakim Jaldén, Member, IEEE, Christoph Studer, Member, IEEE, and Helmut Bölcskei, Fellow, IEEE Dedicated to the memory of Alex B. Gershman (1962-2011) Abstract—We analyze the (computational) complexity distribution of sphere decoding (SD) for random infinite lattices. In particular, we show that under fairly general assumptions on the statistics of the lattice basis matrix, the tail behavior of the SD complexity distribution is fully determined by the inverse volume of the fundamental regions of the underlying lattice. Particularizing this result to N × M , N ≥ M , i.i.d. circularly symmetric complex Gaussian lattice basis matrices, we find that the corresponding complexity distribution is of Pareto-type with tail exponent given by N −M +1. A more refined analysis reveals that the corresponding average complexity of SD is infinite for N = M and finite for N > M . Finally, for i.i.d. circularly symmetric complex Gaussian lattice basis matrices, we analyze SD preprocessing techniques based on lattice-reduction (such as the LLL algorithm or layer-sorting according to the V-BLAST algorithm) and regularization. In particular, we show that latticereduction does not improve the tail exponent of the complexity distribution while regularization results in a SD complexity distribution with tails that decrease faster than polynomial.

and understanding the complexity distribution is important, for example, when SD is used under practically relevant runtime constraints (see, e.g., [8]). In this paper, we make a first attempt in this direction by analyzing the tail behavior of the SD complexity distribution in terms of corresponding tail exponents (i.e., polynomial decay rates). This approach allows us, among others, to characterize the impact of lattice basis matrix preprocessing–e.g., regularization or lattice-reduction– on the tail behavior of the SD complexity distribution. The main contributions of this paper can be summarized as follows: •

Index Terms—Closest lattice point problem, complexity distribution, MIMO wireless, random lattices, sphere decoding.

I. I NTRODUCTION Finding the closest lattice point in an infinite lattice is commonly referred to as the closest lattice point (CLP) problem (see, e.g., [1]). The sphere decoding (SD) algorithm [1]–[8] is a promising approach for solving the CLP problem efficiently. The (computational) complexity of SD, as measured in terms of the number of lattice points searched by the algorithm, depends strongly on the lattice basis matrix and is, in general, difficult to characterize analytically. However, if the lattice basis matrix is assumed random, the complexity of SD is random as well and one can resort to a characterization of the complexity distribution of SD. This approach is similar in spirit to the line of work conducted in the 1960’s aimed at characterizing the complexity distribution of sequential decoding of convolutional codes [9], [10], where the randomness of the complexity is a consequence of random additive noise. Previous work on the complexity of SD focused on characterizing the mean and the variance of SD complexity for i.i.d. Gaussian lattice basis matrices [11]–[14]. Characterizing This work was supported in part by the STREP project No. IST-026905 (MASCOT) within the Sixth Framework Programme of the European Commission. This paper was presented in part at the IEEE International Symposium on Information Theory, Seoul, South Korea, June 2009. D. Seethaler is with RobArt, 4020 Linz, Austria (e-mail: [email protected]). J. Jaldén is with the ACCESS Linnaeus Center, Signal Processing Lab, KTH Royal Institute of Technology, 100 44 Stockholm, Sweden (email: [email protected]). C. Studer is with the Department of Electrical and Computer Engineering, Rice University, Houston, TX 77005 USA (e-mail: [email protected]). H. Bölcskei is with the Department of Information Technology and Electrical Engineering, ETH Zurich, 8092 Zurich, Switzerland (e-mail: [email protected]).

•

•

Under fairly general assumptions on the statistics of the lattice basis matrix and for a large class of preprocessing methods, we prove that the tail exponent of the SD complexity distribution is given by the tail exponent of the distribution of the inverse volume of the fundamental regions of the underlying lattice. These results comprise, for example, lattice basis matrices with non-zero mean correlated complex Gaussian distributed entries as well as preprocessing through lattice-reduction (LR) (see, e.g., [1]). Specializing our main result to the case of N × M , N ≥ M , i.i.d. circularly symmetric complex Gaussian lattice basis matrices, we find that the complexity distribution of SD is of Pareto-type (i.e., the corresponding tails decrease polynomially) with tail exponent given by N − M + 1. We show that this tail exponent cannot be improved (i.e., increased) by LR including layer-sorting (LS) (e.g., according to the V-BLAST algorithm [15]) as a special case. We find, however, that regularization of the lattice basis matrix results in a complexity distribution with faster-than-polynomially decreasing tails. Here it is important to note that solving the CLP on a regularized lattice basis matrix does, in general, not yield the solution to the original CLP. For i.i.d. circularly symmetric complex Gaussian lattice basis matrices, we show that the average (w.r.t. the lattice basis matrix) complexity of SD is infinite for N = M and finite for N > M ; this complements results derived in [11], [12].

Notation: We write Ai,j for the entry in the ith row and jth column of the matrix A and xi for the ith entry of the vector x. Slightly abusing common terminology, we call an N × M , N ≥ M , matrix A unitary if it satisfies AH A = I, where H denotes conjugate transposition, i.e., transposition T followed by element-wise complex conjugation ∗ , and I is the identity matrix of appropriate size. The inverses of A and AH are referred to as A−1 and A−H , respectively. For a N × M matrix A, vec(A) = [aT1 . . . aTM ]T , where

2

ai , i = 1, . . . , M , is the ith column of A. The ordered eigenvalues of a positive semidefinite M × M matrix A are referred to as λi (A), i = 1, . . . , M , with ordering such that 0 ≤ λ1 (A) ≤ · · · ≤ λM the corresponding determinant Q(A); M is given by det(A) = i=1 λi (A). The Euclidean- and the Frobenius norm are denoted by k · k and k · kF , respectively, and |X | refers to the cardinality of the set X . The ceil-function is designated as d·e. Furthermore,√CZ stands for the set of Gaussian integers, i.e., CZ = Z + −1 Z, where Z is the set of integers. The lattice with N × M , N ≥ M , full-rank basis matrix A is defined as L(A) = Ad : d ∈ (CZ)M ⊂ CN . For N = M , the corresponding covering radius is given by the largest distance from a point in CM to its closest lattice point in L(A) [16], i.e., by µ(A) = max

min

x∈CM d∈(CZ)M

kx − Adk.

(1)

E{·} stands for the expectation operator. We write x ∼ χa if the random variable (RV) x is χ-distributed with a > 0 degrees of freedom and normalized such that E x2 = a. The probability density function (pdf) of the RV x ∼ χa is then given by [17] 21−a/2 a−1 −t2 /2 t e , t≥0 (2) Γ(a/2) R∞ and fx (t) = 0, t < 0, where Γ(a) = 0 y a−1 e−y dy refers to the Gamma function. For the corresponding cumulative distri bution function (cdf) we have P x ≤ t = γa/2 (t2 /2), where γa (t) denotes the (regularized) lower incomplete Gamma function. If x ∼ χa , the RV y = x2 is χ2 -distributed with a > 0 degrees of freedom and cdf P y ≤ t = γa/2 (t/2). A complex Gaussian RV x with mean µx and variance σx2 , i.e., x − µx is circularly symmetric Gaussian distributed, is denoted as x ∼ CN (µx , σx2 ). The “little o” notation g(x) = o(f (x)), x → x0 , stands for limx→x0 (g(x)/f (x)) = 0. We d write = for equality in distribution. The natural logarithm a is referred to as log(·). By g(x) ∼ f (x), x → x0 , we . mean that limx→x0 (g(x)/f (x)) = 1. We write g(x) = f (x), x → x0 , to mean that limx→x0 (log g(x)/log(x)) = limx→x0 (log f (x)/log(x)), assuming, of. course,. that the corresponding limits exist. The symbols ≤ and ≥ are defined . . analogously. We write g(x) = x±∞ , x → x0 , and g(x) = x0 , x → x0 , respectively, for limx→x0 (log g(x)/log(x)) = ±∞ and limx→x0 (log g(x)/log(x)) = 0. fx (t) =

A. The Closest Lattice Point Problem The CLP problem (or integer least squares problem) refers to computing b = arg min kr − Hdk2 d

(3)

d∈(CZ)M

for a given vector r ∈ C and a given full-rank matrix H ∈ CN ×M , N ≥ M . In words, solving (3) amounts to finding the point in the lattice L(H) that is closest (in Euclidean sense) to r. In communications, (3) is known as the N

maximum-likelihood (ML) detection problem for detecting a transmitted vector d0 ∈ (CZ)M based on the linear model r = Hd0 + w

(4)

with H known at the receiver and noise w having i.i.d. circularly symmetric complex Gaussian components. For example, in the case of ML detection in multiple-input multiple-output (MIMO) wireless systems with spatial multiplexing, H is the channel matrix, which is typically assumed to consist of i.i.d. circularly symmetric complex Gaussian components. For examples of further ML detection problems, which are of the form (3), we refer to [6], [12], [18]. We note that in the practically relevant case of d0 in (4) being drawn from a finite subset A of (CZ)M , ML detection corresponds to (3) only if A is relaxed to (CZ)M . Such a relaxation step induces a performance loss (see, e.g., [19]) but is necessary if, e.g., SD in combination with LLL-based preprocessing is used. B. Sphere Decoding A prominent approach for solving (3) is the SD algorithm [1]–[8]. In the following, we consider Fincke-Pohst SD [2] without radius reduction and restarting as done, e.g., in [11]– [14]. The algorithm starts by computing the (unique) QRdecomposition (QRD) H = QR, where Q is unitary of dimension N ×M and R is an M ×M upper triangular matrix with positive real-valued elements on its main diagonal. We note that (3) can equivalently be written as b = arg min ky − Rdk2 d

(5)

d∈(CZ)M

where y = QH r. Next, (5) is solved subject to a sphere constraint (SC), which amounts to considering only those lattice points Rd that lie within a hypersphere of radius ρ around y, i.e., all d that satisfy ky − Rdk2 ≤ ρ2 .

(6)

Here, the sphere radius ρ has to be chosen sufficiently large for the search sphere to contain at least one lattice point Rd. Note, however, that if ρ is chosen too large, too many points will satisfy the SC and the complexity of SD will be high. As detailed next, triangularization and the corresponding SC (6) enable an efficient recursive solution of the CLP problem in (3). Consider the length-k subvectors dk ∈ (CZ)k of d defined as dk = (dM −k+1 · · · dM )T , k = 1, . . . , M , where k is called the layer index. The metric ky − Rdk2 = kyM − RM dM k2 can be computed recursively according to kyk − Rk dk k2 = kyk−1 − Rk−1 dk−1 k2 + |∆k (dk )|2 (7) where 2 M X |∆k (dk )| = yM −k+1 − RM −k+1,i di 2

(8)

i=M −k+1

denotes the metric update for layer k, Rk refers to the k × k bottom right (upper triangular) submatrix of R associated with dk , and yk = (yM −k+1 · · · yM )T . Thus, with (7), a necessary

3

condition for d to satisfy the SC is that any associated dk satisfy the partial SC kyk − Rk dk k2 ≤ ρ2 .

(9)

This formulation now enables the following approach for finding all integer vectors d that satisfy (6) in an efficient (recursive) manner. Starting with layer k = 1, the condition (9) reduces to |yM − RM,M dM |2 ≤ ρ2

(10)

which states that the M th component dM of all d ∈ (CZ)M satisfying (6) must be located inside a circle of radius ρ/RM,M and center point yM /RM,M . In the next step, for every dM ∈ CZ that satisfies (10), one finds all dM −1 ∈ CZ such that (9) is satisfied for k = 2. This procedure is repeated until k = M . Among the lattice points delivered by the algorithm, the one with minimum ky − Rdk2 constitutes the solution of (5). II. C OMPLEXITY D ISTRIBUTION OF SD We define the computational complexity of SD as the number of lattice points searched by the algorithm, i.e., the total number of vectors dk ∈ (CZ)k , k = 1, . . . , M , that satisfy the partial SCs in (9) (cf. [11], [20]). Specifically, we define the kth layer complexity of SD as (11) Sk = dk ∈ (CZ)k : kyk − Rk dk k2 ≤ ρ2 with the resulting total complexity S=

M X

(12)

Sk .

k=1

It was shown in [7] that S is proportional (at least for the finite-lattice case) to the VLSI-implementation complexity of SD. A. Complexity Distribution and Tail Exponents The quantities Sk , k = 1, . . . , M , and S, defined above, are functions of H, r, and ρ. In the following, we let H and r be random (potentially statistically dependent) and we let ρ be a deterministic, strictly positive, and finite scalar. For example, in the case of ML detection of the transmitted data vector d0 in MIMO wireless systems with spatial multiplexing, r is given by (4), where the entries of H (the channel matrix) and w (the noise vector) are typically assumed i.i.d. circularly symmetric complex Gaussian. Furthermore, ρ is typically chosen as a function of the noise variance such that the SD algorithm finds d0 with a certain (high) probability (cf. [4], [11], [21]). Since H and r are random, Sk and S are random as well and can be characterized through their respective (complementary) distributions: The layer-wise complexity distributions P[Sk ≥ L], k = 1, . . . , M , and the total complexity distribution P[S ≥ L]. While these distributions are hard to come by analytically, it turns out that the corresponding tail exponents ξk , k = 1, . . . , M , and ξ, defined by .

P[Sk ≥ L] = L−ξk , and

.

P[S ≥ L] = L−ξ ,

L→∞ L→∞

are amenable to an analytical characterization. We note that PM S = k=1 Sk implies ξ = min{ξ1 , . . . , ξM }.

(13)

The tail exponents characterize the tail behavior of the corresponding complexity distributions in terms of polynomial decay rates in L for L → ∞. We note that the tail exponents are non-negative by definition. If the tail exponent is infinity or zero, the corresponding complexity distribution decreases faster or slower than polynomial in L, respectively. In particular, for finite non-zero tail exponents, the corresponding complexity distributions are of Pareto-type meaning that they decay polynomially in L for large L. The tail exponents can be used to draw general conclusions about the corresponding complexity distributions as follows. If the complexity distributions P[S (1) ≥ L] and P[S (2) ≥ L] have tail exponents ξ (1) and ξ (2) with ξ (1) > ξ (2) , we have P[S (1) ≥ L] < P[S (2) ≥ L] for sufficiently large L. Similarly, if the complexity distribution P[S ≥ L] has tail exponent ξ, we have L−(ξ+δ) ≤ P[S ≥ L] ≤ L−(ξ−δ)

(14)

for any δ > 0 and sufficiently large L. Larger tail exponents are, in general, desirable as they imply that the probability of the complexity being atypically large is smaller. This, for example, is advantageous in the context of MIMO detection under practically relevant run-time constraints, i.e., when limits on the number of lattice points that can be searched are imposed (see, e.g., [8]). We conclude this section by emphasizing that the complexity tail exponent as defined above characterizes one specific characteristic of the corresponding complexity distribution and does by no means yield a full picture. For example, if P[S ≥ L] = c L−1 , with some constant c > 0 not depending on L, the corresponding tail exponent equals 1 irrespective of the multiplicative constant c. Furthermore, if the tail exponents of P[S (1) ≥ L] and P[S (2) ≥ L] are equal, i.e., ξ (1) = ξ (2) = ξ, no order relation between P[S (1) ≥ L] and P[S (2) ≥ L] can be inferred from ξ.

B. Main Result Preliminaries: The complexity of SD can often be reduced by employing additional channel matrix preprocessing techniques such as lattice-reduction (LR) (see, e.g., [1]) or regularization (see, e.g., [8], [22]). For LR, which includes layer-sorting (LS) as a special case, SD is applied to the triangularized form of the CLP problem (5) obtained by applying the QRD to HT instead of H, where T is a unimodular matrix depending solely on H (see Section III-C for more details). In the remainder of the paper, any type of preprocessing is incorporated into the mapping from r to y and the mapping from H to R. In particular, we assume that y is a general function of r and H and that R is a general function of H only, where both functions depend on

4

the specific preprocessing1 . We note that this implies that the submatrices Rk , k = 1, . . . , M , of R (cf. (9)) are general functions of H only. Theorem 1: Consider SD with fixed ρ (0 < ρ < ∞) and let H and r be random (potentially statistically dependent). The corresponding kth layer complexity Sk , defined in (11), satisfies 1 . ≥L , L→∞ (15) P[Sk ≥ L] = P det(RH k Rk ) provided all of the following conditions are met: • Statistics of H: There exists a constant β ∈ R, β > 0, such that the probability density function (pdf) f (H) of H satisfies the scaling property f (H) ≥ βf (aH)

(16)

for all H ∈ C and all a ∈ R, a > 1. Statistics of H and preprocessing: The covering radius µ(R) of L(R) satisfies N ×M

•

.

P[µ(R) ≥ L] = L−∞ , •

L → ∞.

(17)

det(RH k Rk )

Preprocessing: Let = gk (H) and µ(R) = gµ (H). There exist constants αk , α ∈ R, αk > 0, α > 0, such that the functions gk (H) and gµ (H) satisfy, respectively, the scaling properties and

gk (bH) = bαk gk (H)

(18)

gµ (bH) = bα gµ (H)

(19)

for all H ∈ C and all b ∈ R, b > 0. Proof: See Appendix A. Basic Proof Approach: We now briefly state the basic proof approach for Theorem 1. The proof is based on separately . establishing the exponential lower bound P[Sk ≥ L] ≥ P 1/det(RH k Rk ) ≥. L , L → ∞, and the exponential upper bound P[Sk ≥ L] ≤ P 1/det(RH Rk ) ≥ L , L → ∞, which . k then combine to P[Sk ≥ L] = P 1/det(RH k Rk ) ≥ L . The starting points for establishing these bounds are the following upper and lower bounds on the layer complexity Sk (see [23, Ch. 3.2, Eq. (3.3)]): Vk (ρ) − µ(Rk )Ak (ρ) Vk (ρ + µ(Rk )) ≤ Sk ≤ (20) H det(Rk Rk ) det(RH k Rk ) N ×M

where Vk (ρ) and Ak (ρ) denote the volume and the surface area of a hypersphere with radius ρ in k complex-valued dimensions, i.e., π k ρ2k Vk (ρ) = (21) k! and 2π k ρ2k−1 Ak (ρ) = . (22) (k − 1)! We note that our proof uses all conditions of Theorem 1, i.e., (16) – (19), to establish the exponential lower bound, while the exponential upper bound only needs condition (17). 1 For

example, consider the case of the triangularized form of the CLP problem being obtained by applying the QRD directly to H, i.e., H = QR, as described in Section I-B. The resulting preprocessing based on direct QRD is captured in the mappings from r to y and H to R as y = QH r and R = QH H, respectively.

Discussion of the Theorem: Theorem 1 states that the tail exponent of P[Sk ≥ L] is fully characterized by the tail exponent of P[1/det(RH k Rk ) ≥ L] provided that conditions (16) – (19) are satisfied. It is immediate that the tail exponent of P[Sk ≥ L] then depends only on the statistics of the lattice basis matrix H and neither on the statistics of r nor on the particular choice of ρ as long as 0 < ρ < ∞. The conditions (16) – (19) constitute fairly general requirements on the statistics of the lattice basis matrix H and on the preprocessing method. For example, in Appendix B, it is shown that the conditions (16) – (19) are satisfied for direct QRD if the entries of H are jointly Gaussian with arbitrary non-singular covariance matrix and arbitrary finite mean, i.e., for H being a Rayleigh- or Ricean-fading MIMO channel with non-singular covariance matrix. In Section III-C, it is furthermore demonstrated that Theorem 1 also comprises preprocessing techniques based on LR such as V-BLAST LS or LR according to the LLL algorithm. It is interesting to note that det(RH k Rk ) is the volume of the fundamental regions of L(Rk ) [16]. A well-known approximation for Sk is given by [23] Sbk =

Vk (ρ) det(RH k Rk )

(23)

where Vk (ρ) was defined in (21). This approximation simply counts the number of fundamental regions (each occupied by exactly one lattice point) that fit into the k-dimensional search sphere and becomes exact if averaging of Sk is performed over yk assumed uniformly distributed over L(Rk ) [23]. Motivated by this insight, Sbk was used in [1] and [20] to assess the complexity of different SD variants. It follows immediately that (15) can equivalently be written as h i . P[Sk ≥ L] = P Sbk ≥ L , L → ∞ (24) which holds for arbitrary statistics of r (or, equivalently, yk ) and no averaging over yk is required. Finally, we emphasize that Theorem 1 does not depend on the specific shape of the search region, a k-dimensional hypersphere in the SD case. Indeed, the theorem continues to hold if the search sphere is replaced by a general bounded search region with a non-empty interior (see Appendix C). This, for example, includes a SD variant based on the l∞ norm [14] where the induced search regions are hypercubes. III. TAIL E XPONENTS FOR I . I . D . G AUSSIAN ZERO - MEAN H In the remainder of the paper, we assume lattice basis matrices H whose entries are i.i.d. Gaussian with zero-mean and variance σH2 (this model is often used in the context of MIMO detection). Specializing Theorem 1 to this case is shown below to lead to particularly simple and interesting results. A. Tail Exponents for Direct QRD As shown in Appendix B, all the conditions of Theorem 1 are met for direct QRD and for the entries of H being jointly Gaussian distributed with arbitrary non-singular covariance matrix and arbitrary finite mean. This evidently includes the

5

i.i.d. Gaussian zero-mean case considered here and, hence, the relation 1 . ≥ L , L→∞ (25) P[Sk ≥ L] = P det(RH k Rk ) holds for k = 1, . . . , M . Using results from [24] on the nearzero behavior of the eigenvalues of complex Wishart matrices, it is shown in Appendix D that .

N −M +1 P[det(RH , k Rk ) ≤ ] =

→0

(26)

for k = 1, . . . , M . For SD with direct QRD and i.i.d. Gaussian zero-mean lattice basis matrices, the result in (26) together with (25) now establishes that .

P[Sk ≥ L] = L−(N −M +1) ,

L → ∞, k = 1, . . . , M. (27)

Evidently, the corresponding total complexity then satisfies (see (13)) .

P[S ≥ L] = L−(N −M +1) ,

L → ∞.

(28)

We can now draw the following conclusions: • The distributions of the individual layer and total complexities are of Pareto-type with tail exponents ξk = ξ = N − M + 1, k = 1, . . . , M . • Increasing N (i.e., the number of receive antennas in a MIMO detection context) for fixed M (i.e., the number of transmit antennas) results in improved (i.e., larger) complexity tail exponents. Relation to Diversity Order: The complexity tail exponent N − M + 1 is reminiscent of the SNR exponent (or diversity order) of conventional suboptimum MIMO detection schemes such as linear and successive interference cancellation (SIC) schemes. Specifically, the error probability Pe (SNR) as a function of the signal-to-noise ratio (SNR) of such schemes satis. fies Pe (SNR) = SNR−(N −M +1) , SNR → ∞ (in contrast to . optimum detection where one would get Pe (SNR) = SNR−N , SNR → ∞). In fact, this equivalence is not a coincidence as the same statistical properties of H are responsible for the diversity order of linear or SIC detectors and for the complexity tail exponent of SD. In particular, we note that the typical error event analysis in [25, Chapter 3] reveals that (see also [26]) 1 . 2 Pe (SNR) = P RM,M < , SNR → ∞ (29) SNR for the error probability of the first detected data symbol in SIC detection (which determines the overall diversity order). On the other hand, for the first layer complexity distribution of SD, it follows from Theorem 1 that " # 1 . P[S1 ≥ L] = P 2 ≥L , L→∞ RM,M which has the same structure as the right hand side (RHS) of (29). Consequently, we obtain the same exponents for the diversity order and the first layer complexity distribution. For the other layer complexity distributions no such direct link can be established.

B. Refined Analysis for Direct QRD As already mentioned, a shortcoming of the tail exponent analysis is that the impact of multiplicative constants is not captured. As a result, for example, the tail exponent analysis does not capture the impact of the sphere radius ρ on the complexity distribution of SD. In the following, we provide a more refined analysis of the first layer complexity distribution P[S1 ≥ L] for direct QRD on i.i.d. Gaussian zero-mean lattice basis matrices H. This characterization reveals, among other factors, the influence of the sphere radius ρ on P[S1 ≥ L] and, furthermore, allows us to derive a lower bound on the total complexity distribution P[S ≥ L], which we use to draw interesting conclusions on the average (w.r.t. to H and r) total complexity of SD complementing results derived in [11], [12]. From√(20) for k = 1 together with µ(R1 ) = µ(RM,M ) = RM,M / 2 ≤ RM,M , we obtain πρ2 − 2πRM,M ρ π(ρ + RM,M )2 ≤ S1 ≤ 2 2 RM,M RM,M which can equivalently be written as 2 2 ρ ρ π − 1 − 1 ≤ S1 ≤ π +1 . RM,M RM,M Consequently, we obtain # " # " ρ2 ρ2 0 00 ≥ L ≤ P[S1 ≥ L] ≤ P 2 ≥L P 2 RM,M RM,M q q 2 2 L L 00 and L = . Furtherwhere L0 = + 1 + 1 − 1 π π √

more, since σH2 RM,M ∼ χ2(N −M +1) [27, Lemma 2.1], we arrive at 2 2 ρ ρ γN −M +1 2 0 ≤ P[S1 ≥ L] ≤ γN −M +1 2 00 . (30) σH L σH L These upper and lower bounds together with L0 ∼ L00 ∼ L/π, L → ∞, and V1 (ρ) = πρ2 yield V1 (ρ) a , L → ∞. (31) P[S1 ≥ L] ∼ γN −M +1 σH2 L a

a

Next, noting that γa (y) =

1 a y (1 + o(1)), a!

y→0

we obtain 1 P[S1 ≥ L] ∼ (N − M + 1)! a

−(N −M +1) σH2 L V1 (ρ) .

(32)

for L → ∞. Compared to P[S1 ≥ L] = L−(N −M +1) , L → ∞, we have thus obtained a finer picture of the first layer complexity distribution since multiplicative constants are also captured. More specifically, (32) quantifies the impact of the sphere radius ρ and shows, as expected, that P[S1 ≥ L] increases with increasing ρ for large L. Next, we note that (32) establishes a stronger version h of (24) i for the h first layer.i In particular, (31) together with P Sb1 ≥ L = P RV21 (ρ) ≥ L = M,M γN −M +1 Vσ12(ρ) implies that HL h i a P[S1 ≥ L] ∼ P Sb1 ≥ L , L → ∞.

6

We can now trivially lower-bound the distribution P[S ≥ L] of the total complexity according to P[S ≥ L] ≥ P[S 1 ≥ L]. 2 Together with the lower bound P[S1 ≥ L] ≥ γN −M +1 σ2ρ L0 H (see (30)) and using L0 ≥ 4 + L, γa (x) ≥ e−x xa /a! (this is a direct consequence of the series expansion of the incom2 2 plete Gamma function [28, Sec. 6.5]), and e−ρ /(σH (L+4)) ≥ 2 2 e−ρ /(4 σH ) , we obtain P[S ≥ L] ≥ C (L + 4)−(N −M +1)

(33)

bounded for N > M while it is unbounded for N = M . Along these lines, we note that the Pareto lower and upper bounds on P[S ≥ L] according to (33) and (36), respectively, can be used to show directly that all moments E{S a } with a < N − M + 1 are finite while all corresponding higher order moments with a ≥ N − M + 1 are infinite. For example, for N = M +1, E{S} is finite (as shown above) while the second moment E{S 2 } (and, hence, the variance of S) is infinite. C. Tail Exponents for LR-Based Preprocessing

where C=

1 (N − M + 1)!

ρ2 σH2

N −M +1 e

−

ρ2 4 σ2 H

.

The total complexity distribution can therefore be lowerbounded by a Pareto distribution with exponent N − M + 1. We note that this result is similar in spirit to results obtained in the context of sequential decoding of convolutional codes over additive white Gaussian noise channels, where the randomness results from random additive noise. Specifically, in [9], [10] it was shown that the complexity distribution of sequential decoding is lower-bounded by a Pareto distribution; the corresponding exponent was found to depend on the code rate and on the noise variance. The Pareto lower bound (33) can be used to draw a number of interesting conclusions on the expected total complexity E{S}, where the expectation is taken over H (assumed i.i.d. Gaussian and zero-mean) and r (arbitrary statistics). Starting from Z ∞ E{S} = P[S ≥ L] dL (34) 0

the lower bound in (33) yields Z ∞ E{S} ≥ C (L + 4)−(N −M +1) dL.

(35)

0

The integral on the RHS of (35) does not converge for N = M as the integrand behaves as L−1 in this case. Consequently, E{S} is infinite for N = M . This complements results on the average complexity of SD derived in [11], [12]. Specifically, [12, Corollary 1, Eq. (5)] provides an expression for E{S} for the special case (with respect to our assumptions) of r given by (4) in terms of an infinite sum over incomplete Gamma functions. Based on this expression it seems hard to draw general conclusions on E{S}. The lower bound in (35), however, immediately shows that this sum does not converge for N = M . Moreover, for N > M , E{S} is finite and, consequently, [12, Corollary 1, Eq. (5)] converges in this case. To see this, we start from (34) together with the upper bound ( 1, for L ≤ L0 P[S ≥ L] ≤ (36) L−(N −M +1−δ0 ) , for L > L0 for sufficiently large L0 and δ0 < 1 (cf. the upper bound in (14)). Inserting (36) into (34), we get Z L0 Z ∞ E{S} ≤ dL + L−(N −M +1−δ0 ) dL 0

L0

which implies E{S} < ∞ for every N > M . We can therefore conclude that the average total complexity E{S} of SD is

Next, we analyze the tail behavior of the complexity distribution induced by LR-based preprocessing. We define LRbased preprocessing (see, e.g., [1]) as applying, prior to the QRD, the transformation G = HT, where T is an M × M unimodular matrix, i.e., Ti,j ∈ CZ, ∀i, j, and |det(T)| = 1. The matrix T can, in general, depend on H and is obtained, for example, through the LLL algorithm [29], which finds a basis matrix G of the lattice L(H) that is closer to an orthogonal matrix than H. Another important preprocessing technique is LS (e.g., according to the V-BLAST algorithm [15]), which is just a special case of LR obtained by restricting T to be a permutation matrix (i.e., exactly one entry in each row and in each column of T is equal to one and all other entries are equal to zero). The triangularized form of the CLP problem, when LRbased preprocessing is applied, is given by (5) with R and e and y e H r, respectively, where Q e and e =Q y replaced by R e e e R are the QR-factors of G, i.e., G = QR. Consequently, e If we denote the R in Theorem 1 is now replaced by R. e corresponding solution of (5) as d, the final solution of (3) is b = Td. e d 1) Verifying the Conditions of Theorem 1 for LR-Based Preprocessing: Let us first verify the conditions of Theorem 1 for LR-based preprocessing of an i.i.d. Gaussian zero-mean lattice basis matrix H. Condition (16) just depends on the statistics of H and was already verified in Appendix B-A for the more general class of lattice basis matrices that follow a correlated Ricean-fading distribution. To verify the remaining conditions, we start by noting that e R. e QRT = Q

(37)

Furthermore, let us write RT = Q0 R0 , where Q0 and R0 are e R. e the QR-factors of RT. With (37), we obtain QQ0 R0 = Q 0 Since QQ is unitary and the QR-factors are unique, it follows e and, hence, R0 = R, e which results in that QQ0 = Q e −1 . R = Q0 RT

(38)

Using (38) along with the facts that T−1 is unimodular (since T is unimodular) and Q0 is unitary, it can be shown that and

e µ(R) = µ(R)

(39)

e H R). e det(RH R) = det(R

(40)

Hence, LR preserves the covering radius as well as the volume of the fundamental regions associated with R. Since (17) and (19) hold for direct QRD (see Appendix B-B and Appendix B-C, respectively), it follows from (39)

7

that conditions (17) and (19) are also satisfied for LR-based preprocessing. As condition (18) is satisfied for direct QRD (see Appendix B-C), it follows from (40) that condition (18) is satisfied for LR-based preprocessing for layer k = M . Hence, all the conditions of Theorem 1 are satisfied for LR-based preprocessing for k = M . As shown in Section III-C2 below, this will be sufficient to draw very general conclusions about the tail behavior of the total complexity distribution incurred by LR-based preprocessing. For the layers k < M , condition (18) cannot be guaranteed if we allow T to be an arbitrary function of H. It turns out, however, that all LR algorithms delivering a unimodular transformation matrix T that is invariant to a positive scaling of H, i.e., H and bH result in the same T for all b ∈ R, b > 0, satisfy condition (18) also for k = 1, . . . , M − 1. In e denotes the lattice-reduced R-factor of H, the this case, if R e as a consequence of the lattice-reduced R-factor of bH is bR, e HR e k) = uniqueness of the QRD. Hence, gk (bH) = det(b2 R k He 2k 2k e b det(Rk Rk ) = b gk (H) and condition (18) is satisfied with αk = 2k for k = 1, . . . , M (as is the case for direct QRD, see Appendix B-C). An LR algorithm with this property satisfies Theorem 1 for all k = 1, . . . , M and is referred to as a standard LR algorithm. All previously proposed LR algorithms in the literature that we are aware of are standard. Specifically, it can be shown that the LLL algorithm [29], the Seysen algorithm [30], LS according to the V-BLAST algorithm [15], and the sorted-QRD in [31] are all standard LR algorithms. 2) Tail Exponents for General LR-Based Preprocessing: As stated above, for general (i.e., not necessarily standard) LR-based preprocessing and k = M , the equivalence in (40) implies 1 . ≥ L , L → ∞. P[SM ≥ L] = P det(RH M RM ) Combined with (27) this allows us to conclude that .

P[SM ≥ L] = L−(N −M +1) ,

L→∞

(41)

or, equivalently, ξM = N − M + 1 for general LR-based preprocessing. Consequently, (13) implies that the tail exponent of the total complexity distribution for general LR-based preprocessing satisfies ξ ≤ N − M + 1. We can therefore conclude that general LR-based preprocessing does not improve the tail exponent of the total complexity distribution as compared to that obtained for direct QRD (cf. (28)). It is, however, important to note that LR-based preprocessing techniques typically reduce the computational complexity of SD, an aspect not reflected by our tail exponent result. In Section III-C4, we will see that the tail exponents associated with the layers k < M can be improved by LRbased preprocessing using the LLL algorithm. 3) Tail Exponents with Layer-Sorting: Let us next specialize LR-based preprocessing to the case of standard LS2 , 2 LS according to the V-BLAST algorithm [15] and the sorted-QRD algorithm [31] fall into the category of standard LS algorithms.

where T is a (potentially H-dependent) permutation matrix that is invariant to positive scaling of H. In this case, we can characterize the tail exponents of the corresponding total complexity and layer-wise complexity distributions precisely. As shown in Appendix E, for any LS strategy, we obtain h i . N −M +1 e HR ek ≤ = P det R , → 0, k = 1, . . . , M k which is identical to the near-zero behavior obtained for direct QRD (see (26)). For standard LS strategies, Theorem 1 holds (see Section III-C1) and implies .

P [Sk ≥ L] = L−(N −M +1) , L → ∞, k = 1, . . . , M. (42) Hence, ξk = ξ = N − M + 1, and we can conclude that standard LS does not improve the layer complexity tail exponents as compared to direct QRD (cf. (27)). 4) Tail Exponents with LLL: For LR based on the LLL algorithm, Theorem 1 implies that 1 . ≥L , L→∞ (43) P[Sk ≥ L] = P e HR e k) det(R k for k = 1, . . . , M . Based on (43), it is shown in Appendix F that .

N

P[Sk ≥ L] ≤ L− k ,

L → ∞,

k = 1, . . . , M

(44)

or, equivalently, ξk ≥ N/k. Comparing with ξk = N − M + 1 for direct QRD (cf. (27)) or standard LS (cf. (42)), we can conclude that LLL preprocessing improves the tail exponents at least up to layer k ≤ dN/(N − M + 1)e − 1. Hence, as compared to direct QRD or standard LS, LLL improves the tail exponents of the lower layers. However, we again note, as shown in Section III-C2, that the tail exponents of the total complexity distribution cannot be improved by LLL preprocessing. In the following, consider N = M . We have ξk = M/k > 1, k = 1, . . . , M − 1, and ξM = 1 (see (43) or (41)), which, together with (13), establishes that the tail behavior of the total complexity distribution of SD with LLL preprocessing is dominated by the tail behavior of the M th layer complexity distribution; in particular, we have ξ = ξM = 1, as is the case with direct QRD and standard LS. D. Tail Exponents with Regularization Next, we analyze the tail behavior of the regularized CLP problem d = arg min kz − Fdk2 (45) d∈(CZ)M

with the (N + M ) × M regularized lattice basis matrix H F= (46) κI where κ ∈ R, κ > 0, is the regularization parameter (see, e.g., [8]) and z is the length-(N + M ) vector z = [rT 0T ]T . A regularization with a specific κ is, for example, obtained by minimum-mean square error (MMSE) based preprocessing (see, e.g., [6], [8], [22] or MMSE generalized decisionfeedback equalization in [32]). We emphasize that the solution of the regularized CLP problem (45) will, in general, not equal the solution of the original CLP problem. It turns out,

8

for k = 1, . . . , M . Following the steps leading from (60) to (61), we get P µ R ≥ L ≤ P[kFkF ≥ L] . From (46), we have kFk2F = kHk2F + κ2 M , which implies P µ R ≥ L ≤ P kHk2F + κ2 M ≥ L2 . = P kHkF ≥ L , L → ∞. (48) As shown in .Appendix B-B (cf. (61) – (65)), we have P kHkF ≥ L ≤ L−∞ , L → ∞, for H correlated Gaussian distributed with . arbitrary finite mean. Hence, (48) implies P µ R ≥ L ≤ L−∞ and condition (17) of Theorem 1 is satisfied for regularization-based preprocessing and i.i.d. Gaussian zero-mean H, which proves (47). Let us next analyze the RHS of (47). The interlacing theorem for bordered matrices [33, Theorem 4.3.8] implies that (cf. (76)) H H λi Rk Rk ≥ λi R R , i = 1, . . . , k. (49) H Furthermore, using (46), we can write λi R R = H H 2 λi F F = λi H H + κ , which, together with (49), H shows that λi Rk Rk ≥ κ2 and, consequently, H det Rk Rk ≥ κ2k

(50)

H i.e., det Rk Rk is strictly positive for any regularization parameter κ > 0. Inserting (50) into (47) therefore yields .

P[Sk ≥ L] = L−∞ ,

L → ∞,

100

P[S ≥ L]

however, that for suitably chosen κ, regularization induces only a small performance loss while the resulting reduction in SD complexity can be significant [6], [8]. The triangularized form of the regularized CLP problem (45) is given by (5) with R and y replaced by R and y = H Q z, respectively, where Q and R are the QR-factors of F, i.e., F = Q R. We again consider H to be zero-mean i.i.d. Gaussian and verify condition (17) for R. Condition (17), as shown in Appendix A-B, is sufficient to state the exponential upper bound # " . 1 (47) P[Sk ≥ L] ≤ P ≥L , L→∞ H det Rk Rk

5×4 −2

10

10−3 100

direct QRD V-BLAST LS LLL

101

L

102

103

Fig. 1. Distribution of total complexity P[S ≥ L] of SD with direct QRD, V-BLAST LS, and LLL preprocessing for 4 × 4 and 5 × 4 MIMO systems.

IV. N UMERICAL R ESULTS We consider SD for data detection in N ×M MIMO wireless systems with spatial multiplexing, where r = Hd0 + w (cf. (4)) with the entries of H and w assumed i.i.d. CN (0, 1/M ) and i.i.d. CN (0, σ 2 ), respectively, and with the transmitted vector d0 ∈ (CZ)M . We note that the complexity of SD is random in H and w and does not depend on the particular d0 , which is due to the fact that any sums/differences of valid lattice points are again valid lattice points. Following the approach proposed in [4], [11], [21], the sphere radius ρ in (6) is chosen such that the transmitted data vector d0 is found by the SD algorithm with probability 0.99. This is accomplished by setting 2 ρ P kwk2 ≤ ρ = γN 2 = 0.99 σ q −1 (0.99). For 1/σ 2 we and solving for ρ, i.e., ρ = σ γN assume a value of 15 dB, which results in ρ ≈ 0.5636 for N = 4 and ρ ≈ 0.6058 for N = 5. Fig. 1 shows the total complexity distribution P[S ≥ L] in double log-scale for SD with direct QRD, V-BLAST LS [15], and with LLL preprocessing [34, with parameter δ = 3/4] for 4 × 4 and 5 × 4 MIMO systems. We can draw the following conclusions: •

k = 1, . . . , M.

We can now conclude that the distributions of the individual layer and total complexities of SD with regularization decrease faster than polynomial in L. This is in stark contrast to direct QRD and LR preprocessing, which (as shown in the previous sections) have total complexity distributions that are of Paretotype with tail exponent N −M +1. We conclude by noting that the performance degradation induced by regularization can be very small while the complexity improvements (as indicated by the faster-than-polynomial tail behavior) can be significant.

4×4

10−1

•

•

For direct QRD in the 4 × 4 case, the total complexity distribution in Fig. 1 exhibits a large-L behavior of L−1 as predicted by (28) for N = M . Adding one receive antenna improves the tail behavior significantly and leads to a large-L behavior of L−2 as predicted by (28). LLL preprocessing and V-BLAST LS reduce the complexity, as compared to direct QRD, but do not change the tail exponent of the total complexity distribution (see Section III-C).

For the same setup as in Fig. 1 for the 4 × 4 case, Fig. 2 shows the layer-wise complexity distributions P[Sk ≥ L], k = 1, 2, 3, 4, and the total complexity distribution P[S ≥ L]

9

P[S ≥ L]

(a)

100

10−1

direct QRD 4×4 S S2

10−2

S3 S1

10−3

P[S ≥ L]

(b)

S4 101

102

L

100

103

LLL 4×4

10−1

S

10−2

10−3

S1 101

S3

S2

L

102

S4 103

Fig. 2. Distributions of individual layer complexities P[Sk ≥ L], k = 1, 2, 3, 4, and of total complexity P[S ≥ L] of SD for a 4 × 4 MIMO system with (a) direct QRD and (b) LR-based preprocessing using the LLL algorithm.

with (a) direct QRD and (b) LLL preprocessing, respectively. We can draw the following conclusions: • For direct QRD, all layer-wise complexity distributions (see Fig. 2(a)) exhibit a large-L behavior of L−1 as predicted by (27) for N = M . • With LLL preprocessing, the tail exponents of the layerwise complexity distributions at the lower layers are improved as compared to that of direct QRD (compare Fig. 2(b) with Fig. 2(a)). However, the last layer (in this case layer 4) exhibits a large-L behavior of L−1 (see Fig. 2(b)) and, hence, dominates the tail behavior of the total complexity distribution (see (12) and (13)). V. C ONCLUSIONS We analyzed the tail behavior of the (computational) complexity distribution of the sphere decoding (SD) algorithm in random infinite lattices. Our results complement and extend previous work that characterized the mean and the variance of SD complexity. In particular, we characterized the tail behavior of the complexity distribution in terms of corresponding tail exponents (i.e., polynomial decay rates). We found that the tail exponent of the SD complexity distribution is given by

the tail exponent of the distribution of the inverse volume of the fundamental regions of the underlying lattice. This result was shown to hold under fairly general assumptions on SD preprocessing and on the statistics of the lattice basis matrix including, e.g., preprocessing based on lattice-reduction (LR) and the case of the lattice basis matrix being correlated Ricean distributed. For N ×M i.i.d. circularly symmetric complex Gaussian lattice basis matrices, we found that the complexity distribution of SD is of Pareto-type with tail exponent given by N −M +1. This shows that increasing N (e.g., the number of receive antennas in the context of MIMO wireless communications) for given M (e.g., the number of transmit antennas) results in larger tail exponents. By means of a more refined analysis of the complexity distribution of SD, we also showed that the average complexity of SD is infinite for N = M and finite for N > M , which complements average complexity results derived in the literature. We finally found that the tail exponent of N − M + 1 cannot be increased by preprocessing based on lattice-reduction including layer-sorting and the LLL algorithm as special cases while regularization results in a SD complexity distribution with tails that decrease faster than polynomial. We note, however, that lattice-reduction based preprocessing typically reduces the complexity of SD (although this is not reflected in the tail exponents of the complexity distributions) and that regularization-based preprocessing can be applied prior to lattice-reduction for further complexity reduction. Throughout the paper, we considered the Fincke-Pohst variant of the SD algorithm with a fixed and lattice-independent choice of the sphere radius. The tools developed in this paper could turn out useful in analyzing the complexity distribution of more advanced SD approaches. Specifically, it would be interesting to understand the impact of a lattice-dependent choice of the sphere radius (as used in Schnorr-Euchner variants of the SD algorithm) on the tail behavior of the complexity distribution. Furthermore, most of our results do not capture multiplicative and additive constants that appear in the complexity distribution. A more refined analysis would clearly be desirable. A PPENDIX A P ROOF OF T HEOREM 1 A. Exponential Lower Bound For establishing the exponential lower bound, we use the lower bound on Sk given in (20), which, together with µ(Rk ) ≤ µ(R), k = 1, . . . , M , yields3 Vk (ρ) − µ(R)Ak (ρ) ≥ L . P[Sk ≥ L] ≥ P det(RH k Rk ) Next, consider a constant c ∈ R, c > 0, such that Vk (ρ) − cAk (ρ) > 0 and define c0 = Vk (ρ) − cAk (ρ) > 0. We then have P[Sk ≥ L] ≥ P [H ∈ B] (51) 3 Note that condition (17) implies full-rank R k with probability one. Consequently, det(RH k Rk ) > 0 and µ(Rk ) < ∞ with probability one. However, it is straightforward to show that Theorem 1 also holds in the case where Rk is rank-deficient with non-zero where we would have ˜ . probability, . ˆ 0 P[Sk ≥ L] = P 1/det(RH k Rk ) ≥ L = L , L → ∞.

10

where B=

H:

0

c gk (H)

≥ L ∩ (gµ (H) ≤ c)

with det(RH k Rk ) = gk (H) and µ(R) = gµ (H). With property (16), we further obtain Z Z P[H ∈ B] = f (H)dH ≥ β f (Lδ H)dH H∈B

H∈B

for all δ > 0, L > 1, and some β > 0. Performing the change of variables H0 = Lδ H and invoking conditions (18) and (19) yields P[H ∈ B] ≥ β L−2M N δ P[H ∈ B 0 ] (52) where B0 =

H:

c0 ≥ L1−δαk gk (H)

∩ (gµ (H) ≤ cLδα ) .

Next, noting that for two events A1 and A2 , by the inclusionexclusion principle, P[A1 ∩ A2 ] ≥ P[A1 ] − P[A¯2 ], where A¯2 denotes the complementary event of A2 , we get c0 ≥ L1−δαk − P gµ (H) > cLδα . P[H ∈ B 0 ] ≥ P gk (H) Now (17) with µ(R) = gµ (H) and δ, α > 0 implies . P gµ (H) > cLδα = L−∞ , L → ∞, which, together with (51) and (52), yields . c0 1−δαk −2M N δ ≥L P[Sk ≥ L] ≥ L P , L → ∞. gk (H) .

Let us write P[1/gk (H) ≥ L] = L−a , L. → ∞, for some constant a ≥ 0. We then have P[Sk ≥ L] ≥ L−2M N δ−(1−δαk )a , L → ∞. As this result holds for arbitrarily small values of δ, . . we can conclude that P[Sk ≥ L] ≥ L−a = P[1/gk (H) ≥ L], L → ∞, which establishes the exponential lower bound. B. Exponential Upper Bound To establish the exponential upper bound, we start from the upper bound on Sk in (20), which yields Vk (ρ + µ(R)) ≥L (53) P[Sk ≥ L] ≤ P det(RH k Rk ) where, again, we used µ(Rk ) ≤ µ(R), k = 1, . . . ,M . Note that P[xy ≥ L] = P (xy ≥L) ∩ (y < Lδ ) + P (xy ≥ L) ∩ (y ≥ Lδ ) ≤ P x ≥ L1−δ + P y ≥ Lδ for any two RVs x, y ∈ R and any constant δ ∈ R, 0 < δ < 1. Applying this to (53) with x = 1/det(RH k Rk ) and y = Vk (ρ + µ(R)), we get 1 1−δ P[Sk ≥ L] ≤ P ≥ L (54) det(RH k Rk ) + P Vk (ρ + µ(R)) ≥ Lδ . With (21) and the binomial theorem, we can write 2k πk X 2k 2k−i Vk ρ + µ(R) = ρ µ(R)i . k! i=0 i

(55)

Next, applying the general property "M # M X X L P xi ≥ L ≤ P xi ≥ M i=1 i=1

(56)

which holds for any set of RVs xi , i = 1, . . . , M , together with (55) yields 2k . X P Vk (ρ + µ(R)) ≥ Lδ ≤ P µ(R)i ≥ Lδ ,

L → ∞.

i=0

Property (17) (for the terms corresponding to i > 0) and . P[c00 ≥ L] = L−∞ , L → ∞, for any constant c00 ≥ 0 (for to i = 0) now directly imply the term corresponding . P Vk (ρ + µ(R)) ≥ Lδ = L−∞ , L → ∞, and, hence, by (54) . 1 1−δ ≥ L , L → ∞. P[Sk ≥ L] ≤ P det(RH k Rk ) . −a As before, writing P 1/det(RH L , L → ∞, k Rk ) ≥ L = . for some constant a ≥ 0, we get P[Sk ≥ L] ≤ L−(1−δ)a , L → ∞. As this result holds for arbitrarily small . values of δ, we can . conclude that P[Sk ≥ L] ≤ L−a = P 1/det(RH k Rk ) ≥ L , L → ∞, which establishes the exponential upper bound. A PPENDIX B T HE C ONDITIONS OF T HEOREM 1 H OLD FOR C ORRELATED R ICEAN FADING AND DIRECT QRD In the following, we show that conditions (16) – (19) are satisfied for direct QRD of a jointly complex Gaussian distributed lattice basis matrix H with arbitrary non-singular covariance matrix and arbitrary finite mean. With h = vec(H) ∈ CN M , the pdf f (h) of h is given by [35] f (h) = c1 e−(h−µ)

H

C−1 (h−µ)

(57)

where µ = E{h}, C = E{(h − µ)(h − µ) }, and c1 = 1/(π N M det(C)). H

A. Condition (16) We start by noting that condition (16) of Theorem 1 can equivalently be written as f (h) ≥ βf (ah)

(58)

for all h ∈ C and all a ∈ R, a > 1, with some constant β ∈ R, β > 0. Inserting (57) into (58) reveals that it suffices to show that NM

(h − µ)H C−1 (h − µ) ≤ (ah − µ)H C−1 (ah − µ) + β 0 (59) for β 0 = −log(β). Reformulating (59) gives H a+1 0 β ≥ 0. (a+1)h−µ C−1 (a+1)h−µ −µHC−1µ+ a−1 Since the first term on the left hand side is always positive, it remains to show that there exists a constant β 0 (equivalently, 0 a constant β = e−β ) such that a+1 0 β ≥ µH C−1 µ a−1 for all a ∈ R, a > 1. Indeed, since (a + 1)/(a − 1) ≥ 1, H −1 any β 0 ≥ µH C−1 µ, or, equivalently, any β ≤ e−µ C µ establishes condition (58), which concludes the proof.

11

B. Condition (17)

C. Conditions (18) and (19)

We first note that the squared covering radius µ2 (R) can be upper-bounded according to (see Appendix G and [20, Prop. 1] for real-valued lattices)

We start by noting that the R-factor of bH with b ∈ R, b > 0, is given by bR as a direct consequence of the uniqueness of the QRD. We therefore get gk (bH) = det(b2 RH k Rk ) = 2k b2k det(RH k Rk ) = b gk (H), which shows that condition (18) is satisfied with αk = 2k. Condition (19) can be verified as follows. By the definition of the covering radius (1), we can write

M

µ2 (R) ≤

1X 2 R . 2 i=1 i,i

(60)

PM 2 Further upper-bounding the RHS of (60) by i=1 Ri,i and 2 2 2 noting that Ri,i ≤ khi k for direct QRD, we obtain µ (R) ≤ kHk2F and, consequently, P [µ(R) ≥ L] ≤ P kHkF ≥ L .

(61)

Next, property (56) applied to the RHS of (61) results in N X M X L P [µ(R) ≥ L] ≤ . P |Hi,j | ≥ √ NM i=1 j=1

(62)

Noting that in the case of Ricean fading, as assumed here, d e i,j , i = 1, . . . , N , j = 1, . . . , M , where µi,j = Hi,j = µi,j + H e i,j ∼ CN (0, σ 2 ), the terms on the RHS of E{Hi,j } and H i,j (62) can be upper-bounded as L e i,j | ≥ √ L = P |µi,j + H P |Hi,j | ≥ √ NM NM e i,j | ≥ √ L . ≤ P |µi,j | + |H NM Again employing property (56), we get L L P |Hi,j | ≥ √ ≤ P |µi,j | ≥ √ (63) NM 2 N M e i,j | ≥ √L . + P |H 2 NM e i,j | is χ2 -distributed, we obtain Noting that |H L2 − L 2 e P |Hi,j | ≥ √ = e 4N M σi,j . 2 NM

(64)

Next, we define µmax = maxi,j |µi,j | (which, according to our 2 2 finite-mean assumption, is finite) and σmax = maxi,j σi,j . For √ L > 2 N M µmax , the first term on the RHS of (63) is zero, which together with (64) implies L2 L − 2 P |Hi,j | ≥ √ ≤ e 4N M σmax NM for sufficiently large L. Hence, using (62), we obtain 2

P [µ(R) ≥ L] ≤ N M e

L − 4N M σ2

max

(65)

for sufficiently large L. Finally, condition (17) is verified by 2

noting that e

L − 4N M σ2

max

.

= L−∞ , L → ∞.

µ(bR) = max

min

kx − bRdk

x

= b max min − Rd b x∈CM d∈(CZ)M = b µ(R) x∈CM d∈(CZ)M

implying that gµ (bH) = µ(bR) = b µ(R) = b gµ (H). This shows that condition (19) is satisfied with α = 1. A PPENDIX C T HEOREM 1 FOR G ENERAL S EARCH R EGIONS In the following, we show that Theorem 1 continues to hold if the search sphere around yk is replaced by a general bounded search region Rk ⊂ Ck with non-empty interior. Following (11), the complexity Sek of SD with search region Rk is given by Sek = dk ∈ (CZ)k : (yk − Rk dk ) ∈ Rk . The proof is based on separately establishing the exponential upper bound . 1 e P Sk ≥ L ≤ P ≥L , L→∞ (66) det(RH k Rk ) and the exponential lower bound . 1 e P Sk ≥ L ≥ P ≥L , det(RH k Rk )

L→∞

(67)

through, respectively, circumscribing and inscribing Rk by properly chosen hyperspheres. We start by proving (66). Since Rk is bounded, it can be circumscribed by a hypersphere with finite radius ρ(1) implying that kyk −Rk dk k ≤ ρ(1) for all (yk −Rk dk ) ∈ Rk . Consequently, if we denote by Sk (ρ(1) , yk ) the complexity of conventional SD according to (11) with radius ρ = ρ(1) and center yk , we have Sek ≤ Sk (ρ(1) , yk ) and P Sek ≥ L ≤ P Sk (ρ(1) , yk ) ≥ L . (68) Next, we note that Theorem 1 does not depend on the particular choice of the sphere radius ρ(1) . Hence, (15) (if the conditions of Theorem 1 are met) can be applied directly to the RHS of (68), which establishes the exponential upper bound (66). It remains to establish the corresponding exponential lower bound (67). Since Rk has non-empty interior, it can be inscribed by a hypersphere with strictly positive radius ρ(2) and any center m ∈ Ck such that yk − Rk dk ∈ Rk whenever kyk − m − Rk dk k ≤ ρ(2) . Consequently, Sek ≥ Sk (ρ(2) , yk − m), which results in P Sek ≥ L ≥ P Sk (ρ(2) , yk − m) ≥ L . (69)

12

Theorem 1 depends neither on the radius ρ(2) nor on the center yk −m of the search sphere and, hence, (15) (if the conditions of Theorem 1 on the statistics of the lattice basis matrix and on preprocessing are satisfied) can be applied directly to the RHS of (69), which establishes the exponential lower bound (67). The proof is concluded by combining (66) and (67). A PPENDIX D N EAR -Z ERO B EHAVIOR OF DET(RH k Rk ) In the following, we show (26) for direct QRD and lattice basis matrices H whose entries are i.i.d. CN (0, σH2 ). We first note that √ the nonzero entries of R are statistically independent with σH2 Ri,i ∼ χ2(N −i+1) and Ri,j ∼ CN (0, σH2 ), for i = 1, . . . , M , j > i [27, Lemma 2.1]. Hence, the submatrix 0 Rk has the same statistics as the matrix Rk , which would be obtained by QRD of a (k + ∆) × k, ∆ = N − M , matrix Hk having i.i.d. CN (0, σH2 ) entries. This implies that the eigenvalues of RH the same statistics k Rk0 have d as the Heigen H H 0 H values of Hk Hk = Rk Rk , i.e., λi Rk Rk = λi Hk Hk , i = 1, . . . , k. We thus have " k # Y H H P[det(Rk Rk ) ≤ ] = P λi Hk Hk ≤ (70) i=1

so that the analysis of the near-zero behavior of det(RH k Rk ) can be based on results in [24], which establish the near-zero behavior of the eigenvalues λi HH k Hk for i.i.d. Gaussian zero-mean matrices Hk . Following [24], we start by performing the variable transαi formations λi HH k Hk = , i = 1, . . . , k, and we assume < 1 (note that this implies α1 ≥ · · · ≥ αk ). Setting α = (α1 · · · αk )T, we can rewrite (70) as P[det(RH k Rk ) ≤ ] = P[α ∈ B]

.

→0

In the following, we show that h i . N −M +1 e HR ek ≤ = P det R , k

(72)

where

→ 0,

k = 1, . . . , M (75) is satisfied for any LS strategy, i.e., if T is a permutation ek matrix, and for i.i.d. zero-mean Gaussian H. Recall that R refers to the k × k bottom right (upper triangular) submatrix e obtained by QRD of HT, i.e., HT = Q e R, e of the R-factor R, where T is the (in general, H-dependent) permutation matrix obtained by the LS algorithm. The proof of (75) will be accomplished by separatelyiestablishing the exponential upper h . e HR e k ≤ ≤ N −M +1 , → 0, and the bound P det R k h i . e HR e k ≤ ≥ N −M +1 , exponential lower bound P det R k → 0, which then combine to (75). Both boundsare obtained e HR e k to the nearby relating the near-zero behavior of det R k zero behavior of the smallest eigenvalue of HH H. A. Exponential Upper Bound e kR e H is a principal submatrix of R eR eH We first note that R k e due to the upper triangular structure of R. Furthermore, e kR e H = λi R e HR e k and λi R eR eH = we have that λi R k k e HR e , which, together with the interlacing theorem for λi R bordered matrices [33, Theorem 4.3.8], implies that e HR e k ≥ λi R e HR e , i = 1, . . . , k. λi R (76) k e HR e = TH HH HT (recall that HT = Q e R) e with T Due to R eH e being a permutation matrix and hence unitary, λ i R R = λi HH H , i = 1, . . . , M . Together with (76), we obtain e HR e k ≥ λi HH H , i = 1, . . . , k, which results in λi R k

(71)

where B = {α : α1 + · · · + αk ≥ 1}. From [24, p. 1079], it follows that 0 P[det(RH k Rk ) ≤ ] = P[α ∈ B ],

A PPENDIX E e HR e k FOR LS N EAR -Z ERO B EHAVIOR OF DET R k

k Y e HR ek ≥ λi HH H . det R k i=1

We can therefore conclude that " k # h i Y H H e R ek ≤ ≤ P P det R λi H H ≤ . k

(77)

i=1

B 0 = {α : α1 + · · · + αk ≥ 1, α1 ≥ · · · ≥ αk ≥ 0}. Furthermore, from [24, p. 1080], we have P[α ∈ B 0 ] = minα∈B0 i=1 (∆+2i−1)αi , → 0. (73) Pk The minimum of i=1 (∆+2i−1)αi over α ∈ B 0 is achieved by setting α1 = 1, α2 = 0, . . . , αk = 0, which results in Pk

.

k X min0 (∆ + 2i − 1)αi = ∆ + 1.

α∈B

i=1

Combining this result with (72) and (73), and using ∆ = N − M , we obtain .

N −M +1 P[det(RH , k Rk ) ≤ ] =

which concludes the proof.

→0

(74)

Similar to Appendix D (cf. (71) – (74)), the near-zero behavior of the RHS can be analyzed by means of the results in [24]. Specifically, we obtain " k # Y . H P λi H H ≤ = N −M +1 , → 0 i=1

which, together with (77), establishes the desired exponential upper bound. B. Exponential Lower Bound Evidently, we have h i e HR ek ≤ ≥ P det R (78) k h i e HR e k ≤ ∩ λk R e HR ek ≤ 1 . P λ1 R k k

13

We next derive a sufficient condition for the event e HR e k ≤ ∩ λk R e HR ek ≤ 1 λ1 R k k

(79)

which will imply a lower bound on the RHS of (78). This lower bound will not depend on the particular LS strategy employed, since it will be given in terms of the eigenvalues λi (HH H), i = 1, . . . , M , and the corresponding eigenvectors ui of HH H. From the decomposition M −1 X HH H = i=1

1 ui uH i λi (HH H)

we get h

HH H

−1 i m,m

≥

1 |u1,m |2 λ1 (HH H)

where |u1,m | = u1 uH 1 m,m . Let us now consider the event 2

λ1 (HH H) ≤ ν and

|u1,m |2 > ν,

m = 1, . . . , M (80)

for some constant ν < 1/M . We obtain h −1 i 1 (81) HH H ≥ , m = 1, . . . , M. m,m −1 The diagonal elements of HH H can be related to the e HR e k −1 as follows. Let us first write diagonal elements of R k e as R " # e k¯ A R e R= ek 0 R e k¯ is an (M − k) × (M − k) upper triangular matrix where R and A is an (M − k) × k full matrix. For the inverse of this partitioned matrix we obtain [33, Section 0.7.3] # " −1 −1 e −1 e e R − R A R ¯ ¯ k k k e −1 = (82) R e −1 0 R k e HR e k −1 is a principal submatrix which implies that R k e HR e −1 obtained by deleting the first M − k rows of R e HR e −1 . Since (TH HH HT)−1 = and columns of R e HR e −1 , R e HR e k −1 is also a principal submatrix of R k T−1 (HH H)−1 T−H . In particular, this results in h i e HR e k −1 = T−1(HH H)−1 T−H R (83) k M −k+i,M −k+i i,i

for i = 1, . . . , k. Furthermore, since T is a permutation matrix, the matrix T−1 (HH H)−1 T−H has the same diagonal elements (just at different positions) as the matrix (HH H)−1 . Therefore, (83) together with (81) implies that h i 1 e HR e k −1 ≥ , i = 1, . . . , k R k i,i which results in

e HR ek ≤ λ1 R k

(84)

upon using 1 e HR ek λ1 R k

≥

h

e HR ek R k

−1 i i,i

,

i = 1, . . . , k.

Consequently, the events (80) imply (84), which corresponds to the first event in (79). It now remains to establish a sufficient event for the second event in (79), i.e., e HR ek for λk R ≤ 1. From the interlacing theorem k for bordered matrices [33, Theorem 4.3.8] applied e HR e k (see Section E-A), we have to HH H and R k H He e λM (H H) ≥ λk Rk Rk . Consequently, λM (HH H) ≤ 1 e HR e k ≤ 1. With (78) and all the established implies λk R k h i e HR ek ≤ ≥ sufficient events for (79), we obtain P det R k P λ1 (HH H) ≤ ν ∩ λM HH H ≤ 1 P |u1,m |2 > ν, ∀m , where we used the fact that the eigenvectors ui , i = 1, . . . , M , are statistically independent of the eigenvalues λi (HH H), i = 1, . . . , M , [27, Lemma 2.6]. Since u1 is uniformly distributed on the unit sphere [27, Lemma 2.6], we have P |u1,m |2 > ν, h ∀m > 0 fori any . e HR ek ≤ ≥ ν < 1/M . This finally implies P det R k . P λ1 (HH H) ≤ ∩ λM HH H ≤ 1 = N −M +1 , → 0, where the exponential equality again follows directly from [24, p. 1080]. A PPENDIX F E XPONENTIAL U PPER B OUND FOR LLL In the following, starting from 1 . ≥L , P[Sk ≥ L] = P e HR e k) det(R

L→∞

(85)

k

we prove (44) for LR-based preprocessing using the LLL e is obtained by QRD of G, i.e., algorithm. In this case, R e e G = QR, with G denoting the LLL-reduced basis associated e satisfy the well-known with H. The diagonal elements of R LLL conditions [29], [34] 2 2 em,m em−1,m−1 R ≥ cR ,

m = 2, . . . , M

(86)

for some constant c > 0 (more precisely, 0 < c < 0.5), which gives 2 2 em,m e1,1 R ≥ cm−1 R , m = 1, . . . , M. (87) Based on (87), we can now lower-bound e HR e k) = det(R k

k Y

2 eM R −i+1,M −i+1

i=1

according to

2k e HR e k ) ≥ c0 R e1,1 det(R (88) k Pk where c0 = ca with a = i=1 (M − i). Furthermore, since e is upper triangular, R e1,1 is the Euclidean norm of a basis R e Hence, R e where e1,1 ≥ γ(R), vector associated with L(R). e denotes the length of the shortest nonzero vector in γ(R) e i.e., γ(R) e = mind∈(CZ)M,d6=0 kRdk. e L(R), From (38) we can e conclude that γ(R) = γ(R). With γ(R) = γ(H), we obtain e1,1 ≥ γ(H), which, upon using (88), results in R

e HR e k ) ≥ c0 γ 2k (H). det(R k Inserting (89) into the RHS of (85) yields . P[Sk ≥ L] ≤ P γ 2k (H) ≤ L−1 , L → ∞.

(89)

(90)

14

From [36, Lemma 3] we have ( −2N c˜L , for M < N 1 ≤ P γ(H) ≤ L c˜L−2Nmax{−(−ln L)N +1 , 1}, for M = N with some constant c˜, which, together with (90), establishes the final result (44). A PPENDIX G U PPER B OUND (60) ON µ2 (R) b SIC as the detection result obtained by We first define d successive interference cancelation (SIC) (see, e.g., [31]). b SIC Noting that dk = [dM −k+1 dTk−1 ]T , the components of d are obtained by solving M scalar minimization problems according to T 2 bT dbSIC,M −k+1 = arg min ∆k dM −k+1 d (91) SIC,k−1 dM −k+1 ∈CZ

for k = 1, . . . , M , starting with k = 1, where the metric b SIC is a suboptimum update |∆k (dk )|2 is defined in (8). Since d solution to the CLP problem (5), we have µ2 (R) = max

b SIC k2 . (92) min ky−Rdk2 ≤ max ky−Rd

y∈CM d∈(CZ)M

y∈CM

b SIC k2 achieved by the SIC detector can The distance ky − Rd be further upper-bounded as follows. Since (91) corresponds to a simple scalar minimization problem over Gaussian integers that are scaled by RM −k+1,M −k+1 , we obtain b SIC,k 2 ≤ 1 R2 ∆k d , k = 1, . . . , M 2 M −k+1,M −k+1 for any y. Consequently, we have b SIC k2 = ky − Rd ≤

M X b SIC,k 2 ∆k d k=1 M X

1 2

2 RM −k+1,M −k+1

k=1

which, together with (92), establishes (60) and hence concludes the proof. ACKNOWLEDGMENT The authors would like to thank C. Akçaba and P. Coronel for helpful discussions. R EFERENCES [1] E. Agrell, T. Eriksson, A. Vardy, and K. Zeger, “Closest point search in lattices,” IEEE Trans. Inf. Theory, vol. 48, no. 8, pp. 2201–2214, Aug. 2002. [2] U. Fincke and M. Pohst, “Improved methods for calculating vectors of short length in a lattice, including a complexity analysis,” Math. Comp., vol. 44, pp. 463–471, Apr. 1985. [3] W. H. Mow, “Maximum likelihood sequence estimation from the lattice viewpoint,” in Proc. ICCS/ISITA 1992, vol. 1, Singapore, Nov. 1992, pp. 127–131. [4] E. Viterbo and E. Biglieri, “A universal decoding algorithm for lattice codes,” in GRETSI 14-ème Colloq., Juan-les-Pins, France, Sept. 1993, pp. 611–614. [5] E. Viterbo and J. Boutros, “A universal lattice code decoder for fading channels,” IEEE Trans. Inf. Theory, vol. 45, pp. 1639–1642, July 1999.

[6] M. O. Damen, H. El Gamal, and G. Caire, “On maximum-likelihood detection and the search for the closest lattice point,” IEEE Trans. Inf. Theory, vol. 49, no. 10, pp. 2389–2402, Oct. 2003. [7] A. Burg, M. Borgmann, M. Wenk, M. Zellweger, W. Fichtner, and H. Bölcskei, “VLSI implementation of MIMO detection using the sphere decoding algorithm,” IEEE J. of Solid-State Circuits, vol. 40, no. 7, pp. 1566–1577, July 2005. [8] C. Studer, A. Burg, and H. Bölcskei, “Soft-output sphere decoding: Algorithms and VLSI implementation,” IEEE J. Sel. Areas Comm., vol. 26, no. 2, pp. 290–300, Feb. 2008. [9] J. Savage, “The distribution of the sequential decoding computation time,” IEEE Trans. Inf. Theory, vol. 12, no. 2, pp. 143–147, Apr. 1966. [10] I. M. Jacobs and E. R. Berlekamp, “A lower bound to the distribution of computation for sequential decoding,” IEEE Trans. Inf. Theory, vol. 13, no. 2, pp. 167–174, April 1967. [11] B. Hassibi and H. Vikalo, “On the sphere decoding algorithm I. Expected complexity,” IEEE Trans. Signal Processing, vol. 53, no. 8, pp. 2806– 2818, Aug. 2005. [12] H. Vikalo and B. Hassibi, “On the sphere decoding algorithm II. Generalizations, second-order statistics, and applications to communications,” IEEE Trans. Signal Processing, vol. 53, no. 8, pp. 2819–2834, Aug. 2005. [13] J. Jaldén and B. Ottersten, “On the complexity of sphere decoding in digital communications,” IEEE Trans. Signal Processing, vol. 53, no. 4, pp. 1474–1484, Apr. 2005. [14] D. Seethaler and H. Bölcskei, “Performance and complexity analysis of infinity-norm sphere-decoding,” IEEE Trans. Inf. Theory, vol. 56, no. 3, pp. 1085–1105, Mar. 2010. [15] G. D. Golden, G. J. Foschini, R. A. Valenzuela, and P. W. Wolniansky, “Detection algorithm and initial laboratory results using V-BLAST space-time communication architecture,” Electronics Letters, vol. 35, pp. 14–16, Jan. 1999. [16] J. H. Conway and N. J. A. Sloane, Sphere Packings, Lattices and Groups. Berlin, Heidelberg, New York: Springer, 1988. [17] A. Papoulis, Probability, Random Variables, and Stochastic Processes, 3rd ed. New York: McGraw-Hill, 1991. [18] A. D. Murugan, H. El Gamal, M. O. Damen, and G. Caire, “A unified framework for tree search decoding: Rediscovering the sequential decoder,” IEEE Trans. Inf. Theory, vol. 52, no. 3, pp. 933–953, Mar. 2006. [19] C. Studer, D. Seethaler, and H. Bölcskei, “Finite lattice-size effects in MIMO detection,” in Proc. 42nd Asilomar Conf. Signals, Systems, and Computers, Pacific Grove, CA, Oct. 2008, pp. 2032–2037. [20] A. H. Banihashemi and A. K. Khandani, “On the complexity of decoding lattices using the Korkin-Zolotarev reduced basis,” IEEE Trans. Inf. Theory, vol. 44, no. 1, pp. 162–171, Jan. 1998. [21] B. M. Hochwald and S. ten Brink, “Achieving near-capacity on a multiple-antenna channel,” IEEE Trans. Inf. Theory, vol. 51, no. 3, pp. 389–399, Mar. 2003. [22] R. Böhnke, D. Wübben, V. Kühn, and K. D. Kammeyer, “Reduced complexity MMSE detection for BLAST architectures,” in Proc. IEEE Globecom 2003, vol. 4, San Francisco, CA, Dec. 2003, pp. 2258–2262. [23] P. M. Gruber and J. M. Wills, Eds., Handbook of Convex Geometry. vol. B, North Holland, Amsterdam: Elsevier, 1993. [24] L. Zheng and D. Tse, “Diversity and multiplexing: A fundamental tradeoff in multiple antenna channels,” IEEE Trans. Inf. Theory, vol. 49, no. 5, pp. 1073–1096, May 2003. [25] D. Tse and P. Viswanath, Fundamentals of Wireless Communication. Cambridge, U.K.: Cambridge University Press, 2005. [26] Y. Jiang, X. Zheng, and J. Li, “Asymptotic performance analysis of VBLAST,” in Proc. IEEE Globecom 2005, vol. 6, St. Louis, MO, Nov. 2005, pp. 3882–3886. [27] A. M. Tulino and S. Verdú, Random Matrix Theory and Wireless Communications. Hanover, MA: Now Publishers Inc. 2004. [28] M. Abramowitz and I. Stegun, Handbook of Mathematical Functions. New York: Dover, 1965. [29] A. K. Lenstra, H. W. Lenstra, Jr., and L. Lovász, “Factoring polynomials with rational coefficients,” Math. Ann., vol. 261, pp. 515–534, 1982. [30] M. Seysen, “Simultaneous reduction of a lattice basis and its reciprocal basis,” Combinatorica, vol. 13, pp. 363–376, 1993. [31] D. Wübben, R. Böhnke, J. Rinas, V. Kühn, and K. D. Kammeyer, “Efficient algorithm for decoding layered space-time codes,” Electronics Letters, vol. 37, no. 22, pp. 1348–1350, Oct. 2001. [32] H. El Gamal, G. Caire, and M. O. Damen, “Lattice coding and decoding achieve the optimal diversity-multiplexing tradeoff of MIMO channels,” IEEE Trans. Inf. Theory, vol. 50, no. 6, pp. 968–985, Jun. 2004.

15

[33] R. A. Horn and C. R. Johnson, Matrix Analysis. Cambridge (UK): Cambridge Univ. Press, 1993. [34] Y. H. Gan, C. Ling, and W. H. Mow, “Complex lattice reduction algorithm for low-complexity MIMO detection,” IEEE Trans. Signal Processing, vol. 57, no. 7, pp. 2701–2710, July 2009. [35] R. J. Muirhead, Aspects of Multivariate Statistical Theory. Hoboken, New Jersey: Wiley, 2005. [36] M. Taherzadeh, A. Mobasher, and A. K. Khandani, “Communication over MIMO broadcast channels using lattice-basis reduction,” IEEE Trans. Inf. Theory, vol. 53, no. 12, pp. 4567–4582, Dec. 2007.

Dominik Seethaler received the Dipl.-Ing. and Dr. techn. degrees in electrical/communication engineering from Vienna University of Technology, Vienna, Austria, in 2002 and 2006, respectively. From 2002 to 2007 he was a Research and Teaching Assistant at the Institute of Communications and Radio Frequency Engineering, Vienna University of Technology. From September 2007 to August 2009 he was a postdoctoral researcher at the Communication Technology Laboratory, ETH Zurich, Switzerland. He is now with RobArt, Linz, Austria.

Joakim Jaldén (S’03–M’08) received the M.Sc. and Ph.D. degrees in electrical engineering from the KTH Royal Institute of Technology, Stockholm, Sweden in 2002 and 2007 respectively. From July 2007 to June 2009 he held a post-doctoral research position at the Vienna University of Technology, Vienna, Austria. He also studied at Stanford University, CA, USA, from September 2000 to May 2002, and worked at ETH Zürich, Switzerland, as a visiting researcher, from August to September, 2008. In July 2009 he joined the Signal Processing Lab within the School of Electrical Engineering at KTH, Stockholm, Sweden, as an Assistant Professor. For his work on MIMO communications, Joakim has been awarded the IEEE Signal Processing Society’s 2006 Young Author Best Paper Award and the first price in the Student Paper Contest at the 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). He is also a recipient of the Ingvar Carlsson Award issued in 2009 by the Swedish Foundation for Strategic Research.

Christoph Studer (S’06–M’10) was born in Solothurn, Switzerland on December 25, 1979. He received the MSc and Dr. sc. degrees in electrical engineering from ETH Zürich, Switzerland, in 2005 and 2009, respectively. In 2005, he was a Visiting Researcher with the Smart Antennas Research Group, Information Systems Laboratory, Stanford University, Stanford, CA, USA. From 2006 to 2009, he was a Research Assistant with the Integrated Systems Laboratory (IIS) at ETH Zurich. In 2008, he was a consultant for Celestrius, an ETH-spinoff specialized in the field of MIMO wireless communication. Between 2008 and 2010, he was a postdoctoral researcher at the Communication Technology Laboratory, ETH Zurich, Switzerland. Since 2011, he has been a postdoctoral researcher at the Department of Electrical and Computer Engineering, Rice University, Houston, TX. His research interests include signal processing and the design of VLSI circuits and systems. Dr. Studer was the recipient of an ETH Medal in 2005 and 2011 for his Master’s and his Ph.D. thesis, respectively. He received the Best Student Paper Award of the 2007 Asilomar Conf. on Signals, Systems, and Computers and of the 2008 IEEE Int. Symp. on Circuits and Systems. In 2010, he received (jointly with Dr. Seethaler and Mr. Fateh) the Swisscom and ICTnet Innovations Award.

Helmut Bölcskei (S’94–M’98–SM’02–F’09) was born in Mödling, Austria on May 29, 1970, and received the Dipl.-Ing. and Dr.Techn. degrees in electrical engineering from Vienna University of Technology, Vienna, Austria, in 1994 and 1997, respectively. In 1998 he was with Vienna University of Technology. From 1999 to 2001, he was a postdoctoral researcher in the Information Systems Laboratory, Department of Electrical Engineering, and in the Department of Statistics, Stanford University, Stanford, CA. He was in the founding team of Iospan Wireless Inc., a Silicon Valley-based startup company (acquired by Intel Corporation in 2002) specialized in multiple-input multiple-output (MIMO) wireless systems for high-speed Internet access. From 2001 to 2002 he was an Assistant Professor of Electrical Engineering at the University of Illinois at Urbana-Champaign. He has been with ETH Zurich since 2002, where he is Professor of Electrical Engineering. He was a visiting researcher at Philips Research Laboratories Eindhoven, The Netherlands, ENST Paris, France, and the Heinrich Hertz Institute Berlin, Germany. His research interests are in information theory, applied and computational harmonic analysis, and mathematical signal processing. He received the 2001 IEEE Signal Processing Society Young Author Best Paper Award, the 2006 IEEE Communications Society Leonard G. Abraham Best Paper Award, the 2010 Vodafone Innovations Award, the ETH Golden Owl Teaching Award, is a Fellow of the IEEE, a EURASIP Fellow, and was an Erwin Schrödinger Fellow of the Austrian National Science Foundation. He was a plenary speaker at several IEEE conferences and served as an associate editor of the IEEE Transactions on Signal Processing, the IEEE Transactions on Wireless Communications and the EURASIP Journal on Applied Signal Processing. He is currently on the editorial board of Foundations and Trends in Networking, serves as Editor-in-Chief of the IEEE Transactions on Information Theory, was TPC co-chair of the 2008 IEEE International Symposium on Information Theory, and serves on the board of governors of the IEEE Information Theory Society.

On the Complexity Distribution of Sphere Decoding - Communication

On the Complexity Distribution of Sphere Decoding - Communication

Suggest Documents

Low-Complexity Sphere Decoding of Polar Codes based on ... - arXiv

Low Complexity Sphere Decoding for Spatial ...

The complexity of sphere decoding perfect codes under a ... - Eurecom

Reduced complexity K-best sphere decoding algorithms for ill ...

Sparsity-aware sphere decoding: Algorithms and complexity ... - arXiv

Low-Complexity Soft-Output Sphere Decoding with ... - IEEE Xplore

Bounds on the decoding complexity of punctured codes on graphs

On The Complexity Of Decoding Lattices Using The Korkin ... - CST

On the Sphere-Decoding Algorithm II ... - Caltech Authors

On the Sphere-Decoding Algorithm I. Expected ... - Caltech Authors

VLSI IMPLEMENTATION OF PIPELINED SPHERE DECODING WITH ...

on the application of low complexity iterative decoding ...

On Performance of Sphere Decoding and Markov Chain ... - Utah ECE

On the Impact of Communication Complexity on the - IEEE Computer ...

Lower Bounds on the Multiparty Communication Complexity

Sphere Decoding for Multiprocessor Architectures - Arizona State

Enabling Sphere Decoding for SCMA - arXiv

Communication Complexity of Key Agreement on

emergence of structural complexity in sphere packings

VLSI implementation of the Sphere Decoding Algorithm - Integrated

On complexity of decoding Reed-Muller codes within their ... - CiteSeerX

On complexity of decoding Reed-Muller codes within their ... - CiteSeerX

On complexity of decoding Reed-Muller codes within ... - Google Sites

Reduced-Complexity Decoding of LDPC Codes