Scaling Laws of Multiple Antenna Group ... - Semantic Scholar

3 downloads 154 Views 259KB Size Report
A. Dana is with Qualcomm Corporation, San Diego, CA. B. Hassibi is with ...... Angeles in 1999, National Semiconductor, Santa Clara, CA, in 2001 and 2002,.
5030

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 7, NO. 12, DECEMBER 2008

Scaling Laws of Multiple Antenna Group-Broadcast Channels Tareq Y. Al-Naffouri, Amir F. Dana, and Babak Hassibi Abstract—Broadcast (or point to multipoint) communication has attracted a lot of research recently. In this paper, we consider the group broadcast channel where the users’ pool is divided into groups, each of which is interested in common information. Such a situation occurs for example in digital audio and video broadcast where the users are divided into various groups according to the shows they are interested in. The paper obtains upper and lower bounds for the sum rate capacity in the large number of users regime and quantifies the effect of spatial correlation on the system capacity. The paper also studies the scaling of the system capacity when the number of users and antennas grow simultaneously. It is shown that in order to achieve a constant rate per user, the number of transmit antennas should scale at least logarithmically in the number of users. Index Terms—MIMO systems, transmit diversity, broadband mobile communication systems, wireless personal communication systems.

I. I NTRODUCTION

F

UTURE breakthroughs in wireless communications will be mostly driven by applications that require high data rates [1]. While increasing the link budget and/or bandwidth can accommodate this increase in data rate, such a solution would not be economical. A more cost effective solution is to exploit the spacial dimension by employing multiple antennas at the transmitter and receiver. Multiple input multiple output (MIMO) communication has thus been the focus of a lot of research [1], [2], [3] which basically demonstrated that the capacity of a point to point MIMO link increases linearly with the number of transmit and receive antennas (an excellent overview of the research on this problem can be found in [4]). Research focus has shifted recently to the role of multiple antennas in multiuser systems, especially broadcast scenarios (i.e., point to multipoint communication) as downlink scheduling is the major bottleneck for future broadband wireless networks. The broadcast channel resembles downlink communication in a cellular system, where the base station is to transmit to a group of users. In these and other broadcast scenarios, one is usually interested in 1) quantifying the system capacity or the maximum possible sum rate to all users, 2) quantifying the scaling behavior of the sum rate for large Manuscript received August 14, 2007; revised February 17, 2008; accepted June 2, 2008. The associate editor coordinating the review of this paper and approving it for publication was I. Lee. T. Y. Al-Naffouri is with the Electrical Engineering Department, King Fahd University of Petroleum and Minerals, Saudi Arabia (e-mail: [email protected]). The work of T. Y. Al-Naffouri was supported by Project IN070373, Deanship of Scientific Research at King Fahd University of Petroleum and Minerals. A. Dana is with Qualcomm Corporation, San Diego, CA. B. Hassibi is with California Institute of Technology (e-mail: [email protected]). Digital Object Identifier 10.1109/T-WC.2008.070902

number of users, and 3) devising computationally efficient algorithms for capturing most of the sum rate in the large number of users regime. In this paper, we distinguish between two types of broadcast scenarios depending on whether the users are interested in common information or not. A. Broadcasting to Independent Users In this problem, users are interested in independent information. Much work has been devoted to answer the three questions raised above for this problem. The capacity region question was recently settled by a technique similar to writing on dirty paper and hence bearing the name dirty paper coding (DPC). Specifically, [5] and [6] have shown that DPC is capable of achieving the maximum possible sum-rate capacity. Subsequently, [7] showed that DPC is able to achieve any point in the capacity region. While DPC solves the broadcast problem optimally, it is computationally expensive and requires a great deal of feedback as the transmitter needs perfect channel state information of all users [8]. Thus, there has been increased interest to match the DPC capacity for large number of users using simple techniques. In [9] and [10], Sharif and Hassibi showed that the sum rate capacity behaves like M log n log n for large n (where n is the total number of users and M is the number of transmit antennas). They also showed that opportunistic beam forming matches this limit. Other promising techniques for capturing most of the DPC capacity were proposed in [7], [10], [11], [12]. They all rely on multiuser diversity to match the DPC for large number of users. Here, each user experiences a different channel and therefore the transmitter can exploit this diversity and choose the set of users that have the best channel conditions. B. Broadcasting to Groups of Users The broadcast scenario considered above assumes that the various users are interested in independent streams of data. More common is the situation that one group of users would be interested in one stream of data, another group with another stream, and so on. An example where this might occur is digital audio and video broadcast where there is a limited number of shows and users are classified according to the shows they are interested in [13], [14], [15]. Here, similar questions to the (independent user) broadcast problem would be relevant. To make the discussion more rigorous, assume that there are n users each equipped with a single antenna. The users are partitioned into K groups where each group is interested in the same stream of data. The transmitter, which is equipped with M antennas, is to schedule transmission to these groups

c 2008 IEEE 1536-1276/08$25.00 

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

AL-NAFFOURI et al.: SCALING LAWS OF MULTIPLE ANTENNA GROUP-BROADCAST CHANNELS

so as to maximize the sum-rate capacity of the system. If the transmitter had one antenna only, this would be a trivial problem. For in this case, all channels involved would be single input single output. Thus, to transmit to any group of users, one simply needs to take care of the user with the weakest link (i.e., the one with the least channel gain). Such ordering of users, however, is not possible in the multiple antenna case and the problem becomes more challenging. The group broadcast problem includes the (independent users) broadcast problem as a special case. Specifically, the independent users broadcast is a group broadcast problem in which each group consists of one user only. The other extreme is when all users belong to one group and are all interested in the same information. This is known as a multicast problem and has started to attract some attention recently. In [16], Khitsi et. al. characterized the system capacity for the two user binary multicast problem. When multicasting to more than 2 users or for Gaussian multicast, [16] was only able to obtain upper and lower bounds. In [17], Steinberg and Shamai considered a two user situation with a hybrid of broadcast and multicast in which the two users can have common as well as independent messages. By examining the techniques used in [16] and [17] and the results arrived at, one can be convinced that finding the exact capacity region for the multicast problem (let alone the more general group broadcast problem) can be quite challenging. As such, several authors have resorted instead to evaluate the capacity asymptotes. In his Masters Thesis [18], Khitsi considered the multicast problem where there is exactly one group of n users interested in a common message transmitted from a base station with M antennas. He showed that for 1 large n, the capacity decreases in proportion to n− M . In a recent paper [19], Jindal and Luo built on the work of Khisti and obtained the scaling order of various techniques when used in a multicast scenario. These techniques include transmit beamforming and group broadcast using spatially white or orthogonal signaling. In this paper, we consider the multiple group broadcast in the large number of users and antennas regime. In contrast to the scaling analysis of [18] and [19], we don’t just obtain order relationships but rather obtain upper and lower bounds that more tightly characterize the system sum-rate capacity. The paper is organized as follows. In Section III, we present a general theory for the scaling behavior of the minimum of a number of iid random variables. The rest of the paper is divided into two parts. In the first part, we consider the large number of users (n) case and obtain upper bounds (in Section IV) and lower bounds (in Section V). In the second part, we consider the scaling for the large number of antennas (M ) regime. We do so for fixed n (Section VI), for n = βM (Section VII) and for n = eM (Section VIII). We set the stage, however, by introducing the system model. II. S YSTEM M ODEL AND F ORMAL D EFINITION OF C APACITY A. System Model Consider a broadcast channel with a base station equipped with M antennas and n users each equipped with a single receive antenna. The n users are partitioned into K groups

5031

each of which is interested in the same stream of data. Specifically, we assume there are αk n users in the kth group K k = 1, 2, . . . , K where k=1 αi = 1, αk > 0. We also let α denote mink {αk } since many of our results will depend on this minimum. Moreover, to get more insights, we will often 1 . times specialize our findings to the case where α = αk = K The received signal at the ik th user is given by yik = h∗ik s + νik where s is the transmitted signal and is subject to an average power constraint E[ss∗ ] ≤ P I and νik ∼ CN (0, 1) is the additive white complex Gaussian noise. The vector hik is the channel matrix between the base station and user ik . The users’ channels are independent and identically distributed as CN (0, IM ). At times, we will assume that hik exhibits some spatial correlation so that hik ∼ CN (0, R) where R is full rank with Tr(R) = M. B. Group Sum-Capacity Expression In this paper we are interested in the sum-capacity of the group broadcast systems. Let Wi be the information stream of rate Ri intended for users in group i for each i = 1, . . . , K. Then (R1 , . . . , RK ) is an achievable rate vector if each user in each group can decode its intended information stream with arbitrary small probability of error using long enough block codes. We define the sum-capacity of the system as the supremum of sum of rates among all achievable rate vectors. Consider a multicast system (i.e., a group broadcast with only one group). Given a total power budget P, the base station can transmit to one user at the following maximum rate   max log det 1 + h2B Coneuser = E B≥0 Tr(B)≤P

Δ

where hi 2B = h∗i Bhi and B is the correlation of the input. Since each user in the group is interested in the same data, we have to appeal to all users, particularly the weakest one, by minimizing the log det over all users, i.e.   (1) Csingle = E max min log det 1 + hi 2B B≥0 Tr(B)≤P

i

The above expression is the sum-capacity of a multicast system. However the sum-capacity of group broadcast problems in general is not known. If the transmitter had one antenna only, this would be a trivial problem. For in this case, all channels involved would be single input single output. Thus, to transmit to the kth group of users, one simply needs to take care of the user with the weakest link, i.e. the link for which |hik | ≤ |hjk | for all j. Such ordering of users, however, is not possible in the multiple antenna case and the problem becomes more challenging. Hence, we resort in this paper to bounding the capacity in the large number of users and antennas regime. III. S CALING LAW FOR THE MINIMUM OF A NUMBER OF RANDOM VARIABLES

Group broadcast is intuitively limited by the worst of otherwise identical users. As such, we present in this section a theorem for finding the minimum of a large number of random variables. To this end, let x1 , x2 , · · · , xn be iid nonnegative random variables with pdf f (x), CDF F (x), and

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

5032

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 7, NO. 12, DECEMBER 2008

characteristic function φ(s). We would like to find the scaling law of the minimum of these random variables, xmin (n) = min {x1 , x2 , · · · , xn } . The CDF of the minimum is given by Fmin (x) = 1 − (1 − F (x))n . Let’s first illustrate the direct method of finding the scaling of the minimum through the following example. a) Example 1: Beamforming for multicast: In beamforming [10], the transmitter sends M orthonormal beams φ1 , φ2 , . . . , φM , and asks each user to feedback the SINR associated with each beam. For example the SINR associated with beam φ1 for user i is given by SINRi =

1 ρ

|h∗i φ1 |2 M + m=2 |h∗i φm |2

where ρ = M P is the signal to noise ratio. Since the base station has to appeal to all users in the group, we are constrained by the worst user. For a given beam, the SINRi ’s are iid with CDF [10] x

e− ρ FSINR (x) = 1 − (1 + x)M−1

x≥0

So the CDF of the minimum of n such SINR’s is given by Fmin SINR (x)

= 1 − (1 − FSINR (x))n nx e− ρ = 1− x≥0 (1 + x)n(M−1)

Note that limn→∞ Fmin SINR ( nx ) = 1 shows that for large n, mini SINRi

− e−x(1/ρ+M−1) . This 1 scales as ( 1 +M−1)n . ρ The method of Example 1 might not apply all the time as it is difficult to find the CDF in closed form sometimes. The following theorem provides a more general method for finding the scaling of the minimum. The proof is omitted due to space limitations. Theorem 1: Let x1 , x2 , · · · , xn be iid nonnegative random variables with CDF F (x), and characteristic function φ(s). Let xmin (n) denote the minimum of these random variables min {x1 , x2 , · · · , xn } . Then an xmin (n) converges in distribu  tion to a random variable y with CDF Fy (y) = 1−exp −y i0 where i0 is the first non-zero derivative of F (x) at zero, i.e., F (i0 ) (0) = 0 and F (j) (0) = 0 for all j < i0 and where 1

an =

i0 ! i0

1 1

F (i0 ) (0) i0

Furthermore, we can find F theorem

(i0 )

1

n i0

(0) using the initial value

limx→0 F (i0 ) (x) = lims→∞ si0 φ(s) b) Example 2: Scaling of spatially correlated channel norms: Let’s find the scaling law for minhi hi 2 when hi are iid CN (0, R). The pdf and CDF of hi 2 will both have different forms depending on whether some of the eigenvalues λl of R are the same or different, and so the direct method for scaling can be quite challenging. On the other hand, the characteristic function takes one form and is given by φ(s) =  M 1 l=1 1+λl s . From this, it is easy to see that  0 for i < M i (i) lim s φ(s) = F (0) = 1 1 for i = M M λ s→∞ l l=1

We thus conclude that 1

1

min hi 2 scales as CM det(R) M i

1

nM

1

where CM = (M !) M . IV. U PPER B OUNDS A. An Upper Bound Using the MAC-BC Duality Consider the group broadcast problem with K groups. To obtain an upper bound, we consider all possible broadcast systems with exactly one user chosen from each group. This is a typical broadcast scenario for which the sum-rate capacity is given by dirty paper coding. Using the MAC-BC duality [20], the sum-capacity is given by CK users =

K  max hik bk h∗ik ) log det(I + k=1 bk ≥ 0 bk = P

(2)

Now, sweeping over all the possible selections of the K users (one user from each group) we can write Cu = min · · · min hi1

hiK

K  max hik bk h∗ik ) (3) log det(I + k=1 bk ≥ 0 bk = P

This is an upper bound on the sum-rate because the bk ’s are optimized for each set of K users when in the group broadcast problem the bk ’s should be the same for all user groups. To get rid of the determinant in (3), we use the arithmetic-mean geometric-mean (AM-GM) inequality  M tr(A) det(A) ≤ M to write Cu ≤ M log(1 + min · · · min max hi1

hiK

bk

K 1  bk hik 2 ) (4) M k=1

P min · · · min max{hi1 2 , · · · , hiK 2 }) (5) = M log(1 + hiK k M hi1 P max min · · · min{hi1 2 , · · · , hiK 2 }) (6) = M log(1 + hiK M k hi1 i.e. Cu ≤ M log(1 +

1 P CM max 1 1 ) k M αM nM

(7)

k

where the third line follows from exchanging the order of minimum and maximum. Thus, we can write  1 P Cu ≤ M log 1 + CM 1 1 (8) M αM nM 1 , and the upper In the equal number of users case, αk = K bound becomes

1 P KM CM 1 (9) Cu ≤ M log 1 + M nM

An alternative upper bound can be obtained as follows. Starting with the MAC-BC inequality (3), let

√ √ Hi = b1 hi1 · · · bK hiK

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

AL-NAFFOURI et al.: SCALING LAWS OF MULTIPLE ANTENNA GROUP-BROADCAST CHANNELS

 ∗ ∗ then, K k=1 hik bk hik = Hi Hi and the upper bound can be equivalently expressed as Cu = min · · · min hi1

hiK

log det (IK + Hi∗ Hi )

max

bk ≥ 0 bk = P

Note that the matrix inside the determinant is now of size K. So applying the AM-GM inequality this time yields  P 1 Cu ≤ K log 1 + CM 1 1 (10) K αM nM Combing (8) and (10) yields for the equal number of users case

1 KM P CM 1 (11) Cu ≤ min{M, K} log 1 + min{M, K} nM Using the expansion that for small x, log(1+x) = x+O(x2 ), accurate for small x, we can write 1

Cu ≤ P CM

KM n

1 M

+ O(

1 2

nM

(12)

)

In the above, we assumed the hik ’s to be white. In the correlated case, the minimization in (6) is done over hi ’s with 1 autocorrelation matrix R and that results in a hit det(R) M on the upper bound 1 1

Cu, corr ≤ P CM det(R) M

1

KM n

1 M

+ O(

1 2

nM

(13)

)

B. How Loose is the Upper Bound? The upper bounds obtained above come from two contributions: the BC-MAC duality and the AM-GM inequality. It is thus interesting to see how loose the upper bound becomes due to the use of the AM-GM inequality. To this end, note that for a positive semidefinite matrix A, det(I + A) ≥ 1 + tr(A) which when applied to (3) yields the following lower bound on Cu

K  ∗ Cu ≥ min · · · min max log 1 + tr hi bk hi (14) hi,1

hi,K

bk

k=1

Following the steps (4)-(7), we obtain (assuming equal number of users case for simplicity)

1 KM Cu ≥ log 1 + P CM 1 (15) nM or using the log(1 + x) = x + O(x2 ) approximation 1

Cu ≥ P CM

KM 1

nM

+ O(

1 2

nM

(16)

)

which coincides with (12). So we don’t loose anything by applying the AM-GM inequality for large number of users and we thus conclude 1

Cu = P CM K M + O( 1 nM

1 By

1

2

nM

(17)

) 1

the AM-GM inequality, we know that det(R) M ≤

1 M

tr(R) = 1.

5033

V. L OWER B OUNDS Having obtained an upper bound on the sum-rate, we now quantify how various methods for scheduling (or resource allocation) behave for large number of users. This gives us an idea about the achievable rates and also provides lower bounds on the group broadcast problem. Specifically, we consider the following scheduling schemes 1) Opportunistic random beamforming 2) Time sharing and 3) Scheduling by treating interference as noise A. Opportunistic Random Beamforming Opportunistic beamforming is a broadcast technique proposed in [9] that captures most of the capacity guaranteed by DPC for large number of users while requiring much less feedback and complexity. Here, the transmitter attempts to choose the best M out of K users to transmit to. To do so, the transmitter uses its M antennas to send M random beams. Each user calculates the M SINR’s (signal to interference and noise ratio), one SINR for each beam, and feeds back the maximum SINR along with its index. The transmitter in turn ranks the K users according to their SINR’s and transmits to the M best ones. Not only does this method require much less feedback than the DPC approach, but it also asymptotically (i.e., in the presence of large number of users) achieves the same performance [9]. To be more specific, the transmitter chooses M random orthonormal beam vectors φm (of size M × 1) generated according to an isotropic distribution. Now these beams are used to transmit the symbols s1 (t), s2 (t), . . . , sM (t) by constructing the transmit vector s(t) =

M 

φm (t)sm (t),

t = 1, . . . , T

(18)

m=1

After T channel uses, the transmitter independently chooses another set of orthonormal vectors {φm } and reconstructs the signal vector s(t) again (according to (18)) and so on. From now on and for simplicity, we will drop the time index t. The signal yk received by the k’th user is given by yk

= =

h∗k s + nk M  h∗k φm sm + nk , k = 1, . . . , K m=1

Here E(ss∗ ) = P I since the sm ’s are assumed to be identically distributed and independently assigned to different users. The k’th receiver estimates the effective channel gain h∗k φm , something that can be arranged by training, to calculate M SINR’s, one for each transmitted beam SINRk,m =

1 ρ

|hk φm |2  + j=m |hk φj |2

m = 1, . . . , M

where ρ = M P is the signal to noise ratio. Each receiver then feeds back its maximum SINR, i.e. max SINRk,m , 1≤m≤M

along with the maximizing index m. Thereafter, the transmitter assigns sm to the user with the highest corresponding SINR, i.e. max SINRk,m . If we perform the above scheduling, the 1≤k≤K

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

5034

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 7, NO. 12, DECEMBER 2008

throughput for large n can be written as [10] 2  RRBF = M E log 1 + max SINRk,1 + o(1) 1≤k≤K

(19)

where the term o(1) accounts for the small probability that user k may be the strongest user for more than one beam φm [10]. In the group broadcast scenario, we replace each user with a group of users of size αk n (the group of users interested in the same data). In this case a beam is assigned to each group. In order to maximize the throughput, each beam m is assigned to the group with the best equivalent SINR. The equivalent SINR of each group is the minimum of SINR’s of decoding the particular beam among users in the group. Therefore, in the group broadcast problem, random beamforming results in the following max min problem

C. Treating Interference as Noise The other extreme would be to allow all groups to be served simultaneously. Each group would then ignore signals that are meant for the other groups, treating them as additive noise. The rate that the 1st group achieves with this strategy would be

h∗i1 B1 hi1 R1 = min log K i1 1 + h∗i1 k=2 Bk hi1 By choosing the Bk ’s properly, we can maximize the sum rate  k Rk . Now, relax the problem further by assuming equal 1 P isotropic covariances for all user groups, i.e. set Bk = K M I, then we can show that 1 P 2 K M hi1  R1 = log min i1 1 + K−1 P hi 2 1 K M 1

KM 1 P 1 RRBF = CM 1 + O( 2 ) =    KM nM nM M E log 1 + max min SINRi1 , . . . , min SINRiK + o(1) Thus, the sum rate for K such user groups is upper bounded i1 iK k according to In Example 1, we proved that the minimum SINR scales as 1 KM P 1 1 1 C ≥ + O( 2 ) R Interf M 1 min SINRi = 1 M M M n n i n ρ +M −1 1 M From that, we conclude that the for large number of users, the Correlation will again introduce a hit det(R) on the lower bound sum rate is given by 1

1 1 P KM M RInterf, corr ≥ det(R) CM 1 + O( 2 ) 1 1 M RRBF = M log 1 + 1 M n nM ( ρ + M − 1) αn We summarize the findings of sections IV and V by the or for the equal number of users per group case following theorem.

Theorem 2: Consider the following group broadcast sceK 1 nario: A base station with M antennas is to broadcast to K RRBF = M log 1 + 1 + o(1) n ( ρ + M − 1) n groups of K users each, under a total power constraint P. Assume also that the users’ channels are iid Gaussian. Then B. Time Sharing the sum-rate capacity behaves like Time sharing can achieve better performance even if we 1 force the groups to have an equal time share C = γP CM K M + O( 12 ) 1 nM nM K   1  2 1 RTS ≥ max min log det 1 + hik Bk where M ≤ γ ≤ 1. When the users’ channels exhibit K Tr(Bk )=P Bk ≥0 hik correlation R, the capacity scales as k=1  K 1  1 1 KM 1 M PC = log 1 + max min hik 2Bk (20) = γ det(R) + O( 2 ) C corr M 1 h K Tr(Bk )=P ik nM nM k=1 These are unfortunate results as they show that the sumP We now relax the problem further by setting Bk = M I, from rate decreases with the number of users. To counter this, we which we conclude that  increase the resources (i.e., number of antennas M ). In the 1 P rest of this paper, we study the scaling of group broadcast RTS ≥ log 1 + CM 1 M (αn) M capacity with the number of antennas when 1) n is fixed, M or using the approximation log(1 + x) = x + O(x2 ), and 2) n = β and 3) M = log n. To simplify the exposition, we will only consider the equal number of users case, i.e. we set assuming equal number of users 1 αk = K ∀k. 1 KM P 1 CM 1 + O( 2 ) RTS ≥ M VI. S CALING WITH M , n FIXED nM nM 1 In this section, we study the scaling of the group broadcast Just like the upper bound, correlation results in a hit det(R) M capacity for large number of antennas M and fixed number of on the lower bound 1 users n. We take the upper bound to be K times the maximum 1 KM P 1 CM det(R) M 1 + O( 2 ) RTS, corr ≥ rate that a single group achieves M nM nM C ≤ K max log(1 + min hi 2B ) (21) 2 It turns out that all the beams behave equally. So the sum-rate is simply M times the rate guaranteed by φ1 or any beam for that matter. As such, we drop the beam dependence for notational convenience.

i

B≥0 Tr(B)≤P



2

K log(1 + min P hi  ) i

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

(22)

AL-NAFFOURI et al.: SCALING LAWS OF MULTIPLE ANTENNA GROUP-BROADCAST CHANNELS

where the second line follows from the fact that B ≤ P I. 2 By the law of large numbers we have that limM→∞ hMi  = 1 w.p.1. Since the minimization is taken over a fixed number 2 of users, we conclude that limM→∞ mini hMi  = 1 w.p.1 and C ≤ K log(1 + P M ) w.p.1 To obtain a lower bound, we use time sharing C ≥ log(1 +

max

B≥0 Tr(B)≤P

min hi 2B )

B≥0 Tr(B)≤P

hi 2B ≥

P 2 n hi  min{M, K }

(23)

from which we conclude that

K M ) w.p.1 (24) n We thus have the following theorem. Theorem 3: Consider the setting of Theorem 2 and assume that the total number of users n is kept fixed. Then as the number of antennas grows to infinity, the group broadcast capacity satisfies the following bounds with probability one C ≥ log(1 + P

≤ C ≤ K log(1 + P M )

VII. S CALING WITH M AND n, M n =β Here we consider the scaling of the upper and lower bounds when both the number of users and antennas grow to infinity while their ratio remains constant M n = β. We rely on time sharing to obtain a lower bound 1 C ≥ K max log(1 + min h∗i Bhi ) i B≥0 Tr(B)≤P K  2 hi  C ≥ log 1 + P min (25) i M P I. We where we have relaxed the problem by setting B = M hi 2 now need to lower bound mini M as M, n → ∞. To this end, define the matrix

Hi = [ h 1

h2

···

h Kn ]



K log(1 +

max

B≥0 Tr(B)≤P

(26)

min hi 2B )

(29)

i

We now turn our attention to upper bound the log argument (the bound maxB≥0 Tr(B)≤P mini h∗i Bhi ≤ P mini hi 2 is too loose). To do so, we will use the dual of the max-min problem derived in Appendix B  max min h∗i Bhi = min P λmax ( βi hi h∗i )  i

βi ≥0,



Now, for fixed n and large M , we have by the law of 2 large numbers limM→∞ mini hMi  = 1 and we can write maxB≥0 Tr(B)≤P mini hi 2B ≥ P K n M. This yields the following lower bound on the sum rate

K n PM)

C

B≥0 Tr(B)≤P

P max min hi 2B ≥ hi 2 n min B≥0 Tr(B)≤P i min{M, K } i

log(1 +

√ √ range [(1 − Kβ)2 , (1 + Kβ)2 ] as shown in [21], [22]. We can thus write   hi 2 ≤ (1 + Kβ)2 (1 − Kβ)2 ≤ lim min n,M→∞ i M Combining this with the time sharing bound (25) yields the desired lower bound on the sum-rate capacity   √ C ≥ log 1 + P (1 − Kβ)2 (28) To obtain an upper bound, we start with the bound

i

To proceed, we need to find a lower bound for P hi 2 maxB≥0 Tr(B)≤P mini hi 2B (the lower bound mini M is too loose). In Appendix A, we show that for every hi in the set {h1 , h2 , . . . , h Kn } max

5035

βi =1

 K P λmax ( hi h∗i ) n i

i

K P λmax (Hi∗ Hi ) n where Hi is as defined in (26). Now, as n, M → ∞ with

(30) (31)

=

Hi Hi∗ n 2

M n K

=

will be confined to the range Kβ, the eigenvalues of 1 1 [(1 − √Kβ )2 , (1 + √Kβ ) ]. We can thus obtain the following upper bound on capacity C ≤ K log(1 + P (1 +

√1 )2 ) β

(32)

From (28) and (32), we see that if we allow the number of antennas to grow linearly with the number of users, we can guarantee a constant sum rate and we can state the following theorem. Theorem 4: Consider the setting of Theorem 2 and assume that the number of antennas M and the number of users n are made large while maintaining a constant ratio M n = β. Then the sum-rate group broadcast capacity will be constant with probability one. More precisely, we have   √ log 1 + P (1 − Kβ)2 ≤ C ≤ K log(1 + P (1 + √1β )2 ) (33) But is it still possible to do so without straining the resources as much? We answer this question in the next section. VIII. S CALING WITH M AND n, M = log n In the first part of the paper, we proved that the group 1 P capacity scales as γ M CM K M 1 . Now for large M, it is easy to nM

prove that CM M and the sum-rate scales as γ P1 . Thus, nM to guarantee a constant rate, we need to set M = log n. To and note that 2

prove this rigorously, we first study the behavior of mini hMi  ∗ 2 2 2 T n  h  · · · h  h diag(Hi Hi ) = 1 2 K for large n (and M ). We use the Chernoff bound in Appendix C to show that Note also that  ∗ 2 ∗ 2 hi 2 λmin (Hi Hi ) ≤ min hi  ≤ λmax (Hi Hi ) ≤ max hi  (27) ∈ [1 − lim P min , 1] =1 (34) l i i i M=log n,n→∞ M M Moreover as n, M → ∞ with M where l .8414, i.e. for M = log n and for large n, n = β (so that n/K = Kβ) the eigenvalues of

Hi∗ Hi M

become uniformly distributed in the

mini

hi 2 M

is a constant.

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

5036

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 7, NO. 12, DECEMBER 2008

A. Lower Bound for Sum-rate Capacity (M = log n)

Note that Hs for different s are i.i.d. Hence, we have bounded X by the minimum of S = αn M i.i.d random variables. In order for the sum-capacity to scale at least like a constant with n, the right side of (37) should scale like a constant as well. Let us denote the cumulative distribution function (CDF) hi 2 of λ ∗ 2 max (HH ) by FW (·), where H is a Gaussian M × M ) log(1 + min hi B ) ≥ log(1 + P min max i i M B≥0 Tr(B)≤P random matrix. Then HH ∗ is a Wishart Matrix [22], and we P I. Or with M = log n and can write which follows by setting B = M 1 M as n → ∞ Pr(X ≥ ) ≤ (1 − FW ( ))S . P log M log M C ≥ log(1 + P H) (35) If X behaves like constant then the above probability should where H ∈ [1 − l , 1]. This lower bound shows that a growth approach one as n tends to infinity. Therefore we should have of M = log n will guarantee a constant capacity because for M (1 − FW ( ))S → 1 (38) M = βn the sum rate is upper bounded by a constant. This log M is summarized in the following theorem. result of [22] we know that FW (t) Theorem 5: Consider the setting of Theorem 2 and assume Using the large deviation 4M−t 2 that the number of antennas M and the number of users n behaves like exp(−M Φ− ( M ; 1)) for t = O(M ), where are made large while maintaining the relationship M = log n. Φ− (x; 1) is defined by We are now ready to derive the lower bound for the sum rate capacity which we obtain through time sharing. Specifically, we have

Then the sum-capacity in such a system scales like a constant and therefore, a constant capacity per user (stream) is achievable in the system. Theorem 5 suggests that if the number of antennas scale logarithmically with the number of user, i.e. M = log n, constant capacity per user is achievable. In the following theorem we show that to obtain a constant rate per user in the system the number of antennas should at least behave like 1 (log n) 2 −(n) for a vanishing function (n). Theorem 6: Consider the setting of Theorem 2 and assume that the number of users n are made large. Then in order to have a sum-capacity that scales like a constant and therefore, a constant capacity per user (stream) is guaranteed, the number of transmit antennas, M , should grow faster than 1 (log n) 2 −(n) , where (n) = logloglogloglogn n . Proof: To obtain a constant capacity per user, the multicast capacity of each group should be at least constant. Consider one of the groups with αn users. Then the multicast capacity is given by log(1 +

max

B≥0,TrB≤P

min h∗i Bhi ) i

Therefore, X = maxB≥0,TrB≤P mini hi B should be constant as we increase n. In Appendix B, we upper bound the above max-min problem by  X ≤ P λmax ( βi hi h∗i ) (36) i

 for any choice of βi ≥ 0 such that i βi = 1. Now suppose we partition the αn users into sets of size M . Then we have S = αn M sets of users. We can obtain an upper bound by 1 for all the users in a considering a uniform choice βi = M particular set s and set βi to zero for users in the remaining S − 1 sets. For this choice of βi , (36) is valid, and we have P λmax (Hs Hs∗ ) M where Hs is the channel matrix between the transmitter and the users in the set s of our partition. Since this is true for each of the S sets, we can write X≤

X≤

P min(λmax (H1 H1∗ ), . . . , λmax (HS HS∗ )) M

(37)

x x2 − )u(x) 4 32 and where u(x) is the step function. Applying the above large deviation bound to our case yields Φ− (x; 1) = (2 log 2 − log(4 − x) −

FW (

M ) ≈ exp(−M 2 log log M ) log M

With this in mind, we conclude that for (38) to hold, we should have that for large n, FW ( logMM ) · S → 0. Alternatively, we require that log S − M 2 log log M ≤ 0. Substituting S =

αn M

in the above inequality we get

log n ≤ (M 2 + 1) log log M (1 + o(1)) This gives an upper bound on the maximum number of supportable users, n. It can be verified that for M scaling like 1 (log n)( 2 −(n)) where (n) = logloglogloglogn n this inequality is not satisfied. Therefore, the number of antennas should grow faster 1 than (log n)( 2 −(n)) with n to guarantee a constant capacity per stream. IX. C ONCLUSION In this paper we considered the problem of multi-group broadcast channels. As its name suggests, in group broadcast there are different groups of user in the system, each group of which is interested in common information. This problem can be a good model for digital multimedia broadcasting in wireless networks that is receiving increasing attention in recent years. We specifically studied the scaling of multigroup broadcast for large number of users. We obtained upper and lower bounds for the sum-rate capacity in the large number of users regime. We showed that the sum rate capacity scales as γP CM 1 . We also quantified the effect of spatial nM correlation on the sum-rate capacity which manifests itself as 1 a hit of det(R) M on the SNR. This is an unfortunate result as it shows that the capacity decreases with the number of users. To go around this, we studied the scaling of the group broadcast capacity with the number of users and antennas. Specifically, two such regimes were studied 1) when the number of antennas is linear in the number of users and 2)

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

AL-NAFFOURI et al.: SCALING LAWS OF MULTIPLE ANTENNA GROUP-BROADCAST CHANNELS

when the number of antennas is logarithmic in the number of users. Both of these regimes guarantee a constant rate per user but the logarithmic rate is more tight as we also proved that the 1 number of antennas should at least behave like (log n)( 2 −) for a constant rate to be maintained. Some extensions of this work would be to evaluate scaling of group broadcast in the presence of multiple antennas at the users’ side and to study the scaling in the wideband case. Another extension would be to study the scaling in the large SNR regime to evaluate the group-broadcast multiplexing gain.

XII. A PPENDIX C

B≥0 Tr(B)≤P

Now for B =

P M I,

P hi 2 min{M, n}

(39)

we have

max

B≥0 Tr(B)≤P

h∗i Bhi

g(Y ) ≤ e−ν(Y −(1−)) which yields the Chernoff bound E[g(Y )] = P (Y ≤ 1 − ) ≤ eν(1−) E[e−νY ]

≥ h∗i B hi

(40)

P hi 2 (41) M which proves part of (39). Now for n ≤ M, let A = {a1 , a2 , . . . , al } be an orthonormal basis for the space spanned by the set G. Then l ≤ n ≤ M. Define the matrix B this time as P B = AA∗ n where

A = a1 a2 · · · al OM×M−l

P(

then B ≥ 0 and Tr(B ) = lP n ≤ P. Moreover, since each hi is spanned by A, it is easy to see that P hi 2 n

max

h∗i Bhi ≥

Now we can tighten the upper bound by choosing the optimum ν, which, upon setting the first derivative to zero, turns out to be ν=M >0 1− and the bound reads P(

P hi 2 n

P (min i

=

max

B ≥ 0, t : Tr(B) ≤ P, t ≤ hi Bh∗ i ∀i

min

 γ, βi ≥ 0 1, i βi = : ∗ γI ≥ i βi hi hi

min 

βi ≥0:

i

βi =1

eM(+log(1−))

hi 2 ≤ 1 − ) = M ≤ =

hi 2 M



hi 2 ≤ 1 − ))n M 1 − (1 − eM(+log(1−)) )n 1 − (1 − n+log(1−) )n 1 − (1 − P (

where the last line follows from the fact that M = log n. For the above probability to vanish as n grows, we require that

lim P (min

n→∞

i

hi 2 ≥ 1 − l ) = 1 M

(44) 2

Now let’s obtain an upper bound for mini hMi  . Employing Chernoff bound again, it is easy to show that for ν ≥ 0

To obtain the dual form of the max-min problem of (29), we introduce the dual variables and write the optimization problem in the following form

=

eM (1 − )M

We can use this to bound the probability P (mini 1 − )

(42)

XI. A PPENDIX B

=

hi 2 ≤ 1 − ) ≤ M =

Let l be the infimum of the set { : + log(1 − ) < −1}, (i.e. l satisfies l + log(1 − l ) = −1 or l .8414). Then,

Combining (40) and (42) yields the desired result.

max min h∗i Bhi B≥0,Tr(B)≤P i

(43)

+ log(1 − ) < −1

so that B≥0 Tr(B)≤P

hi 2 1 ≤ 1 − ) = eν(1−)  M M 1+ ν M

=

h∗i B hi =

and define g(Y ) by  1 if Y ≤ 1 − g(Y ) = 0 if Y > 1 −

Then for ν ≥ 0

Consider the set G = {h1 , h2 , . . . , hn }. We would like to To prove that for every hi ∈ G h∗i Bhi ≥

hi 2 M ,

Let Y =

or

X. A PPENDIX A

max

5037

t

P(

hi 2 ≥ 1 + ) ≤ M =

e−ν(1+) E[eν e−ν(1+)

P(

 λmax ( βi hi h∗i ) i

where the third line follows by first minimizing the cost  ∗ β h over γ. Since γI ≥ i i hi , it readily follows that i  ∗ ), i.e. γ is greater equal to the maximum γ ≥ λmax ( i βi hi hi  eigenvalue of matrix i βi hi h∗i .

hi 2 ≥ 1 + ) ≤ M =

or P (min hi

]

1 ν M M)  M 1+

(1 −

Moreover, the upper bound is tightest for ν = have

γP

hi 2 M

e−M (1 + )M eM(−+log(1+))

hi 2 ≥ 1 + ) ≤ (n(−+log(1+)) )n M

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

(45) (46) We thus

5038

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, VOL. 7, NO. 12, DECEMBER 2008

where we used the fact that n = log M. This probability vanishes provided that − + log(1 + ) < 0 and the infimum for which this is true is u = 0. We can thus write  hi 2 ≤1 =1 (47) lim P min n→∞ i M From (44) and (47), we see that   2 limn→∞ P mini hMi  ∈ [1 − l , 1] = 1

(48)

R EFERENCES [1] B. Hochwald, T. Marzetta, and V. Tarokh, “Multi-antenna channelhardening and its implications for rate feedback and scheduling,” IEEE Trans. Inform., vol. 50, no. 9, pp. 1893–1909, Sept. 2004. [2] E. Telatar, “Capacity of multi-antenna Gaussian channel,” European Trans. Telecommun., vol. 10, pp. 585–595, Nov. 1999. [3] G. J. Foschini and M. J. Gans, “On limits of wireless communications in a fading environment when using multiple antennas,” Wireless Personal Commun., vol. 6, pp. 311–335, Mar. 1998. [4] A. Goldsmith, S. A. Jafar, N. Jindal, and S. Vishwanath, “Capacity limits of MIMO channels,” IEEE J. Select. Areas. Commun., vol. 21, no. 5, pp. 684–702, June 2003. [5] S. Vishwanath, N. Jindal, and A. Goldsmith, “Duality, achievable rates and sum rate capacity of Gaussian MIMO broadcast channel,” IEEE Trans. Inform. Theory, vol. 49, no. 10, pp. 2658–2668, 2003. [6] P. Viswanath and D. N. Tse, “Sum capacity of the vector Gaussian broadcast channel and downlink-uplink duality,” IEEE Trans. Inform. Theory, vol. 49, no. 8, pp. 1912–1921, Aug. 2003. [7] H. Weingarten, Y. Steinberg, and S. Shamai, “The capacity region of the gaussian MIMO broadcast channel,” in Proc. International Symposium on Information Theory (ISIT), 2004. [8] Q. H. Spencer, C. B. Peel, A. L. Swindlehurst, and M. Haardt, “An introduction to the multi-user MIMO downlink,” IEEE Commun. Mag., vol. 42, no. 10, pp. 60–67, Oct. 2004. [9] M. Sharif and B. Hassibi, “Scaling laws of sum rate using timesharing, DPC, and beamforming for MIMO broadcast channels,” in Proc. International Symposium on Information Theory (ISIT), 2004. [10] M. Sharif and B. Hassibi “On the capacity of MIMO broadcast channel with partial side information,” IEEE Trans. Inform. Theory, vol. 51, no. 2, Feb. 2005. [11] P. Viswanath, D. N. Tse, and R. Laroia, “Opportunistic beamforming using dump antennas,” IEEE Trans. Inform. Theory, vol. 48, no. 6, pp. 1277–1294, June 2002. [12] T. Yoo and A. Goldsmith, “On the Optimality of multi-antenna broadcast scheduling using zero-forcing beamforming,” IEEE J. Select. Areas. Commun., vol. 24, no. 3, pp. 528–541, Mar. 2006. [13] G. A. Jones, J. M. Defilippis, H. Hoffmann, and E. A. Williams, “Digital television station and network implementation,” Proc. IEEE, vol. 94, no. 1, pp. 22–36, Jan. 2006. [14] U. Reimers, “Digital video broadcasting,” IEEE Commun. Mag., vol. 36, no. 6, pp. 104–110, June 1998. [15] S. Pekowsky and A. Andorfer, “Multimedia data broadcasting strategies,” IEEE Commun. Mag., vol. 39, no. 4, pp. 138–145, Apr. 2001. [16] A. Khisti, U. Erez, A. Lapidoth, G. Wornell, “Carbon copying onto dirty paper,” submitted to IEEE Trans. Inform. Theory. [17] Y. Steinberg and S. Shamai, “Achievable rates for the broadcast channel with states known at the transmitter,” in Proc. International Symposium on Information Theory (ISIT), pp. 2184–2188, Sept. 2005. [18] A. Khitsi, “Coding techniques for multicasting,” Master’s thesis, MIT, Cambridge, MA, 2004. [19] N. Jindal and Z.-Q. Luo, “Capacity limits of multiple antenna multicast,” in Proc. International Symposium on Information Theory (ISIT), Seattle, USA, July 2006. [20] S. Vishwanath, N. Jindal, and A. Goldsmith, “Duality, achievable rates and sum rate capacity of Gaussian MIMO broadcast channel,” IEEE Trans. Inform. Theory, vol. 49, no. 10, pp. 26582668, 2003.

[21] V. A. Marcenko and L. A. Pastur, “Distribution of eigenvalues for some sets of random matrices,” Math USSR Sbornik, vol. 1, pp. 457-483, 1967. [22] P. Vivo, S. N. Majumdar and O. Bohigas, “Large deviations of the maximum eigenvalue in Wishart random matrices,” J. Phys. A: Math. Theor., vol. 40, pp. 4317–4337, 2007. Tareq Y. Al-Naffouri obtained his M.S. degree in electrical engineering from Georgia Institute of Technology, Atlanta, in 1998, and his Ph.D. in Electrical Engineering from Stanford University, CA in 2004. He was a visiting scholar at California Institute of Technology, Pasadena, CA, in 2005 and during the summer of 2006, and a Fulbright scholar at the University of Southern California in 2008. He has held internship positions at NEC Research Labs, Tokyo, Japan in 1998, Adaptive Systems Lab, University of California at Los Angeles in 1999, National Semiconductor, Santa Clara, CA, in 2001 and 2002, and Beceem Communications Santa Clara, CA, in 2004. Dr Al- Naffouri is currently an Assistant Professor at the Electrical Engineering Department, King Fahd University of Petroleum and Minerals, Saudi Arabia. His research interests lie in the areas of adaptive and statistical signal processing and their applications to wireless communications and in multiuser information theory. He has over 50 publications in Journal and Conference Proceedings. Dr AlNaffouri is the recipient of a 2001 best student paper award at the IEEEEURASIP Workshop on Nonlinear Signal and Image Processing (NSIP) 2001 for his work on adaptive filtering analysis.

Amir F. Dana (S ’98) was born in Tehran, Iran, in 1979. He received the B.S. degree in electrical engineering from the Sharif University of Technology, Tehran, Iran, in 2001, and the M.S. and Ph.D. degrees in electrical engineering from the California Institute of Technology, Pasadena, in 2002 and 2006, respectively. He is currently with Qualcomm, Inc., San Diego, CA. His research interests are in the areas of information theory, and wireless communications. In particular, he has been working on power efficiency of sensor and ad hoc networks, network coding for wireless networks, and capacity of broadcast channels.

Babak Hassibi was born in Tehran, Iran, in 1967. He received the B.S. degree from the University of Tehran in 1989, and the M.S. and Ph.D. degrees from Stanford University in 1993 and 1996, respectively, all in electrical engineering. From October 1996 to October 1998 he was a research associate at the Information Systems Laboratory, Stanford University, and from November 1998 to December 2000 he was a Member of the Technical Staff in the Mathematical Sciences Research Center at Bell Laboratories, Murray Hill, NJ. Since January 2001 he has been with the department of electrical engineering at the California Institute of Technology, Pasadena, CA., where he is currently an associate professor. He has also held shorttem appointments at Ricoh California Research Center, the Indian Institute of Science, and Linkoping University, Sweden. His research interests include wireless communications, robust estimation and control, adaptive signal processing and linear algebra. He is the coauthor of the books Indefinite Quadratic Estimation and Control: A Unified Approach to H2 and H1 Theories (New York: SIAM, 1999) and Linear Estimation (Englewood Cliffs, NJ: Prentice Hall, 2000). He is a recipient of an Alborz Foundation Fellowship, the 1999 O. Hugo Schuck best paper award of the American Automatic Control Council, the 2002 National Science Foundation Career Award, the 2002 Okawa Foundation Research Grant for Information and Telecommunications, the 2003 David and Lucille Packard Fellowship for Science and Engineering and the 2003 Presidential Early Career Award for Scientists and Engineers (PECASE). He has been a Guest Editor for the IEEE T RANSACTIONS ON I NFORMATION T HEORY special issue on space-time transmission, reception, coding and signal processing was an Associate Editor for Communications of the IEEE T RANSACTIONS ON I NFORMATION T HEORY during 2004-2006, and is currently an Editor for the journal F OUNDATIONS AND T RENDS IN I NFORMATION AND C OMMUNICATION.

Authorized licensed use limited to: CALIFORNIA INSTITUTE OF TECHNOLOGY. Downloaded on January 16, 2009 at 12:39 from IEEE Xplore. Restrictions apply.

Suggest Documents