Hindawi Publishing Corporation EURASIP Journal on Advances in Signal Processing Volume 2011, Article ID 575706, 12 pages doi:10.1155/2011/575706
Research Article Frequency-Domain Block Signal Detection with QRM-MLD for Training Sequence-Aided Single-Carrier Transmission Tetsuya Yamamoto, Kazuki Takeda, and Fumiyuki Adachi Department of Electrical and Communication Engineering, Graduate School of Engineering, Tohoku University, 6-6-05 Aza-Aoba, Aramaki, Aoba-ku, Sendai 980-8579, Japan Correspondence should be addressed to Tetsuya Yamamoto,
[email protected] Received 15 April 2010; Revised 7 July 2010; Accepted 18 August 2010 Academic Editor: D. D. Falconer Copyright © 2011 Tetsuya Yamamoto et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. A frequency-domain block signal detection (FDBD) using QR decomposition with M-algorithm maximum likelihood detection (QRM-MLD) can significantly improve the bit error rate (BER) performance of the cyclic prefix inserted single-carrier (CP-SC) block transmission in a frequency-selective fading channel. However, the use of a fairly large number of the surviving paths is required in the M-algorithm, leading to high computational complexity. In this paper, we propose the use of the training sequenceaided SC (TA-SC) block transmission instead of CP-SC block transmission. We show that TA-SC using FDBD with QRM-MLD can achieve the BER performance close to the matched-filter (MF) bound while reducing the computational complexity compared to CP-SC.
1. Introduction In next-generation mobile communication systems, broadband data services are demanded. Since the mobile wireless channel is composed of many propagation paths with different time delays, the channel becomes severely frequency selective as the transmission data rate increases. When the single-carrier (SC) transmission without any equalization technique is used, the bit error rate (BER) performance significantly degrades due to strong intersymbol interference (ISI) [1]. The computational complexity of the maximum likelihood- (ML-) based equalization, that is, ML sequence estimation (MLSE), depends on the number of propagation paths and becomes extremely high in a severely frequencyselective channel [2]. Therefore, several suboptimal linear detection schemes, such as time-domain and frequencydomain linear equalization schemes, have been proposed to reduce the computational complexity [3–5]. A simple onetap frequency-domain equalization based on the minimum mean square error criterion (MMSE-FDE) can significantly improve the BER performance of cyclic prefix inserted SC (CP-SC) block transmission in a frequency-selective fading
channel. However, a big performance gap from the matchedfilter (MF) bound still exists due to the presence of residual ISI after FDE. To narrow the performance gap, an MMSEFDE combined with iterative ISI cancellation was proposed [6–8]. However, the achievable BER performance is still a few dB away from the MF bound, particularly when high-level data modulation (e.g., 16QAM and 64QAM) is used. Near ML-based reduced complexity time-domain equalization schemes have been proposed in [9, 10]. Recently, we proposed a near ML-based reduced complexity frequency-domain equalization scheme, which is called frequency-domain block signal detection (FDBD) using QR decomposition with M-algorithm ML detection (QRM-MLD), for the reception of CP-SC signals transmitted over a frequency-selective channel [11]. QRM-MLD was originally proposed as a signal detection scheme for the multi-input multi-output (MIMO) spatial multiplexing in [12]. In FDBD with QRM-MLD, QR decomposition is applied to a concatenation of the propagation channel and discrete Fourier transform (DFT). We showed [11] that FDBD with QRM-MLD can significantly improve the BER performance when compared to the MMSE-FDE and achieve
N c symbols
TS
Data symbols(1)
Ng symbols
Coded data
DFT block
d
y
s
Y
FDBD with QRM-MLD
Data symbols(0)
Data modulation
TS
Nc + Ng -point DFT
EURASIP Journal on Advances in Signal Processing
+TS
2
To decoder
(a) TA-SC. CP(0) Ng symbols
Data symbols(0)
CP(1)
Figure 2: TA-SC transmission system model.
Data symbols(1)
N c symbols DFT block (b) CP-SC.
Figure 1: Block structure.
the BER performance close to the MF bound even if high level data modulation is used. However, the use of a fairly large number M of surviving paths in the M-algorithm is required, leading to high computational complexity. If smaller M is used, the achievable BER performance degrades because of increased probability of removing the correct path at early stages. This probability greatly affects the achievable BER performance of FDBD with QRM-MLD. In this paper, we will show that the use of training sequence-aided SC (TA-SC) block transmission [13, 14] instead of CP-SC block transmission can significantly reduce the probability of removing the correct path at early stages in QRM-MLD and hence improve the achievable BER performance of FDBD with QRM-MLD. In TA-SC, CP is replaced by a known training sequence (TS), which is a part of DFT block at the receiver, and TS in the previous block acts as CP in the present block. When TA-SC is used, since the symbols to be detected at early stages belong to the known TS, the achievable BER performance of FDBD with QRMMLD can be improved. The performance improvement of TA-SC over CP-SC when using FDBD with QRM-MLD is confirmed by computer simulation. The remainder of this paper is organized as follows. In Section 2, TA-SC using FDBD with QRM-MLD is presented. In Section 3, we will show by computer simulation that TASC transmission using FDBD with QRM-MLD can achieve BER performance close to the MF bound while reducing the number of surviving paths when compared to CPSC. We will also discuss the computational complexity of FDBD with QRM-MLD and show that TA-SC can reduce the overall complexity of FDBD with QRM-MLD to achieve almost the same performance as CP-SC. Section 4 offers some concluding remarks.
2. TA-SC Using FDBD with QRM-MLD 2.1. TA-SC versus CP-SC. The TA-SC block structure is illustrated and compared to CP-SC transmission in Figure 1. CP is replaced by TS. In order to let TS to play the role of CP, DFT size at the receiver must be the sum of number of useful data symbols and the TS length. In the case of CPSC, the data symbol block length and the CP length are, respectively, denoted by Nc and Ng . For TA-SC, to keep the
u(1) u(0)
0
d(3) d(2) d(1) d(0)
1
Stage 0 Stage 1 Stage 2 Stage 3 Stage 4 Stage 5
Surviving path Path having the smallest path metric at the last stage
Figure 3: An example of QRM-MLD (M = 3) with BPSK when Nc = 4 and Ng = 2.
same data rate as CP-SC, the data symbol block length and the TS length need to be set to Nc and Ng , respectively. The difference between TA-SC and CP-SC is the size of DFT to be used at the receiver; the DFT size is Nc + Ng symbols for TA-SC while it is Nc symbols for CP-SC. 2.2. TA-SC Signal Transmission Model. The TA-SC transmission model using FDBD with QRM-MLD is illustrated in Figure 2. Throughout the paper, the symbol-spaced discrete time representation is used. At the transmitter, a binary information sequence to be transmitted is data-modulated, and then the data-modulated symbol sequence is divided into a sequence of symbol blocks of Nc symbols each. The data symbol block can be expressed using the vector form as d = [d(0), . . . , d(n), . . . , d(Nc − 1)]T . Before the transmission, the TS of length Ng symbols is appended at the end of each block. The block s to be transmitted is expressed using the vector form as
T
s = s(0), . . . , s Nc + Ng − 1
T = d(0), . . . , d(Nc − 1), u(0), . . . , u Ng − 1 =
(1)
d , u
where u = [u(0), . . . , u(n), . . . , u(Ng − 1)]T denotes the TS vector which is identical for all blocks. We assume a symbol-spaced frequency-selective fading channel composed of L propagation paths with different time delays. The channel impulse response h(τ) is given by h(τ) =
L −1
hl δ(τ − τl ),
(2)
l=0
where hl and τ l are, respectively, the complex-valued path gain with E[ Ll=−01 |hl |2 ] = 1 and the time delay of the lth
EURASIP Journal on Advances in Signal Processing
3
path. The lth path time delay is assumed to be l symbols, that is, τl = l. The received signal block y(TA) = [y (TA) (0), . . . , y (TA) (t), . . . , y (TA) (Nc + Ng − 1)]T can be expressed using the vector form as ⎡
y(TA)
⎢ ⎢ ⎢
2Es ⎢ ⎢ ⎢ = Ts ⎢ ⎢ ⎢ ⎣
hL−1 · · ·
h1 .. .
hL−1
⎤
h0 h1
h0
..
h1 .. .
hL−1 0
⎡
.
0 .. ..
. ..
. . hL−1 · · · h1 h0
⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦
y
=
2Es (TA) h s + n(TA) , Ts
h
(TA)
h0
⎢ ⎢ ⎢ h1 h0 ⎢ ⎢ . ⎢ .. h1 h0 ⎢ ⎢ .. ⎢ . . h1 . . =⎢ ⎢hL−1 ⎢ .. ⎢ ⎢ h . L−1 ⎢ ⎢ ⎢ hL−1 ⎢ ⎣ .
..
0
(4)
⎤
hL−1 ..
···
1
e− j2π(1×1/J) .. .
···
e− j2π(1×(J −1)/J) .. .
− j2π((J −1)×1/J)
..
.
⎤ ⎥ ⎥ ⎥ ⎥. ⎥ ⎦
(7)
· · · e− j2π((J −1)×(J −1)/J)
(7) Due to the circulant property of h(TA) , we have [15] H
.
⎥ ⎥ ⎥. ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦
h0 h1 .. .
..
(TA)
Y
=
.
(9)
2Es (TA) H s + N(TA) , Ts
(TA)
2.3. FDBD with QRM-MLD. The conditional joint probability density function (pdf), p(Y(TA) | s), of Y(TA) for the given s can be given, from (9), as
=
1 2πσ 2
(Nc +Ng /2)
⎛ ⎞ (TA) 2 (TA) ⎟ Y − 2E /T H s s s ⎜ exp⎝− ⎠, 2
2σ
(10) where σ 2 = N0 /Ts . The MLD is represented, from (10), as
h0
2Es (Nc +Ng ) (TA) F h s + F(Nc +Ng ) n(TA) , Ts
2Es (TA) (Nc +Ng ) H F s + N(TA) Ts
= H(TA) F(Nc +Ng ) and N(TA) = [N (TA) (0), . . . , where H N (TA) (k), . . . , N (TA) (Nc + Ng − 1)]T are, respectively, the equivalent channel matrix and the frequency-domain noise vector.
(5)
=
⎥ ⎥
hL−1 ⎥ ⎥
0
where H (TA) (k) = Ll=−01 hl exp(− j2πkτl /(Nc + Ng )), k = 0, 1, . . . Nc +Ng −1, and (·)H is the Hermitian transpose. Using (8), (6) can be rewritten as
p Y(TA) | s
(TA)
d
Y(TA) = F(Nc +Ng ) y(TA)
1 e
h1 .. ⎥ ⎥ . ⎥
At the receiver, (Nc + Ng )-point DFT is applied to transform the received signal block into the frequency-domain signal vector Y(TA) = [Y (TA) (0), . . . , Y (TA) (k), . . . , Y (TA) (Nc + Ng − 1)]T . Y(TA) is expressed as
=
⎣
1
(8)
where h(TA) is the (Nc + Ng ) × (Nc + Ng ) channel impulse response matrix, given as ⎡
1 ⎢ ⎢ 1 ⎢1 = ⎢. J ⎢ ..
≡ H(TA) ,
where Es and Ts are, respectively, the symbol energy and duration and n(TA) = [n(TA) (0), . . . , n(TA) (t), . . . , n(TA) (Nc + Ng − 1)]T is the noise vector. The tth element, n(TA) (t), of n(TA) is the zero-mean additive white Gaussian noise (AWGN) having the variance 2N 0 /Ts with N0 being the onesided noise power spectrum density. Since the identical TS is used for all blocks, the received signal block can be rewritten, similar to CP-SC transmission, as
⎡
= diag H (TA) (0), . . . , H (TA) (k), . . . , H (TA) Nc + Ng − 1
⎥ ⎥ ⎥ ⎥ + n(TA) , ⎥ ⎥ ⎦
(3)
(TA)
F(J)
F(Nc +Ng ) h(TA) F(Nc +Ng )
⎤
u Ng − L + 1 ⎢ ⎢ .. ⎢ . ×⎢ ⎢ ⎢ u Ng − 1 ⎣ s
where F(J) is the DFT matrix of size J × J, given as
(6)
2 2Es (TA) d (TA) , = arg min Y − H u Ts d∈X Nc
(11)
where d is the symbol-candidate vector. MLD requires a prohibitively high computational complexity. QRM-MLD [12], which was proposed for the signal detection for MIMO multiplexing, can achieve the BER performance near MLD with quite reduced complexity. In this paper, we apply QRMMLD to TA-SC. QRM-MLD consists of two steps; QR decomposition and M-algorithm. In the case of SC transmissions, the signal-tointerference plus noise power ratio (SINR) is identical for
4
EURASIP Journal on Advances in Signal Processing
all symbols in a block, and hence no ordering is necessary in the QR decomposition. First, the QR decomposition is (TA) applied to the equivalent channel matrix H to obtain (TA) (TA) (TA) (TA) H =Q R , where Q is an (Nc + Ng ) × (Nc + Ng ) matrix satisfying Q(TA)H Q(TA) = I (I is the identity matrix) and R(TA) is an (Nc + Ng ) × (Nc + Ng ) upper triangular matrix. The transformed frequency-domain received signal T (TA) = [Y (TA) (0), . . . , Y (TA) (m), . . . , Y (TA) (Nc + Ng − 1)] is Y obtained as (TA) Y
(TA) H
=Q
= ⎡
(TA)
Y
=
⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ×⎢ ⎢ ⎢ ⎢ ⎢ ⎢ ⎣
⎡
2Es (TA) H R s + Q(TA) N(TA) Ts
(TA) (TA) × RNc −2,Nc −2 d(Nc − 2) + RNc −2,Nc −1 d(Nc − 1) 2
R(TA) 0,Nc −1
···
..
.. .
.
R(TA) 0,Nc .. .
···
..
.
(TA) R(TA) Nc −1,Nc −1 RNc −1,Nc · · ·
R(TA) Nc ,Nc
..
0 d(0) .. .
···
⎤
.
.. .
R(TA) Nc −1,Nc +Ng −1 R(TA) Nc ,Nc +Ng −1 .. . R(TA) Nc +Ng −1,Nc +Ng −1
⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦
Ng −1 2Es (TA) RNc −2,Nc +i u(i) Ts i=0
−
⎤
R(TA) 0,Nc +Ng −1
⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎥ ⎢ ⎢ d(Nc − 1) ⎥ H ⎥ ⎢ × ⎢ u(0) ⎥ + Q(TA) N(TA) . ⎥ ⎢ ⎥ ⎢ .. ⎥ ⎢ ⎥ ⎢ ⎣ . ⎦ u Ng − 1
(14)
2Es (TA) (TA) + Y (Nc − 1) − R d(Nc − 1) Ts Nc −1,Nc −1
−
2
Ng −1 2Es (TA) RNc −1,Nc +i u(i) . Ts i=0
Similar to the n = Ng th stage, M surviving paths are selected from M · X paths. This procedure is repeated until the last stage (n = Nc + Ng − 1). The path metric at the nth stage (n = Ng , Ng + 1, . . . , Nc + Ng − 1) is calculated using en =
n−Ng
Y (TA) (Nc − 1 − n )
n =0
(12) (TA) can be obtained by From (12), the ML solution d searching for the best path having the minimum Euclidean distance in the tree diagram composed of Nc + Ng stages. However, in TA-SC, the Nc , Nc + 1, . . . , (Nc + Ng − 1)th (TA) contain the training symbols only, and elements of Y therefore only one path exists at the n = 0, 1, . . . , (Ng − 1)th stages and the M-algorithm [16] can be started from the n = Ng stage. An example of the QRM-MLD is shown in Figure 3 assuming Nc = 4 and N g = 2, binary phase shift keying (BPSK) modulation, and M = 3. In the n = Ng th stage, all possible symbol-candidates for the last symbol d(Nc − 1) in a data symbol block are generated (the number of all possible symbol-candidates is X for X-QAM). The path metric based on the squared Euclidean distance between Y (TA) (Nc − 1) and each symbol-candidate is calculated as
en=Ng
en=Ng +1
2Es (TA) (Nc − 2) − = Y Ts
2Es Ts
R(TA) 0,0
where d(Nc − 1) is the symbol-candidate for d(Nc − 1). Next, M (M ≤ X) paths having the smallest path metric are selected as surviving paths. In the next stage (n = Ng + 1), there are a total of X branches for d(Nc − 2) leaving from each selected surviving path. Therefore, there are totally M ·X possible paths for the two symbol sequence of d(Nc − 1) and d(Nc − 2). The path metrics are calculated for all possible M · X paths using
2Es (TA) (TA) (Nc − 1) − = Y R d(Nc − 1) Ts Nc −1,Nc −1 2 (13)
Ng −1 2Es (TA) − R u(i) , Ts i=0 Nc −1,Nc +i
−
−
n 2Es (TA) R d(Nc − 1 − i) Ts i=0 Nc −1−n ,Nc −1−i
(15)
2
Ng −1 2Es (TA) RNc −1−n ,Nc +i u(i) . Ts i=0
The most possible transmitted symbol sequence is found by tracing back the path with the smallest path metric at the last stage (n = Nc + Ng − 1). QRM-MLD requires X {1 + M(Nc − 1)} times squared Euclidean distance calculation, which significantly smaller than the original MLD that requires X Nc times squared Euclidean distance calculation. 2.4. Advantage of TA-SC over CP-SC. The received signal power associated with the symbol d(Nc −1−i) at the nth stage (n − Ng ≥ i, n = Ng , Ng + 1, . . . , Nc + Ng − 1) is the sum of the squared values of the (Nc − 1), (Nc − 2), . . . , (Nc − 1 − i)th elements in the (Nc − 1 − i)th column of R. In the case of SC transmission, the channel impulse response matrix is circulant, and therefore the magnitude of a lower right element of R drops with large probability [17]. Therefore, the probability of removing the correct path is greater at early stages.
EURASIP Journal on Advances in Signal Processing
5
In the case of CP-SC transmission, the transformed (CP) = [Y (CP) (0), frequency-domain received signal vector Y T (CP) ...,Y (Nc − 1)] is obtained as [11]
(CP) = Y
=
R(CP) ⎢ 0,0
⎢ 2Es ⎢ ⎢ ⎢ Ts ⎢ ⎢ ⎣
⎡ ⎢ ⎢ ⎢ ×⎢ ⎢ ⎣
d(0) d(1) .. .
R(CP) 0,1
Channel code Transmitter
2Es (CP) H R d + Q(CP) N(CP) Ts ⎡
Table 1: Computer simulation condition.
···
R(CP) 0,Nc −1
R(CP) ··· 1,1 .. .
R(CP) 1,Nc −1 .. . R(CP) Nc −1,Nc −1
Data modulation Data symbol block length TS and CP lengths
⎤ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎥ ⎦
Fading type
(16)
Channel Power delay profile
⎤ ⎥ ⎥ H ⎥ ⎥ + Q(CP) N(CP) . ⎥ ⎦
d(Nc − 1) The lower right elements of R(CP) are relevant to the selection of the surviving path. Since the received signal power is lower at early stages, the probability of removing the correct path at early stages may increase when smaller M is used. The probability of removing the correct path at early stages affects significantly the achievable BER performance of FDBD with QRM-MLD. A fairy large M must be used to achieve the BER performance close to the MF bound. For example, M = 256 is necessary for the case of Nc = 64 and 16QAM data modulation [11]. The use of larger M increases the computational complexity. In the case of TA-SC, it can be understood from (12) that the lower right elements of R(TA) are associated with TS, and therefore they are not relevant to the selection of the surviving path. The M-algorithm can start from the n = Ng th stage and therefore, the probability of removing the correct path at early stages can be significantly reduced even if small M is used. This suggests that smaller M can be used for TASC than CP-SC.
3. Computer Simulation The simulation condition is summarized in Table 1. The data symbol block length is Nc = 64 for both TA- and CP-SC and the TS length of TA-SC is Ng = 16 which is equal to the CP length of CP-SC. A partial sequence taken from a PN sequence with a repetition period of 4095 bits is used as TS. The same data modulation is used for TS and useful data. The channel is assumed to be a frequency-selective block Rayleigh fading channel having symbol-spaced L-path uniform power delay profile. Ideal channel estimation is assumed. 3.1. Average BER Performance. The BER performance of TASC using FDBD with QRM-MLD is plotted in Figure 4 as a function of average received bit energy-to-noise power spectrum density ratio Eb /N0 (= (Es /N0 )(1 + Ng /Nc ) /log2 X) for M = 1, 4, and 16. For comparison, the BER performance of CP-SC [11] and the MF bound [18] are also plotted. It can be seen form Figure 4 that when small M is used,
Time delay Channel estimation
Receiver
Turbo code (R = 1/2, 3/4, 8/9, 1) QPSK, 16QAM, 64QAM Nc = 64 Ng =16 Frequency-selective block Rayleigh L = 2 ∼16 path uniformpower delay profile τ l =l (l=0 ∼ L−1) Ideal
the achievable BER performance of CP-SC degrades. On the other hand, TA-SC can achieve better BER performance even if small M is used. The required value of M in TA-SC is 16, 16, and 4 for QPSK, 16QAM, and 64QAM, respectively, to achieve the BER performance similar to CP-SC using M = 256. The reason for this is discussed in the following. Figure 5 shows the pdf of the received signal power PNc −1,n associated with the symbol d(Nc − 1) at the nth stage, where PNc −1,n is given by ⎧n−N 2 ⎪ g ⎪ (TA) ⎪ ⎪ RNc −1−i,Nc −1 ⎪ ⎨
PNc −1,n = ⎪
i=0 n
⎪ ⎪ ⎪ ⎪ ⎩
i=0
2 (CP) RNc −1−i,Nc −1
for TA-SC (17) for CP-SC.
It is seen from Figure 5(a) that when CP-SC is used, the probability that the received signal power drops is high at early stages. Therefore, the probability of removing the correct path at early stages increases when smaller M is used. This is shown in Figure 6 which plots the probability of removing the correct path at the nth stage (n = Ng , Ng +1, Ng +2 for TA-SC and n = 0, 1, 2 for CP-SC) when Eb /N0 = 10 dB and 16QAM is used. The use of larger M can reduce the probability of removing the correct path and hence improve the achievable BER performance; however, the computational complexity increases. The computational complexity of FDBD with QRM-MLD will be discussed in the next subsection. In the case of TA-SC, the lower right elements of R are not used in QRM-MLD. Therefore, the probability that the received signal power at early stages drops is very low (see Figure 5(b)). As a consequence, the probability of removing the correct path at early stages is reduced. This is clearly seen in Figure 6. Figure 7 plots the required Eb /N0 for achieving BER = 10−4 as a function of M. For comparison, the required Eb /N0 for the MF bound is also plotted. In the case of CP-SC, the required value of M to achieve the BER performance close to the MF bound is 64 for QPSK and 256 for 16QAM and 64QAM. However, in the case of TA-SC, much smaller M is required, that is, M = 8 for QPSK and 16 for 16QAM and
EURASIP Journal on Advances in Signal Processing 10−1
10−1
10−2
10−2
Average BER
Average BER
6
10−3
MF bound
10−4
10−5
10−4
QPSK Nc = 64 Ng = 16 L = 16-path 0
5
10−3
10 15 Average received Eb /N0 (dB)
20
10−5
M=4 M = 16 M = 256 (CP-SC only)
TA-SC CP-SC M=1
MF bound 16QAM Nc = 64 Ng = 16 L = 16-path 5
10
15 20 Average received Eb /N0 (dB) M=4 M = 16 M = 256 (CP-SC only)
TA-SC CP-SC M=1
(a) QPSK (X = 4)
25
(b) 16QAM (X = 16)
10−1
Average BER
10−2
10−3
10−4
10−5
MF bound 64QAM Nc = 64 Ng = 16 L = 16-path 10
15 20 25 Average received Eb /N0 (dB)
30
M=4 M = 16 M = 256 (CP-SC only)
TA-SC CP-SC M=1
(c) 64QAM (X = 64)
Figure 4: Average BER performance (uncoded).
64QAM. The performance gap of 1 dB from the MF bound is owing to the insertion of TS and CP. Figure 8 shows the influence of the number L of propagation paths on the required M to reduce the Eb /N0 gap from the MF bound for achieving BER = 10−4 to 1.5, 2.5,
and 3.0 dB for QPSK, 16QAM, and 64QAM, respectively. It can be seen from Figure 8 that the required M increases with L in the case of CP-SC. This is because the number of elements (whose magnitudes likely drop) of R in the lower right positions increases with L [17], and therefore the
EURASIP Journal on Advances in Signal Processing
7
3
3 n=0 n=1
CP-SC transmission with QRM-MLD Nc = 64 Ng = 16 L = 16-path uniform
TA-SC transmission with QRM-MLD Nc = 64 Ng = 16 L = 16-path uniform
n=2
2 Probability
Probability
2
n = 16 n = 32 n = 63 1
0
n = Ng ∼ Ng + 2 n = Ng + 16, Ng + 32, Ng + 63
1
0
1
2
0
0
1 PNc −1,n
PNc −1,n (a) CP-SC
2
(b) TA-SC
Figure 5: Pdf of the received signal power associated with the symbol d(Nc − 1) at the nth stage.
42 Nc = 64 Ng = 16 L = 16-path uniform
0.3
0.2
0.1
Required Eb /N0 for achieving BER= 10−4
Probability of removing correct symbol candidates
0.4
Nc = 64 Ng = 16 L = 16–path uniform
37
32
27 64QAM
22 17
16QAM
12
0
7 M=1
M = 4 M = 16 M = 1
M = 4 M = 16 M = 256
TA-SC TA-SC n = Ng n = Ng + 1 n = Ng + 2
QPSK 1
2
4
8
16 M
32
64
128
265
CP-SC CP-SC n=0 n=1 n=2
CP-SC TA-SC MF bound
Figure 7: Required Eb /N0 for achieving BER = 10−4 .
Figure 6: Probability of removing correct path at nth stage (n = 0 ∼ 2). 16QAM.
probability of removing the correct path at early stages also increases. However, in the case of TA-SC, required M does not almost depend on the number of L.
Below, we examine the transmission performances of coded CP-SC and TA-SC systems. 16QAM is assumed as the data modulation scheme. We employ a rate 1/3 turbo encoder using two (13, 15) recursive systematic convolutional (RSC) component encoders. The two parity sequences from the turbo encoder are punctured to obtain rate-1/2, 3/4,
8
EURASIP Journal on Advances in Signal Processing Table 2: Number of multiplications (uncoded with Nc = 64 and L = 16).
150 Nc = 64 Ng = 16
QPSK 16QAM 64QAM 384 DFT [20] 266240 QR decomposition CP-SC Multiplication of 4096 QH 876214 3505779 Squared Euclidian 218271 distance calculations (M = 256) (M = 256) (M = 256) Total 2453432 9032864 35328512 800 DFT [20] 518400 QR decomposition TA-SC Multiplication of 6400 QH 137120 274304 Squared Euclidian 68504 (M = 4) (M = 2) distance calculations (M = 8) Total 594104 662720 799904
Required M
100
50
M=4 0
2
4
6
8
10
12
14
16
L CP-SC TA-SC QPSK
16QAM 64QAM
Figure 8: Required M as a function of the number L of propagation paths.
and 8/9 turbo codes. Log-MAP decoding with 6 iterations is assumed. The packet length is set to 8 blocks (8Nc symbols) in all simulations. The log likelihood ratio (LLR) is used as the soft-input in the turbo decoder. When FDBD with QRM-MLD is used, however, the LLR values cannot be directly computed, since surviving paths at the last stage do not necessarily contain both 1 and 0 for every coded bit. Therefore, how to estimate reliable LLR values is an important issue for FDBD with QRM-MLD. In our paper, we applied the LLR estimation scheme proposed in [19]. The BER performance of turbo coded TA-SC using FDBD with QRM-MLD is plotted in Figure 9 as a function of average received Eb /N0 (= R(Es /N0 ) (1 + Ng /Nc )/log2 X) for M = 1, 4, and 16. For comparison, the BER performance of CPSC is also plotted. It can be seen form Figure 9 that when small M is used, the achievable BER performance of CP-SC degrades. On the other hand, TA-SC can achieve better BER performance even if small M is used. The required value of M in TA-SC is 1, 16, and 16 for R = 1/2, 3/4, and 8/9, respectively, to achieve the BER performance similar to CPSC using M = 256. 3.2. Complexity. The computational complexities of FDBD with QRM-MLD required for TA-SC and CP-SC are discussed. The complexity here is defined as the number of complex multiply operations. The required number of multiplications is shown in Table 2. First, we discuss the number of multiplications required for the squared Euclidean distance calculations. In FDBD with QRM-MLD, the number of multiplications required for the squared Euclidian distance
calculations is 2X + XM Nn=c −1 1 (n + 2), when M ≤ X. When M > X, it is a bit different from the case of M ≤ X. For example, when M = X 2 , the number of multiplications is (n + 2)X + (n + 3)X 2 + MX Nn=c −2 1 (n + 2). It can be seen from Figure 7 that the required value of M in TA-SC is 8, 4, and 2 for QPSK, 16QAM, and 64QAM, respectively, to achieve the BER performance similar to CP-SC with M = 256 when L = 16 (uncoded case). Therefore, the computational complexity required for the squared Euclidean distance calculations in TA-SC is reduced to about 3.1, 1.6, and 0.8% of that of in CP-SC. Next, we discuss the overall computational complexity, which is the sum of the complexity required for DFT, QR decomposition, multiplication of QH , and the squared Euclidean distance calculation. When the DFT size at a receiver is J, the number of complex multiplications is J 2 for DFT in general (There are also efficient algorithms for DFT [20]), J 3 + J 2 for QR decomposition, and J 2 for the multiplication of QH . In TA-SC, CP is replaced by a known TS, which is a part of DFT block at the receiver, and TS in the previous block acts as CP in the present block as shown in Figure 1. In order to let TS to play the role of CP, DFT size at the receiver must be the sum of data symbol block length and the TS length. In this paper, for TA-SC to keep the same data rate as CP-SC, we have set the data symbol block length and the TS length to be Nc and Ng , respectively. Therefore, DFT requires (Nc + Ng )2 multiplications for the TA-SC case. Furthermore, it also requires large size of equivalent channel matrix H than that of CP-SC (resulting in higher complexity for QR decomposition and multiplication of QH ). However, TASC can reduce significantly the computational complexity required for the squared Euclidean distance calculations as mentioned above. As a result, the overall computational complexity for TA-SC is smaller than that of CP-SC. The overall computational complexity in TA-SC is about 24, 7.4, and 2.3% of that in CP-SC for QPSK, 16QAM, and 64QAM, respectively, when L = 16 (uncoded case).
9
100
100
10−1
10−1
10−2
10−2
Average BER
Average BER
EURASIP Journal on Advances in Signal Processing
10−3
16QAM R = 1/2 turbo coded Nc = 64 Ng = 16 L = 16-path
10−4
10−5 5
10−3
16QAM R = 3/4 turbo coded Nc = 64 Ng = 16 L = 16-path
10−4
10−5
10
15
20
5
10
Average received Eb /N0 (dB) M=4 M = 16 M = 256 (CP-SC only)
TA-SC CP-SC M=1
15
20
Average received Eb /N0 (dB) M=4 M = 16 M = 256 (CP-SC only)
TA-SC CP-SC M=1 (b) R=3/4
(a) R = 1/2 100
Average BER
10−1
10−2
10−3
16QAM R = 8/9 turbo coded Nc = 64 Ng = 16 L = 16-path
10−4
10−5 5
10
15
20
Average received Eb /N0 (dB) M=4 M = 16 M = 256 (CP-SC only)
TA-SC CP-SC M=1 (c) R=8/9
Figure 9: Average BER performance (turbo coded).
3.3. BER Performance Comparison between FDBD with QRMMLD, MMSE-FDE, and FDISIC. Figure 10 compares the BER performances achieved by FDBD with QRM-MLD, MMSE-FDE, and frequency-domain iterative ISI cancellation (FDISIC) [6] when uncoded TA-SC is used. For FDISIC,
the use of three iterations is sufficient (i.e., i = 3) and therefore, only the BER performance curve with i = 3 is plotted. It can be seen from Figure 10 that when 16QAM is used, FDBD with QRM-MLD using M ≥ 2 provides better BER performance than FDISIC using i = 3. When
EURASIP Journal on Advances in Signal Processing 10−1
10−1
10−2
10−2
Average BER
Average BER
10
10−3
10−4
10−5
10−4
MF bound 16QAM Nc = 64 Ng = 16 L = 16-path 5
10
15 20 Average received Eb /N0 (dB)
MMSE-FDE FDISIC (i = 3) FDBD with QRM-MLD M=1
10−3
25
M=2 M=4 M=8 M = 16
10−5
64QAM MF bound Nc = 64 Ng = 16 L = 16-path 10
15
20 25 Average received Eb /N0 (dB)
MMSE-FDE FDISIC (i = 3) FDBD with QRM-MLD M=1
(a) 16QAM (X = 16)
30
M=2 M=4 M=8 M = 16
(b) 64QAM (X = 64)
Figure 10: BER performance comparison between FDBD with QRM-MLD, MMSE-FDE, and FDISIC in uncoded TA-SC.
64QAM is used, FDBD with QRM-MLD can achieve better BER than FDISIC even if M = 1 is used. When 16 (64) QAM is used, FDBD with QRM-MLD using M = 16 can reduce the required Eb /N0 for an average BER = 10−4 by about 2.8(6.8) dB compared to FDISIC using i = 3. FDBD with QRM-MLD requires about 20(80) times higher computational complexity than FDISIC for Nc = 64 and 16(64) QAM. FDBD with QRM-MLD can improve the BER performance at the cost of increased complexity.FDBD with QRM-MLD significantly reduces the complexity compared to the MLD. However, the computational complexity of FDBD with QRM-MLD is still much higher than MMSEFDE and MMSE-FDE with iterative ISI cancellation. This is because QR decomposition and path selection using Malgorithm require high computational complexity. Therefore, further complexity reduction is necessary. This is left as an interesting future research topic. In the case of path selection using M-algorithm, the complexity can be reduced by using adaptive M algorithm [21], which adapts the value of M for each stage based on the respective channel condition. Quadrant detection scheme [22, 23] also can reduce the complexity required for the M-algorithm. In Figure 11, we compare the BER performances of turbo-coded TA-SC using FDBD with QRM-MLD and also using MMSE-FDE. Turbo decoding with 6 iterations is performed after FDBD with QRM-MLD and also after MMSE-FDE. It can be seen form Figure 11 that when R = 3/4 and 8/9, FDBD with QRM-MLD provides much better BER performance than MMSE-FDE. When R = 3/4(8/9) ,
FDBD with QRM-MLD using M = 16 can reduce the required Eb /N0 for an average BER = 10−4 by about 2.5(4.8) dB when compared to MMSE-FDE. However, when R = 1/2, a fairy large M(M ≥ 512) must be used to achieve better BER performance than MMSE-FDE even if TA-SC is used. When smaller M than 256 is used, the LLR estimation error increases and hence, the achievable BER performance of FDBD with QRM-MLD is inferior to that of MMSE-FDE. Joint channel decoding and QRM-MLD can be performed in an iterative fashion (called FDBD with iterative QRM-MLD in this paper) to improve the BER performance of low-rate turbo-coded TA-SC system. However, this paper is intended to show that when using FDBD with QRM-MLD, TA-SC system is superior to the well-known CP-SC system. FDBD with iterative QRM-MLD for coded TA-SC system is left as an interesting future study.
4. Conclusion In this paper, we presented the application of FDBD with QRM-MLD to TA-SC, in which the known TS in the previous block acts as CP in the present block. The known TS is exploited in the M-algorithm to reduce the probability of removing the correct path at an early stages. We showed by computer simulation that the required number of surviving paths in the M-algorithm is greatly reduced in TA-SC. Therefore, the computational complexity required for FDBD with QRM-MLD is greatly reduced. The overall complexity required for FDBD with QRM-MLD in TA-SC is reduced
11
100
100
10−1
10−1
10−2
10−2
Average BER
Average BER
EURASIP Journal on Advances in Signal Processing
10−3
16QAM R = 1/2 turbo coded Nc = 64 Ng = 16 L = 16-path
10−4
10−5 5
10−3
16QAM R = 3/4 turbo coded Nc = 64 Ng = 16 L = 16-path
10−4
10−5
10
15
20
5
10
Average received Eb /N0 (dB) M = 16 M = 64 M = 256 M = 512
MMSE-FDE FDBD with QRM-MLD M=1 M=4
15
20
Average received Eb /N0 (dB) MMSE-FDE FDBD with QRM-MLD M=1 M=4
(a) R = 1/2
M = 16 M = 64 M = 256 M = 512
(b) R = 3/4 100
Average BER
10−1
10−2
10−3
16QAM R = 8/9 turbo coded Nc = 64 Ng = 16 L = 16-path
10−4
10−5 5
10
15
20
Average received Eb /N0 (dB) MMSE-FDE FDBD with QRM-MLD M=1 M=4
M = 16 M = 64 M = 256 M = 512
(c) R = 8/9
Figure 11: BER performance comparison between FDBD with QRM-MLD and MMSE-FDE in turbo-coded TA-SC.
to about 24%, 7.4%, and 2.3% of that in CP-SC for QPSK, 16QAM, and 64QAM, respectively, when Nc = 64 and L = 16 (uncoded case). We also showed that FDBD with QRM-MLD provides better BER performance than FDISIC
when uncoded TA-SC is used, but, at the cost of increased complexity. We showed that when high-rate (R ≥ 3/4) turbocode is used, FDBD with QRM-MLD provides better BER
12 performance than MMSE-FDE. However, when low-rate (R = 1/2) turbo-code is used, a fairy large M (M ≥ 512) must be used to achieve better BER performance than MMSE-FDE even if TA-SC is used. FDBD with iterative QRM-MLD may significantly improve the achievable BER performance. FDBD with iterative QRM-MLD for low-rate turbo-coded TA-SC system is left as an interesting future study. The use of TA-SC can reduce the computational complexity required for the M-algorithm, but still requires high computational complexity in the QR decomposition of the equivalent channel matrix. Another important future study is the further complexity reduction of FDBD with QRM-MLD.
References [1] J. G. Proakis and M. Salehi, Digital Communications, McGrawHill, New York, NY, USA, 5th edition, 2008. [2] G. D. Forney Jr., “Maximum likelihood sequence estimation of digital sequence in the presence of intersymbol interference,” IEEE Transactions on Information Theory, vol. 18, no. 3, pp. 363–378, 1972. [3] N. Al-Dhahir and A. H. H. Sayed, “The finite-length multiinput multi-output MMSE-DFE,” IEEE Transactions on Signal Processing, vol. 48, no. 10, pp. 2921–2936, 2000. [4] D. Falconer, S. L. Ariyavisitakul, A. Benyamin-Seeyar, and B. Eidson, “Frequency domain equalization for single-carrier broadband wireless systems,” IEEE Communications Magazine, vol. 40, no. 4, pp. 58–66, 2002. [5] K. Takeda, T. Itagaki, and F. Adachi, “Joint use of frequencydomain equalization and transmit/receive antenna diversity for single-carrier transmissions,” IEICE Transactions on Communications, vol. E87-B, no. 7, pp. 1946–1953, 2004. [6] K. Takeda, K. Ishihara, and F. Adachi, “Frequency-domain ICI cancellation with MMSE equalization for DS-CDMA downlink,” IEICE Transactions on Communications, vol. E89B, no. 12, pp. 3335–3343, 2006. [7] N. Benvenuto and S. Tomasin, “Block iterative DFE for single carrier modulation,” Electronics Letters, vol. 38, no. 19, pp. 1144–1145, 2002. [8] N. Benvenuto and S. Tomasin, “Iterative design and detection of a DFE in the frequency domain,” IEEE Transactions on Communications, vol. 53, no. 11, pp. 1867–1875, 2005. [9] A. Duel and C. Heegard, “Delayed decision-feedback sequence estimation,” IEEE Transactions on Communications, vol. 37, no. 5, pp. 428–436, 1989. [10] H. C. Myburgh and J. C. Olivier, “Low complexity iterative MLSE equalization in highly spread underwater acoustic channels,” in Proceedings of Oceans ’09 IEEE Bremen; Balancing Technology with Future Needs, Bremen, Germany, May 2009. [11] T. Yamamoto, K. Takeda, and F. Adachi, “Single-carrier transmission using QRM-MLD with antenna diversity,” in Proceedings of the 12th International Symposium on Wireless Personal Multimedia Communications (WPMC ’09), Sendai, Japan, September 2009. [12] K. J. Kim and J. Yue, “Joint channel estimation and data detection algorithms for MIMO-OFDM systems,” in Proceedings of the 36th Asilomar Conference on Signals, System and Computers, vol. 2, pp. 1857–1861, Pacific Grove, Calif, USA, November 2002.
EURASIP Journal on Advances in Signal Processing [13] L. Deneire, B. Gyselinckx, and M. Engels, “Training sequence versus cyclic prefix—a new look on single carrier communication,” IEEE Communications Letters, vol. 5, no. 7, pp. 292–294, 2001. [14] J. Coon, M. Sandell, M. Beach, and J. McGeehan, “Channel and noise variance estimation and tracking algorithms for unique-word based single-carrier systems,” IEEE Transactions on Wireless Communications, vol. 5, no. 6, pp. 1488–1496, 2006. [15] G. H. Golub and C. F. van Loan, Matrix Computations, Johns Hopkins University Press, Baltimore, Md, USA, 3rd edition, 1996. [16] J. B. Anderson and S. Mohan, “Sequential coding algorithms: a survey and cost analysis,” IEEE Transactions on Communications, vol. 32, no. 2, pp. 169–176, 1984. [17] K. Takeda, H. Tomeba, and F. Adachi, “Joint TomlinsonHarashima precoding and frequency-domain equalization for broadband single-carrier transmission,” IEICE Transactions on Communications, vol. E91-B, no. 1, pp. 258–266, 2008. [18] F. Adachi and K. Takeda, “Bit error rate analysis of DS-CDMA with joint frequency-domain equalization and antenna diversity combining,” IEICE Transactions on Communications, vol. E87-B, no. 10, pp. 2991–3002, 2004. [19] W. Shin, H. Kim, M.-H. Son, and H. Park, “An improved LLR computation for QRM-MLD in coded MIMO systems,” in Proceedings of the 66th IEEE Vehicular Technology Conference (VTC ’07), pp. 447–451, Baltimore, Md, USA, SeptemberOctober 2007. [20] D. P. Kolba and T. W. Parks, “A prime factor FFT algorithm using high-speed convolution,” IEEE Transactions on Acoustics, Speech, and Signal Processing, vol. 25, no. 4, pp. 281–294, 1977. [21] H. Kawai, K. Higuchi, N. Maeda, and M. Sawahashi, “Adaptive control of surviving symbol replica candidates in QRM-MLD for OFDM MIMO multiplexing,” IEEE Journal on Selected Areas in Communications, vol. 24, no. 6, pp. 1130–1140, 2006. [22] K. Higuchi, H. Kawai, H. Taoka, N. Maeda, and M. Sawahashi, “Adaptive selection of surviving symbol replica can-didates for quasi-maximum likelihood detection using M-Algorithm with QR-decomposition for OFDM MIMO mul-tiplexing,” IEICE Transactions on Communications, vol. E92-B, no. 4, pp. 1258–1271, 2009. [23] K. Lai and L. Lin, “Low-complexity adaptive tree search algorithm for MIMO detection,” IEEE Transactions on Wireless Communications, vol. 8, no. 7, pp. 3716–3726, 2009.
Photographȱ©ȱTurismeȱdeȱBarcelonaȱ/ȱJ.ȱTrullàs
Preliminaryȱcallȱforȱpapers
OrganizingȱCommittee
The 2011 European Signal Processing Conference (EUSIPCOȬ2011) is the nineteenth in a series of conferences promoted by the European Association for Signal Processing (EURASIP, www.eurasip.org). This year edition will take place in Barcelona, capital city of Catalonia (Spain), and will be jointly organized by the Centre Tecnològic de Telecomunicacions de Catalunya (CTTC) and the Universitat Politècnica de Catalunya (UPC). EUSIPCOȬ2011 will focus on key aspects of signal processing theory and applications li ti as listed li t d below. b l A Acceptance t off submissions b i i will ill be b based b d on quality, lit relevance and originality. Accepted papers will be published in the EUSIPCO proceedings and presented during the conference. Paper submissions, proposals for tutorials and proposals for special sessions are invited in, but not limited to, the following areas of interest.
Areas of Interest • Audio and electroȬacoustics. • Design, implementation, and applications of signal processing systems. • Multimedia l d signall processing and d coding. d • Image and multidimensional signal processing. • Signal detection and estimation. • Sensor array and multiȬchannel signal processing. • Sensor fusion in networked systems. • Signal processing for communications. • Medical imaging and image analysis. • NonȬstationary, nonȬlinear and nonȬGaussian signal processing.
Submissions Procedures to submit a paper and proposals for special sessions and tutorials will be detailed at www.eusipco2011.org. Submitted papers must be cameraȬready, no more than 5 pages long, and conforming to the standard specified on the EUSIPCO 2011 web site. First authors who are registered students can participate in the best student paper competition.
ImportantȱDeadlines: P Proposalsȱforȱspecialȱsessionsȱ l f i l i
15 D 2010 15ȱDecȱ2010
Proposalsȱforȱtutorials
18ȱFeb 2011
Electronicȱsubmissionȱofȱfullȱpapers
21ȱFeb 2011
Notificationȱofȱacceptance SubmissionȱofȱcameraȬreadyȱpapers Webpage:ȱwww.eusipco2011.org
23ȱMay 2011 6ȱJun 2011
HonoraryȱChair MiguelȱA.ȱLagunasȱ(CTTC) GeneralȱChair AnaȱI.ȱPérezȬNeiraȱ(UPC) GeneralȱViceȬChair CarlesȱAntónȬHaroȱ(CTTC) TechnicalȱProgramȱChair XavierȱMestreȱ(CTTC) TechnicalȱProgramȱCo Technical Program CoȬChairs Chairs JavierȱHernandoȱ(UPC) MontserratȱPardàsȱ(UPC) PlenaryȱTalks FerranȱMarquésȱ(UPC) YoninaȱEldarȱ(Technion) SpecialȱSessions IgnacioȱSantamaríaȱ(Unversidadȱ deȱCantabria) MatsȱBengtssonȱ(KTH) Finances MontserratȱNájarȱ(UPC) Montserrat Nájar (UPC) Tutorials DanielȱP.ȱPalomarȱ (HongȱKongȱUST) BeatriceȱPesquetȬPopescuȱ(ENST) Publicityȱ StephanȱPfletschingerȱ(CTTC) MònicaȱNavarroȱ(CTTC) Publications AntonioȱPascualȱ(UPC) CarlesȱFernándezȱ(CTTC) IIndustrialȱLiaisonȱ&ȱExhibits d i l Li i & E hibi AngelikiȱAlexiouȱȱ (UniversityȱofȱPiraeus) AlbertȱSitjàȱ(CTTC) InternationalȱLiaison JuȱLiuȱ(ShandongȱUniversityȬChina) JinhongȱYuanȱ(UNSWȬAustralia) TamasȱSziranyiȱ(SZTAKIȱȬHungary) RichȱSternȱ(CMUȬUSA) RicardoȱL.ȱdeȱQueirozȱȱ(UNBȬBrazil)