Turbo Processing for Joint Channel Estimation, Synchronization, and Decoding in Coded MIMO-OFDM Systems

This paper proposes a turbo joint channel estimation, synchronization, and decoding scheme for coded multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) systems. The e ﬀ ects of carrier frequency o ﬀ set (CFO), sampling frequency o ﬀ set (SFO), and channel impulse responses (CIRs) on the received samples are analyzed and explored to develop the turbo decoding process and vector recursive least squares (RLSs) algorithm for joint CIR, CFO, and SFO tracking. For burst transmission, with initial estimates derived from the preamble, the proposed scheme can operate without the need of pilot tones during the data segment. Simulation results show that the proposed turbo joint channel estimation, synchronization, and decoding scheme o ﬀ ers fast convergence and low mean squared error (MSE) performance over quasistatic Rayleigh multipath fading channels. The proposed scheme can be used in a coded MIMO-OFDM transceiver in the presence of multipath fading, carrier frequency o ﬀ set, and sampling frequency o ﬀ set to provide a bit error rate (BER) performance comparable to that in an ideal case of perfect synchronization and channel estimation over a wide range of SFO values.


Introduction
Coded multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) has been intensively explored for broadband communications over multipath-rich, time-invariant frequency-selective channels [1].Turbo processing has been considered for coded MIMO and MIMO-OFDM systems for performance enhancement [2][3][4][5].In particular, iterative detection and decoding issues in MIMO systems to achieve near-Shannon capacity limit [2] and performance gain [5] were investigated under the assumption of perfect channel estimation and synchronization.Taking into account the effects of imperfect channel knowledge on the system performance, [4] developed a combined iterative detection/decoding and channel estimation scheme to improve the overall performance of MIMO-OFDM systems with perfect synchronization.
Under imperfect synchronization conditions, multicarrier transmissions such as OFDM and MIMO-OFDM are highly susceptible to synchronization errors such as carrier frequency offset (CFO) and sampling frequency offset (SFO) [6][7][8][9][10][11], especially for operation at low signal-to-noise ratio (SNR) regimes in case of high-performance coded systems.Therefore, estimation of frequency offsets (CFO and SFO) and channel impulse responses (CIRs) are of crucial importance in (coded) MIMO-OFDM systems using coherent detection.So far, most studies on the issue have been focused on separate and sequential CFO/SFO and channel estimation [7,[11][12][13][14].More specifically, channel estimation is performed by assuming that perfect synchronization has been established [12][13][14], even though channel estimation would be degraded by imperfect synchronization and vice versa.In most practical systems (e.g., WiFi, WiMAX), data is transmitted in bursts, and each burst is appended with a preamble that contains known training sequences to facilitate the initial synchronization and channel estimation.However, the insufficient accuracy of initially estimated CFO, SFO, and channel responses as well as their time variation still require known pilot tones inserted in the data segment of the burst to update and enhance the CFO, SFO, and channel EURASIP Journal on Wireless Communications and Networking estimation accuracy in order to maintain the high system performance at the cost of reduced transmission/bandwidth efficiency (due to inserted pilot tones), for example, in the IEEE802.11[15], 4 pilot tones are inserted in every block of 48 data tones, representing an overhead of 8.33%.
Since synchronization and channel estimation are mutually related, joint channel estimation and synchronization would provide better performance [10].Recently, a few algorithms [8,[16][17][18][19] have been proposed for the estimation of CIRs and CFO in uncoded MIMO-OFDM systems but these algorithms have neglected the SFO effect in their studies.However, the detrimental effect of the SFO (even for a very small SFO) will likely lead to a significant degradation of the OFDM receiver performance even when perfect CIR and CFO knowledge is available [20].Specifically, the SFO induces a sampling delay that drifts linearly in time over an OFDM symbol [21].Without any SFO compensation, this delay hampers the OFDM receiver as soon as the product of the relative SFO and the number of subcarriers become comparable to one [9].Consequently, OFDM receivers become more vulnerable to the SFO effect as the used FFT size increases.For instance, an SFO of 40 ppm can cause a window shift of up to six samples [21] in a burst of 1000 OFDM symbols used in multiband OFDM systems [22].As another example, in the presence of sampling clock offset of 1 ppm in the DVB-T 2 K mode [23], the FFT window will move one sample around every 400 symbols [10].
Various SFO, CFO, and channel schemes have been investigated.In [24], a correlation-based SFO estimation scheme for MIMO-OFDM systems in the absence of CFO was proposed.Under the assumption of perfect channel estimation, decision-directed (DD) techniques were proposed for joint CFO/SFO estimation and tracking [21] and for phase noise and residual frequency offset compensation [25] in OFDM systems.Unlike [21,25], under the assumption of perfect channel estimation, maximum likelihood (ML-)based joint CFO and channel estimation schemes using pilot signals in multiuser MIMO-OFDM systems were considered in [18,19].An overview of CFO/SFO estimation and compensation schemes using pre-FFT nondata-aided (NDA) acquisition, post-FFT data-aided (DA) acquisition, and post-FFT DA tracking can be found in [6,26].However, existing joint channel estimation and synchronization algorithms for coded MIMO-OFDM systems have omitted the SFO in their investigations regardless of its detrimental effect [9,10,20,21,24].
In this paper, we propose a joint synchronization, channel estimation, and decoding turbo processing scheme for coded MIMO-OFDM systems in the presence of quasistatic multipath channels, CFO, and SFO.By analyzing the nonlinear interrelation between CFO, SFO, channel responses, and received subcarriers, we develop an iterative vector recursive least-squares (RLSs-)-based joint CIR, CFO, and SFO tracking scheme that can be incorporated in the turbo processing between the MIMO-demapper and softinput soft-output (SISO) decoder for the coded MIMO-OFDM receiver.Conceptually, more accurate estimates of CFO, SFO, and CIR can be obtained by using more reliably detected data and also help to enhance the MIMO-demapper output reliability that will improve the performance of the SISO decoder in the next iteration of the turbo process.Furthermore, the use of soft estimates alleviates the detrimental effect of error propagation that usually occurs when hard decisions are used in a feedback tracking loop or in decision-directed modes.As a result, better accuracy in CFO/SFO/CIR estimation and tracking can be achieved without the need of overhead pilot tones, that is, removing significant transmission efficiency loss and enhancing the spectral efficiency.As initial values of the CFO, SFO, and CIR play an important role in the convergence of the joint synchronization, channel estimation, and decoding turbo processing, we also develop a coarse CFO, SFO, and CIR estimation scheme (that was not studied in [27]) applied to the preamble of the burst and based on the combined CFO-SFO perturbation in order to provide the accurately estimated initial values of the CFO, SFO, and CIR.
The rest of the paper is organized as follows.Section 2 describes the coded MIMO-OFDM signal model.Section 3 analyzes the effects of CFO, SFO, and channel responses on the received samples.These interrelations are further explored to develop the turbo joint channel estimation, synchronization, and decoding scheme in Section 4, and the vector RLS-based joint CIR, CFO, and SFO tracking algorithm is delineated in Section 5. Section 6 presents the coarse estimation of the CFO, SFO, and CIR.Simulation results for various scenarios are discussed in Section 7. Finally, Section 8 summarizes the paper.

System Model
Figure 1 shows a simplified block diagram of a convolutional-coded MIMO-OFDM transmitter using N t transmit antennas and M-ary quadrature amplitude modulation (M-QAM).This transmitter architecture is similar to the space-time (ST) bit-interleaved coded modulation (BICM) in [28].Using a serial-to-parallel (S/P) converter, the input convolutional-encoded bitstream is first split into N t parallel sequences.Each sequence is further bitinterleaved and then organized as a sequence of Q-bit tuples, {d u m,k }, where  (DAC), the transmitted baseband signal at the uth transmit antenna can be written as Where T is the sampling period at the output of IFFT, N g denotes the number of CP samples, T g = N g T, T s = (N + N g )T is the OFDM symbol length after CP insertion, u(t) is the unit step function, and U(t) = u(t)−u(t−T s ).Practically, the colocated DACs are driven by a common sampling clock with frequency of 1/T .The multiple coded OFDM signals are transmitted over a frequency-selective, multipath fading channel.We assume fading conditions are unchanged within an OFDM burst interval, so that the quasistatic channel response between the uth transmit antenna and the vth receive antenna can be represented by where h u,v,l and τ l are the complex gain and delay of the lth path, respectively.L is the total number of resolvable (effective) paths.

Effects of CFO, SFO, and Channel Responses on Received Samples
Frequency discrepancies between oscillators used in the radio transmitters and receivers, and channel-induced Doppler shifts cause a net carrier frequency offset (CFO) of Δ f in the received signal, where f is the operating radio carrier frequency.Practically, it is reasonable to assume that all pairs of transmit-receive antennas experience the same CFO [8], and the received signal at the vth receive antenna element can be written as The impinging signals at all receive antennas are then sampled for analog-to-digital conversion (ADC) by the common receive clock at rate 1/T .Since T / = T, the time alignment of received samples is also affected by the sampling frequency offset (SFO).After sampling and CP removal, the sample of the mth OFDM symbol of the received signal r v (t) at time instant t n = nT is given by where The complex-valued Gaussian noise sample, w v,m,n , has zero mean and a variance of ,l e − j(2πk/N)l is the channel frequency response (CFR) at the kth subcarrier for the pair of the uth transmit antenna and the vth receive antenna, and T is the corresponding effective time-domain channel impulse response (CIR).The SFO and CFO terms are represented in terms of the transmit sampling period T as η = ΔT/T, ΔT = T − T, and As observed in (4), the CFO and SFO induce the timedomain phase rotation that will translate into intercarrier interference (ICI), attenuation, and phase rotation in the frequency domain as shown in the following derivations.
After FFT, the received FD sample at the vth receive antennais where (εi+i−k) , and sinc(x) = sin(πx)/(πx).It is noted that the frequency-domain expression of the received samples in [6,Equation 37] corresponds to an approximation of (5) for the case of the single-input single-output configuration (N t = 1, N r = 1).In the first summation in (5), the term i = k corresponds to the subcarrier of interest, while the other terms with i / = k represent ICI.As can be observed from the above expression for ρ i,k , the term ε i = iη+ ε η needs to be removed in order to suppress ICI.Obviously, in an ideal case with zero SFO and CFO, , and perfect orthogonality among subcarriers is preserved at the receiver.In addition, the coefficient (εi+i−k)  quantifies the CFO-SFO-induced attenuation and phase rotation of received subcarriers.Thus, to mitigate ICI and attenuation, the effects of CFO and SFO on the received samples have to be compensated.Hence, the estimates of CFO and SFO are needed to compensate for the detrimental effects (phase rotation) of synchronization errors, while the channel estimates are required for the MIMO demapping as illustrated in Figure 2.More specifically, the CFO and SFO compensations will be performed in the time domain (before FFT implementation at receiver) as described in the following derivations.
Following the same approach in [20], the received timedomain sample in (4) can be multiplied by exp[− j2πε c η n/N] prior to FFT to mitigate ICI as shown in Figure 2, that is, where ε c η = (1 + η c )ε c , ε c , and η c are the estimates of CFO and SFO, respectively.
After FFT, the resulting subcarriers at the vth receive antenna are After some manipulation, (7) can be rewritten as where Based on (8), the vector representation of the frequencydomain (FD) received samples at all receive antennas can be expressed by where the (u, v)th entry of T , and each of the complex elements in W c m (k) has a variance of N 0 .Equation (10) provides an insight of the nonlinear interrelation between CFO, SFO, channel responses, and received subcarriers.It indicates that the estimation of CFO (ε c ), SFO (η c ), and channel responses requires knowledge of subcarrier data X m (k), while the decoding of subcarrier data X m (k) also needs to know the CFO, SFO, and channel responses in addition to the binary convolutional coding structure in X m (k).This interrelation can be exploited to develop a high-performance turbo joint channel estimation, synchronization, and decoding scheme that can mutually enhance the estimation accuracy and decoding reliability in an iterative manner.To reduce the number of estimated parameters for the MIMO channel, it is desired to estimate the channel impulse response {h u,v,0 , h u,v,1 , . . ., h u,v,L−1 } instead of the channel frequency response H u,v (k) as H u,v (k) can be derived from the channel impulse response by a simple Fourier transform.The CFO, SFO, and CIR estimation needs to deal with the nonlinear relation as shown in (10) and will be discussed in Section 5.The development of the turbo processing will be addressed in Section 4.

Turbo Joint Channel Estimation, Synchronization, and Decoding
The binary convolutional coding structure in X m (k) is used to develop the constituent soft-input soft-output (SISO) decoder (shown in Figure 2) to provide more reliable soft estimates of the coded bits, P(c; O), based on the extrinsic soft-bit information received from the MIMO-demapper, P(c; I), using the computations presented in [29].P(c; O) are then split into N t streams and interleaved to form N t soft-bit estimate streams P(d u m,k,q ; I) that are used as extrinsic information for MIMO demapping and CIR, CFO, and SFO estimation as follows.
The purpose of MIMO-demapper is to compute the extrinsic soft bit information: where b ∈ {0, 1}, and the letters I and O refer to, respectively, the input and output of the MIMO-demapper.Based on (10), the term Simplified FFT  where X (b)  u,m,k,q is the set of the vectors where X m is the set of all possible values of X m (k), P(X m (k) = x) = Π u Π q P(d u m,k,q = d u m,k,q (x); I) due to the use of interleaving, and d u m,k,q (x) denotes the value of the corresponding bit d u m,k,q in the vector x.The above equations, (11) and (12), indicate that unlike the cases of perfect channel estimation and synchronization in [2] and perfect synchronization in [4], the MIMO demapper herein employs the estimated channel responses, CFO and SFO, H(k), ε, η to derive the extrinsic soft bit information.
The estimation of channel responses, CFO and SFO, H(k), ε, η, is also based on (10) and hence, needs knowledge of subcarrier data X m (k).For this, based on the computed P(X m (k) = x), the soft mapper (shown in Figure 2) generates the soft estimate, X m (k), as its mean, that is, Due to the close interaction between the CIR, CFO, and SFO estimates and the MIMO-demapper, the proposed turbo processing is performed in a joint detection estimation manner (as described above) instead of a serial fashion (i.e., updating H(k), ε, η only after a few iterations for simplicity).
As shown in Section 6, convergence to the good performance can be achieved with only 2 or 3 iterations.The N t extrinsic soft bit information streams, P(d u k,q ; O), u = 1, . . ., N t , are then deinterleaved and parallel-to-serial converted to form the extrinsic soft bitstream P(c; I) for the constituent soft-input soft-output (SISO) decoder that will provide more reliable soft estimates of the coded bits, P(c; O), for the next iteration.At any iteration, hard decision can be applied on P(u; O) to produce the decoded data bits.The information flow graph of the proposed turbo joint channel estimation, synchronization, and decoding scheme, shown in Figure 3, illustrates the iterative exchange of the extrinsic information between the constituent functional blocks in the receiver.By using the known training sequence X m (k) in the preamble segment of a burst, initial estimates of CFO and SFO can be accurately obtained by using the conjugate delay correlation property and then used to establish the initial CIR estimates by the vector RLS algorithm as discussed in Section 5.

Vector RLS-Based Joint Tracking of CIR, CFO, and SFO
Due to the nonlinear effects of CFO and SFO on the received samples as shown by (10) in both time and frequency domains, the joint estimation of CIR, CFO, and SFO would require highly complex nonlinear estimation techniques.
To avoid such complexity, the paper uses Taylor series to approximately linearize the nonlinear estimation problem.
In addition, under the assumption that all transmit-receive antenna pairs experience common CFO and SFO values [7,8,11], we can develop a fast-convergence, vector RLSbased joint CIR, CFO, and SFO estimation and tracking algorithm suitable for MIMO-OFDM receivers as follows.
As previously discussed, to reduce the number of estimated channel parameters, we consider Using the least squares (LS) criterion, our aim is to iteratively estimate the (2LN t N r + 2) × 1 parameter vector T at iteration i to minimize the following weighted squared error sum: where λ is the forgetting factor, p = 1, . . ., i denotes the pth tone index in the set of i tone indices used in this adaptive estimation.The elements of ω i are with u = 1, . . ., N t , v = 1, . . ., N r , l = 0, . . ., L − 1.From (10), we obtain It is noted that X u,mp (k p ) denotes the soft estimate of the pth data tone at subcarrier k p of the m p th OFDM symbol from the u th transmit antenna.It is clear that f v ( X u,mp (k p ), ω i ) is a nonlinear function of ω i,2LNtNr = ε (i) and ω i,2LNtNr +1 = η (i) .For a sufficiently small error e i,p,v , f v ( X u,mp (k p ), ω i ) can be approximately represented by the linear terms of its Taylor series, that is, an approximately linear estimation error can be determined by The gradient vector of f v ( X u,mp (k p ), ω i−1 ) corresponding to the vth receive antenna is determined by where Note that for ρ = 1, . . ., N r and ρ / ∂ ω i,l+L+2L(u−1)+2LNt(ρ−1) = 0. Subsequently, the vector RLS algorithm [30] can be used to formulate the following vector RLS-based joint CIR, CFO and SFO tracking scheme.
, where γ is the regularization parameter.(The use of a scaled identity matrix for initialization is mainly for convenience, and a random initialization matrix can also be employed.Since convergence will invariably be attained, but the final converged position will depend on many environmental factors and are unknown, the difference in using the two types of initialization matrices  is in general not significant.However, due to its randomness, using a random matrix may give rise to problems with matrix inversion or other similar matrix operations under certain conditions.As a result, most adaptive algorithms make use of the more deterministic scaled identity matrix for initialization purposes.) Iterative Procedure.At the ith iteration with a forgetting factor λ, update Under the above implementation of the vector RLS-based tracking of CIR, CFO, and SFO algorithm, the resulting computational complexity is (L 3 N 3 t N 3 r N d ) per each turbo iteration, where L denotes the channel length, N t stands for the number of transmit antennas, N r is the number of receive antennas, and N d is the number of subcarriers used in each turbo iteration for the vector RLS tracking.

Coarse CIR, CFO, and SFO Estimation for Initial Values
For a stationary environment and time-invariant parameter vector, the RLS algorithm is stable regardless of the eigenvalue spread of the input vector correlation matrix [31] as shown in [32].Due to the use of the first-order Taylor series approximation, the stability of the vector RLS-based CFO, SFO, and CIR tracking scheme requires sufficiently small initial errors between the initial guesses and the true values of CIR, CFO, and SFO.Accurate yet simple coarse estimation of CFO and SFO can be based on the conjugate delay correlation of the two identical and known training sequences in the preamble of the burst (as shown in Figure 3), that is, based on (4), we can obtain the following approximation: where m 1 and m 2 = m 1 + 1 denote the indices of the 1st and 2nd training sequences.Therefore, the combined CFO-SFO perturbation can be estimated by where Under the assumption of η 1 (e.g., for a typical SFO value of around 50 ppm or 5E-5 in practice) and the use of the two identical long training sequences in the preamble of a burst, the coarse (initial) CFO and SFO estimates can be determined separately by where Φ[ ,n .The above coarse CFO and SFO estimates are then used in the coarse CIR estimation that employs the vector RLS algorithm with the known X m (k)'s during the preamble.

Simulation Results and Discussions
Computer simulation has been conducted to evaluate the performance of the proposed turbo joint channel estimation, synchronization, and decoding scheme for a convolutionalcoded MIMO-OFDM system.In the investigation, the OFDM-related parameters are set to be similar to that given by IEEE standard 802.11a[15].QPSK is employed for data OFDM symbols, each has 52 data tones.Note that in [15], 4 out of 52 data tones are reserved for known pilot tones to facilitate the CIR, CFO, and SFO tracking, which represents an overhead of 8.33%.For the proposed turbo joint channel estimation, synchronization, and decoding scheme, the entire OFDM symbol can be used for data tones to eliminate  this overhead of 8.33%.As illustrated in Figure 3, a burst format of two identical long training symbols and 225 data OFDM symbols was used in the simulation.The two identical long training symbols in the preamble of a burst are used to perform a correlation-based coarse CFO-SFO estimation to establish their initial values for the turbo joint tracking of CIR, CFO, and SFO.The coarse CIR estimation is performed by using the vector RLS algorithm and the first long training symbols with the available CFO and SFO initial estimates and initial guesses of CIRs and the gradient components at (19) corresponding to CFO-SFO variables set to zeros.The rate 1/2 nonrecursive systematic convolutional code with length covering 2 OFDM symbols is employed for encoding at the transmitter.At the receiver, the SISO decoder is used as discussed in Section 4. For each transmit-receive antenna pair, we consider an exponentially decaying Rayleigh fading channel with a channel length of 5 and a RMS delay spread of 50 nanoseconds.In the simulation, the channel impulse responses and frequency offsets are assumed to be unchanged over the duration of a burst of 227 OFDM symbols (two training OFDM symbols for preamble).
Figure 4 shows the measured mean squared errors (MSEs) of the CIR estimate and relevant Cramér-Rao lower bounds (CRLBs).The numerical results demonstrate that the proposed estimation algorithm provides a fast convergence and the best MSE performance with forgetting factor λ = 1 and regularization parameter γ = 10.For comparison, the CRLB values of the CIR estimates obtained by using any unbiased pilot-aided estimation approach with 4 known pilot tones (in the IEEE standard 802.11a[15]) and of all 52 known tones (i.e., ideal but unrealistic case) in each data OFDM symbol are also plotted in Figure 4.As can be seen in Figure 4, the numerical results show that the MSE values of the CIR estimates obtained by the proposed scheme with just one iteration are even smaller than the CRLB obtained by any unbiased pilot-aided joint CIR, CFO, and SFO estimation approach using 4 pilots in each OFDM symbol.Furthermore, after just 3 iterations, the proposed scheme converges to its best MSE performance close to the CRLB of the ideal but unrealistic case of all 52 known tones.In the same manner, Figures 5 and 6 show the MSE results and relevant CRLBs of the CFO and SFO estimates, respectively.Figure 7 shows the MSE performance and CRLB values of the proposed turbo scheme with 3 iterations of turbo processing versus SNR for QPSK (a) and 16-QAM (b).As can be seen in Figure 7(a), the proposed joint CIR/CFO/SFO estimation scheme provides more accurate CFO estimates than the existing ML-based CFO and SFO tracking algorithm [11] that requires the use of perfect channel knowledge.For the same SNR, the gap between the MSE and corresponding CRLB for QPSK is smaller than that for 16-QAM.Figure 8 shows the BER performance of the proposed turbo scheme with different numbers of iterations.For reference, the ideal BER performance (curve E) in the case of perfect channel estimation and synchronization (i.e., zero CFO and SFO, using 3 iterations between MIMO-demapper and SISO decoder) is also plotted.The results show that the performance of the proposed turbo scheme is improved with the number of iterations and can approach that of the case of perfect channel estimation and synchronization after 3 iterations (curve D).Without turbo processing, the resulting worst-case BER performance (curve A) corresponding to the case of using only the preamble for the vector RLSbased joint channel estimation and synchronization is plotted in Figure 8.As shown, without the use of the turbo principle, the vector RLS-based joint channel estimation and synchronization scheme using only the preamble (curve A) provides an unacceptable receiver performance (BER values around 0.5), while the proposed turbo scheme offers a remarkable improvement in BER performance even after just one iteration (curve B).
To investigate the effect of CFO and SFO on the performance of the proposed turbo scheme, Figures 9 and   10 show the BER performance of the proposed turbo algorithm under various CFO and SFO values, respectively.For reference, the ideal BER performance in the case of perfect channel estimation and synchronization (i.e., zero CFO and SFO, using 3 iterations between MIMO-demapper and SISO decoder) is also plotted.As shown, the proposed turbo estimation scheme is highly robust against a wide range of SFO values.

Conclusions
In this paper, a received signal model in the presence of CFO, SFO and channel distortions was examined and explored to develop a turbo joint channel estimation, synchronization, and decoding scheme and a vector RLS-based joint CFO, SFO, and CIR tracking algorithm for coded MIMO-OFDM systems over quasistatic Rayleigh multipath fading channels.The astonishing benefits of turbo process enable the proposed joint channel estimation, synchronization, and decoding scheme to provide a near ideal BER performance over a wide range of SFO values without the needs of known pilot tones inserted in the data segment of a burst.Simulation results show that the joint CIR, CFO, and SFO estimation with the turbo principle offers fast convergence and low MSE performance over quasistatic Rayleigh multipath fading channels.Note that ICI components in (A.1) can be assumed to be additive and Gaussian distributed and included in W v,m (k i ) [20].By collecting K subcarriers in each receive antenna, the resulting KN r subcarriers from N r receive antennas can be represented in the vector form as follow:

Appendices
where  and P N = σ 2 .Assume that the coefficients of CIR, {h u,v,0 , h u,v,1 , . . ., h u,v,L−1 }, are independent zeromean complex random variables with common variances{σ 2 0 , σ 2 1 , . ., σ 2 L−1 } for all pairs of transmit-receive antennas, and all receive antennas experience the same AWGN power.After some manipulation, it can be shown that the SNR values at all receive antennas are equal to where E s = E{ |X u,m (k)| 2 } is the average energy of the M-QAM symbols.

Figure 3 :
Figure 3: Turbo processing for joint channel estimation, synchronization, and decoding.

Figure 8 :
Figure 8: BER performance of the proposed turbo joint channel estimation, synchronization, and decoding scheme.

10 EURASIPFigure 9 :Figure 10 :
Figure 9: BER performance of the proposed turbo joint channel estimation, synchronization, and decoding scheme under various SFO values.

A
. Cramér-Rao Lower Bound for Pilot-Based Estimates of CIR, CFO, and SFOBased on (5), the received subcarrier k i in frequency domain at the vth receive antennacan be expressed byY v,m k i = e j(2π/N)Nm i εk i ρ ki,ki Nt u=1 X u,m k i H u,v k i + W v,m k i .(A.1)