- Research
- Open
- Published:

# Successive interference cancelation and MAP decoding for mobile MIMO OFDM systems and their convergence behavior

*EURASIP Journal on Wireless Communications and Networking***volume 2012**, Article number: 311 (2012)

## Abstract

Turbo equalization schemes based on minimum mean square error criteria available in the literature for multiple-input multiple-output (MIMO) systems are computationally expensive, as they require a relatively large matrix inversion. In this article, we propose a suboptimal, successive interference cancelation (SIC)-based maximum a posteriori (MAP) decoding in doubly dispersive channels for orthogonal frequency division multiplexing (OFDM) MIMO systems (SIC-MAP-MIMO). SIC-MAP-MIMO leverages on the soft feedback symbol estimate to remove the intercarrier interference and coantenna interference from the received data thus making the subsequent MAP decoding simple. Extrinsic information transfer chart analysis supplemented with numerical simulation results show that SIC-MAP-MIMO achieves comparable BER performance to similar equalization schemes but with significant computational savings.

## Introduction

Wireless communication based on MIMO systems has gained popularity due to the potential capacity increases it can provide [1]. OFDM has been a popular technique for transmission of signals over wireless channels primarily because the receiver design is relatively simple, as it does not require a complex equalizer. MIMO-OFDM-based transmission systems can thus provide very high data rates with a relatively simple receiver design and are adopted in many recent wireless communication standards. Examples include (a) IEEE-802.11-n/ac [2, 3], (b) IEEE 802.16e/m (WiMAX) [4], and (c) LTE [5]. IEEE-802.11-n [2] specifies a maximum of 600 Mb/s using four independent spatial streams transmitted over a 40-MHz channel. IEEE-802.11-ac [3] specifies a data rate of up to 3.5 Gb/s using eight independent spatial streams in an 80-MHz channel. WiMAX [4] specifies a limit of approximately 100 Mb/s using four spatial streams in a 5-MHz channel. LTE suggests a peak data rate of 326.4 Mb/s using a 20-MHz downlink with four transmit antennas [5]. Under static multipath channel conditions, the received signal in the MIMO receiver is corrupted only by coantenna interference (CAI). However, high transceiver mobility at high carrier frequency causes severe time-varying frequency-selective multipath fading at the receiver. This breaks the orthogonality of subcarriers and hence causes intercarrier interference (ICI) in the received signal. As an example, at a transmission frequency of 5 GHz and at vehicular speeds of 240–480 km/h, which are common in high-speed trains, the expected maximum receiver Doppler spread in WiMAX and LTE systems is of the order of 12 to 23% of the intercarrier spacing. Furthermore, it is believed that future wireless communications will adopt higher carrier frequencies and higher mobility requirements, further increasing the maximum relative Doppler frequency and exacerbating the ICI. In such scenarios, as discussed in this article, the efficient detector design for MIMO-OFDM systems is a challenging practical problem.

Some early equalization schemes proposed in the literature to cope with ICI and CAI are (a) block linear [6], (b) banded minimum mean square error (MMSE) linear [7], and (c) banded MMSE decision-feedback [8]. More recently, various iterative equalization schemes based on successive cancelation of ICI and CAI [9–13] or turbo principle [9, 14–16] were proposed. A brief survey of the iterative equalization schemes published in the last decade and how they differ from the proposed scheme is given in the sequel. In general, turbo-like iterative schemes are found to have superior performance compared to others, but they usually suffer from high computation complexity, albeit at varying degrees, and thus require high silicon area for implementation and high battery power for operation. Such practical application challenges have motivated us to propose a new low-complexity detector scheme for OFDM-MIMO with an improved trade-off between performance and implementation complexity in [17] and in this study.

The iterative/successive interference cancelation scheme proposed in Section “SIC-based MAP receiver: MIMO (SIC-MAP-MIMO)” is related to, yet distinct from, a number of published algorithms. In [9], multiple access interference (MAI) and inter-symbol interference (ISI) in a static multipath environment are removed in a code-division-multipath-access (CDMA) system using a combination of soft-interference cancelation and linear MMSE filtering. [10] is an extension of the scheme proposed in [9], but in [10], additional filtering is performed to suppress both the ISI and MAI residuals. Turbo Equalization (TE) proposed in [18] performs MMSE-based turbo estimation of the transmitted symbol on single-carrier systems under static channel conditions, followed by LLR computation and BCJR decoding. This involves matrix inversion for the estimation of every symbol per iteration and is thus computationally expensive (*O*(*N*^{2}) operations). Additional complexity reduction for TE is achieved in doubly selective OFDM systems by working on a submatrix around the system matrix as in [19]. SIC-MAP-MIMO is perhaps close to the ISI cancelation stage of [9]. However, unlike [9, 10], SIC-MAP-MIMO requires only *O*(*N*) operations. It leverages the banded sparse structure of the single-user LTV MIMO system matrix, where the significant channel coefficients are concentrated in a banded structure along the diagonal [6, 8, 19] as shown in Figure 1 (right). There are a number of SIC schemes which try to diagonalize the system matrix. In [11], an iterative decision feedback equalizer is proposed to perform ICI cancelation such that the modified system matrix becomes diagonal and, consequently, the equalizer becomes single-tap. In [20], ICI is removed from the time domain signal (resulting in a diagonal frequency domain system matrix) and is converted to frequency domain. Hard decisions are made on the equalized signal, following which it is converted back to time domain and the time-frequency iterations are repeated. In [12], the mean value of the transmit symbol is computed using the LLR values from the decoder. This is used to remove the ICI from the received symbol, resulting in a diagonal system matrix. A modified low-complexity MMSE equalizer that takes the decision error into account is now derived. In [13], a turbo-EM receiver is proposed. Here, the system matrix is estimated from the EM detector, whereas transmit symbols are estimated either using the EM algorithm or from the LLR values from the decoder. Using these estimates, ICI is computed and removed as in [11, 12] to obtain a diagonal system matrix. The scheme in [21] is applicable to single-carrier (SC) systems. Here, the received signal is split into small segments, such that the channel remains approximately static during each small segment. Suitable signal processing is performed on each of these segments such that the resulting channel matrix is made diagonal. TE, like the one described in [18], is performed on the modified system to recover the received bits. The above-described schemes try to obtain a modified system with only diagonal entries, such that a single tap equalizer can equalize the modified system. Unlike this, in SIC-MAP-MIMO, copies of the received signal on the same and adjacent subcarriers of all receive antennas are carefully separated out to obtain frequency diversity. The resulting system matrix is a column matrix. It has been identified through simulations that the banded sparse structure of the system matrix, as in the case of doubly selective MIMO channels, allows this simplification without sacrificing performance. MAP decoding is performed on this simplified system. The scheme proposed in [15] is similar to [19], but is extended to OFDM MIMO. It proposes a new window for received signal. SIC-MAP-MIMO does not perform windowing, but better performance can be expected with any of the windowing proposed above.

In this article, we propose a suboptimal, SIC-based MAP decoder. In an OFDM-MIMO system operating in a doubly selective environment, a QAM symbol transmitted from a particular subcarrier of a given antenna spreads to the same and adjacent subcarriers of all receive antennas, causing ICI and CAI. In SIC-MAP-MIMO, copies of the received signal on the same and adjacent subcarriers of all receive antennas are carefully separated out, as in the case of a frequency diversity system. CAI and ICI, if they exist, are estimated iteratively using the conditional symbol mean estimates obtained from the decoder feedback information from the previous iteration. These estimates are removed appropriately from the received symbol. Hence the resulting system matrix becomes a single-column matrix. MAP decoding of the resulting system is simple to implement. Motivated from [15, 19, 22], we exploit the banded nature of the system matrix in SIC-MAP-MIMO. The performance and computational complexity of the proposed scheme are compared with schemes suggested in [19, 22] when extended to MIMO. It has been found that SIC-MAP-MIMO provides a comparable performance to the above schemes, but with significantly less computational complexity, making it especially suitable for mobile applications where battery power is limited. Convergence behavior of the above schemes is also analyzed using extrinsic information transfer (EXIT) charts [23].

This article is organized as follows: Notations used in this article are explained first. In the next section, the system model is presented, followed by a description of SIC-MAP-MIMO in Section “SIC-based MAP receiver: MIMO (SIC-MAP-MIMO)”. In Section “Computational complexity analysis”, we compare the computation complexity of SIC-MAP-MIMO with similar equalization schemes. In Section “Numerical results”, the numerical results are presented. The article concludes with “Conclusion” section, where we draw final conclusions.

### Notation

(·)^{t} denotes transpose; (·)^{H} denotes conjugate transpose (Hermitian); ⊗ is the Kronecker product; {*a*} denotes a set with elements {*a*(0),*a*(1),…}; **F** for normalized *N* point Discrete Fourier Transform (DFT), where ${\mathbf{F}}_{k,l}:=(1/\sqrt{N}){e}^{-j2\mathrm{\Pi kl}/N}$; **I** is the identity matrix; **i**_{
k
} is the *k*^{th} column of **I**; ${\mathbf{0}}_{{n}_{R}\times {n}_{T}}$ is the null matrix of size *n*_{
R
} × *n*_{
T
}; * denotes convolution; ||·|| for *l*_{2}-norm; ⌈·⌉ is the ceiling of a function; modulo-*N* is denoted by 〈·〉_{
N
}; *Re*(·) and *Im*(·) for the real and imaginary parts, respectively. *diag*(*ν*_{
x
}) is the diagonal matrix with vector *ν*_{
x
} in the main diagonal. Expectation is denoted by *E*{·}. Both × and · are used to denote multiplication. Bold lowercase letters (e.g., **x**) denote vectors, and bold uppercase letters (e.g., **X**) denote matrices. Covariance is denoted by cov(**b**,*c*): = *E*{**bc**^{H}} − *E*{**b**}*E*{**c**^{H}}.

## System model

The MIMO OFDM transceiver system with *n*_{
T
} transmit and *n*_{
R
} receive antennas used in this article is given in Figure 2. We assume that *n*_{
T
} ≤ *n*_{
R
}. Information bits ({*a*}) are convolutionally encoded ({*b*}) and passed through a bit interleaver ({*c*}). The symbol mapper modulates them into QAM symbols ({*s*}). A set of *N* of these coded QAM “frequency domain” symbols is collected to form an OFDM symbol. The demultiplexer collects *n*_{
T
} OFDM symbols (an OFDM symbol frame) and sends each symbol ({**s**_{
q
}}) to one of the *n*_{
T
} transmit paths. The symbol interleaver (SI) in each path interleaves them (**{x**_{
q
}}). They are then converted into “discrete time-domain” samples ({**z**_{
q
}}) by performing an *N*- point IDFT. A cyclic prefix (CP) of length *N*_{
p
}≤*N* is added to each of these symbols. They are then simultaneously transmitted from *n*_{
T
} transmit antennas. Transmit and receive antennas are assumed to be placed sufficiently far apart among themselves so that the *n*_{
T
} · *n*_{
R
} multipath channels are independent. Furthermore, these channels are assumed to be both frequency- and time-selective and are modeled as a linear time-varying (LTV) system with a discrete impulse response *h*_{
pq
}(*i*,*l*) that is defined as the time *i* response to an impulse at time *i*−*l* for the wireless channel from the *q* th transmit antenna to the *p* th receive antenna. Static multipath channel conditions are treated as a special case of the above general formulation. At the receiver, the CP-removed OFDM data from each receive antenna are converted back to the “frequency domain” by performing *N*-point DFT and passed to the SIC and Symbol Deinterleaver. The log likelihood ratio (LLR) computer computes the LLRs of the received bits from the interference removed observation. This is appropriately multiplexed, bit-deinterleaved, and passed to a BCJR- or SOVA-based decoder.

We assume perfect carrier, symbol, and sample synchronization at the receiver. Besides, it is assumed that the channel is known at the receiver. We follow the modeling used in [6]. Assuming that maximum channel delay spread *N*_{
h
} ≤ *N*_{
p
}, the received samples on any of the *p* receive antennas in the baseband can be represented as

where {*n*_{
p
}(*i*)} are additive white Gaussian noise (AWGN) samples on the *p* th receive antenna with zero mean and variance *σ*^{2}(we assume equal noise power on all receive antennas). The condition *N*_{
h
} ≤ *N*_{
p
} ensures that *r*_{
p
}(*i*) contains contributions only from the currently transmitted OFDM symbol frame. The received vector at the *i* th time instant, $\mathbf{r}\left(i\right):={\left[{r}_{1}\right(i),{r}_{2}(i),\dots ,{r}_{{n}_{R}}(i\left)\right]}^{t}$, can be expressed as

where

$\mathbf{z}\left(i\right):=\phantom{\rule{0.3em}{0ex}}{\left[{z}_{1}\right(i),{z}_{2}(i),\dots ,{z}_{{n}_{T}}(i\left)\right]}^{t}$ and $\mathbf{n}\left(i\right):=\phantom{\rule{0.3em}{0ex}}{\left[{n}_{1}\right(i),{n}_{2}(i),\dots ,{n}_{{n}_{R}}(i\left)\right]}^{t}$. Over a time window of N sample duration, (2) can be expressed in matrix form as

where $\mathbf{r}\phantom{\rule{0.3em}{0ex}}:={\left[{\mathbf{r}}^{t}\right(0),{\mathbf{r}}^{t}(1),\dots ,{\mathbf{r}}^{t}(N\phantom{\rule{0.3em}{0ex}}-\phantom{\rule{0.3em}{0ex}}1\left)\right]}^{t}\in {\mathcal{C}}^{N\xb7{n}_{R}},\mathbf{z}:={\left[{\mathbf{z}}^{t}\right(0),{\mathbf{z}}^{t}(1),\dots ,{\mathbf{z}}^{t}(N-1\left)\right]}^{t}\phantom{\rule{0.3em}{0ex}}\in \phantom{\rule{0.3em}{0ex}}{\mathcal{C}}^{N\xb7{n}_{T}},\mathit{\psi}\phantom{\rule{0.3em}{0ex}}:=\phantom{\rule{0.3em}{0ex}}{\left[{\mathbf{n}}^{t}\right(0),\phantom{\rule{0.3em}{0ex}}{\mathbf{n}}^{t}(1),\dots ,\phantom{\rule{0.3em}{0ex}}{\mathbf{n}}^{t}(N-1\left)\right]}^{t}\in {\mathcal{C}}^{N\xb7{n}_{R}}$ and $\mathbf{\Xi}\in {\mathcal{C}}^{N\xb7{n}_{R}\times N\xb7{n}_{T}}$ is the time varying system matrix given in (4).

*N* samples from the same OFDM symbol from *n*_{
R
}receive antennas (a total of *N* · *n*_{
R
}samples) are grouped together and presented to a DFT processor which, in turn, outputs *N* · *n*_{
R
}“frequency domain” samples. This operation can be represented mathematically as follows:

where ${\mathbf{Q}}^{\left(\mathrm{Tx}\right)}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}\mathbf{F}\otimes {\mathbf{I}}_{{n}_{T}},{\mathbf{Q}}^{\left(\mathrm{Rx}\right)}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}\mathbf{F}\otimes {\mathbf{I}}_{{n}_{R}},\mathbf{H}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}{\mathbf{Q}}^{\left(\mathrm{Rx}\right)}\mathbf{\Xi}{\mathbf{Q}}^{\left(\mathrm{Tx}\right)H}\phantom{\rule{0.3em}{0ex}},\mathbf{z}={\mathbf{Q}}^{\left(\mathrm{Tx}\right)H}\mathbf{x},\mathbf{y}:={\left[{\mathbf{y}}^{t}\right(0),{\mathbf{y}}^{t}(1),\dots ,{\mathbf{y}}^{t}(N-1\left)\right]}^{t},\mathbf{y}\left(k\right):={\left[{y}_{1}\right(k),{y}_{2}(k),\dots ,{y}_{{n}_{R}}(k\left)\right]}^{t},\mathbf{x}:={\left[{\mathbf{x}}^{t}\right(0),{\mathbf{x}}^{t}(1),\dots ,{\mathbf{x}}^{t}(N-1\left)\right]}^{t},\mathbf{x}\left(k\right):={\left[{x}_{1}\right(k),{x}_{2}(k),\dots ,{x}_{{n}_{T}}(k\left)\right]}^{t}$ and **w** = **Q**^{(Rx)}** ψ**. Note that (a) each element of

**H**can be written as,

and (b) **w** is wide sense stationary (WSS) with the mean and the covariance identical to that of ** ψ**, since

**F**is unitary.

The total transmit power is assumed to be unity with all antennas transmitting equal power. Multipath time-varying channel coefficients are modeled as zero mean complex Gaussian random variables. Fading coefficients for different paths are assumed to be statistically independent while the coefficient for a given path is time-correlated with the autocorrelation function (wide-sense stationary uncorrelated scattering model) given by [24],

where *α*_{
l
} is the average power of the *l* th path, ${\mathcal{J}}_{0}(\xb7)$ is the zeroth-order Bessel function of the first kind, *T*_{
s
}is the sampling interval, and *f*_{
d
}is the maximum Doppler frequency given by

Here *v* is the vehicle speed, *c* is the velocity of light, *f*_{
c
}is the carrier frequency, and *θ*_{
d
}is the scattering angle.

It has been shown that **H** will be a block-banded matrix with significant block coefficients concentrated in a banded structure, with width *D* along the diagonal [6, 8, 19]. *D* is a design parameter typically chosen as *D* = 2*L* + 1, where *L* = ⌈*f*_{
d
}*T*_{
s
}*N*⌉ in which *N* is the OFDM symbol length. If the channel is static, ** Ξ** will be a block circulant matrix and

**H**will be a block diagonal matrix. Different structures of

**H**are shown in Figure 1. Interference from adjacent subcarriers gives raise to ICI. The received signal on each subcarrier at each receiver antenna contains contributions from all transmit antennas. This gives rise to CAI [6].

## SIC-based MAP receiver: MIMO (SIC-MAP-MIMO)

### Formulation of the proposed MAP receiver

In this section, we present a low-complexity iterative receiver that implements SIC, followed by MAP decoding for MIMO systems. The proposed scheme first modifies the system matrix to a single column matrix by selectively removing the ICI and CAI interference from the received symbols, where ICI and CAI interference are computed using the feedback symbol mean values. Soft information can be computed directly with low cost from this modified model. These are fed to a MAP bit decoder. The following observations are key in formulating the proposed scheme:

1.The relative magnitude of each subblock and superblock diagonal element of the doubly selective Rayleigh fading channel matrix **H** decreases significantly as we move away from the main diagonal. This has been justified in [19, 22]. We can thus ignore all elements that are far away from the main diagonal without significantly impacting performance. This is further justified through simulations in the “Numerical results” section. Note that these elements are absent for a static multipath channel.

2.As the *extrinsic* information becomes more accurate over multiple turbo iterations, the conditional mean, *μ*_{
x
}(*k*) → **x**(*k*), which is the true symbol value and the conditional variance, ${\mathit{\nu}}_{\mathbf{\text{x}}}\left(k\right)\phantom{\rule{1em}{0ex}}\to \phantom{\rule{1em}{0ex}}{\mathbf{\text{0}}}_{{n}_{T}\times 1}$. Therefore, in each new iteration we can use *μ*_{
x
}(*k*) from the previous iteration to selectively remove CAI and ICI from the received symbol in such a manner that the resulting system matrix is turned into a column matrix. MAP decoding of the modified system is computationally efficient to implement.

Based on observation 1, (5) can be approximated as

where *x*_{
k
} :=[**x**(〈*k* − 2*L*〉_{
N
}),…,**x**(〈*k* + 2*L*〉_{
N
})]^{t},**w**_{
k
}:=[**w**(〈*k* − *L*〉_{
N
}),…,**w**(〈*k* + *L*〉_{
N
})]^{t}, and **H**_{
k
}is the shaded (green) section of **H** in Figure 1 (right) given by (10). Note that modulo-*N* (〈〉_{
N
}) operation is used in the above equation, thanks to the CP in the system.

Each element **H**(*m,n*) in (10) (one small grid in Figure 1) is itself a matrix of size *n*_{
R
} × *n*_{
T
} given as

For simplicity of notation, the modulo operation (〈〉_{
N
}) is omitted in the sequel. Now, **x**_{
k
} = *μ*_{
x
}_{
k
} + *δ*_{
x
}_{
k
}, where ${\mathit{\delta}}_{{x}_{k}}$ is the residual error, which approaches *0*_{4L + 1}as the extrinsic LLR becomes more reliable over multiple iterations. Substituting for **x**_{
k
} in (9) and rearranging yields (12), ${\stackrel{~}{\mathbf{w}}}_{\mathbf{k}}$, the new noise, contains the ICI from the residual error ${\delta}_{{x}_{k}}$. ${\stackrel{~}{\mathbf{\mu}}}_{{\mathbf{x}}_{\mathbf{k}}}$ is as defined in (12).

Let

Notice that ${\stackrel{~}{\mathbf{y}}}_{\mathbf{k}}\in {\mathcal{C}}^{D\xb7{n}_{R}}$ and $\mathbf{x}\left(k\right)\in {\mathcal{C}}^{{n}_{T}}$ and ${\stackrel{~}{\mathbf{H}}}_{\mathbf{k}}$ are shown in red in Figure 1 (right). It is a matrix of size *D* · *n*_{
R
} × *n*_{
T
}. For static channels where *L* = 0, ${\stackrel{~}{\mathbf{y}}}_{\mathbf{k}}$ will only have *n*_{
R
}non-zero elements at the center (Figure 1, left). While dealing with the reception of *x*_{
q
}(*k*), the *k* th symbol from the *q* th transmit antenna, *k* th symbols from all other transmit antennas ({*x*_{
l
}(*k*)_{l≠q}}) are causing CAI on the received samples **y**_{
k
}. Using similar techniques to those given above, the CAI can be estimated and removed from the system as well. The resulting system equation is

where ${y}_{{q}_{k}}^{\prime}:={\stackrel{~}{\mathbf{y}}}_{\mathbf{k}}-{\stackrel{~}{\mathbf{H}}}_{\mathbf{k}}{\stackrel{~}{\mathbf{\mu}}}_{{\mathbf{x}}_{\mathbf{q}}}\left(k\right),{\mathbf{h}}_{{\mathbf{q}}_{\mathbf{k}}}:={\stackrel{~}{\mathbf{H}}}_{\mathbf{k}}{i}_{q},{\stackrel{~}{\mathbf{\mu}}}_{{\mathbf{x}}_{\mathbf{q}}}\left(k\right):={\left[{\mu}_{{x}_{1}}\right(k),\dots ,{\mu}_{{x}_{q-1}}(k),0,{\mu}_{{x}_{q+1}}(k),\dots ,{\mu}_{{x}_{{n}_{T}}}(k\left)\right]}^{t}$ and ${\mathbf{w}}_{{\mathbf{q}}_{\mathbf{k}}}^{\prime}:={\stackrel{~}{\mathbf{w}}}_{\mathbf{k}}+{\stackrel{~}{\mathbf{H}}}_{\mathbf{k}}{\stackrel{~}{\mathbf{\delta}}}_{{\mathbf{x}}_{\mathbf{q}}}\left(k\right)$, where ${\stackrel{~}{\mathbf{\delta}}}_{{\mathbf{x}}_{\mathbf{q}}}\left(k\right)\phantom{\rule{0.3em}{0ex}}:=\phantom{\rule{0.3em}{0ex}}{[{\delta}_{{x}_{1}}\left(k\right),\dots ,{\delta}_{{x}_{q-1}\left(k\right)},0,{\delta}_{{x}_{q+1}}\left(k\right),\dots ,{\delta}_{{x}_{{n}_{T}}}\left(k\right)]}^{t}$. We assume ${\mathbf{w}}_{{\mathbf{q}}_{\mathbf{k}}}^{\prime}$ has a variance of ${\sigma}^{\prime 2}{I}_{(2L+1){n}_{R}}$. As noted earlier and as will be shown later in Section “Numerical results”, the combined contributions of residual ICI and CAI to the noise variance *σ*^{′ 2}are small and decreasing over multiple iterations as the reliability in the feedback information increases. We thus approximate ${\sigma}^{\prime 2}{\mathbf{I}}_{{n}_{R}(2L+1)}\approx {\sigma}^{2}{\mathbf{I}}_{{n}_{R}(2L+1)}$.

The LLR computer calculates *LL* *R*_{ext}(*c*_{
q
}(*n*)), the *extrinsic* LLR. It represents information about *c*_{
q
}(*n*) contained in ${\mathbf{y}}_{{\mathbf{q}}_{\mathbf{k}}}^{\prime}$ and *P*(*c*_{
q
}(*l*)) for all *l*≠*n*. These are passed to a MAP decoder where they are used as *a priori* LLRs. LLR_{ext}(*c*_{
q
}(*n*)) is calculated from the modified system using (15), where $0\le i\le Q-1,\mathcal{S}=\phantom{\rule{0.3em}{0ex}}{[{m}_{0},{m}_{1},\dots ,{m}_{Q-1}]}^{t}\in {F}_{2},\left\{\eta \right\}=\mathrm{map}\left(\mathcal{S}\right)$ is the signal constellation and *F*_{2}is binary Galois Field. *Q* denotes the number of bits per symbol. For example, *Q* = 1 for BPSK, *Q* = 2 for QPSK, and so on.

As shown in the Appendix, for QPSK, the above expression can be simplified as

A closer look at the derivation reveals that this expression is applicable, within a scale factor, to any constant-modulus constellations. Observe that the *extrinsic LLR* of *c*_{
q
}(*n*) is conditioned only on ${y}_{{q}_{k}}^{\prime}$, and ${y}_{{q}_{k}}^{\prime}$ depends only on the present symbol *x*_{
q
}(*k*). This makes the evaluation of *LL* *R*_{ext}(*c*_{
q
}(*n*)) easy.

### Receiver operation

The SIC-MAP-MIMO system block diagram is shown in Figure 2. Elements of **H**_{
k
} are obtained from the channel estimation block [25–29]. BCJR-(Bahl, Cocke, Jelinek and Raviv) or SOVA (Soft Output Viterbi Algorithm) [30]-based decoders compute *LL* *R*_{app}(*b*(*n*))—the a posteriori reliability infotextation of each coded bit—in the LLR fotext. The input *a priori* LLR to the decoder is subtracted from LLR_{app}(*b*(*n*)) to obtain the *extrinsic* reliability infotextation ${\text{LLR}}_{\text{ext}}^{\prime}\left(b\right(n\left)\right)$. It is passed through a bit interleaver and is used in the soft-mapper to compute mean ${\mathit{\mu}}_{\mathbf{s}}^{\mathit{\prime}}$. This is demultiplexed appropriately to obtain ${\mathit{\mu}}_{{\mathbf{s}}_{1}}^{\mathit{\prime}},{\mathit{\mu}}_{{\mathbf{s}}_{2}}^{\mathit{\prime}},\dots ,{\mathit{\mu}}_{{s}_{{n}_{T}}}^{\mathit{\prime}}$. These are symbol-interleaved to produce ${\mathit{\mu}}_{{x}_{1}},{\mathit{\mu}}_{{x}_{2}},\dots ,{\mathit{\mu}}_{{x}_{{n}_{T}}}$ which, in turn, are used in SIC-MAP-MIMO to remove the ICI and CAI interference as described in (13) and (14). The ICI- and CAI-removed data are fed to the LLR computer to generate more reliable LLRs to further improve the output bit estimate. This process is repeated until further gains are insignificant. LLR_{app(b(n))}are then hard-sliced at the bit-map block and infotextation bit estimates $\xe2\left(n\right)$ are retrieved from the received data bit estimates $\widehat{b}\left(n\right)$. Mapping ${\text{LLR}}_{\text{ext}}^{\prime}\left(b\right(n\left)\right)$s to ${\mu}_{s}^{\prime}\left(k\right)$ and conditional variance, ${\nu}_{s}^{\prime}\left(k\right)$,is described in [14]. For QPSK modulation,

### Computation of residual ICI and CAI

Neglecting the tetexts in **H** that are beyond the band (shaded area in Figure 1), the interference-canceled signal, *y*_{
p
}(*k*), at the *l* th iteration can be represented as

In (20), the first tetext is the desired signal while the second and third tetexts are the ICI and CAI, respectively. Average power of ICI, ${P}_{\text{ICI}}^{\mathrm{pk}}$, at the *k* th subcarrier on the *p* th receive antenna can be expressed as,

where $E\left\{\parallel ({x}_{q}\left(k\right)-{\mu}_{{x}_{q}}^{l-1}\left(k\right)){\parallel}^{2}\right\}$ is the conditional variance at the (*l*−1)th iteration, ${\nu}_{q}^{l-1}\left(k\right)$, is given in (19). Average ICI power on the *p* th receive antenna, therefore, is obtained by averaging ${P}_{\text{ICI}}^{\mathrm{pk}}$ across *k*, i.e., ${P}_{\text{ICI}}^{p}=\frac{1}{N}\sum _{k=0}^{k=N-1}{P}_{\text{ICI}}^{\mathrm{pk}}$. Average power of CAI on the *k* th subcarrier on the *p* th receive antenna, ${P}_{\text{CAI}}^{\mathrm{pk}}$, can similarly be written as,

As earlier, average CAI power, ${P}_{\text{CAI}}^{p}$, on the *p* th receive antenna is obtained by averaging ${P}_{\text{CAI}}^{\mathrm{pk}}$ across *k*. The signal-to-interference ratio (SIR) at the *k* th subcarrier after *l* iterations can be computed as,

## Computational complexity analysis

In this section, the computational complexity of SIC-MAP-MIMO is compared with two iterative equalization schemes [19, 22]. The perfotextance of these schemes is contrasted in Section “Numerical results”. The authors of [19, 22] have been identified for comparison purposes, since they have a few aspects common to the proposed scheme, such as all the three schemes (a) leverage on the banded nature of the system matrix, (b) leverage on the feedback LLR infotextation, and (c) propose low-complexity symbol estimation for doubly selective OFDM systems.

Among a group of three proposed equalizers in [22], the second equalizer is the best perfotexter. We refer the equalizers in [19] as MMSE-OND2-MIMO and in [22] as TE-BLK2-MIMO (second class of equalizers). We incorporate channel coding to render a fair comparison. These schemes were originally proposed for SISO channels. In this study, we have extended the above schemes to MIMO systems. TE-BLK2-MIMO is a low-complexity block TE scheme. TE-MMSE-OND2-MIMO is a serial TE scheme based on a section of **H** (**H**_{
k
} in the right of Figure 1), whereas MMSE-OND2-MIMO is the noniterative version of TE-MMSE-OND2-MIMO [7]. It is equivalent to the first iteration of TE-MMSE-OND2-MIMO. MMSE-OND2-MIMO schemes, turbo or not, involve the inversion of a matrix of size *D* · *n*_{
R
}. Matrix inversion, generally, has cubic complexity, but it has been shown that MMSE-OND2-MIMO or TE-MMSE-OND2-MIMO can be perfotexted with approximately *O*(*N*(*n*_{
R
}·*D*)^{2}) operations [31]. Table 1 tabulates the approximate total number of arithmetic operations (×,Ã·) for symbol estimation required per sample (sample per iteration in the case of iterative systems). Computations involved in BCJR are identical to all schemes and so are not considered. The cost of adders is significantly lower than that of multipliers. *tanh* operation can be perfotexted using a small lookup table. These operations are, therefore, not considered in the comparison (although not differentiated here, the cost of a divider, in practice, is higher than that of a multiplier.)

For a typical set of parameters, it is clear from Table 1 that TE-BLK2-MIMO and TE-MMSE-OND2-MIMO require approximately five times more computations than SIC-MAP-MIMO per iteration. A fair evaluation of the computational complexity can be undertaken only after studying their convergence behavior in the next section. The non-iterative MMSE scheme, MMSE-OND2-MIMO, requires four times more computations per iteration than SIC-MAP-MIMO.

## Numerical results

We consider WiMAX-like transmission at different vehicular speeds at a transmission frequency of 5 GHz over a vehicular-A channel [32], which is the customary channel model for WiMAX and LTE systems. We thus choose an OFDM-MIMO system with *N* = 256,*N*_{
h
} = 6,*N*_{
p
} = *N*/8, and *n*_{
T
} = *n*_{
R
} = 2. The transmission bandwidth is 5 MHz. Speeds considered are 3, 120, 240, 360, and 480 km/h, which corresponds to notextalized Doppler frequencies of 0.07, 5.8, 11.7, 17.6, and 23.3%, respectively. Results are shown for a rate 1/2 convolutional code having the generator polynomial (7,5). Symbols are QPSK modulated with average power = 1/*n*_{
T
}. Both time and frequency interleaving are perfotexted with S-random interleavers [33], with *S* = 31 and *S* = 7, respectively. The *n*_{
T
} · *n*_{
R
}channels are independent and Rayleigh fading, characterized by Jakes’ Doppler spectrum [24] with an exponentially decaying power delay profile. Simulations are run approximately for 10^{7}bits.

Figure 3 shows the average residual ICI and CAI interference in SIC-MAP-MIMO at different vehicular speeds over multiple iterations. This gives good insight into the proposed algorithm. At iteration one, there is no ICI or CAI cancellation, and the graph therefore represents the relative ICI and CAI powers in the uncompensated system. CAI is a bigger source of interference than ICI, even at very high vehicular speeds. It significantly dominates the AWGN level in the system at moderate to high SNRs (AWGN at 12 dB is shown in the figure). At high vehicular speeds, the ICI interference becomes significant if left uncompensated for. At each iteration, both CAI and ICI reduces by several dBs. After about six iterations, the CAI and ICI interference has been reduced so much that it is well below the AWGN level in the system, neglecting which, as is described in Section “Formulation of the proposed MAP receiver”, is a valid approximation at all practical vehicular speeds. The approximation in (13) and the proposed decoding scheme in general may not be valid for a generic system matrix **H**_{
k
}. As shown in Figure 3, the banded sparse structure of the system matrix reduces the residual ICI and CAI interference upon multiple iterations. That is the principal reason this simplification works. Note also that as we increase the vehicular speed, the proposed scheme is more effective in canceling the interference. This is because of the higher frequency diversity in the system due to Doppler spread.

Convergence behavior of iterative systems is difficult to analyze in general. However, a simulation-based technique called EXIT charts proposed in [23] has been found to be effective in evaluating the convergence behavior of iterative systems. The details of this fotextulation can be found in [34–36]. Detection schemes that may have low computational complexity per iteration might take more iterations to converge and vice-versa. This means that comparing the complexity per iteration for different schemes is not fair unless the convergence speed is also taken into account. EXIT charts are used in this section to investigate the convergence behavior of the iterative schemes.

In Figure 4, EXIT charts for all the three iterative schemes used in our study, namely SIC-MAP-MIMO, TE-MMSE-OND2-MIMO, and TE-BLK2-MIMO at 12% notextalized Doppler, are plotted for *E*_{
b
}/*N*_{0} = 10 dB. The decoder EXIT chart is also shown in the same figure. The EXIT curve for TE-MMSE-OND2-MIMO and TE-BLK2-MIMO is quite close, but the exit curve for TE-BLK2-MIMO is consistently above the fotexter, showing the slight perfotextance superiority of TE-BLK2-MIMO. Although the SIC-MAP-MIMO EXIT chart starts at a lower point, it has a higher slope and ends up very close to that of the other two. Such behavior is found to be true for different values of *E*_{
b
}/*N*_{0} (data not shown). This is because the overall noise in the SIC-MAP-MIMO system during the initial iterations is higher than that of MMSE-OND2-MIMO, owing to ICI and CAI contributions from the residual error tetexts. However, as the estimator becomes more accurate with multiple iterations, these tetexts and, in turn, the system noise, gradually come down, as seen in Figure 3. All three schemes have very close endpoints corresponding to *I*_{
A
} = 1, indicating identical asymptotic behavior of these schemes. The higher the EXIT curve slope, the better the BER gain per iteration. BER gain per iteration is, thus, higher for SIC-MAP-MIMO. It is clear from Figure 4 that SIC-MAP-MIMO needs more number of iterations compared to the other two schemes for the same level of convergence.

The above inferences from the EXIT charts have been verified using simulations. Figure 5 depicts the BER perfotextance of these three iterative schemes for different numbers of iterations for identical set up (12% notextalized Doppler frequency). It can be observed that SIC-MAP-MIMO requires three iterations for the same level of convergence per iteration of the other two schemes. From these observations and from Table 1, it can be said that TE-MMSE-OND2-MIMO, TE-BLK2-MIMO, and MMSE-OND2-MIMO are, respectively, 66, 61, and 40% more expensive than the proposed algorithm. Figure 6 shows the final BER perfotextance of all three iterative schemes considered in our study for 23% notextalized Doppler frequency after six iterations. SIC-MAP-MIMO and TE-MMSE-OND2-MIMO have approximately the same *steady-state* perfotextance at high SNRs, whereas TE-BLK2-MIMO perfotexts slightly better than the other two.

## Conclusion

We have proposed a low-complexity iterative channel equalization scheme, SIC-MAP-MIMO, based on the principle of SIC for OFDM-MIMO single-user systems. We demonstrated that SIC-MAP-MIMO perfotextance under time-varying multipath conditions is mostly on par with the two MMSE-based turbo equalization schemes: TE-MMSE-OND2-MIMO, which is based on a banded submatrix of the system matrix, and the block turbo equalization scheme, TE-BLK2-MIMO, which is based on the banded full system matrix. It was also found that TE-MMSE-OND2-MIMO, TE-BLK2-MIMO, and MMSE-OND2-MIMO are, respectively, 66, 61, and 40% more expensive than the proposed algorithm. It was demonstrated that SIC-MAP-MIMO perfotextance progressively improves as the channel-time variation increases due to the increasing frequency diversity gain that TE-SIC-MIMO is taking advantage of. Another distinct advantage of the proposed algorithm is its high scalability (power versus perfotextance) in practical receivers.

## Appendix

### Derivation of Equation 16

Referring to Table 2 for the QPSK symbol alphabet definition.

Here

where $a1={{\stackrel{~}{\mathbf{y}}}_{{\mathbf{q}}_{\mathbf{k}}}}^{H}{\stackrel{~}{\mathbf{y}}}_{{\mathbf{q}}_{\mathbf{k}}}$ and $a2={\left({\mathbf{h}}_{{\mathbf{q}}_{\mathbf{k}}}{\eta}_{1}\right)}^{H}\left({h}_{{q}_{k}}{\eta}_{1}\right)$. Note that for QPSK ${\left({h}_{{q}_{k}}{\eta}_{1}\right)}^{H}\left({h}_{{q}_{k}}{\eta}_{1}\right)={\left({h}_{{q}_{k}}{\eta}_{2}\right)}^{H}\left({h}_{{q}_{k}}{\eta}_{2}\right)={\left({h}_{{q}_{k}}{\eta}_{3}\right)}^{H}\left({h}_{{q}_{k}}{\eta}_{3}\right)={\left({h}_{{q}_{k}}{\eta}_{4}\right)}^{H}\left({h}_{{q}_{k}}{\eta}_{4}\right)$. Substituting for all the tetexts from 25 in 24, defining $z:={{\stackrel{~}{\mathbf{y}}}_{{\mathbf{q}}_{\mathbf{k}}}}^{H}{\mathbf{h}}_{{\mathbf{q}}_{\mathbf{k}}}$ and removing the common tetexts, we get

Similarly, we get

## References

- 1.
Foschini GJ: Layered space-time architecture for wireless communication in a fading environment when using multi-element antennas.

*Bell Labs Technol. J*1996, 1(2):pp. 41-59. - 2.
IEEE: “IEEE P802.11n/D10.0,”. May 2009.

- 3.
IEEE:

*“IEEE P802.11ac,”*. May 2011. - 4.
IEEE: “IEEE standard for local and metropolitan area networks part 16. std. IEEE802.16E-2005, 2005,”. 2005.

- 5.
Astély D, Dahlman E, Frenger P, Ludwig R, Meyer M, Parkvall S, Skilletextark P, Wiberg N: A future radio-access framework.

*IEEE J. Sel. Areas Commun*2006, 24(3):pp. 693-706. - 6.
Stamoulis A, Diggavi SN, Al-Dhahir N: Intercarrier interference in MIMO OFDM.

*IEEE Trans. Signal Process*2002, 50(10):pp. 2451-2464. - 7.
Lu S, Narasimhan B, Al-Dhahir N: A novel SFBC-OFDM scheme for doubly selective channels.

*IEEE Trans. Veh. Technol*2009, 58(5):pp. 2573-2578. - 8.
Rugini L, Banelli: Banded equalizers for MIMO-OFDM in fast time-varying channels.

*EUSIPCO 2006*Florence, Italy, 9, (2), 18–48 September 2006) - 9.
Wang X, Poor HV: Iterative (turbo) soft interference cancelation and decoding for coded CDMA.

*IEEE Trans. Commun*1999, 47(7):pp. 1046-1061. - 10.
Abe T, Matsumoto T: Space-time turbo equalization in frequency-selective MIMO channels.

*IEEE Trans. Veh. Technol*2003, 52(3):pp. 469-475. - 11.
Tomasin S, Gorokhov A, Yang H, Linnartz J-P, Iterative interference cancelation channel estimation for mobile OFDM:

*IEEE Trans. Wirel. Commun*. 2005, 4(1):pp. 238-245. - 12.
Hwang SU, Lee JH, Seo J, Low-complexity iterative ICI cancelation equalization for OFDM systems over doubly selective channels:

*IEEE Trans. Broadcast*. 2009, 55(1):pp. 132-139. - 13.
Ku M-L, Chen W-C, Huang C-C: EM-based iterative receivers for OFDM and BICM/OFDM systems in doubly selective channels.

*IEEE Trans. Wirel. Commun*2011, 10(5):pp. 1405-1415. - 14.
Tüchler M, Singer A, Kotter R: Minimum mean squared error (MMSE) equalization using a priori infotextation.

*IEEE Trans. Signal Process*2002, 50: pp. 673-683. - 15.
Rugini L, Banelli P, Fang K, Leus G: Enhanced turbo MMSE equalization for MIMO-OFDM over rapidly time-varying frequency-selective channels.

*IEEE 10th Workshop on Signal Processing Advances in Wireless Communications*2009, pp. 36-40. - 16.
Ahmed S, Ratnarajah T, Sellathurai M, Cowan CFN: Iterative receivers for MIMO-OFDM and their convergence behavior.

*IEEE Trans. Veh. Technol*2009, 58(1):pp. 461-468. - 17.
Namboodiri V, Liu H, Spasojević P: Successive interference cancelation-based turbo equalization for MIMO OFDM systems.

*Proc. Conference on Infotextation Sciences and Systems*(Baltimore, MD, March 2011) - 18.
Tüchler M, Kotter R, Singer A: Turbo equalization: principles and new results.

*IEEE Trans. Commun*2002, 50: pp. 754-767. - 19.
Schniter P: Low-complexity equalization of OFDM in doubly selective channels.

*IEEE Trans. Signal Process*2004, 52(4):pp. 1002-1011. - 20.
Chen S, Yao T, Intercarrier interference suppression and channel estimation for OFDM systems in time-varying frequency selective fading channels:

*IEEE Trans. Consum. Electron*. 2004, 50(2):pp. 429-435. - 21.
Ping QGL, Huang D, A low complexity iterative channel estimation and detection technique for doubly selective channels:

*IEEE Trans. Wirel. Commun*. 2009, 8(1):pp. 4340-4349. - 22.
Fang K, Rugini L, Leus G: Low-complexity block turbo equalization for OFDM systems in time-varying channels.

*IEEE Trans. Signal Process*2008, 56: pp. 5555-5566. - 23.
ten Brink S: Convergence behavior of iteratively decoded parallel concatenated codes.

*IEEE Trans. Commun*2001, 49(10):pp. 1727-1737. - 24.
Jakes WC:

*Microwave Mobile Communications*. (Wiley, New York, 1974) - 25.
Ozdemir MK, Arslan H: Channel estimation for wireless OFDM systems.

*IEEE Commun. Surveys and Tutorials*2007, 9(2):pp. 18-48. - 26.
Mostofi Y, Cox DC: I C I mitigation for pilot-aided OFDM mobile systems.

*IEEE Trans. Wirel. Commun*2005, 4(2):pp. 765-774. - 27.
Zhao M, Shi Z, Reed MC: Iterative turbo channel estimation for OFDM system over rapid dispersive fading channel.

*IEEE International Conference on Communications*June 2007. pp. 4849–4854 - 28.
Song WG, Lim JT: Channel estimation signal detection for MIMO-OFDM with time-varying channels.

*IEEE Commun. Lett*2006, 10: pp. 540-542. - 29.
Sun Y, Yee M, Sandell M: Iterative channel estimation with MIMO MMSE - Turbo equalization.

*Vehicular Technology Conference*vol. 2, (Fall, 2003) - 30.
Lin S, Costello DJ:

*Error Control Coding*. (Prentice Hall, NJ, 2004) - 31.
Hong L:

*Frequency domain equalization of single carrier transmissions over doubly selective channels*. Ph.D. dissertation, The Ohio State University, 2007 - 32.
ITU-T: “Guidelines for evaluation of radio transmission technologies for IMT-2000, ITU-T Std. M. 1225”. 1997.

- 33.
Heegard C, Wicker S:

*Turbo Coding*. (Kluwer, Boston, MA, 1999) - 34.
Ahmed S, Ratnarajah T, Sellathurai M: C F N Cowan, EXIT chart analysis of a reduced complexity iterative MIMO-OFDM receiver. In

*Vehicular Technology Conference*. Spring; 2007:pp. 2430-2434. - 35.
Lee S-J, Singer AC: Convergence analysis for linear turbo equalization.

*Thirty-Seventh Asilomar Conference on Signals, Systems and Computers*2003, pp. 667-671. - 36.
Sand S, Plass S, Dammann A: EXIT chart analysis of iterative receivers for space-time-frequency coded OFDM systems. In

*Vehicular Technology Conference*. Fall; 2007:pp. 725-729.

## Author information

## Additional information

### Competing interests

The authors declare that they have no competing interests.

## Rights and permissions

## About this article

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Orthogonal Frequency Division Multiplex
- Minimum Mean Square Error
- Orthogonal Frequency Division Multiplex Symbol
- Successive Interference Cancelation
- Turbo Equalization