Training design for precoded BICM-MIMO systems in block-fading channels

Andalibi, Zohreh; Nguyen, Ha H; Salt, Joseph E

doi:10.1186/1687-1499-2012-80

Research
Open access
Published: 04 March 2012

Training design for precoded BICM-MIMO systems in block-fading channels

Zohreh Andalibi^1,2,
Ha H Nguyen^1,2 &
Joseph E Salt^1,2

EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 80 (2012) Cite this article

2342 Accesses
1 Citations
Metrics details

Abstract

In order to improve bandwidth efficiency and error performance, a new training scheme is proposed for bit-interleaved-coded modulation in multiple-input multiple-output (BICM-MIMO) systems. Typically, in a block-fading channel, the training overhead used for obtaining channel knowledge is proportional to a power of 2 of the number of transmit antennas. However, this overhead can be reduced by embedding pilot symbols within data symbols before precoding. The values, positions, and the number of pilot symbols are found by minimizing the Cramer-Rao bound on the channel estimation error. Computer simulations are presented to demonstrate the advantage of the proposed scheme over other training methods, in terms of both the mean-square-error of the channel estimation and the system's frame-error-rate.

1 Introduction

The pioneering work on multiple-input multiple-output (MIMO) systems [1] shows that a MIMO system can provide a multiplexing gain and accordingly high spectral efficiency over slow fading channels. On the other hand, to achieve a high diversity order, space-time transmission techniques can be implemented at the transmitter [2, 3]. To achieve both high diversity order and coding gain in coded modulation systems, the concept of space-time transmission has also been applied [4, 5]. In such systems, space-time transmission is typically implemented using a linear space-time matrix, or equivalently a linear precoder, so that a single modulation symbol is efficiently transmitted across multiple transmit antennas. Among many research works on precoder design for coded modulation systems with multiple antennas, the design that considers all the relevant components of the transmitter, namely precoding, modulation, and interleaver, can be found in [5–7]. Specifically, a full-rate precoder with any size and for any number of transmit antennas is designed in [6] to maximize the achievable diversity order and coding gain in MIMO block-fading channels.

It is shown in [6] that the maximum achievable diversity order can be realized by an iterative receiver that employs a soft-input soft-output detector [5] and under the assumption of having the perfect channel state information (CSI) at the receiver. In practice, however, CSI has to be estimated using a channel estimator and it is never perfect. Two types of channel estimators have been used for MIMO block-fading channels in coded modulation systems, i.e., training-based and semi-blind channel estimators [8, 9]. In both types of channel estimators, known signals are used to estimate the CSI at the first iteration of the iterative receiver.

Conventionally, for block-fading channels, known signals or the training sequence is included at the beginning of each data block, which is called time-multiplexed training or pilot symbol-assisted modulation (PSAM) scheme [10]. This scheme however reduces bandwidth efficiency of MIMO systems, since the amount of training overhead needed is at least a power of 2 of the number of transmit antennas [11] to ensure the identifiability of the MIMO channel. A straightforward application of the PSAM scheme to a BICM-MIMO system would be time-multiplex data information with the training information after the precoder.

As an alternative to the above conventional PSAM scheme, a potential benefit can be sought by time-multiplexing data information with the training information before the precoder in the transmitter. This new approach shall reduce the required training overhead compared to the conventional PSAM, since the transmitted training symbols are spread over more time periods; thanks to the precoder. This approach shall be referred to as precoded PSAM (PPSAM). Investigating power and time allocations of the training symbols in PPSAM scheme is the main objective of this article.

Moreover, by multiplexing the training sequence before precoder, training symbols can be exploited in both the initialization and iteration phases of the iterative channel estimation process. This is different from a conventional iterative channel estimator using PSAM scheme, in which training sequence is only used at the initialization phase. A natural question is whether the optimal training design for the initialization phase using PPSAM scheme is still optimal for subsequent iterations of an iterative channel estimator. On the one hand, the channel estimation error at the initialization phase translates to an SNR shift in the BER performance [8]. On the other hand, the channel estimation error from the last iteration of the iterative estimator has a strong impact on the error floor of the BER performance [12]. Therefore, optimal training sequence should be designed carefully that considers both initialization and iteration phases.

One of different criteria that have been used to design training sequences is the minimization of the Cramer-Rao bound (CRB) of the channel estimation error [10]. This criterion shall be used in this article due to two main reasons. First, it is directly related to the channel estimation error. Second, since the CRB is a lower bound on the mean-squared-error (MSE) of any unbiased estimator, designing training sequences using this criterion would be applicable to many estimation algorithms. Other design criteria, such as maximizing the channel capacity [8] and minimizing the outage probability [13], are based on some specific channel estimation algorithms.

The article is organized as follows. The system model of BICM-MIMO is presented in Section 2. In Section 3 a lower bound on the MSE of the channel estimator is obtained and the training sequence is designed by minimizing this bound. Section 4 provides numerical results and comparisons. Section 5 concludes the article.

2 System model

Figure 1 shows the block diagram of a BICM-MIMO system under consideration. At the transmitter, a channel encoder with a rate-r error-correcting code converts the vector of information bits b into a codeword c. The coded bits are then interleaved by a random interleaver as described in [6] to produce the interleaved codeword $\tilde{c}$ . The interleaved codewords are segmented into groups of (Nn_t - N_p) × m bits, where N is the spreading factor of the precoder, n_t is the number of transmit antennas, N_p is the number of pilot symbols in Nn_t precoded symbols and m is the number of bits carried by one symbol of a QAM constellation whose size is |Ω| = 2 ^m . Next, the coded bits are mapped to (Nn_t - N_p) QAM constellation points. In this step, N_p known pilot symbols are inserted in every segmented group of (Nn_t - N_p) data symbols to produce N super-symbols. Here, each super-symbol refers to a group of n_t consecutive symbols. Investigating the positions and the number of pilot symbols (i.e., N_p) to be used in each Nn_t symbols is the main objective of this article.

Every group of N super-symbols is then spread over N time periods using a linear precoder G. The Nn_t × Nn_t matrix G multiplies a vector of Nn_t QAM symbols at the precoder input, and generates Nn_t symbols to be transmitted over n_t antennas, over N time periods.

This is illustrated in Figure 2. Let $x_{k} = [x_{(k - 1) N n_{t} + 1}, x_{(k - 1) N n_{t} + 2}, \dots, x_{(k - 1) N n_{t} + N n_{t}}]$ be the k th vector to be precoded. Then, x _k G gives the precoded symbols. Here, x_i 's are complex data or pilot symbols belonging to the 2 ^m -QAM constellation Ω. It is assumed that the data symbols x_i 's are i.i.d with variance $σ_{x}^{2}$ . After precoding, precoded symbols are transmitted through n_t transmit antennas over a block-fading channel.

With n_t transmit antennas and n_r receive antennas, the channel is modeled by an n_t × n_r matrix. For frequency-flat Rayleigh fading, coefficients of the channel matrix are i.i.d. zero-mean circularly symmetric complex Gaussian random variables with variance $σ_{h}^{2}$ . The channel is assumed to be block fading with n_c different channel realizations during each codeword. For the k th symbol to be precoded, x _k, the Nn_t × Nn_r extended channel matrix, H _k, can be written as

H_{k} = diag \{\underset{N / n_{s}}{\underset{⏟}{H_{k}^{[1]}, \dots, H_{k}^{[1]},}} H_{k}^{[2]}, \dots, H_{k}^{[2]}, \dots, H_{k}^{[n_{s}]}, \dots, H_{k}^{[n_{s}]}\},

(1)

where n_s is the number of distinct channel realizations during N time periods of each codeword. To simplify the notation it is also assumed^(a) that n_s divides N. For example, if the length of a codeword is 64 and n_c = 32, then choosing N = 2 would make n_s = 1, whereas choosing N = 4 gives n_s = 2. Notation $H_{k}^{[t]}$ refers to the n_t × n_r complex matrix k that defines the t th channel realization included in n_s channel realizations. The extended channel input/output relationship is expressed by

y_{k} = x_{k} G H_{k} + w_{k}

(2)

where $y_{k} = [y_{(k - 1) N n_{r} + 1}, y_{(k - 1) N n_{r} + 2}, \dots, y_{(k - 1) N n_{r} + N n_{r}}]$ is the received vector at the k th precoding time period and w_k is the noise vector with size 1 × Nn_r whose components are i.i.d zero-mean circularly symmetric Gaussian random variables with variance N₀. It is noted from (2) that although both data and pilot symbols are precoded, the part of the precoder that multiplies the pilot symbols depends on the positions of the pilot symbols in x _k. Equivalently, the design of the pilot symbols is governed by the properties of the precoder used. Since this study adopts the transmission framework and precoder design in [6], it is useful to review the properties of the precoder proposed in [6].

In general, the properties of the precoder in [6] are established by the maximum-likelihood decoding analysis and an assumption of ideal channel interleaving. Specifically, this linear precoder which achieves full diversity order and maximum coding gain satisfies the following two conditions:

A genie condition, which guarantees orthogonal and equal norm sub-rows in the linear precoding matrix. Each sub-row has size n_t in a precoding matrix with size Nn_t× Nn_t.
Dispersive nucleo algebraic (DNA) condition, which is based on Proposition 2 in [6], forces null and orthogonal nucleotides with size s' = N/n_s. Nucleotides refer to subparts of sub-rows with size s'.

A linear precoder that satisfies the above two sets of conditions is called DNA-cyclo precoder and has the best performance in terms of achieving diversity and coding gains with low complexity receiver when N ≤ n_t. It is suggested in [6] that to generate one class of such a precoder, a Ns' × Ns' cyclotomic rotator, denoted by Φ, that satisfies the genie condition is first selected. Then the orthogonal nucleotides are placed inside an Nn_t× Nn_t matrix and they are separated with null nucleotides. Therefore, the DNA-cyclo precoder matrix can be expressed by subparts of a cyclotomic rotator as follows:

G = [\begin{matrix} I_{n_{t} / s^{'}} \otimes Φ^{[1] [1]} & \dots & I_{n_{t} / s^{'}} \otimes Φ^{[N] [1]} \\ I_{n_{t} / s^{'}} \otimes Φ^{[1] [2]} & \dots & I_{n_{t} / s^{'}} \otimes Φ^{[N] [2]} \\ ⋮ & ⋱ & ⋮ \\ I_{n_{t} / s^{'}} \otimes Φ^{[1] [N s^{'}]} & \dots & I_{n_{t} / s^{'}} \otimes Φ^{[N] [N s^{'}]} \end{matrix}]

(3)

where Φ^{[i ] [j]}is the i th sub-row of the j th row of Φ with size 1 × s', I_n is an identity matrix with size n × n and ⊗ denotes the Kronecker product.

The properties that shall be useful for the problem considered in this article, which are implied directly from the genie and DNA conditions, are ΦΦ^H = I_{Ns '} and $Φ^{[i] [t]} {(Φ^{[j] [t]})}^{H} = \frac{1}{N} δ (i - j)$ . It is also useful to point out that each component of Φ has an exponential form with a scaling factor of $\frac{1}{\sqrt{N s^{'}}}$ .

The iterative receiver is also shown in Figure 1. The channel estimator produces an estimate of the channel using the minimum MSE (MMSE) criterion based on the training sequence. Details about channel estimation with the proposed method of inserting training sequence shall be given in Section 3. After channel estimation is performed using the training signal, the soft-input soft-output demodulator uses the MMSE criterion to demodulate the data. The soft-output MMSE demodulator computes the extrinsic information for the interleaved bits, ${Λ_{ext}^{({\tilde{c}}_{l})}}$ , from the received symbols. To obtain Λ-values, the demodulator exploits the a priori information of the coded bits coming from the decoder, ${Λ_{ap}^{({\tilde{c}}_{l})}}$ , and the channel estimate ${\hat{H}}_{k}$ . In the first iteration, the demodulator assumes that the a priori Λ-values are zero, except for the pilot symbols. For the corresponding bits of the pilot symbols, the demodulator uses a large number, say ± 100 as their a priori Λ-values. The de-interleaved outputs, i.e., ${Λ_{ap}^{(c_{l})}}$ , become the a priori Λ-values used in the channel decoder shown in Figure 1 after removing the information of pilot symbols. The channel decoder uses the maximum a posteriori probability (MAP) algorithm to compute the extrinsic Λ-values ${Λ_{ext}^{(c_{l})}}$ . for all coded bits, which are used again in the next iteration in the demodulator. In subsequent iterations, soft information from the decoder is used to improve the performance of the channel estimator. The detailed operation of the iterative channel estimator is discussed in the following sections.

3 Training design and channel estimator

As discussed before, the criterion used for training design in this article is the CRB on the channel estimation error. The bound states that the MSE of any unbiased estimator is lower bounded by the trace of inverse of complex Fisher information matrix (FIM) [14]. To derive FIM, the relation between the channel input and channel output during one block-length, i.e., N/n_s time periods, whose corresponding channel matrix is $H_{k}^{[t]}$ , is of interest. In the following, index k is omitted, since it suffices to consider the transmission of a single precoded symbol for the purpose of channel estimation. With the previously described structure of the precoder, the channel output during one super-symbol time is given by

y^{[i, t]} = (I_{n_{r}} \otimes (\sum_{τ = 1}^{N s^{'}} x^{[τ]} \otimes Φ^{[i, t] [τ]})) h^{[t]} + w^{[i, t]}; t = 1, \dots, n_{s}, i = 1, \dots, s^{'}

(4)

where y^[i,t]= y^{[(t- 1)s'+i]}represents the ((t - 1)s' + i)th received symbol during N time periods, with size n_r × 1. Moreover, h^[t]is the column vector formed by vertically stacking the columns of an n_t × n_r channel realization matrix H^[t]and x^[τ]'s are constructed by splitting x in Ns' sub-vectors with size 1 × n_t/s'. In the following, we call these sub-vectors x^[τ]'s nucleo symbols.

It is quite obvious from (4) that, to have all the received super-symbols, y^[i,t], contain training information, there should be at least one pilot nucleo (i.e., n_t/s' pilot symbols) in each group of Ns' nucleos to be precoded.

With the above structure of the proposed training sequence, the number of pilot symbols in Nn_t transmitted symbols would be N_p = n_p× n_t/s', where n_p nucleo symbols in a symbol to be precoded are assigned to training sequence. Therefore, (4) can be rewritten as

y^{[i, t]} = (I_{n_{r}} \otimes (\sum_{τ \in I_{d}} x_{d}^{[τ]} \otimes Φ_{d}^{[i, t] [τ]} + \sum_{τ \in I_{p}} x_{p}^{[τ]} \otimes Φ_{p}^{[i, t] [τ]})) h^{[t]} + w^{[i, t]},

(5)

where $I_{d}$ and $I_{p}$ are sets of indexes from {1, . . . , Ns'}, that are assigned to data and pilot nucleos, respectively, and $| I_{d} | + | I_{p} | = (N s^{'} - n_{p}) + n_{p} = N s^{'}$ . Note that the subscripts "d" and "p" are used to differentiate between data and pilot nucleos. For convenience, the notations $Φ_{p}^{[i, t] [τ]}$ and $Φ_{d}^{[i, t] [τ]}$ are used to refer to sub-rows of Φ that are multiplied by pilot and data nucloes, i.e., $x_{p}^{[τ]}$ and $x_{d}^{[τ]}$ , respectively. Furthermore, in the following the notation T^[i,t]is used for $I_{n_{r}} \otimes (\sum_{τ \in I_{p}} x_{p}^{[τ]} \otimes Φ_{p}^{[i, t] [τ]})$ .

The derivation of FIM is given in the next section. Pilot symbols are exploited at the initialization phase and in subsequent iterations considering the special structure of the training sequence. In general, training design can be investigated for these two phases separately. However, for the precoder adopted in this article, the optimal training design obtained for the initialization phase turns out to also be optimal for the iteration phase. Nevertheless, the optimal numbers of pilot nucleos in these two phases of channel estimation are not the same.

3.1 Fisher information matrix

The key steps in deriving the FIM in the initialization phase are now given. Without loss of generality we drop superscript t in (5) and perform all the derivations for the first block period (i.e., t = 1). Collecting all the observations during the first block period of length s' in a vector φ, the FIM for the channel estimation problem at the initialization phase is defined and computed as

\begin{aligned} FI M^{init} (n_{p}, x_{p}, I_{p}) & = E_{φ, h} \{[\frac{\partial ln p (φ, h)}{\partial h^{*}}] {[\frac{\partial ln p (φ, h)}{\partial h^{*}}]}^{H}\} \\ = E_{h} \{E_{φ} \{[\frac{\partial ln p (φ | h)}{\partial h^{*}}] {[\frac{\partial ln p (φ | h)}{\partial h^{*}}]}^{H}| h\}\} \\ + E_{h} \{[\frac{\partial ln p (h)}{\partial h^{*}}] {[\frac{\partial ln p (h)}{\partial h^{*}}]}^{H}\} \end{aligned}

(6)

where $F I M^{i n i t} (n_{p}, x_{p}, ℐ_{p})$ shows the dependency of FIM on those parameters of interest. Using the i.i.d. assumption on noise and data, p(φ|h) can be approximated as a complex normal distribution with mean $μ = {[μ_{1}^{T}, \dots, μ_{s^{'}}^{T}]}^{T}$ and covariance R_φ= diag[R₁, . . . , R _{s '}]. Moreover, it follows from (5) that μ _i = E_φ {y^[i]|h} = T^[i]h and

R_{i} = H (σ_{x}^{2} I_{n_{t} / s^{'}} \otimes ({(Φ_{d}^{[i]})}^{T} {(Φ_{d}^{[i]})}^{*})) H^{H} + N_{0} I_{n_{r}}

(7)

where $H = {(H^{[1]})}^{T}$ and $Φ_{d}^{[i]}$ is the i th sub-matrix of Φ with size (N s' - n_p) × s' that is assigned to data symbols.

The i.i.d. assumptions on noise and data make the FIM additive. Specifically, $FI M^{init} (n_{p}, x_{p}, I_{p}) = \sum_{i = 1}^{s^{'}} {FIM}_{i}^{init}$ . The quantity ${FIM}_{i}^{init}$ is obtained as follows:

{FIM}_{i}^{init} = E_{h} \{E_{y} \{\frac{\partial ln p (y | h)}{\partial h^{*}} {(\frac{\partial ln p (y | h)}{\partial h^{*}})}^{H}| h\}\} + σ_{h}^{- 2} I_{n_{t} n_{r}} .

We know that

ln p (y | h) = Constant - ln | R_{i} | - {(y - μ_{i})}^{H} R_{i}^{- 1} (y - μ_{i}) .

(8)

and $\frac{\partial ln | R_{i} |}{\partial h_{l}^{*}} = trace (R_{i}^{- 1} \frac{\partial R_{i}}{\partial h_{l}^{*}})$ . Therefore,

\frac{\partial R_{i}}{\partial h_{l}^{*}} = H (σ_{x}^{2} I_{n_{t} / s^{'}} \otimes ({(Φ_{d}^{[i]})}^{T} {(Φ_{d}^{[i]})}^{*})) Σ_{l}^{T}

(9)

where ∑ _l is an n_r× n_t null matrix with only a single element of 1 at position $(⌊\frac{l - 1}{n_{t}}⌋ + 1, (l - 1 mod n_{t}) + 1)$ . The derivative of the third term in (8) is

\frac{\partial {(y - μ_{i})}^{H} R_{i}^{- 1} (y - μ_{i})}{\partial h_{l}^{*}} = - \frac{\partial μ_{i}^{H}}{\partial h_{l}^{*}} R_{i}^{- 1} (y - μ_{i}) + {(y - μ_{i})}^{H} \frac{\partial R_{i}^{- 1}}{\partial h_{l}^{*}} (y - μ_{i})

where $\frac{\partial R_{i}^{- 1}}{\partial h_{l}^{*}} = - R_{i}^{- 1} \frac{\partial R_{i}}{\partial h_{l}^{*}} R_{i}^{- 1}$ and $\frac{\partial R_{i}}{\partial h_{l}^{*}}$ is given by (9). In addition,

\frac{\partial μ_{i}^{H}}{\partial h_{l}^{*}} = \frac{\partial h^{H}}{\partial h_{l}^{*}} {(T^{[i]})}^{H} = e_{l}^{T} {(T^{[i]})}^{H}

where e_l is an n_tn_r× 1 null vector with a single element 1 at position l.

Using all the above equations and after some manipulations, one has

\begin{aligned} {({FIM}_{i}^{init})}_{l, j} = E_{h} {e_{l}^{T} {(T^{[i]})}^{H} R_{i}^{- 1} T^{[i]} e_{j} \\ + tr (R_{i}^{- 1} H A^{[i]} Σ_{l}^{T} R_{i}^{- 1} Σ_{j} {(A^{[i]})}^{H} H^{H})} + σ_{h}^{- 2} δ (l - j), \end{aligned}

where $A^{[i]} \equiv (σ_{x}^{2} I_{n_{t} / s^{'}} \otimes ({(Φ_{d}^{[i]})}^{T} {(Φ_{d}^{[i]})}^{*}))$ .

Using the fact that tr (ABC) = tr (CAB) and summing over s' quantities ${FIM}_{i}^{init}$ , the total FIM is given by,

FI M^{init} (n_{p}, x_{p}, I_{p}) = E_{h} \{\sum_{i = 1}^{s^{'}} R_{i}^{- 1} \otimes ({(X_{p}^{[i]})}^{H} X_{p}^{[i]}) + R_{i}^{- 1} \otimes Q_{i}\} + s^{'} σ_{h}^{- 2} I_{n_{t} n_{r}}

(10)

where $X_{p}^{[i]} = \sum_{τ \in I_{p}} x_{p}^{[τ]} \otimes Φ_{p}^{[i] [τ]}$ , and $Q_{i} = {(A^{[i]})}^{T} H^{T} R_{i}^{- 1} H^{*} {(A^{[i]})}^{*}$ .

For designing training sequence, (10) can be simplified further using numerical calculation. Using numerical calculation, it is observed that for a Rayleigh-distributed channel, the matrix $E_{h} {R_{i}^{- 1}}$ in (10) is approximately a diagonal matrix^(b), $α I_{n_{r}}$ . This observation means that E_h {Q _i} can be approximated by $n_{r} α σ_{h}^{2} {(A^{[i]})}^{T} {(A^{[i]})}^{*}$ . Then, by performing the expectation operation and using the factorization property of the Kronecker product, (10) can be represented as

\begin{gathered} E_{h} \{\sum_{i = 1}^{s^{'}} α I_{n_{r}} \otimes ({(X_{p}^{[i]})}^{H} X_{p}^{[i]}) + α I_{n_{r}} \otimes n_{r} α σ_{h}^{2} {(A^{[i]})}^{τ} {(A^{[i]})}^{*}\} + s^{'} σ_{h}^{- 2} I_{n_{r}} \otimes I_{n_{t}} = \\ I_{n_{r}} \otimes (\sum_{i = 1}^{s^{'}} α ({(X_{p}^{[i]})}^{H} X_{p}^{[i]}) + n_{r} α^{2} σ_{h}^{2} {(A^{[i]})}^{τ} {(A^{[i]})}^{*} + s^{'} σ_{h}^{- 2} I_{n_{t}}) \end{gathered}

Moreover, using the property of the Kronecker product (A ⊗ B)(C ⊗ D) = (AC) ⊗ (BD), it follows that ${(X_{p}^{[i]})}^{H} X_{p}^{[i]} = \sum_{τ \in I_{p}} \sum_{τ^{'} \in I_{p}} ({(x_{p}^{[τ]})}^{H} x_{p}^{[τ^{'}]}) \otimes ({(Φ_{p}^{[i] [τ]})}^{H} Φ_{p}^{[i] [τ^{'}]})$ . Therefore (10) can be further simplified to

\begin{gathered} FI M^{init} (n_{p}, x_{p}, I_{p}) = I_{n_{r}} \otimes \\ (α \sum_{τ \in I_{p}} \sum_{τ^{'} \in I_{p}} {(x_{p}^{[τ]})}^{H} x_{p}^{[τ^{'}]} \otimes \sum_{i = 1}^{s^{'}} {(Φ_{p}^{[i] [τ]})}^{H} Φ_{p}^{[i] [τ^{'}]} + n_{r} α^{2} σ_{h}^{2} \sum_{i = 1}^{s^{'}} {(A^{[i]})}^{τ} {(A^{[i]})}^{*} + s^{'} σ_{h}^{- 2} I_{n_{t}}) \end{gathered}

(11)

In general, the second term in (11) depends on $I_{p}$ , but not on the training symbols, whereas the first term depends on both x_p and $I_{p}$ . Although both terms depend on n_p, how FIM^init depends on n_p is determined by $I_{p}$ . Therefore, in the following $I_{p}$ and x_p are first optimized. Then n_p is determined for the optimized $I_{p}$ .

For the iteration phase, specifically the last iteration, estimation and detection are implemented using information about the data symbols as well as the pilot symbols. Thus, the parameter of interest in deriving FIM is $θ = {[h^{T} x_{d}]}^{T}$ . Moreover, $μ_{i} = E_{φ} {y^{[i]} | θ} = (I_{n_{r}} \otimes (\sum_{τ} x^{[τ]} \otimes Φ^{[i] [τ]})) h$ and $R_{i} = N_{0} I_{n_{r}}$ . By replacing θ in (6) for h and after some manipulations, the FIM for channel estimation in the iteration phase is given by

\begin{gathered} FI M^{iter} (n_{p}, x_{p}, I_{p}) = N_{0}^{- 1} I_{n_{r}} \otimes \\ (\frac{N s^{'} - n_{p}}{N} σ_{x}^{2} I_{n_{t}} + \sum_{τ \in I_{p}} \sum_{τ^{'} \in I_{p}} {(x_{p}^{[τ]})}^{H} x_{p}^{[τ^{'}]} \otimes \sum_{i = 1}^{s^{'}} {(Φ_{p}^{[i] [τ]})}^{H} Φ_{p}^{[i] [τ^{'}]}) + s^{'} σ_{h}^{- 2} I_{n_{t} n_{r}} . \end{gathered}

(12)

3.2 Optimization of training symbols and their positions

This section is first concerned with minimizing the CRB expression for the initialization phase. The minimization is under a constraint on the power budget for the training sequence. Such a constraint is expressed as

\sum_{τ \in I_{p}} (x_{p}^{[τ]} \otimes Φ_{p}^{[i] [τ]}) ({(x_{p}^{[τ]})}^{H} \otimes {(Φ_{p}^{[i] [τ]})}^{H}) \leq P_{t} .

(13)

Using the properties of the precoder employed in this study, the above constraint can be simplified to $\frac{s^{'}}{N} \sum_{τ^{'} \in I_{p}} x_{p}^{[τ]} {(x_{p}^{[τ]})}^{H} \leq P_{t}$ . The other obvious constraint is that the training symbols should be selected from QAM constellation Ω. Then, the training symbols, x_p's and their positions, specified by $I_{p}$ , are obtained by solving the following constrained optimization problem:

\begin{gathered} min_{x_{p}, I_{p}} CR B^{init} (n_{p}, x_{p}, I_{p}) = min_{x_{p}, I_{p}} tr (FI M^{init} {(n_{p}, x_{p}, I_{p})}^{- 1}) \\ s . t . \{\begin{gathered} \frac{s^{'}}{N} \sum_{τ \in I_{p}} x_{p}^{[τ]} {(x_{p}^{[τ]})}^{H} \leq P_{t} \\ {(x_{p}^{[τ]})}_{j} \in Ω, j = 1, \dots, n_{t} / s^{'}, τ \in I_{p} \end{gathered} \end{gathered}

(14)

where ${(x_{p}^{[τ]})}_{j}$ is the j th pilot symbol in the τ th pilot nucleo and the FIM is given in (11).

To proceed, lets consider two separate cases for problem (14): n_p = 1 and n_p ≥ 2. Case 1 (n_p = 1): In this case the FIM is simplified to

\begin{gathered} I_{n_{r}} \otimes (α ({(x_{p}^{[τ]})}^{H} x_{p}^{[τ]}) \otimes \sum_{i = 1}^{s^{'}} ({(Φ_{p}^{[i] [τ]})}^{H} Φ_{p}^{[i] [τ]}) \\ + n_{r} α^{2} σ_{h}^{2} \sum_{i = 1}^{s^{'}} {(A^{[i]})}^{τ} {(A^{[i]})}^{*} + s^{'} σ_{h}^{- 2} I_{n_{t}}), \end{gathered}

(15)

Because of the shift-invariant property of (15) with respect to τ, τ can be any value in the set {1, 2, . . . , Ns'}. For simplicity, set τ = 1 and the superscript τ is omitted. Using the fact that if X > 0 then tr (X^-1) ≥ ∑ _i 1/(X)_i,_i, the original optimization problem is simplified by minimizing the lower bound of the objective function.

On the other hand, $\sum_{i = 1}^{s^{'}} ({(Φ_{p}^{[i] [τ]})}^{H} Φ_{p}^{[i] [τ]}) = \frac{1}{N} I_{s^{'}}$ , $\sum_{i = 1}^{s^{'}} {(A^{[i]})}^{T} {(A^{[i]})}^{*} = \frac{σ_{x}^{4}}{s^{'}} ({(\frac{N s^{'} - 1}{N})}^{2} + {(\frac{1}{N})}^{2}) I_{n_{t}}$ and the constraint is $\frac{s^{'}}{N} x_{p} x_{p}^{H} = \frac{s^{'}}{N} \sum_{j = 1}^{n_{t} / s^{'}} | {(x_{p})}_{j} |^{2}$ . Therefore, it is not hard to see that the solution of the simplified optimization problem is $| {(x_{p})}_{1} |^{2} = | {(x_{p})}_{2} |^{2} = \dots = | {(x_{p})}_{n_{t} / s^{'}} |^{2} = \frac{N P_{t}}{n_{t}}$ . It means that all pilot symbols should have the same power. For example, one can select corner points of the QAM constellations for the training symbols.

Case 2 (n_p ≥ 2): In this case there are two options for the placements of pilot nucleos. The first option is to group all pilot nucleos in one single cluster and the second option is to spread pilot nucleos. It can be shown that the CRB is invariant with respect to a shift of the placements of pilot nucleos in both options. Therefore, it suffices to select one cluster or one spread placement. However, the precoder has been designed such that the soft-output demodulator works with uncorrelated inputs and putting pilot nucleos between data nucleos may violate this condition. That condition is satisfied when A^[i]has a diagonal form. The implication of this property is to place pilot nucloes equi-spaced in x_k and $I_{p} = {i_{0} + k n; k = 0, \dots, n_{p} - 1}$ , where n = Ns'/n_p and i₀ ∈ {1, . . . , n}, which leads to $A^{[i]} = σ_{x}^{2} \frac{N s^{'} - n_{p}}{N s^{'}} I_{s^{'}}$ . In this selection it is supposed that n_p is divisible by Ns'.

Then the FIM in (11) can be represented by

I_{n_{r}} \otimes (\frac{1}{N} α \sum_{τ \in I_{p}} {(x_{p}^{[τ]})}^{H} x_{p}^{[τ]} \otimes I_{s^{'}} + n_{r} α^{2} σ_{x}^{4} σ_{h}^{2} \frac{1}{s^{'}} {(\frac{N s^{'} - n_{p}}{N})}^{2} I_{n_{t}} + s^{'} σ_{h}^{- 2} I_{n_{t}})

(16)

To obtain the above expression of the objective function, the following property has been used:

{(\sum_{i = 1}^{s^{'}} {(Φ_{p}^{[i] [τ]})}^{H} Φ_{p}^{[i] [τ^{'}]})}_{n, l} = \{\begin{matrix} \frac{1}{N}, & τ = τ^{'}; n = l \\ 0, & otherwise \end{matrix}

(17)

Moreover, the only term that depends on the training symbols is $\sum_{τ \in I_{p}} {(x_{p}^{[τ]})}^{H} x_{p}^{[τ]}$ in (16). Finally, using the constraint on training power, which can be written as

\frac{s^{'}}{N} \sum_{τ \in I_{p}} \sum_{j = 1}^{n t / s^{'}} | {(x_{p}^{[τ]})}_{j} |^{2} \leq P_{t},

(18)

the solution is given by $\sum_{τ} | {(x_{p}^{[τ]})}_{j} |^{2} = \frac{N P_{t}}{n_{t}}; j = 1, \dots, n_{t} / s^{'}$ .

Now consider the training design for the iteration phase. Observe that all the terms in (12) have diagonal forms with equal diagonal elements, except $\sum_{τ \in I_{p}} \sum_{τ^{'} \in I_{p}} {(x_{p}^{[τ]})}^{H} x_{p}^{[τ^{'}]} \otimes \sum_{i = 1}^{s^{'}} {(Φ_{p}^{[i] [τ]})}^{H} Φ_{p}^{[i] [τ^{'}]}$ . This means that the solution of problem (14), but with $FI M^{init} (n_{p}, x_{p}, I_{p})$ replaced by $FI M^{iter} (n_{p}, x_{p}, I_{p})$ , is to choose equal diagonal elements for this term. Therefore, the training sequence designed for the initialization is also optimal for the iteration phase.

In summary, by selecting pilot nucleos such that the sum of the powers of their corresponding pilot symbols with the same indexes are equal, the bound on CRB is minimized. The above condition can give different selections for pilot symbols from a two-dimensional constellation. It should be pointed out, however, that not all selections guarantee that pilot symbols belong to standard QAM constellations.

3.3 Determination of the number of the training symbols

For block-fading channels, the number of pilot nucleos, i.e., n_p, should be as small as possible that meets the power constraint. Using a larger value for n_p wastes bandwidth and does not change the system performance.

The optimum numbers of the training symbols in the initialization phase and iteration phase are not the same. This is explained as follows. At the initialization, by looking at (7), it is observed that the first term in (11) is an increasing function of n_p. However, the second term is a decreasing function of n_p that is multiplied by n_r. Therefore, n_p that minimizes the CRB are determined by the summation of these two terms, which is also determined by the value of n_r. Table 1 gives several examples of optimal n_p for different sets of n_t, n_r and N. For the iteration phase, the expression in (12) means that the CRB in the iteration phase always increases by increasing n_p. Since it is assumed that there is perfect information about the data symbols in the iteration phase, which is not the case in reality, it is most appropriate to select n_p considering only the initialization phase.

Table 1 Optimum n_p for several sets of parameters {n_t, n_r, N}

Full size table

To demonstrate the optimal training design, Figure 3 shows a graphical structure for a simple example, where $P_{t} = 4 σ_{x}^{2}, n_{p} = 2,$ N = 2, n_t = 4 and n_r = 2. In this example, n_s = 1. Then the size of pilot nucleos should be n_t/s' = 2, where s' = N/n_s = 2.

3.4 Channel estimation

For the channel estimation task, one can view the received vector during one block length as $φ^{[t]} = {[{(y^{[1, t]})}^{T}, {(y^{[2, t]})}^{T}, \dots, {(y^{[s^{'}, t]})}^{T}]}^{T}$ .

At the initialization, the mean and covariance matrix of this vector are given in Section 3.1. By treating the data symbols as nuisance parameters, the MMSE channel estimate can be found as [14]

{\hat{h}}^{[t]} = σ_{h}^{2} T^{H} (σ_{h}^{2} T T^{H} + R_{φ^{[t]}}) φ^{[t]}

(19)

where T = [(T¹) ^T , . . . , (T^[s']) ^T ] ^T .

In the subsequent iterations, soft information from the decoder is used to improve the performance of the channel estimator. The channel estimator uses such information to compute new estimates of the channel coefficients using expected values of the data symbols. Therefore, the interleaved ${Λ_{ext}^{(c_{l})}}$ from the decoder are fed back to the estimator to calculate the expected values of the data symbols, i.e., E{x_d}. The entries of E{x_d} are calculated using ${Λ_{ap}^{({\tilde{c}}_{l})}}$ at each iteration by E{(x_d)_i} = ∑_x_∈Ωx · p((x_d) _i = x). The detailed derivations of the probability p((x_d) _i = x) from Λ-values are given in [15] (note that the calculation depends on the mapping rule in Ω).

To verify the results obtained in this section, Section 4 compares numerically the MSE performance of the above channel estimator obtained with the optimal and suboptimal training sequences.

4 Illustrative results

In this section, the frame-error-rate (FER) and MSE performances of BICM-MIMO systems using a MMSE iterative channel estimator are presented. The space-time precoder is the DNA-cyclo precoder that satisfies the properties outlined in Section 2. We consider quadrature phase-shift keying (QPSK) modulation with Gray mapping.

The MSE performance of a BICM-MIMO for a codeword length of 4 × 1024 bits is shown for a 4 × 2 block-fading MIMO channel in Figure 4, when n_c = 2. In this figure, E_b is the energy per information bit. The code used is the 16-state convolutional code with generator polynomials (23, 35) in octal form. In Figure 4, the MSE curves are obtained after 1 and 5 iterations of the iterative channel estimation/demodulation/decoding, with the following cyclotomic rotator [16]:

Φ = \frac{1}{2} [\begin{matrix} 1 & 1 & e^{j 6 π / 15} & - e^{j 6 π / 15} \\ e^{j 2 π / 15} & j e^{j 2 π / 15} & - e^{j 8 π / 15} & j e^{j 8 π / 15} \\ e^{j 4 π / 15} & - e^{j 4 π / 15} & e^{j 10 π / 15} & e^{j 10 π / 15} \\ e^{j 6 π / 15} & - j e^{j 6 π / 15} & - e^{j 12 π / 15} & - j e^{j 12 π / 15} \end{matrix}]

and when the setting for N, n_s, n_p and P_t in Figure 3 are used. The channel is generated randomly and is assumed to be Rayleigh distributed. For the purpose of comparison, the results for MSE performances of the optimal PPSAM, denoted by O-PPSAM and the suboptimal PPSAM, denoted by SO-PPSAM as well as the CRB are shown in Figure 4. For SO-PPSAM, two pilot nucleos are inserted as one cluster in front of data nucleos in a symbol to be precoded. In contrast, in the case of O-PPSAM, the optimized training sequence embeds the pilot nucleos at the first and third positions of Ns' = 4 positions for nucleos. The MSE curves show that the performance of the optimal scheme is better than the sub-optimum scheme for the first iteration (i.e., initialization). In fact the MSE performance of the proposed scheme closely approaches the CRB at high E_b /N₀ after 5 iterations.

In Figure 5, the FER performance of the system with the PPSAM schemes is compared with the conventional PSAM training scheme for the same system parameters as in Figure 4. The top curve is the FER performance of the system with the conventional PSAM training scheme. Note that for a fair comparison, the training scheme in PSAM also meets the training power constraint as trace $(X_{p} X_{p}^{H}) = P_{t}$ , where X_p is the training matrix placed at the beginning of each block of the precoded symbols. The optimal option for PSAM scheme in terms of minimizing the FER as proposed in [11] is to select X_p to have orthogonal columns. The simplest option is $\sqrt{2 \times σ_{x}^{2} / n_{t}} I_{n_{t}} = \sqrt{σ_{x}^{2}} I_{n_{t}}$ , which results in the same power budget as that of the proposed scheme.

As can be seen from Figure 5, the O-PPSAM scheme offers 0.5 dB performance gain as compared to the SO-PPSAM scheme at FER = 10^-2. In comparison with PSAM, the performance of the PSAM scheme is about 0.5-1.5 dB worse than the proposed scheme depending on E_b /N₀ after 5 iterations. This is expected because the pilot information is embedded in the precoded symbols for the proposed scheme and not for the PSAM scheme. In this way, the demodulator can also make use of this information. Note, however, that for the first iteration, since there is no information about data, PSAM works the best. More importantly, while the proposed scheme uses a little bandwidth for training information (for the system considered in this figure the training overhead is n_p× n_t/s' = 4), the training overhead of PSAM scheme is n_t× n_t = 16, which is quadruple. To investigate the effect of the number of transmit antennas, two different systems, one with 2 × 2 channel and one with 4 × 2 MIMO channel, are compared in Figures 6 and 7 in terms of MSE and FER, respectively. For both channels, n_p = 2 and the optimum scheme are used when N = 2, while other system parameters are the same as those used for Figure 4. As can be seen from Figure 6, the MSE of the channel estimation increases when increasing the number of transmit antennas. This is expected because there are more channels to be estimated for the same amount of training information and power as done in the comparison. Nevertheless, the gain in diversity by using more antennas can still improve the overall FER performance as seen in Figure 7.

5 Conclusion

In this article, a new training design for a BICM-MIMO system over a block-fading channel has been proposed. The design inserts pilot symbols into the data symbols before precoding. The new training sequence improves bandwidth efficiency as compared to the conventional PSAM scheme and can also be used by the demodulator in the receiver. In order to design the optimal training symbols and their positions, the CRB on the channel estimations at the initialization and at the iteration phases are minimized. Compared to PSAM, performance improvement achieved with the proposed training is about 1.5 dB at a FER level of 10^-2.

Endnotes

^aIn practice, since n_s is typically an approximated value over some range and since N can be selected, such an assumption can be fulfilled. ^bUsing the matrix inversion lemma, one has $R_{i}^{- 1} = {(H A^{[i]} H^{H} + N_{0} I_{n_{r}})}^{- 1} = N_{0}^{- 1} I_{n_{r}} + N_{0}^{- 2} H A^{[i]} H^{H} {(I_{n_{r}} + N_{0}^{- 1} H A^{[i]} H^{H})}^{- 1}$ . Therefore, for high SNR, $E {R_{i}^{- 1}}$ can be approximated by $N_{0}^{- 1} I_{n_{r}}$ .

References

Caire G, Shamai S: On the achievable throughput of a multiantenna Gaussian broadcast channel. IEEE Trans Inf Theory 2003, 49(7):1691-1706. 10.1109/TIT.2003.813523
Article MathSciNet MATH Google Scholar
Alamouti SM: A simple transmit diversity technique for wireless communications. IEEE J Sel Areas Commun 1998, 16(8):1451-1458. 10.1109/49.730453
Article Google Scholar
Tarokh V, Seshadri N, Calderbank AR: Space-time codes for high data rate wireless communication: performance criterion and code construction. IEEE Trans Inf Theory 1998, 44(2):744-765. 10.1109/18.661517
Article MathSciNet MATH Google Scholar
Boutros J, Viterbo E: Signal space diversity: a power and bandwidth eficient diversity technique for the Rayleigh fading channel. IEEE Trans Inf Theory 1998, 44(4):1453-1467. 10.1109/18.681321
Article MathSciNet MATH Google Scholar
Boutros J, Gresset N, Brunel L: Turbo coding and decoding for multiple antenna channels. In International Symposium on Turbo Codes and Related Topics. Brest, France; 2003:1-8.
Google Scholar
Gresset N, Brunel L, Boutros J: Space-time coding techniques with bit-interleaved coded modulations for MIMO block-fading channels. IEEE Trans Inf Theory 2008, 54(5):2156-2178.
Article MathSciNet MATH Google Scholar
Gresset N, Boutros JJ, Brunel L: Optimal linear precoding for BICM over MIMO channels. In ISIT, 66. Chicago, IL; 2004.
Google Scholar
Coldrey M, Bohlin P: Training-based MIMO systems, Part I: performance comparison. IEEE Trans Signal Process 2007, 55(11):5464-5476.
Article MathSciNet Google Scholar
Nicoli M, Ferrara S, Spagnolini U: Soft-iterative channel estimation: methods and performance analysis. IEEE Trans Signal Process 2007, 55(6):2993-3006.
Article MathSciNet Google Scholar
Dong M, Tong L, Sadler BM: Optimal insertion of pilot symbols for transmissions over time-varying flat fading channels. IEEE Trans Signal Process 2004, 52(5):1403-1418. 10.1109/TSP.2004.826182
Article MathSciNet Google Scholar
Taricco G, Biglieri E: Space-time decoding with imperfect channel estimation. IEEE Trans Wirel Commun 2005, 4(4):1874-1888.
Article Google Scholar
Huang Y, Ritcey JA: Joint iterative channel estimation and decoding for bit-interleaved coded modulation over correlated fading channels. IEEE Trans Wirel Commun 2005, 4(5):2549-2558.
Article MathSciNet Google Scholar
Piantanida P, Sadough SM: On the outage capacity of a practical decoder accounting for channel estimation inaccuracies. IEEE Trans Commun 2009, 57(5):1341-1350.
Article Google Scholar
Kay SM: Fundamentals of Statistical Signal Processing: Estimation Theory. Prentice-Hall PTR, New Jersey; 1993.
MATH Google Scholar
Khalighi MA, Boutros JJ: Semi-blind channel estimation using the EM algorithm in iterative MIMO APP detectors. IEEE Trans Wirel Commun 2006, 5(11):3165-3173.
Article Google Scholar
Kraidy GM, Rossi P: Full-diversity iterative MMSE receivers with space-time precoders over block-fading MIMO channels. In Proc IEEE Int Conf Wireless Commun and Signal Processing. Suzhou; 2010:1-5.
Google Scholar

Download references

Author information

Authors and Affiliations

TRLabs, Saskatoon, Canada
Zohreh Andalibi, Ha H Nguyen & Joseph E Salt
Department of Electrical and Computer Engineering, University of Saskatchewan, 57 Campus Drive, Saskatoon, SK, S7N5A9, Canada
Zohreh Andalibi, Ha H Nguyen & Joseph E Salt

Authors

Zohreh Andalibi
View author publications
You can also search for this author in PubMed Google Scholar
Ha H Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Joseph E Salt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zohreh Andalibi.

Additional information

Competing interests

Zohreh Andalibi has received funding from TRLabs of Saskatchewan. This organization partially is financing this manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Andalibi, Z., Nguyen, H.H. & Salt, J.E. Training design for precoded BICM-MIMO systems in block-fading channels. J Wireless Com Network 2012, 80 (2012). https://doi.org/10.1186/1687-1499-2012-80

Download citation

Received: 07 August 2011
Accepted: 04 March 2012
Published: 04 March 2012
DOI: https://doi.org/10.1186/1687-1499-2012-80

Training design for precoded BICM-MIMO systems in block-fading channels

Abstract

1 Introduction

2 System model

3 Training design and channel estimator

3.1 Fisher information matrix

3.2 Optimization of training symbols and their positions

3.3 Determination of the number of the training symbols

3.4 Channel estimation

4 Illustrative results

5 Conclusion

Endnotes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Rights and permissions

About this article

Cite this article

Share this article

Keywords