Sparse channel estimation of MIMO-OFDM systems with unconstrained smoothed l0-norm-regularized least squares compressed sensing

Ye, Xinrong; Zhu, Wei-Ping; Zhang, Aiqing; Yan, Jun

doi:10.1186/1687-1499-2013-282

Research
Open access
Published: 10 December 2013

Sparse channel estimation of MIMO-OFDM systems with unconstrained smoothed l₀-norm-regularized least squares compressed sensing

Xinrong Ye^1,2,
Wei-Ping Zhu^1,3,
Aiqing Zhang^1,2 &
…
Jun Yan¹

EURASIP Journal on Wireless Communications and Networking volume 2013, Article number: 282 (2013) Cite this article

4209 Accesses
11 Citations
Metrics details

Abstract

This paper investigates the sparse channel estimation issue of multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) systems. Beginning with the formulation of least squares (LS) solution to sparse MIMO-OFDM channel estimation, a compressed channel sensing (CCS) framework based on the new smoothed l₀-norm-regularized least squares (l₂-Sl₀) algorithm is proposed. Three methods, namely quasi-Newton, conjugate gradient, and optimization in the null and complement spaces of the measurement matrix, are then proposed to solve the l₂-Sl₀ unconstrained optimization problem. Moreover, the two former are also applied to solve the l₂-Sl₀ channel estimation. A number of computer simulation-based experiments are conducted showing a better reconstruction accuracy of the l₂-Sl₀ algorithm as compared with the smoothed l₀-norm (Sl₀) algorithm in the presence of noise. The proposed CCS approach can save nearly 25% pilot signals to maintain the same mean square error (MSE) and bit error rate (BER) performances as given by the conventional LS method.

1. Introduction

Coherent detection and equalization in multiple input multiple output orthogonal frequency division multiplexing (MIMO-OFDM [1]) systems require channel state information (CSI) at the receiver. In real wireless environments, however, the CSI is not known. Therefore, channel estimation is of crucial importance to MIMO-OFDM systems. In various wireless propagation environments, the channel may consist of only a few dominant propagation (non-zero) paths, even though it has a large propagation delay. Thus, the channel impulse response has a sparse nature [2–4]. However, conventional methods, such as least squares (LS), ignore this prior information about the unknown channel leading to lower spectral efficiency. Recently, sparse channel estimation with an objective of decreasing the training sequence to improve spectral efficiency is becoming a hot research topic.

Previously reported approaches for sparse channel estimation can broadly be categorized into two types, namely the most significant tap (MST) detection and compressed channel sensing (CCS). The MST detection methods [4–6] used a measure to determine if a channel tap was non-zero (‘active’). The disadvantage of this type of methods is that a large number of pilots are needed to render an accurate MST detection and effective channel estimation. The CCS methods are based on the compressed sensing (CS [7]) technology. In [8], the authors formalized the notion of multipath sparsity and proposed the CCS approach. In [9], the orthogonal matching pursuit (OMP) and basis pursuit (BP) algorithms were applied to estimate underwater acoustic channels with large Doppler spread. In [10], the authors proposed an overcomplete basis for doubly selective channels and a metric called localized coherence for selecting training signals to ensure good estimation performance. In [11], a CCS approach for doubly selective channels and a sparsity-enhancing basis expansion with a method for optimizing it were proposed. In [12], two criteria as guiding principles to optimize the pilot pattern for CCS in OFDM systems were proposed. Methods of this type utilize the prior sparse information of the unknown channel and the advantage of CS and thus can improve the spectral efficiency by reducing the number of pilot symbols to be transmitted.

Different from literatures [9–12] that used the existing sparse reconstruction algorithms for CCS in OFDM or single carrier systems, we aim to exploit a novel reconstruction algorithm for CCS in MIMO-OFDM systems. The proposed smoothed l₀-norm-regularized least squares reconstruction algorithm is named l₂-Sl₀ in this paper, which differs from the smoothed l₀-norm reconstruction algorithm (Sl₀[13]) in two aspects. First, Sl₀ is a constrained optimization problem which is solved in [13] using the steepest descent approach. However, l₂-Sl₀ is an unconstrained minimization problem which is to be solved in this paper by using three methods, namely quasi-Newton approach, conjugate gradient approach, and optimization in the null and complement spaces of the measurement matrix. Second, unlike the Sl₀ using a fixed step size to control the decrease of the parameter σ, which determines the degree of smoothness and the approximation accuracy of l₀-norm, l₂-Sl₀ uses a variable one. Simulation results show that the proposed l₂-Sl₀ reconstruction approach outperforms the Sl₀ approach in the presence of noise, and at the cost of slightly more computational time, the CCS approach using l₂-Sl₀ in conjunction with conjugate gradient yields a performance slightly better than that of the CCS method using fast iterative shrinkage-thresholding algorithm (FISTA [14]) or orthogonal matching pursuit (OMP [15]) algorithm.

The remainder of the paper is organized as follows: Section 2 formulates the sparse channel estimation problem of MIMO-OFDM systems based on LS, ideal-LS, and compressed sensing. Section 3 presents three sparse reconstruction algorithms using the proposed l₂-Sl₀ objective function, based on which a new CS-based sparse channel estimation approach is developed. Section 4 comprises a number of experiments showing a better reconstruction accuracy of the l₂-Sl₀-based method as compared with the Sl₀ algorithm, and a higher spectrum efficiency of the sparse channel estimation employing l₂-Sl₀ than that using the LS method. Section 5 concludes this paper by highlighting some of the contributions presented.

2. The sparse channel estimation problem of MIMO-OFDM systems

Consider a similar MIMO-OFDM system as described in [16] with N_T transmit and N_R receive antennas. The MIMO channel can be characterized by an array of L-tap finite impulse response (FIR) filters given by a number of N_R × N_T matrices H(n), (n = 0,1,…,L − 1), whose (i_R,i_T)-th element $h_{i_{R}, i_{T}} (l), (0 \leq l \leq L - 1)$ represents the l- th tap of the channel response between the i_R-th receive antenna and the i_T-th transmit antenna. In the case of uniform sampling, a wireless channel can often be modeled as a sparse channel [17–19], i.e., only a few elements are nonzero in $[h_{i_{R}, i_{T}} (0), h_{i_{R}, i_{T}} (1), \dots, h_{i_{R}, i_{T}} (L - 1)]$ . If the length of the cyclic prefix (CP) is not less than the channel length L, the received pilot signal in i_R-th receiver antenna can be written as

\begin{array}{l} Y_{i_{R}, pilot} = & [diag (X_{1, pilot}) F_{pilot}, \dots, diag (X_{N_{T}, pilot}) F_{pilot}] h_{i_{R}} \\ + n_{i_{R}, pilot}, \end{array}

(1)

where $X_{i_{T}, pilot} = {[X_{i_{T}} (k_{1}), \dots, X_{i_{T}} (k_{p})]}^{T}$ and $Y_{i_{R}, pilot} = {[Y_{i_{R}} (k_{1}), \dots, Y_{i_{R}} (k_{p})]}^{T}$ are the pilot signals in the i_T-th transmit antenna and i_R-th receive antenna, diag(X_1,pilot) is a diagonal matrix with X_l,pilot as the main diagonal elements, $h_{i_{R}} = {[h_{i_{R}, 1}^{T}, \dots, h_{i_{R}, i_{T}}^{T}, \dots, h_{i_{R}, N_{T}}^{T}]}^{T}$ with $h_{i_{R}, i_{T}} = {[h_{i_{R}, i_{T}} (0), \dots, h_{i_{R}, i_{T}} (L - 1)]}^{T}$ , and n_{iR,
pilot} represents the frequency domain noise. Let F_L be a K × L matrix formed by the first L columns of a K × K DFT matrix F, then F_pilot can be formed by taking only the rows of F_L associated with the K_P pilot sub-carriers.

By letting $A = I_{N_{R}} \otimes [diag (X_{1, pilot}) F_{pilot}, \dots, diag (X_{N_{T}, pilot}) F_{pilot}]$ , $Y_{pilot} = {[Y_{1, pilot}^{T}, \dots, Y_{N_{R}, pilot}^{T}]}^{T}$ , $h = {[h_{1}^{T}, \dots, h_{N_{R}}^{T}]}^{T}$ , and $n_{pilot} = {[n_{1, pilot}^{T}, \dots, n_{N_{R}, pilot}^{T}]}^{T}$ , where ⊗ represents Kronecker product, we can get

Y_{pilot} = Ah + n_{pilot},

(2)

which can be solved by the conventional LS method, giving $\hat{h} = A^{†} Y_{pilot}$ , where † represents the pseudoinverse.

Assuming the positions l_d (d = 0,1,…,D-1, and l₀ < l₁ < … < l_D-1) of the MST are correctly estimated, Equation 1 can be rewritten as

\begin{array}{l} Y_{i_{_{R}}, pilot} = & [diag (X_{1, pilot}) W_{pilot}, \dots, diag (X_{N_{T}, pilot}) W_{pilot}] z_{i_{R}} \\ + n_{i_{R}, pilot}, \end{array}

(3)

where $z_{i_{R}} = {[z_{i_{R}, 1}^{T}, \dots, z_{i_{R}, i_{T}}^{T}, \dots, z_{i_{R}, N_{T}}^{T}]}^{T}$ with $z_{i_{R}, i_{T}} = {[z_{i_{R}, i_{T}} (0), \dots, z_{i_{R}, i_{T}} (D - 1)]}^{T}$ , D is the number of nonzero taps, and W_pilot can be formed by taking only the D columns of F_pilot associated with the nonzero tap positions l_d. Let $\tilde{A} = I_{N_{R}} \otimes [diag (X_{1, pilot}) W_{pilot}, \dots, diag (X_{N_{T}, pilot}) W_{pilot}]$ and $z = {[z_{1}^{T}, \dots, z_{N_{R}}^{T}]}^{T}$ . We can obtain

Y_{pilot} = \tilde{A} z + n_{pilot} .

(4)

When n_pilot is white noise and the positions l_d of MST are correctly estimated, we can obtain the estimate of the MST as $\hat{z} = {\tilde{A}}^{†} Y_{pilot}$ . We can also obtain the Cramer-Rao bound of the sparse channel estimate $\hat{h}$ through setting the elements of the positions l_d equal to $\hat{z}$ and other elements equal to zero [4]. The above method to obtain the Cramer-Rao bound of $\hat{h}$ is named as ideal-LS for comparison in this paper.

Note that the dimension of Y_pilot is proportional to the number of pilot subcarriers, and Equation 2 is an underdetermined problem when the dimension of Y_pilot is smaller than that of h. Therefore, the sparse channel estimation in MIMO-OFDM systems can be viewed as solving an underdetermined linear inverse problem with sparsity constraint, i.e.,

\min_{h} | | h | |_{0} s . t . Y_{pilot} = Ah + n_{pilot},

(5)

where || · ||₀ represents the number of nonzero components named as l₀-norm.

3. Sparse channel estimation using l₂-Sl₀reconstruction algorithm

The sparse signal reconstruction problem in CS is to estimate a sparse vector x ∈ ℂ^N from an observed vector y ∈ ℂ^M based on the linear model

y = Φx + w,

(6)

where w ∈ ℂ^M is unknown noise and Φ ∈ ℂ^M × N is a known measurement matrix, typically with M ≪ N. This means that the signal x is ‘sensed’ by a reduced or ‘compressed’ number of measurements. Therefore, the signal reconstruction problem can be described as the following constrained minimization problem,

\min_{x} | | x | |_{0} s . t . | | Φx - y | |_{2} \leq ϵ,

(7)

where the bound ϵ ≥ 0 is used to allow certain error tolerance. In general, ϵ is related to the variance of noise w. Unfortunately, the problem in Equation 7 is a NP-hard combinatorial problem, whose computational complexity grows exponentially with the increase of the signal size and becomes prohibitive even for signals of moderate sizes. Consequently, several techniques have been proposed to tackle this difficult problem. One of the approaches is the convex relaxation, such as BP [20], which replaces ||x||₀ with ||x||₁ to make the problem easier to solve. Another approach, such as matching pursuit (MP [21]) or OMP, is much faster than BP but is a greedy algorithm and does not have provable reconstruction quality at the level of BP method [22]. Different from the above techniques, the smoothed l₀-norm approach [13] is to approximate the discontinuous l₀-norm by a suitable continuous one and then minimize it by an optimization algorithm dedicated to continuous functions. For example, the following continuous function

\begin{array}{l} F_{σ} (x) & = \sum_{i = 1}^{N} f_{σ} (x_{i}) with f_{σ} (x_{i}) \\ = 1 - exp (\frac{- x_{i}^{2}}{2 σ^{2}}), \end{array}

(8)

where σ is a small value, has been proposed to approximate ||x||₀ in [13]. In other words, the minimum l₀-norm solution is then found by minimizing F_σ(x) for a very small value of σ. The parameter σ determines how smooth the function F_σ(x) would be and the accuracy of the approximation. Generally speaking, for larger values of σ, F_σ(x) is smoother and contains less local minima, but the approximation to l₀-norm is worse. On the other hand, for smaller values of σ, a highly nonsmooth F_σ(x) results, which gives a better approximation to l₀-norm but a difficult minimization problem. Consequently, the Sl₀ approach used a ‘decreasing’ sequence for σ.

The Sl₀ reconstruction algorithm is typically 2 to 3 orders of magnitude faster than BP, while resulting in the same or better accuracy [13]. However, in the presence of noise, the accuracy of Sl₀ algorithm needs to be improved. Therefore, in the next section, we will propose several improved Sl₀ reconstruction algorithms.

3.1 The l₂-Sl₀-BFGS reconstruction algorithm for channel estimation

Like l₁-regularized l₂ approach (l₂-l₁[14, 23, 24]) and l_p-regularized l₂ algorithm [25], we use a parameter λ > 0 to balance the twin objectives of minimizing both error and sparsity, giving the following unconstrained optimization problem:

min_{x} F (x) = \frac{1}{2} | | Φx - y | |_{2}^{2} + λ \sum_{i = 1}^{N} [1 - exp (\frac{- x_{i}^{2}}{2 σ^{2}})] .

(9)

The objective function in Equation 9 remains differentiable, and its gradient can be obtained as

\nabla F (x) = Φ^{T} (Φx - y) + g,

(10)

where g = [g₁, g₂, …, g_N]^T with g_i being given by

g_{i} = λ (x_{i} / σ^{2}) e^{- x_{i}^{2} / 2 σ^{2}} .

(11)

For a fixed value of σ, the problem in Equation 9 is now solved using a quasi-Newton algorithm where an approximation of the inverse of the Hessian is obtained using the Broyden-Fletcher-Goldfarb-Shanno (BFGS) update formula [26–28]. As such, the algorithm is referred to as the l₂-Sl₀-BFGS algorithm.

The quadratic (l₂-norm) error term $\frac{1}{2} | | Φx - y | |_{2}^{2}$ in Equation 9 is a convex function, but the convex region of the approximate l₀-norm term $F_{σ} (x) = \sum_{i = 1}^{N} [1 - exp (\frac{- x_{i}^{2}}{2 σ^{2}})]$ depends on parameter is σ. In general, the greater the value of σ, the larger the convex region is. To see this, we compute the gradient of F_σ(x), denoted as $g^{'} = {[g_{1}^{'}, g_{2}^{'}, \dots, g_{N}^{'}]}^{T}$ , whose element is given by

g_{i}^{'} = (x_{i} / σ^{2}) e^{- x_{i}^{2} / 2 σ^{2}} .

(12)

Also, the Hessian of F_σ(x) is a diagonal matrix as given by

\begin{array}{l} \nabla^{2} F_{σ} (x) & = diag \{h_{11}, h_{22}, \dots, h_{NN}\} with h_{ii} \\ = (\frac{1}{σ^{2}} - \frac{x_{i}^{2}}{σ^{4}}) e^{\frac{- x_{i}^{2}}{2 σ^{2}}} \end{array}

(13)

Therefore, F_σ(x) is convex if and only if

| x_{i} | \leq σ, 1 \leq i \leq N .

(14)

Since Equation 14 defines an N-dimensional hypercube whose volume is (2σ)^N, the size of the convex region in the x-space is proportional to σ. On the other hand, in order to better approximate the l₀-norm, σ must be sufficiently small. Consequently, to avoid getting trapped into local minima, we gradually decrease the value of σ, as in the Sl₀ approach. More specifically, for minimum F(x) at σ_i, the initial point is x_*(σ_i-1) obtained in the previous iteration, which is near the global optimal solution.

Since a broadband wireless channel response h usually consists of a few dominant propagation paths and Equation 2 has a similar form as Equation 6, the estimation of h can be viewed as a sparse signal reconstruction in compressed sensing. Thus, we refer to this kind of sparse channel estimation method as CCS. Using Equations 2, 6, and 9, we can obtain the objective function of CCS based on the l₂-Sl₀ reconstruction algorithm,

\begin{array}{l} min_{h} F (h) = & \frac{1}{2} | | Ah - Y_{pilot} | |_{2}^{2} \\ + λ \sum_{i = 1}^{N} [1 - exp (\frac{- h_{i}^{2}}{2 σ^{2}})] . \end{array}

(15)

From the above analysis, the proposed CCS using the l₂-Sl₀-BFGS algorithm can be implemented by the pseudo-code in Algorithm 1.

Algorithm 1 CCS using the l₂-Sl₀-BFGS algorithm

Note that in Algorithm 1, the values of δ_r and r_J are chosen such that 0 < δ_r < 0.1 and 0.5 < r_J < 1. The method of l₂-Sl₀ uses a variable factor r_i = r_i − 1 + δ_r to control the decrease of the parameter σ. Our idea is to use an ‘increasing’ step size corresponding to the decreasing values of σ.

3.2 The l₂-Sl₀-CG reconstruction algorithm for channel estimation

The Hessian matrix of the objective function F(x) in Equation 9 can be computed as

\nabla^{2} F (x) = Φ^{T} Φ + λ \nabla^{2} F_{σ} (x),

(16)

where ∇²F_σ(x) is computed using Equation 13. Since the gradient and Hessian matrix of F(x) can be efficiently evaluated using the closed-form formula in Equations 10 and 16, it is convenient to apply the conjugate gradient method to solve the l₂-Sl₀ optimization problem. The algorithm is thus referred to as the l₂-Sl₀-CG algorithm.

In the k-th iteration of the conjugate gradient technique, x_k is updated as

x_{k + 1} = x_{k} + α_{k} d_{k}, (k = 0, 1, \dots, L - 1) .

(17)

The conjugate direction d_k is computed as

\begin{array}{l} d_{k} = \{\begin{array}{c} \begin{array}{c} - g_{0}, & k = 0 \end{array} \\ \begin{array}{c} - g_{k} + β_{k - 1} d_{k - 1}, & k = 1, 2, \dots, L - 1 \end{array} \end{array} with \\ β_{k - 1} = \frac{g_{k}^{T} g_{k}}{g_{k - 1}^{T} g_{k - 1}}, \end{array}

(18)

and the k-th step size α_k is computed using

α_{k} = \frac{g_{k}^{T} g_{k}}{d_{k}^{T} H_{k} d_{k}},

(19)

Where g_k is the gradient vector computed using Equation 10 and H_k is the Hessian matrix obtained using Equation 16 at x = x_k, respectively. The proposed CCS using l₂-Sl₀- CG algorithm can be implemented by the pseudo-code in Algorithm 2.

Algorithm 2 CCS using l₂-Sl₀- CG algorithm

3.3 Signal reconstruction via optimization in null and complement spaces of Φ

Let Φ = UΣV^T be the singular value decomposition (SVD) of Φ where U_M×M and V_N×N are unitary matrices, and Σ = [S, 0]_M × N with S = diag(s₁, …, s_M) being a diagonal matrix composed by the singular values of Φ. Let V=[V_r,V_n], where the columns of V_n span the null space of Φ and the columns of V_r span the orthogonal complement of the null space. Using V_n and V_r, a signal x of length N can be expressed as

x = V_{r} α + V_{n} β,

(20)

Where α and β are vectors of length M and N − M, respectively. Applying the SVD of Φ, the l₂-norm term in Equation 9 can be simplified as [25]

\frac{1}{2} | | Φx - y | |_{2}^{2} = \frac{1}{2} | | Σα - \tilde{y} | |_{2}^{2} = \frac{1}{2} \sum_{i = 1}^{M} (s_{i} α_{i} - {\tilde{y}}_{i})^{2},

(21)

Where S_i is the i-th singular value of Φ, α_i and ${\tilde{y}}_{i}$ are the i-th component of α and $\tilde{y} = U^{T} y$ , respectively. Using Equations 20 and 21, the optimization problem in Equation 9 can be recast as

\begin{array}{l} min_{α, β} F (α, β) = & \frac{1}{2} \sum_{i = 1}^{M} (s_{i} α_{i} - {\tilde{y}}_{i})^{2} \\ + λ \sum_{j = 1}^{N} [1 - exp (\frac{- {(V_{r, j} α + V_{n, j} β)}^{2}}{2 σ^{2}})], \end{array}

(22)

Where V_r,j and V_n,j are the j-th row of V_r and that of V_n, respectively.

An iterative algorithm to solve the optimization problem in Equation 22 is proposed as follows. In the k-th iteration of the optimization process, signal x^(k) is updated as

x^{(k + 1)} = x^{(k)} + μ^{(k)} d^{(k)},

(23)

where

\begin{array}{l} x^{(k)} = V_{r} α^{(k)} + V_{n} β^{(k)}, \\ d^{(k)} = V_{r} d_{r}^{(k)} + V_{n} d_{n}^{(k)} \end{array}

(24)

and the step size μ^(k) > 0 is determined by the inexact line search method of Roger Fletcher [26]. Assuming that the updating vectors $d_{r}^{(k)}$ and $d_{n}^{(k)}$ are written as

\begin{array}{l} d_{r}^{(k)} = {[d_{r, 1}^{(k)}, d_{r, 2}^{(k)}, \dots, d_{r, M}^{(k)}]}^{T}, \\ d_{n}^{(k)} = {[d_{n, 1}^{(k)}, d_{n, 2}^{(k)}, .., d_{n, N - M}^{(k)}]}^{T} \end{array}

(25)

which are determined by minimizing F(α,β) along each of the directions defined by the column vectors of [V_r,V_n]. Therefore, $d_{r}^{(k)}$ and $d_{n}^{(k)}$ become descent directions of F(α,β), and in the case of real Φ and x, $d_{r, i}^{(k)}$ can be calculated via iteration as

\begin{array}{l} {(d_{r, i}^{(k)})}^{(p)} = & \frac{s_{i} {\tilde{y}}_{i} - s_{i}^{2} α_{i} - \frac{λ}{σ^{2}} \sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{r}^{(j, i)} {(d_{r, i}^{(k)})}^{(p - 1)})}^{2}}{2 σ^{2}}) x_{j} v_{r}^{(j, i)}]}{s_{i}^{2} + \frac{λ}{σ^{2}} \sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{r}^{(j, i)} {(d_{r, i}^{(k)})}^{(p - 1)})}^{2}}{2 σ^{2}}) {(v_{r}^{(j, i)})}^{2}]}, \\ (1 \leq i \leq M), \end{array}

(26)

Where x_j is the j-th component of vector x^(k), $v_{r}^{(j, i)}$ is the (j,i)-th component of matrix V_r, α_i is the i-th component of vector α^(k), and ${(d_{r, i}^{(k)})}^{(p)}$ is the p-th iteration value of $d_{r, i}^{(k)}$ with the initialization value ${(d_{r, i}^{(k)})}^{(0)} = 0$ . Similarly, $d_{n, i}^{(k)}$ in Equation 25 is given by

{(d_{n, i}^{(k)})}^{(q)} = \frac{- \sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{n}^{(j, i)} {(d_{n, i}^{(k)})}^{(q - 1)})}^{2}}{2 σ^{2}}) x_{j} v_{n}^{(j, i)}]}{\sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{n}^{(j, i)} {(d_{n, i}^{(k)})}^{(q - 1)})}^{2}}{2 σ^{2}}) {(v_{n}^{(j, i)})}^{2}]}, (1 \leq i \leq N - M),

(27)

where $v_{n}^{(j, i)}$ is the (j,i)-th component of matrix V_n, and ${(d_{n, i}^{(k)})}^{(q)}$ is the q-th iteration value of $d_{n, i}^{(k)}$ with the initialization value ${(d_{n, i}^{(k)})}^{(0)} = 0$ . The derivation of Equations 26 and 27 is given in the Appendix. In addition, the computation of $d_{r, i}^{(k)}$ using Equation 26 requires vector α^(k) to be computed first as

V_{r}^{T} x^{(k)} = V_{r}^{T} (V_{r} α^{(k)} + V_{n} β^{(k)}) = α^{(k)} .

(28)

The reconstruction algorithm via optimization in null and complement spaces of measurement matrix Φ is referred to hereafter as the l₂-Sl₀- NC algorithm, which can be implemented by the pseudo-code in Algorithm 3.

Algorithm 3 The l₂-Sl₀- NC reconstruction algorithm

4. Simulation results

In this section, the reconstruction performance of the proposed approach (l₂-Sl₀) is evaluated by computer simulations. The spectral efficiency of the CCS using l₂-Sl₀ algorithm is also discussed. More specifically, the l₂-Sl₀ algorithm includes l₂-Sl₀-BFGS, l₂-Sl₀- CG, and l₂-Sl₀- NC in the scenario where y, Φ ,x are real-valued. While in complex-valued scenarios, l₂-Sl₀ only means l₂-Sl₀- BFGS and l₂-Sl₀- CG. Note that Equations 26 and 27 are obtained only in the case where y, Φ, x are real-valued. Namely, the l₂-Sl₀- NC is not suitable to reconstruct complex signals. In all the experiments, the initial value of r is set to 0.5 for both Sl₀ and l₂-Sl₀ algorithms. The values of δ_r and r_J required by the l₂-Sl₀ algorithm are chosen as 0.05 and 0.7, respectively.

In experiments 1 and 2, the signal length and the number of measurements are set to N = 1,000 and M = 400, respectively. A K-sparse source x was artificially created as follows: (1) set x to a zero vector of length N, (2) generate a vector z of length K assuming that each element z_i is a random value drawn from the normal distribution N(0,1) in the real-valued scenario or from N(0,1/2)+jN(0,1/2) in the complex-valued scenario, and (3) randomly select K components of x and set them to z. Each element of the measurement matrix Φ is randomly generated using the normal distribution N(0,1) or N(0,1)+jN(0,1), and each row is normalized to unity. Then, the mixtures are generated using the noisy model y=Φx+w, where w is an additive white Gaussian noise with covariance matrix σ_wΙ_M (I_M stands for the M × M identity matrix). To evaluate the estimation accuracy, the signal-to-noise ratio (SNR) defined as $20 log (| | x | |_{2} / | | x - \hat{x} | |_{2})$ is used, where x and $\hat{x}$ denote the true value and its estimate, respectively.

In experiment 1, we compare the reconstruction performance of l₂-Sl₀ with that of Sl₀. Figures 1 and 2 show the reconstruction SNR at different powers of noise σ_w in real and complex signal scenarios, respectively. For each value of σ_w, the reconstruction SNR is averaged over 100 runs. It is seen that l₂-Sl₀ produces a better SNR than Sl₀, which shows the robustness of l₂-Sl₀ against noise. The objective function of l₂-Sl₀ algorithm in Equation 15 comprises the quadratic error term $\frac{1}{2} | | Ah - Y_{pilot} | |_{2}^{2}$ which permits a small perturbation. Therefore, the l₂-Sl₀ algorithm has a larger capability to reconstruct sparse signal in the presence of noise than Sl₀. For smaller values of σ, F_σ(x) contains more local minima. Therefore, the decrease of σ should not be too quick in the Sl₀ and l₂-Sl₀ algorithms. Moreover, unlike Sl₀ using a fixed step size to control the decrease of the parameter σ, l₂-Sl₀ uses a variable one, and the step size δ_r slightly increasing with the reduction of σ may also help the l₂-Sl₀ to improve its estimation accuracy.

In experiment 2, the l₂-Sl₀ algorithms are tested using N = 1,000, M = 400, and various K sparse signals with σ_ω = 0.01, to examine the algorithms' performance for signals of different sparsity levels. The results obtained are plotted in Figures 3 and 4 with y, Φ, x being real and complex values, respectively. It is observed that the performance of the l₂-Sl₀ algorithm is better than the Sl₀ algorithm in most cases. In real-valued scenario, the l₂-Sl₀- BFGS, l₂-Sl₀- CG, and l₂-Sl₀- NC are comparable for K smaller than 130, but l₂-Sl₀- BFGS performs better for K between 130 and 210. In addition, when K is smaller than 90, the final SNR of the Sl₀ algorithm increases with the rise of sparsity K. This is because the initial estimate ${\hat{x}}_{0}$ is set to the minimum l₂-norm solution of y = Φx + w, which has few zero elements and is far away from the actual signal with many zero elements, and ${\hat{x}}_{0}$ is gradually close to the actual signal with the rise of sparsity K. However, this phenomena is not obvious in l₂-Sl₀ algorithm, since the initial estimate ${\hat{x}}_{0}$ is set to zeros in l₂-Sl₀-CG and l₂-Sl₀-NC, which is near the actual solution for a small value of K. Because of ${\hat{x}}_{0}$ being set to zeros and the thresholds of δ₁ and δ₂ being not small enough for the value sparsity K above 230, the l₂-Sl₀-NC performs the worst among the algorithms tested.

Next, we investigate the accuracy and the spectral efficiency of CCS method using l₂-Sl₀. We consider a MIMO-OFDM system with two transmit and two receive antennas (N_T = N_R = 2). The number of subcarriers is 512, and the QPSK modulation is used. The length of cyclic prefix is 20, which equals the length of wireless channel impulse response. In experiment 3, a Rayleigh channel modeled by a 4-tap MIMO-FIR filter is assumed, in which each tap corresponds to a 2 × 2 random matrix whose elements are i.i.d. complex Gaussian variables with zero mean and unit variance, and the position l_d of MSTs is {2, 6, 13, 19}. The estimation performance is evaluated in terms of the bit error rate (BER) and mean square error (MSE) given by MSE(Δh) = $\frac{\sum_{i = 1}^{M} | | {\hat{h}}_{i} - h_{i} | |_{2}^{2}}{\sum_{i = 1}^{M} | | h_{i} | |_{2}^{2}}$ , where M represents the number of simulations and h_i and ${\hat{h}}_{i}$ represent the actual and the estimated channels from the i-th simulation, respectively.

In experiment 3, we investigate the performance and required computational time of the CCS using l₂-Sl₀-BFGS, l₂-Sl₀-CG, and Sl₀ reconstruction algorithms with 30 pilot signals in each transmit antenna. The simulation consists of 2,000 Monte Carlo runs. Moreover, their performance is compared with those of the CCS using OMP and FISTA. OMP is the most popular one in the type of greedy reconstruction algorithm, and FISTA is the most fast one in the type of l₂-l₁ reconstruction algorithm. Figures 5 and 6 show the MSE and BER plots resulting from the above five CCS methods and the conventional LS method, respectively. As can be seen, the CCS method using l₂-Sl₀-CG only needs 30 pilot signals to obtain the approximate performance of the LS method using 40 pilot signals which implies that the CCS using l₂-Sl₀-CG can save nearly 25% pilot signals. This merit of CCS is due to the prior sparse information of the wireless channel utilized and the efficient reconstruction of sparse signals from a very limited number of measurements allowed by CS. In addition, the CCS applying l₂-Sl₀-CG or l₂-Sl₀-BFGS outperforms the CCS using Sl₀ more obviously than that in experiment 1, which shows that the l₂-Sl₀ has a larger capability to reconstruct sparse signal than Sl₀ in the case when each row of measurement matrix is not normalized to unity. Since the l₂-Sl₀ algorithm is halted after a fixed number of iterations, furthermore, the fixed number does not depend on the sparsity of the signal directly; it is convenient to set the number in practical applications.

We use the CPU time as a measure of complexity. The simulations are performed in MATLAB R2009b environment using an Intel Core i3, 2.53-GHz processor with 2 GB of memory, and under Microsoft Windows XP operating system. The results shown in Figure 7 indicate that the CCS using l₂-Sl₀ requires more computational time than that using other algorithms tested. The l₂-Sl₀ algorithm needs an iterative process to find the optimal solution at each value of σ; therefore, the running time of l₂-Sl₀ is longer than that of others tested. However, at the cost of slightly more computational time, the CCS using l₂-Sl₀-CG yields slightly better performance than the CCS using OMP or FISTA, and the threshold value for termination iteration in the l₂-Sl₀ algorithm is easier to be set. More specifically, it is shown in Algorithm 2 that l₂-Sl₀-CG applies a constant value L to stop the iteration, and the constant value is independent of the sparsity of signal and the power of noise. However, the valid threshold values for termination iteration in the OMP and FISTA algorithms always depend on the power of noise or the sparsity of signal, which are both quite difficult to estimate beforehand in practical applications.

In experiment 4, we investigate the BER of the CCS using 30 pilot signals in each transmit antenna under different channel sparsities, namely for different numbers of MSTs. Moreover, the position l_d of MST is randomly selected in each Monte Carlo simulation. Figure 8 shows the BER plots of CCS using l₂-Sl₀-CG and l₂-Sl₀-BFGS algorithms. The figure shows that a better BER performance can be expected in general for less number of MSTs. In addition, when the length of channel response is 20, the CCS using l₂-Sl₀-CG and that using l₂-Sl₀-BFGS are found to yield acceptable BERs for up to 8 and 4 MSTs, respectively.

5. Conclusion

In this paper, a new approach for sparse channel estimation of MIMO-OFDM systems based on compressed sensing has been presented. The new approach uses a smoothed l₀-norm-regularized least squares (l₂-Sl₀) objective function and solves the optimization problem by three reconstruction algorithms: quasi-Newton, conjugate gradient (CG), and optimization in the null and complement spaces of measurement matrix (ONCS). The better reconstruction accuracy of the l₂-Sl₀ as compared with the Sl₀ algorithm and the higher spectrum efficiency of the CCS using l₂-Sl₀-CG or l₂-Sl₀-BFGS as compared with the conventional LS method have been shown by computer simulations.

Appendix

Derivation of Equations 26 and 27

Suppose that e_i is the i-th column of an M × M identity matrix, and the vectors α and β in Equation 22 are fixed. At point α, a line search along direction e_i is carried out by solving the one-dimensional optimization problem

\begin{array}{l} min_{d_{r, i}} F (α + d_{r, i} e_{i}, β) = \frac{1}{2} | | s (α + d_{r, i} e_{i}) - \tilde{y} | |_{2}^{2} \\ + λ \sum_{j = 1}^{N} [1 - exp (- \frac{{(x_{j} + d_{r, i} v_{r}^{(j, i)})}^{2}}{2 σ^{2}})] \\ = \frac{1}{2} (s_{i} (α_{i} + d_{r, i}) - {\tilde{y}}_{i})^{2} + \frac{1}{2} \sum_{\begin{array}{l} k = 1 \\ k \neq i \end{array}}^{M} {(s_{k} α_{k} - {\tilde{y}}_{k})}^{2} \\ + λ \sum_{j = 1}^{N} [1 - exp (- \frac{{(x_{j} + d_{r, i} v_{r}^{(j, i)})}^{2}}{2 σ^{2}})], \end{array}

(29)

Where x_j is the j-th component of vector x, and $v_{r}^{(j, i)}$ is the (j,i)-th component of matrix V_r. By equating the derivative ∂F(α + d_r,ie_i, β)/∂d_r,i to zero, for real Φ and x, we can obtain

d_{r, i} = \frac{s_{i} {\tilde{y}}_{i} - s_{i}^{2} α_{i} - \frac{λ}{σ^{2}} \sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{r}^{(j, i)} d_{r, i})}^{2}}{2 σ^{2}}) x_{j} v_{r}^{(j, i)}]}{s_{i}^{2} + \frac{λ}{σ^{2}} \sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{r}^{(j, i)} d_{r, i})}^{2}}{2 σ^{2}}) {(v_{r}^{(j, i)})}^{2}]} .

(30)

Note that d_r,i can be solved via iterations with the initial value of d_r,i being set to zero in the right side of (30). In a similar manner, d_n,i can be obtained as

d_{n, i} = \frac{- \sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{n}^{(j, i)} d_{n, i})}^{2}}{2 σ^{2}}) x_{j} v_{n}^{(j, i)}]}{\sum_{j = 1}^{N} [exp (\frac{- {(x_{j} + v_{n}^{(j, i)} d_{n, i})}^{2}}{2 σ^{2}}) {(v_{n}^{(j, i)})}^{2}]},

(31)

where $v_{n}^{(j, i)}$ is the (j,i)-th component of matrix V_n.

References

Stuber G, Barry JR, Mclaughlin SW, Li Y, Ingram MA, Pratt TG: Broadband MIMO-OFDM wireless communications. Proc IEEE 2004, 92(2):241-294.
Article Google Scholar
Raghavendra MR, Giridhar K: Improving channel estimation in OFDM systems for sparse multipath channels. IEEE Signal Process Lett 2005, 12(1):52-55.
Article Google Scholar
Hwang JK, Chung RL, Tsai MF, Deng JH: Highly efficient sparse multipath channel estimator with Chu-sequence preamble for frequency-domain MIMO DFE receiver. IEICE Trans Commun 2007, E90B(8):2103-2110.
Article Google Scholar
Carbonelli C, Vedantam S, Mitra U: Sparse channel estimation with zero-tap detection. IEEE Trans Wireless Commun 2007, 6(5):1743-1763.
Article Google Scholar
Wan F, Zhu W-P, Swamy MNS: Semiblind most significant tap detection for sparse channel estimation of OFDM systems. IEEE Trans Circuits Syst I: Reg Papers 2010, 57(3):703-713.
Article MathSciNet Google Scholar
Wan F, Zhu W-P, Swamy MNS: Semiblind sparse channel estimation for MIMO-OFDM systems. IEEE Trans Vehicular Technol 2011, 60(6):2569-2582.
Article Google Scholar
Donoho D: Compressed sensing. IEEE Trans Inf Theory 2006, 52(4):1289-1306.
Article MathSciNet MATH Google Scholar
Bajwa WU, Haupt J, Sayeed AM, Nowak R: Compressed channel sensing: a new approach to estimating sparse multipath channels. IEEE Trans on Signal Processing 2010, 98(6):1058-1076.
Google Scholar
Berger CR, Zhou S, Preisig JC, Willet P: Sparse channel estimation for multicarrier underwater acoustic communication: from subspace methods to compressed sensing. IEEE Trans Signal Process 2010, 58(3):1708-1721.
Article MathSciNet Google Scholar
Sharp M, Scaglione A: A useful performance metric for compressed channel sensing. IEEE Trans Signal Process 2011, 59(6):2982-2988.
Article MathSciNet Google Scholar
Taubock G, Hlawatsch F, Eiwen D, Rauhut H: Compressive estimation of doubly selective channels in multicarrier systems: leakage effects and sparsity-enhancing processing. IEEE J Sel Top Signal Process 2010, 4(2):255-271.
Article Google Scholar
He X, Song R, Zhu W: Optimal pilot pattern design for compressed sensing-based sparse channel estimation in OFDM systems. Circuits Syst Signal Process 2012, 31: 1379-1395. 10.1007/s00034-011-9378-6
Article MathSciNet Google Scholar
Mohimani H, Babaie-Zadeh M, Jutten C: A fast approach for overcomplete sparse decomposition based on smoothed l₀ norm. IEEE Trans Signal Process 2009, 57(1):289-301.
Article MathSciNet MATH Google Scholar
Beck A, Teboulle M: A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J Imaging Sci 2009, 2(1):183-202. 10.1137/080716542
Article MathSciNet MATH Google Scholar
Tropp JA, Gilbert AC: Signal recovery from random measurements via orthogonal matching pursuit. IEEE Trans Inf Theory 2007, 53(12):4655-4666.
Article MathSciNet MATH Google Scholar
Ye X, Zhu W, Zhang A, Meng Q: Sparse channel estimation in MIMO-OFDM systems based on an improved sparse reconstruction by separable approximation algorithm. Journal of Information and Computational Science 2013, 10(2):609-619.
Google Scholar
Bajwa WU, Sayeed A, Nowak R: Sparse multipath channels: modeling and estimation. In Proceedings of the 13th IEEE Digital Signal Processing Workshop. Marco Island; 4–7 Jan 2009
Gui G, Adachi F: Improved least mean square algorithm with application to adaptive sparse channel estimation. EURASIP J Wirel Commun Netw 2013, 2013: 204. 10.1186/1687-1499-2013-204
Article Google Scholar
Gui G, Mehbodniya A, Adachi F: Bayesian sparse channel estimation and data detection for OFDM communication systems. In 2013 IEEE 78th Vehicular Technology Conference (VTC2013-Fall). Las Vegas; 2–5 Sept 2013
Chen SS, Donoho DL, Saunders MA: Atomic decomposition by basis pursuit. SIAM J Scientif Comput 1999, 20(1):33-61.
Article MathSciNet MATH Google Scholar
Mallat S, Zhang Z: Matching pursuits with time-frequency dictionaries. IEEE Trans Signal Process 1993, 41(12):3397-3415. 10.1109/78.258082
Article MATH Google Scholar
Dai W, Milenkovic O: Subspace pursuit for compressive sensing signal reconstruction. IEEE Trans Inf Theory 2009, 55(5):2230-2249.
Article MathSciNet Google Scholar
Zibulevsky M, Elad M: L1-L2 optimization in signal and image processing. IEEE Signal Process Mag 2010, 27(5):76-88.
Article Google Scholar
Wright SJ, Nowak RD, Figueiredo MAT: Sparse reconstruction by separable approximation. IEEE Trans Signal Process 2009, 57(7):2479-2493.
Article MathSciNet Google Scholar
Pant JK, Lu W-S, Antoniou A: Recovery of sparse signals from noisy measurements using an l_p regularized least-squares algorithm. In IEEE Pacific Rim Conference on communications, computers and signal processing. Canada: University of Victoria; 48-53. 23–26 Aug 2011
Antoniou A, Lu W-S: Practical Optimization: Algorithms and Engineering Applications. New York: Springer; 2007.
MATH Google Scholar
Pant JK, Lu W-S, Antoniou A: Reconstruction of sparse signals by minimizing a re-weighted approximate l₀-norm in the null space of the measurement matrix. In Proceedings of the Midwest Symposium on Circuits and Systems. Seattle; 430-433. 1–4 Aug 2010
Pant JK, Lu W-S, Antoniou A: Unconstrained regularized l_p norm based algorithm for the reconstruction of sparse signals. In Proceedings of the IEEE International Symposium on Circuits and Systems. Brazil; 1740-1743. 15–18 May 2011

Download references

Acknowledgements

We express our thanks to the anonymous reviewers for their valuable comments to improve the quality and the presentation of this paper. This work is supported by the National Natural Science Foundation of China under grant nos. 61372122 and 61302104 and the Basic Research Program of Jiangsu Province under grant no. BK2011756.

Author information

Authors and Affiliations

Institute of Signal Processing and Transmission, Nanjing University of Posts and Telecommunications, Nanjing, 210003, China
Xinrong Ye, Wei-Ping Zhu, Aiqing Zhang & Jun Yan
College of Physics and Electronic Information, Anhui Normal University, Wuhu, 241000, China
Xinrong Ye & Aiqing Zhang
Department of Electrical and Computer Engineering, Concordia University, Montreal, QCH3G1M8, Canada
Wei-Ping Zhu

Authors

Xinrong Ye
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Ping Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Aiqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Yan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xinrong Ye.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Ye, X., Zhu, WP., Zhang, A. et al. Sparse channel estimation of MIMO-OFDM systems with unconstrained smoothed l₀-norm-regularized least squares compressed sensing. J Wireless Com Network 2013, 282 (2013). https://doi.org/10.1186/1687-1499-2013-282

Download citation

Received: 19 July 2013
Accepted: 21 November 2013
Published: 10 December 2013
DOI: https://doi.org/10.1186/1687-1499-2013-282