Research  Open  Published:
Efficient Bayesian compressed sensingbased channel estimation techniques for massive MIMOOFDM systems
EURASIP Journal on Wireless Communications and Networkingvolume 2017, Article number: 38 (2017)
Abstract
Efficient and highly accurate channel state information (CSI) at the base station (BS) is essential to achieve the potential benefits of massive multiple input multiple output (MIMO) systems. However, the achievable accuracy that is attainable is limited in practice due to the problem of pilot contamination. It has recently been shown that compressed sensing (CS) techniques can address the pilot contamination problem. However, CSbased channel estimation requires prior knowledge of channel sparsity to achieve optimum performance, also the conventional CS techniques show poor recovery performance for low signal to noise ratio (SNR). To overcome these shortages, in this paper, an efficient channel estimation approach is proposed for massive MIMO systems using Bayesian compressed sensing (BCS) based on prior knowledge of statistical information regarding channel sparsity. Furthermore, by utilizing the common sparsity feature inherent in the massive MIMO system channel, we extend the proposed Bayesian algorithm to a multitask (MT) version, so the developed MTBCS can obtain better performance results than the single task version. Several computer simulation based experiments are performed to confirm that the proposed methods can reconstruct the original channel coefficient more effectively when compared to the conventional channel estimator in terms of estimation accuracy.
Introduction
The main activity of recent research has identified that the major targets for the next generation of mobile communications, the socalled fifth generation of mobile communications, are to achieve 1000 times the system capacity and 10 times the spectral efficiency, energy efficiency and data rate, and 25 times the average cell throughput [1]. From a highlevel perspective, there is a promising technology that enables reaching higher fifth generation targets, called a massive multiple input multiple output (MIMO). A massive MIMO can be defined as a system using a large number of antennas at the base station; accordingly, a significant beamforming can be achieved and the system capacity can serve a large number of users [2].
When comparing massive MIMO to the conventional MIMO systems, massive MIMO shows several advantageous aspects. Firstly, as the number of the antennas at the base station goes to high values, the simplest coherent combiner and linear precoder turn out to be optimal. Secondly, by exploiting the features of the channel reciprocity, additional antennas increase the network capacity significantly without the need for additional feedback overhead. Thirdly, enabling the power reduction in the uplink and in the downlink can provide the potential for smallcell size shrinking [3].
The major limiting factor in massive MIMO is the availability of accurate, instantaneous channel state information (CSI) at the base station. The CSI is typically acquired by transmitting predefined pilot signals and estimating the channel coefficients from the received signals by applying an appropriate estimation algorithm [1–3].
Channel estimation accuracy depends on having perfect orthogonal pilots allocated to the users; however, to achieve high spectral efficiency, the same carrier frequency should be used in the neighbouring cells by following a specific reuse pattern. This leads to the creation of a spatially correlated intercell interference, known as pilot contamination, which reduces the estimation performance and spectral efficiency [1–3].
The pilot contamination problem was analyzed in [4] and it has shown that the precoding downlink signal of the base station in the serving cell contaminated the received signal of the users roaming in other cells. The authors of [5] analyzed the pilot contamination problems in multicell massive MIMO systems relying on a large antennas at the base station, and demonstrated that the pilot contamination problem persisted in largescale MIMO [6].
However, pilot contamination could be reduced by reducing the number of pilots. A multiuser scenario therefore needs to reduce the number of pilots without affecting the channel impulse response (CIR) quality. Hence, the development of efficient channel estimation techniques for massive MIMO that are computationally less complex and require a fewer number of pilots is a challenge that should be thoroughly addressed [7].
Recently, compressed sensing (CS) techniques have received attention since they can recover the unknown signals from only a small number of measurements, thus using significantly far fewer samples than is possible via the conventional Nyquist rate, which is the signal recovery scheme developed for CS to exploit the sparse nature of signals (that is, only a small number of components in a signal vector are nonzero). CS allows for accurate system parameter estimation with fewer pilots; thereby, addressing the pilot contamination problem and improving the bandwidth efficiency [8, 9]. However, classical CS algorithms require prior knowledge of channel sparsity, which is usually unknown in practical scenarios. In addition, to apply CS algorithms, the sampling matrix must satisfy the restricted isometry property (RIP) for guaranteeing reliable estimators. Such a condition cannot be easily verified because it results computational demanding [10, 11].
To overcome the scarcity of CSbased channel estimation in massive MIMO systems, in this paper, we propose an improved channel estimation scheme based on the theory of Bayesian CS (BCS) that introduces relevance vector machines (RVM) and statistical learning information (SLI) into standard CS; whereby, probabilistic a priori information regarding the channel sparsity can be exploited for more reliable channel recovery to mitigate the pilot contamination problem. Also, the sampling matrix condition is efficiently overcome based on probabilistic formulation [12–14].
Compared with the classical based scheme, our simulation results indicate that the proposed channel estimation methods provide improved estimation accuracy and can address the pilot contamination problem.
Furthermore, by exploiting the common statistical sparsity inherent in different multipath signals, we extend the BCS algorithms to a multitask version for simultaneously reconstructing multiple signals, thus leading to MTBCS [15, 16].
The main contributions of this paper are summarised as follows:

The BCSbased channel estimation algorithm has been proposed for massive MIMO to address the pilot contamination problem.

We have also proposed to enhance the performance of the BCSbased estimator through the principle of thresholding to select the most significant taps to improve the channel estimation accuracy.

In addition, we have exploited the common statistical sparsity distribution to enhance the estimation accuracy performance through the proposed MTBCSbased estimator.

To provide the benchmark for the minimum performance error of the BSC and MTBCS, the Cramer Rao bound (CRB) has been drawn for BCS and it has been derived and drawn for MTBCS.
The remainder of this paper is organized as follows. The multicell massive MIMO system model is presented in Section 2. The BSCbased and the MTBSC based channel estimation details are reviewed in Sections 3 and 4, respectively. In section 5, we provide the CramerRao bound analysis. Section 6 presents the simulation results. Finally, the final conclusions are drawn in Section 7.
The following notation is adopted throughout the paper: $\mathbb {C}$ denotes the complex number field. For ${A} \in \mathbb {C}$, we have A=A _{ R }+j A _{ I }, where $j=\sqrt {1}$, while A _{ R } and A _{ I } are the real and imaginary parts of A, respectively. For any matrix A, A _{ i,j } denotes the (i,j)th element. The transpose, inverse and Hermitian transpose operators are denoted by (.)^{T}, (.)^{−1}, and (.)^{H}, respectively. Upper bold font are used to denote matrices while lower light font are used to denote vectors, lower and upper case represents the time domain and frequency domain, respectively. The I denotes an identity matrix, $diag\{\underline {\mathbf {X}}\}$ denotes the diagonal matrix with the diagonal entries equal to the elements of X and $\hat {X}$ represents the estimate of $\hat {X}$. The Frobenius and spectral norms of a matrix x are denoted by ∥x∥_{ F } and ∥x∥_{2} respectively. E{.} has been employed to denote expectation with regard to all random variables within the brackets. A Gaussian stochastic variable o is the denoted by o∼N(r,q), where r is the mean and q is the variance. Also, a random vector x having the prober complex Gaussian distribution of mean μ and covariance Σ is indicated by x∼C N(x;μ,Σ), where, $ N(\mathbf {x};\boldsymbol {\mu },\boldsymbol {\Sigma })=\frac {1}{det(\pi \boldsymbol {\Sigma })} e^{(\mathbf {x}\boldsymbol {\mu })\boldsymbol {\Sigma }^{1}(\mathbf {x}\boldsymbol {\mu })}$, for simplicity we refer to C N(x;μ,Σ) as x∼C N(μ,Σ).
Massive MIMO system model
We consider a time division duplexing (TDD) multicell massive MIMO system with C cells as shown in Fig. 1. Each cell comprises of M antennas at the BS and N single antenna users. To improve the spectral efficiency, orthogonal frequency division multiplexing (OFDM) is adopted [17, 18].
At the beginning of the transmission, all mobile stations in all cells synchronously transmit OFDM pilot symbols to their serving base stations. Let the OFDM pilot symbol of user n in the cth cell be denoted by $\mathbf {x}^{n}_{c}=[{X}^{n}_{c}[1]\ {X}^{n}_{c}[2] \cdots {X}^{n}_{c}[K]]^{T}$, where K is the number of subcarriers. The OFDM transmission partition the multipath channel between the user and each antenna of the BS into K parallel independent additive white Gaussian noise (AWGN) subchannels in the frequency domain. Each subchannel is associated with a subcarrier. Let ${H}^{n}_{c^{*},c,i}[k]$ denote the kth subchannel coefficient between the nth user in the cth cell and the ith antenna of the BS of cell c ^{∗} in the uplink.
The received signal $\phantom {\dot {i}\!}{Y}_{c^{*},i}$ by the ith antenna element of the cell c ^{∗} at the kth subcarrier can be expressed as
for all 1≤i≤M and 1≤c≤C, where ${V}_{c^{*},i}[k]\phantom {\dot {i}\!}$ is the AWGN at the ith antenna of the BS in cell c ^{∗} at the kth subcarrier. Letting $\phantom {\dot {i}\!}\mathbf {y}_{c^{*},i}=[Y_{c*,i}[1]\cdots Y_{c*,i}[K]]^{T}$, we can write (1) for all subcarriers at the ith antenna of the BS in cell c ^{∗} in the compact form as
where $\mathbf {X}^{n}_{c^{*}}=\text {diag}\{\mathbf {x}^{n}_{c^{*}}\}$, $\mathbf {h}^{n}_{c^{*},c,i}=[{H}^{n}_{c^{*},c,i}[1]\cdots {H}^{n}_{c^{*},c,i}[K]]^{T}$ and $\mathbf {v}_{c^{*},i}=[{V}_{c^{*},i}[1]\cdots {V}_{c^{*},i}[K]]^{T} \sim CN(0,{\sigma }_{v}^{2})$. Let $\mathbf {g}^{n}_{c^{*},c,i}=[g^{n}_{c^{*},c,i}[1] \cdots g^{n}_{c^{*},c,i}[\ell ] \cdots g^{n}_{c^{*},c,i}[L]]^{T}$ collect the samples of the sampled multipath CIR between the nth user of the cth cell and the ith antenna of the BS in cell c ^{∗}, where L is the number of the channel taps and $g^{n}_{c^{*},c,i}[\ell ]$ corresponds to the ℓth channel tap. The K frequency domain channel coefficients, i.e., $\mathbf {h}^{n}_{c^{*},c,i}$, can be calculated as the Kpoint DFT of the CIR samples, i.e., $\mathbf {g}^{n}_{c^{*},c,i} \in \mathbb {C}^{L \times 1}$, e.g., [18].
Hence,
where $\mathbf {F} \in \mathbb {C}^{K \times K}$ represents the discrete Fourier transform (DFT) matrix, whose element in row s and column r is given by $[\frac {1}{\sqrt {K}}e^{{j2 \pi *(Kr)(Ks)}/{K}}]$, 1≤r≤K and 1≤s≤K and $\mathbf {g}^{\prime n}_{c^{*},c,i}\in \mathbb {C}^{K \times 1}$ is $\mathbf {g}^{n}_{c^{*},c,i}\in \mathbb {C}^{L \times 1}$ augmented with K−L zeros. Using (3) in (2), we get
The channel coefficient is modelled as $g^{n}_{c^{*},c,i}[\ell ]=\sqrt {{\phi }_{c^{*},c,i}}[\ell ] {\psi }_{c^{*},c,i}[\ell ]$ for 1≤ℓ≤L, where ${\phi }_{c^{*},c,i}\phantom {\dot {i}\!}$ model the pathloss and shadowing (largescale fading), while the term $\phantom {\dot {i}\!}{\psi }_{c^{*},c,i}$ is assumed to be independent identical distribution (i.i.d) of unknown random variables with C N(0,1) (smallscale fading) [3].
The received signal of (4) can be rewritten as
where the term $\mathbf {z}_{c^{*},i}= \sum _{c=1, c\neq {c^{*}}}^{C}\sum _{n=1}^{N}\mathbf {X}^{n}_{c} \mathbf {F} \mathbf {g}^{\prime n}_{c^{*},c,i}+\mathbf {v}_{c^{*},i}$ in (5) represents the net sum of intercell interference plus the receiver noise, the variance interference ${{\sigma }_{I}^{2}}$ of the intercell interference term caused during pilot transmission can be expressed as
We define the measurement matrix $\mathbf {A}^{n}_{c^{*}}= \mathbf {X}^{n}_{c^{*}}\mathbf {F}$, then (5) can be rewritten as
Based on the physical properties of outdoor electromagnetic propagation, the CIR in wireless communications usually contain a few significant channel taps as can be shown in Fig. 2, i.e., the CIR are sparse; hence, the number of nonzero taps of the channel is much smaller than the channel length, then the CS techniques can be applied for sparse channel estimation. This sparse property can be exploited to reduce the necessary channel parameters to be estimated. In this case, we can address the pilot contamination problem by using fewer pilots than the unknown channel coefficients [7, 19, 20].
BCSbased channel estimation
In common literature, channel estimation methods are classified into parametric and Bayesian approaches. A standard parametric approach is the best linear unbiased estimator, which is often referred to as least squares channel estimation. In contrast to parametric methods, the Bayesian approach treats the desired parameters as random variable with apriori known statistics. Clearly, the a priori probability density function (PDF) of the channel is assumed to be perfectly known at the receiver [21, 22]. Based on the Bayesian channel estimation philosophy, the estimation of unknown parameters is the expectation of the posterior probabilistic distribution that is proportional to the prior probability and the likelihood of the unknown parameters.
In this section, BCSbased channel estimation is presented in the context of massive MIMO channel estimation. Following the general procedure of BCS in [23] and [24], the full posterior distribution over unknown parameters of interest for the problem at hand can be given as
where β represents the hyperparameters that control the sparsity of the channel while σ ^{2} is the net sum of the noise variance and interference variance.
However, the probability of the observation vector, $\phantom {\dot {i}\!}P(\mathbf {y}_{c^{*},i})$, is defined by the following equation
cannot be computed analytically. So, the posterior distribution can be decomposed as
The first term of (10), $P\left (\mathbf {g}^{\prime n}_{c^{*},c^{*},i}\mathbf {y}_{c^{*},i},\boldsymbol {\beta },\mathbf {\sigma }^{2}\right)$, the posterior distribution over the channel coefficient can be expressed based on Bayes’ rule as
The posterior distribution given above is Gaussian distribution with mean $\boldsymbol {\mu }^{n}_{c^{*},c^{*},i}$ and the variance $\boldsymbol {\Sigma }^{n}_{c^{*},c^{*},i}$ are given by
where ζ=d i a g{β _{1},β _{2},…,β _{ K }}.
The estimated channel based on Bayesian estimation approaches to minimize the mean square error (MSE) is the expectation of $P\left (\mathbf {g}^{\prime n}_{c^{*},c^{*},i}\mathbf {y}_{c^{*},i},\boldsymbol {\beta },{\sigma }^{2}\right)$, so the estimated channel can be expressed as
Now, to obtain the estimated channel $\hat {\mathbf {g}}^{\prime n}_{c^{*},c^{*},i}$, we need to find the heyparmarpater σ ^{2} and β that can be obtained from the second term on the righthand side of (10) by applying a type −I I maximum likelihood procedure by operating a RVM.
Based on Bayes’ theorem, the posterior distribution $P\left (\boldsymbol {\beta },{\sigma }^{2}\mathbf {y}_{c^{*},i}\right)$ is proportional $P\left (\mathbf {y}_{c^{*},i}\boldsymbol {\beta },{\sigma }^{2}\right)$ [23], Then, the type −I I maximum likelihood is applied to the log marginal likelihood as follows
Based on the assumption of the RVM approach in [23], the term $P(\mathbf {g}^{\prime n}_{c^{*},c^{*},i}\boldsymbol {\beta })$ follows zeromean Gaussian distribution and can be expressed as
while the Gaussian likelihood function of $\phantom {\dot {i}\!}\mathbf {y}_{c^{*},i}$ according to the probability theory, can be written as
By substituting (16) and (17) into (15), marginal likelihood $P(\mathbf {y}_{c^{*},i}\boldsymbol {\beta },{\sigma }^{2})$ can be expressed as
β can be obtained by differentiating the log marginal likelihood with regard to σ ^{2}, and equating it to zero and it can be given as
While σ ^{2} is obtained by differentiating (19) with regard to β and set these derivations to zero and can be expressed as
The β _{ k } and $\sigma _{k}^{2}$ which maximize the log marginal likelihood are then found iteratively by setting β and σ ^{2} to initial values and then finding values for $\boldsymbol {\mu }^{n}_{c^{*},c^{*},i}$ and $\boldsymbol {\Sigma }^{n}_{c^{*},c^{*},i}$ from (12) and (13). These values are then repeatedly used to calculate a new estimate for β _{ k } and σ ^{2} and until a convergence criteria is met.
Further details of the BCS algorithm can be found in [23, 24]. The procedure for implementation of the proposed technique is summarized in Algorithm 1.
In contrast to the conventional BCSbased estimator, it can also improve the performance of the BCS estimator based on the principle of thresholding, which can be applied to keep the most significant taps. The proposed algorithm applies a threshold approach by retaining the channel taps that have energy above a threshold value of ϱ and set the other taps to zero. The value of ϱ is the energy of the channel impulse response.
Multitask BCS based channel estimation
With a high probability of user movements, the massive MIMO system channel may vary. Consequently, the channels at different time instants/locations are different but share the same common statistical property. As a result, to estimate the current channel, we can exploit the previous compressive vectors in addition to the current compressive vector [15].
Given the system model in II, the received signals of (7) can have the following formulation
for j=1,2,…J where J is the number of the task, $\mathbf {A}^{n}_{c^{*},j}, \mathbf {g}^{\prime n}_{c^{*},c^{*},i,j}\phantom {\dot {i}\!}$ and $\phantom {\dot {i}\!}\mathbf {z}_{c^{*},i,j}$ represents the jth measurement matrices,channel vector and the noise vector, respectively [15].
The main target is to estimate the channel $\mathbf {g}^{\prime n}_{c^{*},c^{*},i,j}$ which can be computed based on Bayesian channel estimation philosophy as the mean of the channel posterior distribution that can be represented as
where ξ _{0} represents the inverse of the net sum of the noise variance and interference variance, while Ξ _{ j } represent the hyperparameters that control the sparsity of the channel. Based on Bayes’ rule the posterior distribution can be given as
the mean and covariance can be given by
where ψ=d i a g(ψ _{0},ψ _{1},ψ _{2},…,ψ _{ K }).
The likelihood function for the parameter $\mathbf {g}^{\prime n}_{c^{*},c^{*},i,j}$ and ξ _{0} based on the received signal $\mathbf {y}_{c^{*},i,j}\phantom {\dot {i}\!}$ and can be expressed as
The channel coefficients $\mathbf {g}^{\prime n}_{c^{*},c^{*},i,j}$ are assumed to be drawn from a product of zeromean Gaussian distributions that are shared by all tasks as follow
To obtain the estimated channel, we need to estimate Ξ _{ j } and ξ _{0} by applying the same procedure in Section 3 to the posterior distribution $P\left (\mathbf {y}_{c^{*},i,j},\boldsymbol {\Xi }_{j},{\xi }_{0}\right)$ that can be inference as [16]
Now, by maximizing the log marginal likelihood and then differentiating with respect to Ξ _{ j } and ξ _{0} and setting to zero yields
Further information on MTBCS can be found in [16].
CRB for BCSbased estimator
In this section, we analyse the CRB for the proposed BCS and MTBCS based channel estimation techniques to provide a benchmark for the minimum estimation error that can be achieved by the proposed algorithm. The CRB on the covariance of any estimator $\hat {\boldsymbol \theta }$ can be given as
where J(θ) is the Fisher information matrix (FIM) corresponding to the observation f, and can be given as
where l(θ,f) is the likelihood function corresponding to the observation f, parameterized by θ [25].
Therefore, given the system model in 2, the closed form expression of the Bayesian CRB (BCRB) for the proposed BCS can be given as
Theorem 1
Given (28), the closed form expression of the BCRB for the proposed MTBCS can be given as
Proof
See Appendix 1. □
Simulation results
To verify the accuracy of our analytical results, the simulation parameters can be summarized as follows: the number of antennas is 100, the number of users is 100, the number of the channel taps is 500, the number of subcarrier K is 4096 and the convergence δ is 10^{−6}. The simulation results are obtained by averaging over 1000 realizations.
To compare the accuracy of the channel estimation techniques, the normalized (MSE) is used for performance evaluation and is computed as
Figure 3 shows the MSE performance comparison among a BCSbased channel estimation of three scenarios under small pilot contamination ($\phantom {\dot {i}\!}{\phi _{c^{*},c^{*},i}}=1$ and $\phantom {\dot {i}\!}{\phi _{c^{*},c,i}}=0.1$), strong pilot contamination (${\phi _{c^{*},c^{*},i}}=1\phantom {\dot {i}\!}$ and $\phantom {\dot {i}\!}{\phi _{c^{*},c,i}}=0.5$), very strong pilot contamination ($\phantom {\dot {i}\!}{\phi _{c^{*},c^{*},i}}=1$ and $\phantom {\dot {i}\!}{\phi _{c^{*},c,i}}=0.9$), regularized least square (RLS)based estimator with no pilot contamination as a benchmark and the BCRB for BCS as a reference line. The results have shown significant improvement in estimation accuracy and addressing the pilot contamination problem for SNR values of −40 to 40 dB for the proposed technique compared with RLS. This is a result of exploiting the prior statistical of channel sparsity. Furthermore, the results still show enhanced estimation performance for high SNR.
Figure 4 shows the (MSE) performance versus SNR with a different value of setting to the number of subcarrier K={100,200 and 300}, so the compression ratio (CR) (i.e., L/K) is to be C R={0.2,0.1and 0.06}, while the experiment is run under small pilot contamination ($\phantom {\dot {i}\!}{\phi _{c^{*},c^{*},i}}=1$ and $\phantom {\dot {i}\!}{\phi _{c^{*},c,i}}=0.1$). The results prove that the estimation accuracy is better performed by decreasing the values of the number of subcarriers, accordingly with increasing CR.
Figure 5 demonstrates the MSE of the BSCbased channel estimation versus SNR for three scenarios of different settings to the number of antennas at the base station M={100,200, and 300}, the system under strong pilot contamination ($\phantom {\dot {i}\!}{\phi _{c^{*},c^{*},i}}=1$ and $\phantom {\dot {i}\!}{\phi _{c^{*},c,i}}=0.7$). The results show that the estimation accuracy of the proposed algorithm is enhanced by increasing the number of antennas. Thus, according to the law of large numbers, more coordinated BS antennas could provide more accurate support estimation.
Figure 6 shows the (MSE) performance versus SNR for BCS with different values for the number of pilots: 1000, 500, 100, 50, and 25, where the number of subcarrier K is 4096. The number of the CIR path is 500 while the experiments run under strong pilot contamination. For cases of the number of the pilots is greater than the number of channel taps (i.e., 1000 and 500), the BCS provides inefficient estimation accuracy, while for the other cases of the number of the pilot of (100, 50, and 25), which is less than 500, the estimation accuracy is enhanced significantly. In addition, there is no significant improvement for the cases of the number of the pilots 100, 50, and 25. In these cases, we can address pilot contamination by employing small values for the number of the pilot, i.e., 25.
Figure 7 compares the (MSE) performance versus SNR among BCS, thresholdBSC, MTBCS, LS, OMP and the Bilinear Approximate Message Passing (BiAMP) [26]. The number of subcarrier K is 1024 and the number of the CIR path is 100. Results show the proposed MTBCS enjoys significant performance improvement over all the other estimators as a result of exploiting the statistical prior information on a large scale. However, this advantage is at the expense of a relatively high complexity of BCS and MTBCS over other estimators as depicted in Table 1, which compares the computational complexity BiAMP [26], BCS [23], OMP [27], LS [28], and the MTBCS [16]. Also, the results showed that the thresholding approach enhances the estimation accuracy of the conventional BCS, as the CIR contains so many taps with no significant energy. By setting the threshold and neglecting these taps, a huge part of the noise and interference from pilot contamination will be eliminated.
Conclusions
To address the pilot contamination problem in massive MIMO systems, we proposed a BCSbased channel estimation algorithm for the multicell multiuser massive MIMO. The simulation results have revealed that the BCSbased channel estimation algorithm has tremendous improvement over conventionalbased channel estimation algorithms and can address the pilot contamination problem. Furthermore, the proposed technique can be enhanced by thresholding the CIR to a certain value and also by exploiting the common sparsity feature inherent in the system channel. In addition, the number of antennas and the compression ratio should be selected wisely to achieve optimum estimation accuracy.
Appendix 1: Proof of Theorem 1
Following Section 5, we can write the FIM as
Based on Bayes’ rule in (32), the FIM can be decomposed into two terms
using (28), the first term can be computed as follow
By applying the same procedure in (38 and 39) to the second term of (37) gives
References
 1
CX Wang, et al, Cellular architecture and key technologies for 5G wireless communication Networks. IEEE Comm. Mag. 52(2), 122–130 (2014).
 2
V Jungnickel, K Manolakis, W Zirwas, B Panzner, V Braun, M Lossow, M Sternad, R Apelfrojd, T Svensson, The role of small cells, coordinated multipoint, and massive MIMO in 5G. IEEE Commun. Mag. 52(5), 44–51 (2014).
 3
H Zhang, S Gao, D Li, H Chen, L Yang, On superimposed pilot for channel estimation in multicell multiuser MIMO uplink: large system analysis. IEEE Trans. Vehicular Technol. 65:, 99 (2015).
 4
J Jose, A Ashikhmin, TL Marzetta, et al, Pilot contamination problem in multicell TDD systems. Proc. IEEE Int.Symp. Inf. Theory, Seoul. 28:, 2184–2188 (2009).
 5
J Jose, A Ashikhmin, TL Marzetta, et al, Pilot contamination and precoding in multicell TDD systems. IEEE Trans. Wireless Commun. 10(8), 2640–2651 (2011).
 6
J Zhang, B Zhang, S Chen, X Mu, M ElHajjar, L Hanzo, Pilot contamination elimination for largescale multipleantenna aided OFDM systems. IEEE J. Selected Topics Signal Process. 8(5), 759–772 (2014).
 7
M Masood, L Afify, TY AlNaffouri, Efficient coordinated recovery of sparse channels in massive MIMO. IEEE Trans. Signal Process. 63(1), 104–118 (2015).
 8
N Sinh. Compressive sensing for multichannel and large scale MIMO networks. PhD. thesis, Dept. of Elect. And Comp. Eng., Concordia Univ. (Montreal, 2014), p. 13.
 9
C Qi, G Yue, L Wu, Y Huang, A Nallanathan, Pilot design schemes for sparse channel estimation in OFDM systems. IEEE Trans. Veh. Technol. 64(4), 1493–1505 (2015).
 10
M Carlin, P Rocca, G Oliveri, F Viani, A Massa, Directionsofarrival estimation through Bayesian compressive sensing strategies. IEEE Trans. Antennas Propagat. 61(7), 3828–3838 (2013).
 11
RG Baraniuk, Compressive sampling. IEEE Signal Process. Mag. 24(4), 118–124 (2007).
 12
Z Fan, Z Lu, Y Han, in 2014 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB). Accurate channel estimation based on Bayesian compressive sensing for nextgeneration wireless broadcasting systems (Beijing, 2014), pp. 1–5.
 13
Z Fan, Z Lu, Y Han, Reliable channel estimation based on Bayesian compressive sensing for TDSOFDM systems, (Macau, 2014).
 14
X Cheng, YL Guan, G Yue, S Li, in Proc. IEEE GLOBECOM. Enhanced Bayesian compressive sensing for ultrawideband channel estimation (California, 2012), pp. 4065–4070.
 15
X Cheng, M Wang, Y Guan, Ultra wideband channel estimation: a Bayesian compressive sensing strategy based on statistical sparsity. IEEE Trans. Vehicular Technol. 64(5), 1819–1832 (2015).
 16
S Ji, D Dunson, L Carin, Multitask compressive sensing. IEEE Trans. Signal Process. 57(1), 92–106 (2009).
 17
X Guo, S Chen, J Zhang, X Mu, L Hanzo, Optimal pilot design for pilot contamination elimination/reduction in largescale multipleantenna aided OFDM systems. IEEE Trans. Wireless Commun. 15(11), 7229–7243 (2016).
 18
MR Nakhai, Multicarrier transmission. IET Signal Process. 2(1), 114 (2008).
 19
W Ding, F Yang, W Dai, J Song, Time–frequency joint sparse channel estimation for MIMOOFDM systems. IEEE Commun. Lett. 19(1), 58–61 (2015).
 20
C Qi, G Yue, L Wu, Y Huang, A Nallanathan, Pilot design schemes for sparse channel estimation in OFDM systems. IEEE Trans. Veh. Technol. 64(4), 1493–1505 (2015).
 21
A Scherb, K Kammeyer, in Proc. IEEE Workshop Smart Antennas. Bayesian channel estimation for doubly correlated MIMO systems (Vienna, 2007).
 22
SM Kay, Fundamentals of Statistical Signal Processing: Estimation Theory (PTR PrenticeHall, Englewood Cliffs, 1993).
 23
S Ji, Y Xue, L Carin, Bayesian compressive sensing. IEEE Trans. Signal Process. 56(6), 2346–2356 (2008).
 24
ME Tipping, Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. 1:, 211–244 (2001).
 25
AK Jagannatham, BD Rao, Whiteningrotationbased semiblind MIMO channel estimation. IEEE Trans. Signal Process. 54(3), 861–869 (2006).
 26
JT Parker, P Schniter, V Cevher, Bilinear generalized approximate message passing. IEEE Trans. Signal Process. 62(22), 5839–5853 (2014).
 27
JA Tropp, AC Gilbert, Signal recovery from partial information via orthogonal matching pursuit. IEEE Trans. Inf. Theory. 53(12), 4655–4666 (2007).
 28
OO Oyerinde, Reweighted regularised variable step size normalised least mean squarebased iterative channel estimation for multicarrierinterleave division multiple access systems. IET Signal Process. 10(8), 947–954 (2016).
Acknowledgements
This work is supported by the Iraqi Higher Committee of Educational Development (HCED). The authors would like to acknowledge its financial support.
Competing interests
The authors declare that they have no competing interests.
Author information
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Received
Revised
Accepted
Published
DOI
Keywords
 Massive multiple input multiple output (MIMO)
 Channel estimation
 Bayesian compressed sensing (BCS)
 Pilot contamination
 Channel state information (CSI)
 Multitask Bayesian compressed sensing (MTBCS)