Channel estimation in OFDM systems operating under high mobility using Wiener filter combined basis expansion model

In this article, we first thoroughly analyze Wiener filter combined least squares based channel estimation (WF-LS) and then illustrate its limitation in high-speed mobile environments. Based on the analysis, we propose to combine WF with basis expansion model (BEM) based channel estimation to deal with channel estimation in various mobile environments, especially in high-speed cases. The expression for Wiener filter combined BEM based channel estimation (WF-BEM) is derived and the result explicitly considers the effect of intercarrier interference (ICI) that occurs in orthogonal frequency division multiplexing (OFDM) systems when operating under high mobility. The simulation results demonstrate that our proposed WF-BEM is better than WF-LS in time-varying channels, and the performance improvement is significant especially in fast time-varying channels.


Introduction
Due to its high data rate transmission capability and its robustness to multipath delay spread, orthogonal frequency division multiplexing (OFDM) has been adopted in most parts of modern wireless communication sytstems, such as wireless local area networks (WLAN) [1], digital audio and video broadcasting [2], and so as OFDM being standardized for the future wireless communication systems, such as wireless metropolitan area networks (WiMAX) [3] and 3GPP long term evolution (LTE) [4]. On the other hand, with high-speed railway construction worldwide, the moving speed of high-speed train has been reported to be able to operate as high as more than 400 km/h, or, even higher on-board highspeed vehicles such as aircraft. Consequently, mobility support is widely regarded as one of the key features in the above-mentioned OFDM systems. Meanwhile, for coherent detection in a wireless communication system, channel state information is indispensable. Therefore, to ensure effective communication of OFDM systems operating in high speed scenarios or alternatively very fast time-varying wireless channels, channel estimation method applicable to such high-speed environments has remained largely an open issue.
Although blind channel estimation methods can save bandwidth by avoiding the trans-mission of any pilot or training symbols, they must exploit the statistic of the received signals, either explicitly or implicitly, which requires the wireless channel is time invariant during several OFDM symbols (e.g., cyclic prefix (CP) [5] or virtual carriers [6] based blind channel estimation method, etc.). Therefore, blind channel estimation methods are only capable of tracking slow channel variations. In fast fading channels, they will suffer severe performance degradation. For channel estimation in wireless communication systems operating under high mobility, pilot symbols (i.e., symbols with known pilot tones) must be inserted periodically into the transmission frame to track fast channel variations (also called pilotassisted transmission). Channel estimation in fading channels with very high mobilities usually consists of two steps. As shown in Figure 1, channel estimation at pilot symbols is the first step in completing channel estimation in communication systems. The next step, we need to perform interpolation between the pilot symbols to obtain the channel estimate of data symbols (i.e., symbols only with unknown data subcarriers), which are transmitted between these pilot symbols. On one hand, traditional training symbols or pilots aided channel estimation methods adopted to obtain the channel estimate of pilot symbols assume that the wireless channel is static during an OFDM symbol period, which means that the channel frequency response matrix is a diagonal matrix [7]. This assumption holds when the users are in indoor environments or is approximately valid in slow speed moving environments. Channel estimation problems in this situation have been well studied and solved in the literature. For example, least squares (LS) or minimum mean square error (MMSE) based channel estimation methods [8] can be used to estimate the diagonal of the channel frequency response matrix, or we can further proceed to eliminate noise in transformdomain [9]. On the other hand, Considering that Wiener filter (WF) is the best interpolation method at the time dimension in terms of minimizing MSE [10], Dong et al. [11] and Zheng and Xiao [12] derive the expression for Wiener filter combined LS based channel estimation (hereafter called WF-LS), combining LS based channel estimation with WF. The result is based on the assumption that the channel coefficients within an OFDM symbol period are constants and thus the effect of intercarrier interference (ICI) is completely neglected. Although the proposed WF-LS method can deal with slow to moderate fading channels, however, in time-varying channels, especially in fast time-varying cases, the latter assumption may be often violated in practice and the channel coefficients can vary in an OFDM symbol period, and thus ICI occurs, where the off-diagonal elements in the channel frequency response matrix represent ICI [7]. Therefore, the performance of WF-LS will degrade significantly or even lose its efficiency in this situation.
Basis expansion model (BEM) based channel estimation method can accurately estimate both slow and fast time-varying coefficients of the wireless channel in an OFDM symbol period using the coefficients of BEM that are far less than the length of wireless channel [13,14]. Combining BEM based channel estimation with WF, in this article we derive the expression for Wiener filter combined BEM based channel estimation (hereafter called WF-BEM), which explicitly considers the effect of ICI. The simulation results show that WF-BEM is better than WF-LS in time-varying channels, especially in fast time-varying cases.
The organization of this article is as follows. Section 2 describes the OFDM system model. Section 3 gives further analysis of WF-LS and then illustrates its t  e  p  1  :  C  h  a  n  n  e  l  e  s  t  i  m  a  t  i  o  n  a  t  p  i  l  o  t  s  y  m  b  o  l   S  t  e  p  2  :  P  e  r  f  o  r  m  i  n  t  e  r  p  o  l  a  t  i  o  n  t  o  o  b  t  a  i  n  t  h  e  c  h  a  n  n  e  l  e  s  t  i  m  a  t  e  o  f  d  a  t  a  s  y  m  b o l limitation in time-varying channels. Section 4 derives the expression for WF-BEM based channel estimation. Section 5 gives some simulation results that demonstrate the effectiveness of the proposed algorithm. Finally, conclusions are drawn in the final Section 6. Notation: Matrices and vectors are denoted by boldface letters. A(m, n) is the (m, n)-th entry of the matrix A. x(m) denotes the m-th element of the vector x. A hat over a variable indicates an estimate of the variable (e.g.,Ĥ ). E{·} denotes the expected-value operator. Superscripts [·] T , [·] -1 , [·] H , and [·]* denote the transpose, the matrix inversion, the Hermitian and the complex conjugate operations, respectively. I is the identity matrix.

System description
T be a column vector that represents the estimate of channel frequency response at the k-th subcarrier of pilot symbols in an OFDM system, k = 0, 1, ..., N p -1 represents the pilot subcarrier index, which is evenly located in pilot symbols, N p is the number of pilot subcarriers in one pilot symbol, p i , i = 0, 1, ..., M -1 represents the position of the p i -th pilot symbol at the time dimension, M represents the pilot symbol numbers (see Figure 1 for illus- ] T , and the pilot symbol insertion rate r at the time dimension satisfies r ≥ 2f D T, f D T is the normalized Doppler frequency. Zadoff-Chu sequence [15] is adopted at the p i -th pilot sym- ] T in this article, so |X p i (k)| 2 = 1. The channel estimateĤ d (k) at the data symbol position d can be estimated by interpolatingP k .
Let the interpolation row vector be C d,k , andĤ d (k) can be obtained byĤ d (k) = C d,kPk . By minimizing the esti- the optimal interpolation vector, i.e., the WF can be obtained as [10] Dong et al. [11] and Zheng and Xiao [12] uses LS based channel estimation to obtainP , which completely neglects the effect of ICI. Therefore, the expression for WF-LS is expressed asĤ In this article, Channel and noise are assumed to be independent from each other and noise is complex Gaussian distribution with zero mean and s 2 variance.

Further analysis of WF-LS based channel estimation
Least squares based channel estimate is given bŷ where (with the subscript m -O representing WF-LS with orders m) can be expressed aŝ is the auto-correlation function of the channel frequency response at the k-th subcarrier of different OFDM symbols [16], where J 0 (·) is the zero-th order Bessel function of the first kind and Γ(·) is the gamma function. Meanwhile, the coefficients of linear interpolation is given bŷ In the following, we will first show that linear interpolation is in fact a special case of WF-LS with orders 2, or put it another way, WF-LS with orders 2 will "degrade" into linear interpolation asymptotically. Then, we will analyze the asymptotical relationships among WF-LS with different orders. Finally we will point out the limitation of WF-LS. Let l = 1 in (5) and substitute the resulting (5) into (4a) and (4b), after some manipulations we can obtain the expanded expression for Notice that (7a) and (7b) hold true when the wireless channel is static or in very slow moving environments. When SNR ∞, that is, when s 2 0 (i.e., in the absence of noise), and inserting (6a) and (6b) into (7a) and (7b), respectively, then the relationship between the coefficients of WF-LS with orders 2 and that of linear interpolation can be obtained aŝ can be established, as shown in Figure 2. It is worth noting that a totally different looking Figure 2 will be obtained if we choose another set of p 0 , p 1 and p, but the asymptotical behavior as f D T 0 will definitely be the same, as proved and explained in (8). Therefore, it can be inferred that WF-LS with orders 2 implicitly exists certain linearity which results in its failure in high speed communication environments, in which case the approximate linearity of the channel frequency responseĤ p 0 (k) andĤ p 1 (k) between p 0 and p 1 disappears because of fast fading channels. On the other hand, because the channel estimation at pilot symbols is fixed to LS based channel estimation, if we want to improve the performance of WF-LS, one has to resort to WF-LS with higher orders (i.e., choosing M >2 in (2)) and hopefully that by adopting WF-LS with higher orders we can improve the estimation accuracy through using more pilot symbols, i.e., the MSE of channel estimation may be decreased by collecting more pilot symbols [17]. However, without properly adopting WF-LS with higher orders, there will be no performance improvement. In what follows we will investigate the asymptotical behavior of WF-LS with higher orders and show that WF-LS with higher orders can indeed be "degraded" into WF-LS with orders 2 if not carefully designed.
Substituting (3) and (5) into (2) and letting M = 3 we can obtainĈ (2). Then, substituting (4a) and (4b) into the resultingĈ (1), respectively, after a few tedious, but otherwise straightforward, algebraic manipulations, we can obtain the relationships between It is observed from (10) that as s 2 0 and by fixing p, p 0 and p 1 (it is assumed here that we already have or know p, p 0 and p 1 for WF-LS with orders 2), what really matters are f D T and the distance between p 1 and p 2 . By using (10) and putting different values of f D T and the distance between p 1 and p 2 , we will obtain some interesting results as shown in Figure 3. It can be seen from Figure 3 that as p 2 increases,Ĉ converge, and so will beĈ (2) will converge to zero as p 2 increases. On the other hand, this convergence will change with f D T , as can also be observed from Figure 3. In this case, WF-LS with orders 3 will "degrade" into WF-LS with orders 2, which means that even more pilot symbols are adopted to estimateĤ WF−LS d (k) , there will be no performance gains at all. Also note here that as in Figure 2, if we choose another set of p 0 , p 1 , p 2 and p we will obtain a different looking Figure 3, but the asymptotical convergence behavior will be the same. This asymptotical convergence behavior can be explained by investigating (10), which shows that B(Δ p ) will become small and close to zero as f D T (p 2p i ), i = 0, 1 grows large and tends to infinity. In conclusion we have the following results in the absence of noise When noise is present, the convergence behavior can be obtained through similar analysis by investigating (10) and the result is shown in Figure 4. This idea of analyzing the relationship between WF-LS with orders 3 and WF-LS with orders 2 can be further extended to analyze the relationships among WF-LS with different orders, which can be inferred directly from the above analysis that when p m is sufficiently large (for m >2) Therefore we conclude that we should carefully avoid these "convergence points" when we are trying to improve the performance of WF-LS with low orders by using WF-LS with higher orders, otherwise this goodwill will prove to be in vain. Of course, we can choose p m arbitrarily close to p m-1 , but this will cause transmitting more pilot symbols, which will significantly reduce the spectral efficiency. However, we can not choose p m arbitrarily large to increase the spectral efficiency, as have been proved that this will result in no performance improvement. Meanwhile the choose of p m is related to f D T. For example, we observe that when SNR = 30 dB and f D T = 0.01, in order forĈ (1) and 0, respectively, p 2 should 0 0 .
(1) Figure 2 The relationship between the coefficients of Wiener filter with orders 2 and that of linear interpolation. In this figure we let s 2 0, p 0 = 1, p 1 = 4 and p = 2.  (1) and 0, respectively, p 2 should be bigger than 9. The above observations are also approximately true for other SNR values. Therefore, from the above observations and analysis we may conclude that the combined parameter f D T (p m -p m-1 ) should be smaller than about 0.5 to avoid the degradation of WF with higher orders. This result is coincide with the sampling theorem in the time domain [18], which is, f D T (p m -p m-1 ) <0.5. Till this point we can say that we have just proved the sampling theorem in the time domain, from a new perspective, i.e., from the coefficients of WF-LS point of view. The optimal value of f D T (p m -p m-1 ) that will simultaneously increase the performance while maintaining the spectral efficiency is dependent on the specific problem at hand and related to the pilot symbol and system design problems, which is beyond the scope of this article, but will be an interesting and a meaningful topic for the future work.
Although the performance of WF-LS can be improved by adopting more pilot symbols provided that the above conditions are satisfied, WF-LS has an intrinsic weakness that will lead to its decreased performance in high speed mobile environments. As will be seen in Section 5, the performance of WF-LS is acceptable in slow to moderate mobile environments, but decreases significantly in high speed environments. This degradation of WF-LS is due mainly to the fatal weakness of the underlining assumption of LS based channel estimation that the channel coefficients within one OFDM symbol period are constants. This assumption will generally not hold true in time-varying channels, especially in fast time-varying cases. In light of the above analysis and considering that BEM based channel estimation method [13,14] can accurately estimate the time-varying channel coefficients of an OFDM symbol, we propose to combine BEM based channel estimation with WF to deal with channel estimation in time-varying channels.

The expression for the proposed WF-BEM based channel estimation
Basis expansion model based channel impulse response of the p i -th pilot symbol can be expressed as [13,14] where h p i (n, l) represents the channel impulse response of the l-th path at time n within the p i -th pilot symbol period and is assumed to be modeled as a wide sense stationary (WSS) complex Gaussian process with a statistically independent path, n = 0, 1, ..., N -1, N is the symbol length (N p ≤ N), l = 0, 1, ..., L -1, L is the length of wireless channel, h q,l (p i ) is the coefficient of BEM, b n,q is the base that captures channel time variations, and Q is the number of BEM bases. BEM is motivated by the observation that the temporal (n) variation of h(n, l) is usually rather smooth due to the channel's limited Doppler spread, and therefore, {b n,q } Q q=0 can be chosen as a small set (i.e., Q ≪ N) of smooth functions.
For the pilot subcarriers in the p i -th pilot symbol, an equation consisting of h q,l (p i ) can be expressed as [14] where ] T is the received signal at the pilot subcarriers of the p i -th pilot symbol in the frequency domain, represents the coefficients of BEM to be estimated, P consists of the base, the pilot subcarriers and the Fourier transform matrix at the positions of according pilot subcarriers [14,Eq. (14)], and Z p i is the interference from data subcarriers and noise. In this article, the regularized LS method [19] is adopted to estimate η p i aŝ (0), 0.01 (1) (1), 0.1 (1) (1), 0.01 (2), 0.1 (2), 0.01 where small value of a is to ensure the matrix P H P + aI a matrix with full rank. Let us define a matrix A ≜ (P H P + aI) -1 P H in (15) and A can be further divided into Q + 1 sub-matrices, each of which is of size L × N p . More specifically, A can be expressed as Combing (15) and (16),ηq,l(pi) can be expressed aŝ As shown in [20], in various mobile environments, the received signal Y p i (k) at the k-th subcarrier is expressed as where W p i (k) is an additive complex Gaussian noise in the frequency domain, represents the channel frequency response corresponding to the desired subcarrier k and is the ICI coefficient, which is the the off-diagonal element in the channel frequency response matrix. Since the ICI term consists of a large number of random interferences and is based on the assumption that h p i (n, l) is WSS complex Gaussian process, we can model the term as additive white Gaussian noise according to the central limit theorem [21].
To derive the expression for WF-BEM that considers the effect of ICI, the following steps are proposed: • Step (1) Substitute (17) into (13). We can get the estimate of BEM based channel impulse response aŝ • Step (2) . ð23Þ • Step (4) Use (20) to derive the correlation function Through the proposed Steps (1)-(4) and by using (22) and (24), E{Ĥ BEM p i (k)(Ĥ BEM p j (k)) * } can be derived as follows: if and if p i = p j , where we have used the fact that and its corresponding function in the frequency domain, σ 2 l is the power of the l-th path and without loss of generality we assume L−1 l=0 σ 2 l = 1, T s is the sampling period. It is noted that based on the assumption that h p i (n, l) within a single OFDM symbol period are constants, the effect of ICI is neglected (i.e., in (20) H ICI p i (k, z) ≡ 0), the authors Dong et al. [11] and Zheng and Xiao [12] only consider the noise variance s 2 . However, we consider the variation of h p i (n, l) within a single OFDM symbol period in time-varying channels and our derived result explicitly considers both the noise variance and the effect of ICI, which can provide us with details about channel variations in various mobile environments. • Step (5) Substitute (23), (25) and (26) into (2). Then, the interpolation vector of WF-BEM can be obtained aŝ Comparing (27)  Step (5). Finally, the expression for WF-BEM is given bŷ The proposed algorithm is summarized as follows. First, we use expression (22) to obtain the channel estimate of pilot symbols (corresponding to the Step 1 in Figure 1); then, we use expression (28) to obtain the channel estimate of data symbols (corresponding to the Step 2 in Figure 1).
In this section the interpolation vector of WF-BEM at the frequency dimension is not considered for the reason that by using BEM based channel estimation, the channel impulse response at a pilot symbol can be obtained by expression (21). Then, through transferring the channel impulse response into frequency domain by expression (22), the estimate of channel frequency response at required subcarriers of the pilot symbol can be obtained.

Simulation results and discussions
To evaluate the performance of the proposed WF-BEM algorithm, extensive computer simulations are carried out. LTE uplink system [15,22] is considered in the simulation. The simulation parameters are listed in Table 1 and the generalized complex exponential BEM (GCE-BEM) [14] is adopted.

MSE comparison
The MSE curve of BEM based channel estimation with LS based channel estimation only at pilot symbols is shown in Figure 5. From Figure 5 it can be seen that in time-varying channels, BEM based channel estimation performs consistently better than LS based channel estimation. The estimation accuracy of BEM based channel estimation is much better than that of LS based channel estimation especially in high-speed mobile environment, as can be seen from Figure 5. This result proves our analysis in Section 3 that in time-varying channels, as the assumption that the channel coefficients within one OFDM symbol period are constants no longer holds true, BEM based channel estimation will outperform LS based channel estimation. After interpolation along the time dimension by using WF, the MSE curve of WF-BEM with WF-LS is shown in Figure 6. It is seen from Figure 6 that due to the precondition that BEM based channel estimation is better than LS based channel estimation in time-varying channels, the estimation accuracy of the proposed WF-BEM is better than WF-LS.
As stated in Section 1, the channel estimation accuracy at data symbols is determined by both the channel estimation accuracy at pilot symbols and the accuracy of the interpolation method. WF is the best interpolation method at the time dimension to estimate channel at data symbols in terms of minimizing MSE. The optimality of WF is independent of the channel estimation method adopted at pilot symbols. Therefore we can   infer that the reason that WF-LS underperforms is because the channel estimation accuracy of LS based channel estimation is worse than that of BEM based channel estimation at pilot symbols in time-varying channels, which have been proved by Figure 5. This is also the main reason for our motivation to combine BEM based channel estimation with WF and derive the expression for WF-BEM based channel estimation to deal with channel estimation in time-varying channels.
Besides BEM based channel estimation can more accurately estimate fast time-varying channels than LS based channel estimation does, another reason that a channel estimator with higher accuracy results in improved performance in time-varying channels is illustrated as follows. In OFDM systems, Doppler effects and instabilities of the transmitter and receiver carrier frequency oscillators will cause a loss of orthogonality between the subcarriers, resulting in ICI. The carrier frequency offset caused by Doppler effects in time-varying channels is a fraction of the subcarrier spacing (i.e., in most practical cases the normalized Doppler frequency is less than 0.2). If we want to compensate this effect, we are dealing with the problem of fine-frequency adjustment. One effect of carrier frequency offset is the detrimental effect of a rotation of the subcarriers [23,24]. This effect will be recognized by a channel estimator, which does not distinguish between phase offsets caused by the channel and those caused by a frequency offset. Thus, a channel equalizer appears also to have fine frequency synchronization capabilities. Therefore it can be inferred that a channel estimator with higher accuracy can more accurately compensate this rotation caused by Doppler effects and thus will perform better in time-varying channels.  Figure 7 illustrates that since the wireless channel can be regarded as nearly invariant during one OFDM symbol period at 30 km/h and also because the Doppler frequency is small in this situation, the performance of WF-LS is close to that of WF-BEM, there is less than 1 dB SNR gain for WF-BEM in this situation. The actual values of these estimates (i.e., ideal channel) are also given to facilitate comparison with the estimates. We can see that the proposed WF-BEM is very close to the ideal channel case, there is only less than 0.5 dB loss compared to the ideal one. Figure 8 illustrates that since the wireless channel changes not that fast enough during an OFDM symbol period at 120 km/h compared with that at 350 and 480 km/h, the performance gap between WF-BEM and WF-LS is still not that obvious, there is only about 1 dB SNR gain for WF-BEM compared with WF-LS in this situation (for both 16 and 64QAM). And still, there is only less than 1 dB gap between the proposed WF-BEM and the ideal channel case.

BLER comparison
However, in high-speed environments (350 and 480 km/h), the wireless channel changes quickly and drastically. Hence, in these cases the wireless channel can not be regarded as nearly static and even slow changing during one OFDM symbol period anymore. Therefore, at mobile speed of 350 km/h, compared with WF-LS, about 2.5 and 3.5 dB SNR gain can be obtained in 16 and 64QAM modulation mode respectively through WF-BEM, as can be seen from Figure 9. It is observed from Figure 9 that even at 350 km/h, compared to the ideal channel case, there is only about 1.5 dB loss of the proposed WF-BEM in 16QAM modulation mode and about less than 3 dB loss in 64QAM mode. From Figure  9 we can infer that the use of efficient multilevel modulations, which make use of multiple signal phase and amplitude levels to carry multiple bits per symbol, may demand more precise channel estimation to demodulate the received signal, especially in high-speed environments.
Finally, the performance of WF-BEM with WF-LS at extremely high speed 480 km/h is presented in Figure  10. As can be observed from Figure 10, the performance of both WF-BEM and WF-LS rapidly degrade for an increasing speed. However, our proposed WF-BEM is still much better than WF-LS in 16QAM modulation mode, there is more than 3 dB SNR gain. There are two reasons for this degradation. Firstly, the wireless channel is changing so fast that by merely interpolating to obtain the channel estimate of data symbols becomes difficult  and inaccurate. In this case we may need to insert pilot tones in data symbols (e.g., comb-type pilot pattern) to actually estimate the channel at all OFDM symbols. Secondly, the ICI is so evident that in this case it can not be just "equalized out" as explained above. In this case, we may have to turn to ICI-cancelation schemes (e.g., self ICI cancelation [25], the use of windowing [26] or the use of pilot tones [27,28]) to mitigate or suppress the ICI. Nevertheless, to change the pilot pattern at the transmitter or to implement ICI-cancelation scheme at the receiver will necessitate altering the transceiver's structure, increasing the computational complexity significantly, and moreover, the use of more pilot tones will inevitably decrease the capacity and/or throughput of the overall system. Hence, considering that the sensitivity of M-QAM-OFDM signals to Doppler frequency increases significantly with the alphabet size M, we recommend that in order to strike a compromise between complexity and performance as well as data transmission rate, low level modulation such as 8PSK, QPSK or BPSK modulation schemes should be adopted at 480 km/h. When operating under (including) 350 km/h the use of high level modulation 16 or 64QAM can be adopted to increase data transmission rate while at the same time maintain good performance.

ICI analysis
In Figure 11, we show the effect of ICI that is included in the derived result (see expression (26)). It is observed from Figure 11 that when the Normalized Doppler frequency is small, the effect of ICI is also small, which may indeed be ignored just as the authors of [11,12] do. However, as the moving speed increases, the Normalized Doppler frequency will increase as well, and thus the effect of ICI can not be ignored anymore in this situation. In conclusion, the effect of ICI and the noise variance will contribute together to influence the performance of OFDM systems in time-varying channels, especially in high-speed cases.

Conclusions
In this article, we have further analyzed WF-LS, including the asymptotical relationship between WF-LS with orders 2 and linear interpolation, the asymptotical    T h e e f f e c t o f I C I Figure 11 The effect of ICI. The effect of ICI is plotted as the absolute value versus the normalized Doppler frequency. relationships among WF-LS with different orders. Considering the limitation of WF-LS is that LS based channel estimation assumes the channel coefficients within one OFDM symbol period are constants, we propose to combine BEM based channel estimation, which can accurately estimate the time-varying channel coefficients during one OFDM symbol period, with WF to deal with the challenging problem of channel estimation in OFDM systems operating under various mobile environments, especially in high-speed cases. The expression for WF-BEM based channel estimation has been derived, and under the hypothesis of approximating the ICI term as additive white Gaussian noise, the derived result explicitly considers the effect of ICI. The simulation results showed that the proposed WF-BEM provides a substantial gain, in terms of estimation accuracy in time-varying channels, with respect to WF-LS, especially in fast time-varying channels.