Open Access

High-order approximation algorithm using Gram-Schmidt QR transformations coupled with an iterative correction process for tracking frequency drift in OFDM systems

EURASIP Journal on Wireless Communications and Networking20142014:16

Received: 18 July 2013

Accepted: 6 January 2014

Published: 27 January 2014


To jointly track frequency drift (or fine frequency offset) and channel state variation for orthogonal frequency division multiplexing (OFDM) communications over mobile wireless channels, the maximum-likelihood (ML) estimation techniques are commonly adopted. A major difficulty arises from the highly nonlinear nature of the log-likelihood function which renders local extrema or multiple solutions. The use of mathematical approximation coupled with an adaptive iteration algorithm is a viable approach to ease problem solving. The approximation methods used in existing works are usually of the first-order level. In this work, we devise a high-order approximation algorithm to improve the tracking performance. As shall be seen, the problem amounts to finding the roots of an approximate high-degree polynomial equation derived from the log-likelihood function. To this end, we triangularize the companion matrix constructed from the high-degree polynomial using iterative Gram-Schmidt QR transformations, an eigenvalue problem in matrix computation, coupled with another iterative correction process to produce a frequency offset estimate to good accuracy. It is found that a high-order approximation algorithm formulated can achieve much better tracking performance in terms of estimation accuracy, tracking range, and error rate results than various other tracking algorithms appearing in the literature.


Frequency trackingChannel estimationMaximum-likelihood estimationQR transformationOrthogonal frequency division multiplexing

1. Introduction

To operate an orthogonal frequency division multiplexing (OFDM) system in a mobile wireless environment, initial acquisition, sometimes referred to as the coarse synchronization, is required for both time and frequency synchronization[16]. The usual approach for acquisition uses some correlation techniques (utilizing the property of correlation between repeated symbol blocks[2], correlation between data symbols and the redundant symbol copies within cyclic prefix[3], or correlation between first and second half of the training symbol block[4]) which require no knowledge of the channel state information. After acquisition has been completed, because of Doppler spread, multipath fading, as well as instability of the local oscillators, small carrier frequency drift, or fine carrier frequency offset (CFO), and time variation of the channel impulse response (CIR) or channel frequency response (CFR) will continue to exist and hence must be tracked periodically or frequently (depending on the channel fading rate, tracking period or frequency may vary in different applications)[1]. Frequency tracking is sometimes referred to as fine frequency synchronization[1] which is the theme of this paper. In the literature, numerous frequency tracking algorithms have been proposed for OFDM communications[1, 713]. Unlike the coarse frequency synchronization in acquisition, the fine frequency synchronization or frequency tracking usually requires channel knowledge. Also, tracking channel state variation usually requires frequency to be synchronized[1420]. That is to say, either perfect channel estimation is assumed in deriving frequency offset algorithms or vice versa, perfect frequency synchronization is assumed in deriving channel estimation algorithms[10]. As a result, many researchers propose joint frequency tracking and channel estimation algorithms for OFDM communications[1, 9, 10, 1214, 2125]. All the tracking schemes mentioned above adopt the maximum-likelihood (ML) estimation (MLE) technique. The major task in MLE is to maximize a log-likelihood function with respect to the parameters to be estimated. For the joint tracking problem at hand, two parameters, viz., CFO and CIR (or CFR), are to be estimated. To find the maximum or the peak, we set partial differentiations of the log-likelihood function to zero with respect to these two parameters resulting in two simultaneous equations in terms of the two parameters. Solving the simultaneous equations gives the solutions of the two sought-after estimators. However, the log-likelihood function is usually highly nonlinear; its derivatives will also be highly nonlinear. Therefore, there exist multiple peaks (often infinitely many) or multiple solutions. The highest peak is called the global solution and is the sought-after solution. The rest of the peaks are called local maxima or more generally local extrema as some applications may seek a minimum or a valley. A local extremum is not the desired solution. To design an algorithm to accurately locate the desired global solution may be no easy task. To achieve the purpose, many algorithm processes are impractically complicated[9, 13]. A straightforward approach is to use the gradient method[10]. However, for the gradient method to converge to the desired global solution, it is critical to properly choose the starting point as well as the adaptation step size. Improper choices of the starting point and/or step size may lead to an undesired local extremum solution or even result in divergence. The starting point and step size may vary when the channel characteristic or the environment changes. For most practical applications, hardly an analysis can be provided for the choices of the starting point and step size. These choices then must be made by trial and error which may often take much time or even render failure. In general, when the hyperboloid surface represented by the log-likelihood function is highly irregular in shape with densely distributed local extrema, the search by gradient method becomes very difficult or even impossible. Thus, though theoretically workable, a gradient algorithm may not be practically implementable. At any rate, simulation results given by existing algorithms cited previously are mostly given for small frequency drifts. We have tested some of these algorithms, though not all of them, and found that, when the frequency drift (true CFO value) becomes large (say, exceeding 0.25 of the carrier spacing), the algorithms are not able to deliver a convincing performance. The culprit is the local extrema obstacle. To overcome the local extrema difficulty, a viable approach is to use mathematical approximation as done in the expectation-maximization (EM)-based algorithm of[12, 24, 25] and the joint maximum-likelihood channel and frequency estimation (JML-CFE) algorithm of[12]. The EM-based algorithm was first proposed by[12]. Then, the authors in[24] extended the idea to MIMO applications and[25] extended it to fast fading channels. Another approximation approach is called the recursive least squares (RLS)-based method. This approach requires linearization of the estimated parameters. The basic idea of approximation is to expand the cost function into its Taylor series form and then truncate the terms of high power orders as an approximation. The approximation methods used in[12, 24, 25] and[14] are all the first-order approximation methods (the derivative of the truncated Taylor series is a polynomial of degree one). However, the first order is too low an order to achieve tracking/estimation results with sufficient accuracy, especially when the frequency drift is large. Therefore, again, simulation results in[12] and[14] are given only for small frequency drifts.

In this work, we are inspired to extend the first-order approximation method to a high-order approximation method in an attempt to improve the tracking/estimation performance. As will be shown, the problem finally amounts to finding the roots of a high-degree real coefficient polynomial equation resulting from the high-order Taylor series truncation (the derivative of the log-likelihood function is expanded into a Taylor series). To this end, we triangularize the companion matrix constructed from the high-degree polynomial using iterative QR transformations by way of the Gram-Schmidt orthogonalization process (to be called the Gram-Schmidt QR transformations)[2628]. However, due to the Taylor series truncation, the triangularized matrix result obtained from the Gram-Schmidt QR transformations is only approximate. Then, to complete our entire high-order approximation algorithm, a correction process, also iterative, is introduced to work in conjunction with the Gram-Schmidt QR transformations to eventually produce a final CFO estimate to good accuracy. We will show that such a high-order approximation algorithm, in terms of tracking range, estimation accuracies, and symbol error rate (SER), will gain much improved performance over other algorithms appearing in the literature for OFDM communications.

We note here that the development of the fine CFO tracking with wide tracking range while keeping high estimation accuracy and low computational complexity is beneficial to the design of the synchronization algorithms, viewing the fact that different coarse synchronization algorithms lead to different magnitudes of the fine residual frequency offsets. Therefore, if the followed fine synchronization algorithm has wide estimation range, the number of times of performing coarse synchronization in a certain time period can be reduced because only the fine synchronization is needed to be performed without doing coarse synchronization in advance each time, which reduces the computational loads. The reason explains why we make efforts on improving the fine synchronization with wide tracking range, high estimation accuracy, and low computational complexity.

2. Signal and system model

Define X = diag{X0, X1, …, XN - 1} as the diagonal matrix with {X k , k = 0, 1, …, N - 1} as the set of transmitted baseband frequency domain data symbols over an OFDM symbol block of length N (in symbol units), r = [r0, r1, …, rN - 1] T as the time domain baseband received signal vector, and w = [w0, w1, …, wN - 1] T as the time domain baseband noise vector where {w n } are independent, identically distributed (i.i.d.) complex Gaussian random variables (RVs) with zero mean and variance σ w 2 and can usually be denoted by w n N 0 , σ w 2 for convenience, where T denotes transpose. For a frequency-selective channel of dispersion length v, let h = [h0, h1, …, hv - 1] T be the CIR vector with {h m , m = 0, 1, …, v - 1} being spatially uncorrelated and H = [H0, H1, …, HN - 1] T be the CFR vector[29, 30]. Here,
H = F v h ,
F ν = 1 1 1 1 e - j 2 π / N e - j 2 π ν - 1 / N 1 e - j 4 π / N e - j 4 π ν - 1 / N 1 e - j 2 π N - 1 / N e - j 2 π N - 1 ν - 1 / N
is an N × ν discrete Fourier transform (DFT) matrix. Assuming the frequency offset normalized to subcarrier spacing is δ. Then, adopting unitary DFT for signal data, after demodulation and discarding the cyclic prefix at the receiver, the complex baseband sample at the n th time slot in the received time domain OFDM block is given by[2, 9, 10]
r n = e j 2 π n δ / N y n + w n , n = 0 , 1 , , N - 1 ,
where y n is the offset-free noiseless data sample given as
y n = 1 N k = 0 N - 1 H k X k e j 2 π nk / N .

We note here that removal of cyclic prefix symbols requires accurate time synchronization which, as stated previously, has been completed in acquisition. In fact, timing error can be divided into integer part and fractional part. The coarse timing acquisition deals with the correction of the integer part of the timing error, while the fractional part of timing error is absorbed into the estimation of channel impulse response[31].

It is readily seen that (3) can be put in vector form as
r = 1 N D δ F N H XH + w = 1 N D δ F N H X F v h + w ,
where H denotes Hermitian transpose and
D δ = diag 1 , e j 2 π δ / N , , e j 2 π N - 1 δ / N .

3. The high-order approximation algorithm

The novelty of our proposed algorithm lies in the use of high-order approximation for a nonlinear derivative of the log-likelihood function and the use of an innovative iterative correction process to refine the approximate solution obtained via QR transformation adopted from a matrix computation theory. The most important is such an approach indeed yields very satisfying results. We shall now present our scheme in detail.

3.1 Formulation of the maximum-likelihood estimation

From (5), a log-likelihood function can be derived as
ln Λ = - N ln π σ w 2 - 1 σ w 2 r - 1 N D δ F N H X F v h 2 = - N ln π σ w 2 - 1 σ w 2 r H r - 2 N Re r H D δ F N H X F v h + 1 N h H F v H X H F N D δ H D δ F N H X F v h = - N ln π σ w 2 - 1 σ w 2 r H r - 2 N Re r H D δ F N H X F v h + h H F v H X H X F v h ,
where Re{} means real part. Now, by setting ∂ ln Λ/∂h = 0, we can obtain a solution for h that will render a maximum ln Λ for a fixed δ. This is just an ML estimate of h at a fixed δ given by[10]
h ^ = 1 N F v H X H X F v - 1 F v H X H F N D δ H r .
Constant modulus training sequence has been proven optimal for channel estimation[32]. Chu sequence[33], for example, falls onto this category. We shall use a Chu sequence given by X k = e j π m k 2 / N m being any integer relatively prime to N}. This results in X H X = I N . Then, (8) can be simplified to
h ^ = 1 N N F v H X H F N D δ H r .
Next, setting ln Λ δ = 0 leads to
Im r H Q D δ F N H X F v h = 0 ,
where Q = diag{0, 1, …, N - 1} and Im{} means imaginary part. Replacing the h in (10) by the h ^ of (9), we find
Im r H Q D δ F N H X F v F v H X H F N D δ H r = Im r H D δ G D δ H r = m = 0 N - 1 n = 0 N - 1 Im r m * r n g m , n e j 2 π δ m - n / N = 0 ,
where gm,n is the (m + 1, n + 1)th element of an N × N matrix G given by
G = Q F N H X F v F v H X H F N = g m , n = 0 0 l = 0 v - 1 k = 0 N - 1 X k e - j 2 π l - 1 k / N k ' = 0 N - 1 X k ' e j 2 π lk ' / N l = 0 v - 1 k = 0 N - 1 X k e - j 2 π l - 1 k / N k ' = 0 N - 1 X k ' e j 2 π l - 1 k ' / N N - 1 l = 0 v - 1 k = 0 N - 1 X k e - j 2 π l - N + 1 k / N k ' = 0 N - 1 X k ' e j 2 π lk ' / N N - 1 l = 0 v - 1 k = 0 N - 1 X k e - j 2 π l - N + 1 k / N k ' = 0 N - 1 X k ' e j 2 π l - 1 k ' / N 0 l = 0 v - 1 k = 0 N - 1 X k e - j 2 π l - 1 k / N k ' = 0 N - 1 X k ' e j 2 π l - N + 1 k ' / N N - 1 l = 0 v - 1 k = 0 N - 1 X k e - j 2 π l - N + 1 k / N k ' = 0 N - 1 X k ' e j 2 π l - N + 1 k ' / N .

Since (11) is now channel independent, we have decoupled δ from h and (11) can thus be solved for δ alone. However, (11) is highly nonlinear in δ and contains infinite number of solutions. We only desire the one solution that yields the global maximum of ln Λ. The task is not possible by analytical means. However, we can resort to the numerical method.

3.2 The approximation approach

We now expand the exponential term ej 2π(m - n)δ/N in (11) into an infinite series (Taylor series expansion) and then truncate this infinite series beyond terms with power order higher than K to obtain
e j 2 π m - n δ / N = 1 + j 2 π m - n N δ - 1 2 ! 2 π m - n N 2 δ 2 - + 1 K ! j 2 π m - n N K δ K .
Substituting this approximate expression of (13) for the exponential term into (11), we will get a K degree polynomial of δ with real coefficients. Therefore, solving (11) becomes equivalent to finding the roots of a real polynomial of degree K. This is an eigenvalue problem in matrix computations[26]. Express the K degree polynomial as
p δ = a 0 + a 1 δ + + a K - 1 δ K - 1 + δ K = 0 .
Notice that we have normalized the polynomial such that the coefficient a K is unity. This can be easily done just by dividing the original polynomial equation by the original nonzero a K . To make it distinguishable, denote the original nonzero a K by a different symbol a ˜ K . It then can be readily verified that the coefficients {a k } in (14) are given by
a k = 1 a ˜ K m = 0 N - 1 n = 0 N - 1 Im r m * r n g m , n k ! j 2 π m - n N k = 1 a ˜ K k ! 2 π N k Im j k n = 0 k - 1 n C n k r H Q k - n G Q n r , k = 0 , 1 , , K

where C n m = m ! m - n ! n ! , m ≥ n.

A word is in order here. We define our approximation order as K when the polynomial degree in (14) is K. However, note that (14) is the approximation of (11) which is a derivative of the log-likelihood function of (7). Thus, here the Taylor series truncation is performed after differentiation of the log-likelihood function, while in[12] and[14], the Taylor series truncation is performed directly on the log-likelihood function. According to our definition,[12] and[14] are actually using first-order approximations. Note that when K = 1, we also have a first-order approximation algorithm.

3.3 QR transformations

Now, from (13), we construct a K × K square matrix called the companion matrix as[27]
A = 0 0 0 - a 0 1 0 0 - a 1 0 1 0 0 0 0 1 - a K - 1 .

It has been known that a square matrix can be triangularized by iterative QR transformations[2628], where Q is an orthogonal matrix and R is an upper triangular matrix. When all the roots of the polynomial of (14) are real, these roots will constitute the diagonal elements of the eventual triangularized A matrix that are also the eigenvalues of A[2628]. In case the polynomial equation of (14) has complex roots in conjugate pairs, the iterative QR transformations will lead to a Hessenberg matrix[26] which is quasi-triangular. That is, the subdiagonal immediately below the main diagonal will contain nonzero elements. However, on the main diagonal, each real root will appear as an element while each complex root will not appear but is replaced by a certain unpredictable number, either real or complex, as an element[26]. There is no way of telling which elements are real roots and which are the numbers replacing complex roots. However, it can be certain that the one real root is there to be the desired global solution for the CFO estimate which must be real. We shall detail later how to find this global solution by our algorithm design.

The process of iterative QR transformations called the Gram-Schmidt QR transformations involves two operation phases alternatively performed. The first operation phase is called the Gram-Schmidt QR decomposition. The QR decomposition is carried out by the Gram-Schmidt orthogonalization process[26, 34]. The second operation phase is an iterative transformation process (or a triangularization process). However, when the polynomial degree K gets too high, the triangularization process will begin to produce less accurate results[34]. Fortunately, for the OFDM tracking problem at hand, we do not have to use a very high order of K and only need a crude result out of the triangularization process since a complementing correction process coupled with the triangularization process will carry the burden and take care the rest of the matter to eventually bring a final CFO estimate result to great accuracy. As a result, we do not need to execute a great many iterations of QR transformations. It can be demonstrated that just a couple of iterations of QR transformations will suffice. For fine frequency synchronization, the frequency drift is less than half the carrier spacing (|δ| < 0.5)[9, 10, 12, 14]. In this case, computer experiments show that our order 2 algorithm can produce results with good accuracy. When a wider tracking range is desired (|δ| > 0.5), we will need at least an order 4 algorithm.

The iterative QR transformations are carried out as follows:

Let the square matrix A of (13) be denoted as
A = a 1 a 2 a K ,

where {a k , k = 1, 2, …, K} are K × 1 column vectors of A. We then carry out the Gram-Schmidt orthogonalization process as follows:

Define the projection of a vector a on a unit vector e as
proj e a = a T e e 2 e ,
where T denotes the transpose and e = e T e is the Euclidean norm of e. Then, let
u 1 = a 1 , u 2 = a 2 - proj e 1 a 2 , u 3 = a 3 - proj e 1 a 3 - proj e 2 a 3 , u K = a K - k = 1 K - 1 p roj e k a K , with e k = u k u k .
The orthogonal matrix Q is now formed as
Q = e 1 e 2 e K .
The upper triangular matrix R is given by
R = Q T A .
This completes the Gram-Schmidt QR decomposition process. Next, we start the following iterations:
A 1 = RQ .

Construct Q1 from A1 using the Gram-Schmidt orthogonal process as done above.

Construct R 1 = Q 1 T A 1 .
A 2 = R 1 Q 1 A L = R L - 1 Q L - 1 .

When L is sufficiently large, we will find A L to be upper triangular with diagonal elements equal to the roots of the polynomial of (14). Now, the Gram-Schmidt QR transformations are completed. Note that, as mentioned earlier, in actual operations when coupled with a complementing correction process, we need not to execute many iterations of the QR transformations (L needs not be large). In our computer simulations, we have used L = 2 and up to K = 6.

Of the K roots obtained, only one root will be desired. Assume the K roots are δ1, δ2, …, δ K . Substitute each root into (9), then into (7), with the Chu sequence chosen for X earlier. The one root that maximizes (7) is the sought-after solution δ ^ 0 , i.e.,
δ ^ 0 = arg max δ k ln Λ δ k , k = 1 , 2 , , K .

For the special case of the first-order approximation when K = 1, (14) directly leads to δ ^ 0 = - a 0 / a 1 , and hence, no QR transformation needs to be executed. Furthermore, in case (14) possesses complex roots, A L will be in the Hessenberg form (quasi-triangular) as stated earlier. However, it will not matter. Exactly as in the all real roots case, we simply substitute each diagonal element into the log-likelihood function of (7). The one that yields the maximum log-likelihood function is the global solution just as given by (23).

3.4 The iterative correction process

We now must note that the global solution δ ^ 0 obtained as described above was via an approximation method truncating a Taylor series expansion beyond terms with power order higher than K, i.e., the approximate expression of (13) was used. Therefore, the solution δ ^ 0 is still an approximate solution (with L = 2 as is to be used, this solution will be even cruder). We can further refine this solution by an iterative correction process as follows.

With the initial estimate δ ^ 0 (this is why we have used the subscript 0 to begin with), we can correct the received signal vector as
r 1 = D δ ^ 0 H r 0 ,
where r0 = r is just the original received signal vector. After this CFO correction, r1 is expected to be cleaner than r0 (= r). We thus use this corrected signal or replace r with r1 in (9) to come up with a better log-likelihood function from which a new polynomial of degree K is then generated, i.e., a new (9) is generated. Carrying out the iterative Gram-Schmidt QR transformations again as above, we get a new and better CFO estimate δ ^ 1 . Continuing this way, eventually at a certain M th iteration, we should have δ ^ M 0 . Then, our final or overall CFO estimate can be obtained as
δ ^ = m = 0 M δ ^ m δ .

To avoid confusion, we shall, from here on, call δ ^ the final or overall CFO estimate, define the i th interim CFO estimate as δ ^ i = m = 0 i δ ^ m , i = 0, 1, …, M, (notice that δ ^ M = δ ^ ), and refer to δ as the true CFO (the very beginning CFO to be estimated).

Now, replacing the final CFO estimate of (25) for the δ in (9), we can get the CIR estimator h ^ immediately.

We will summarize the iterative adaptive QR transformation algorithm consisting of two phases alternately. To be clear, the matrices Q l , R l , and A l mentioned above at the first phase are all added with an extra subscript m as Ql,m, Rl,m, and Al,m, respectively, corresponding to the m th run of the second phase. The proposed iterative adaptive algorithm is listed as below:

Initial condition: r0 = r and A0,0 = A.

For m = 0, 1, …, M

For l = 0, 1, …, L - 1

Construct orthogonal matrix Ql,m from companion matrix Al,m using Gram-Schmidt orthogonal process.

Construct R l , m = Q l , m T A l , m .

Construct Al+ 1,m = Rl,mQl,m.


Choose the K diagonal elements of AL,m to be δ k , k = 1, 2, …, K.

Calculate δ ^ m = arg max δ k ln Λ δ k .

Construct r m + 1 = D δ ^ m H r m .

Construct companion matrix A0,m + 1 from the K degree polynomial that approximates lm r m + 1 H D δ - δ ^ m G D δ - δ ^ m H r m + 1 = 0 .


The final or overall CFO estimate is δ ^ = m = 0 M δ ^ m δ , and the CIR estimate is h ^ = 1 N N F v H X H F N D δ ^ H r .

Apparently, when a linear approximation (first-order approximation) is used for a high-degree polynomial, the precision becomes less as compared to a quadratic approximation (second-order approximation). The computational load for a high-order approximation algorithm would be heavier than that of a first-order approximation algorithm since more terms are involved in computations. However, what is important is that we are rewarded with tremendous improvements in CFO tracking range, estimation accuracy, and SER performance. These improvements will be demonstrated with performance comparisons by computer simulations in Section 4.

4. Simulation results and discussions

Consider an OFDM system with DFT size N = 64 operated over a frequency-selective channel of dispersion length v = 9 having an exponential power profile αe-πm/10, m = 0, 1, …, v - 1, with unit power, where α is used to normalized the channel power[30]. In all simulations for the triangularization process with |δ| < 0.5, only two successive Gram-Schmidt QR transformations will be executed (L = 2) for orders up to K = 6. For notations, denote Var δ ^ = E δ - δ ^ 2 = E [ ( δ - m = 0 M δ ^ m ) 2 ] and Var δ ^ i = E δ - δ ^ i 2 = E [ ( δ - m = 0 i δ ^ m ) 2 ] as the variance or MSE of the final CFO estimator and the i th interim CFO estimator, respectively. In Figure 1, assuming a small true CFO of δ = 0.18 with a 30-dB signal-to-noise ratio (SNR), we present the learning curves of the MSE of the interim CFO estimator (i.e., Var δ ^ i vs. i) for three approximation orders K = 1, 2, and 4. Note that no Gram-Schmidt QR transformation is executed for the first-order algorithm. From the figure, we see that the first-order algorithm takes one iteration step or one correction cycle (i = 1) to reach the steady-state value, while the second and fourth algorithms need two correction cycles (i = 2). This may be because the heavy computation load required by higher degree polynomials will encumber the correction process. We have mentioned earlier that when the polynomial degree K is too high, the triangularization process will begin to give less accurate results[34]. Then, Figure 2 shows the results of repeated experiments using a larger δ = 0.48. The result of the sixth-order algorithm is now added. This time, the first-order algorithm fails to deliver an acceptable result. Again, for the similar reason of computational encumbrance by high-order process, the second-order algorithm has a smoother path to reach the steady state needing two correction iterations (i = 2), while both the fourth- and sixth-order algorithms require one more iterations (i = 3) and struggle slightly on way to convergence (the learning curves experience small fluctuations). However, if we increase δ further beyond half the carrier spacing (|δ| > 0.5), we shall begin to see the advantage of using higher orders. In Figure 3, the curves using δ = 0.6 are presented. Now, lower order algorithms (K ≤ 2) start to diverge. However, the high-order algorithms (fourth and sixth) can still deliver good performance with the higher order (sixth order) doing better.
Figure 1

Learning curves of MSE of interim CFO estimator for approximation algorithms of various orders, δ= 0.18, SNR = 30 dB, L= 2.

Figure 2

Learning curves of MSE of interim CFO estimator for approximation algorithms of various orders, δ= 0.48, SNR = 30 dB, L= 2.

Figure 3

Learning curves of MSE of interim CFO estimator for approximation algorithms of various orders, δ= 0.6, SNR = 30 dB, L= 2.

From Figures 1,2 and3, the minimum required correction cycles ensuring convergence for carrier frequency offsets of 0.18, 0.48, and 0.6 are 2, 3, and 4, respectively, for the proposed method. Accordingly, we choose the correction cycle parameter M = 4 for all the subsequent simulations for our proposed method.

Next, over the same channel system with the small frequency drift of δ = 0.18, Figures 4 and5 respectively give the performance curves of the final CFO estimator MSE (i.e., Var δ ^ or Var δ ^ m ) vs. SNR and CIR estimator MSE Var h ^ vs. SNR for various algorithms including those of the EM-based algorithm and JML-CFE algorithm given by[12, 14]. For all the estimator MSE performance curves, results are averaged over 10,000 runs. Also incorporated in the figure is the Cramer-Rao bound (CRB) vs. SNR curve for performance comparison. The derivations for CRB are presented in the Appendix for clarity. For this small frequency drift, we see that our algorithms of order K = 1, 2, and 4 all yield comparable performance with estimator MSE curves adhering closely to the CRB curve, and they all outperform the EM-based and the JML-CFE algorithms. In fact, it can be apparently seen that the EM-based algorithm performance is unacceptable. Simulation results show that the EM-based algorithm can work well only with δ < 0.05. The reason that the EM-based algorithm works poorly may be due to the fact that the expectation step uses MMSE estimate rather than ML estimate for the CIR in addition to the poor use of low first-order approximation. If we increase δ to 0.48, we will find that all the first-order algorithms including ours will all fail to work, while the three high-order algorithms (K = 2, 4, 6) still perform comparably with estimator MSE curves very closely adhering to the CRB (results not shown for space saving). For δ values beyond half the carrier spacing, Figures 6 and7 respectively plot the curves of final CFO estimator MSE and the CIR estimator MSE against SNR along with the CRB curves for δ = 0.6. Orders 2, 4, and 6 are given. We can now see that the sixth-order algorithm clearly prevails as both the CFO and CIR estimator MSEs will begin to adhere closely to the CRBs from SNR = 20 dB, while for the fourth-order algorithm, the SNR must be greater than 30 dB for estimator MSEs to closely adhere to the CRBs. For orders K ≤ 2, the algorithm will fail to perform at δ = 0.6 just as in Figure 3.
Figure 4

MSE of final CFO estimator vs. SNR curves for various algorithms with δ= 0.18.

Figure 5

MSE of CIR estimator vs. SNR curves for various algorithms with δ= 0.18.

Figure 6

MSE of final CFO estimator vs. SNR curves for various algorithms with δ= 0.6.

Figure 7

MSE of CIR estimator vs. SNR curves for various algorithms with δ= 0.6.

Although the estimator MSE is an important and very useful performance metric for parameter estimations, the ultimate performance measure in communication systems belongs to error rate. We therefore should compare the error rate performances of various algorithms. It is well known that OFDM error rate performance relies on the accuracies of frequency and channel estimation[30]. A good tracking scheme should achieve better CFO and CIR estimations hence lower error rate. Employing 16-ary square QAM transmission over the same channel system, with small frequency drift δ = 0.18, Figure 8 compares SER vs. SNR curves for all the first-order schemes as well as the second-order algorithm. Clearly, both the schemes of[12] and[14] fail to perform satisfactorily, while our first- and second-order algorithms perform comparably well with SER values just slightly higher than an ideal case (perfect CFO synchronization and channel estimation). Then, we assume a larger CFO δ = 0.6. Now as shown in Figure 9, both our first- and second-order algorithms fail to work. The fourth- and sixth-order algorithms perform comparably well with SER curve slightly higher than the ideal result.
Figure 8

Curves of 16-QAM SER vs. SNR for low-order approximation algorithms with δ= 0.18.

Figure 9

Curves of 16-QAM SER vs. SNR for approximation algorithms of various orders with δ= 0.6.

Figures 10 and11 show the MSE of final CFO estimate vs. CFO δ and the MSE of CIR estimator vs. CFO δ, respectively, at fixed SNR. Both the figures show that the sixth-order algorithm works well for CFO up to 0.6, while the second-order algorithm performs comparably well with the sixth order for CFO up to 0.55 but fails when CFO approaches 0.6. For the fourth order, when the SNR is greater than 30 dB, it performs comparably well with the sixth order. The results show agreement with what we get from Figures 4,5,6 and7, in which only two specific CFO values, 0.18 and 0.6, are considered. Figure 12 shows the curves of SER vs. CFO δ. It shows agreement with what we get from Figures 8 and9 that the fourth and the sixth have the comparable performance in terms of SER up to CFO of 0.6. Notice that seeing from Figures 4,5,6,7,10 and11, the fourth order has poor MSEs for both the CFO and CIR estimators working on CFO approaching 0.6 at low SNR, i.e., SNR = 20 dB. However, seeing from Figures 8,9 and12, the SER curves of the fourth order are closely attached to those of the sixth order for CFO up to 0.6 for all values of SNR. This means that, at low SNR which corresponds to high SER, the SER performance makes no difference for the fourth- and sixth-order algorithms even though they have big differences in MSE values at low SNR. Since the ultimate performance depends on SER, instead of MSE, we conclude that the fourth order will suffice for covering the CFO exceeding half the subcarrier spacing.
Figure 10

MSE of final CFO estimator vs. CFO curves for fixed SNR.

Figure 11

MSE of CIR estimator vs. CFO curves for fixed SNR.

Figure 12

Curves of 16-QAM SER vs. CFO for fixed SNR.

By observing various simulation results, we can conclude that, for a tracking range up to |δ| < 0.6, the fourth-order algorithm should suffice. To cover a wider tracking range, one can use orders higher than 4. However, using orders higher than 4 is not recommended since it will only increase the computation load without much gain except for a wider tracking range which is rarely necessary for fine frequency tracking. In fact, for fine frequency tracking, the frequency drift ought to be less than half the carrier spacing or |δ| < 0.5. The |δ| < 0.5 range is already a much wider range than those covered by all the existing tracking algorithms cited previously. Our second-order algorithm can easily cover this range with adequate performance. Thus, our proposed algorithm design indeed provides a significant contribution to the OFDM tracking study.

The significant performance improvement provided by our high-order approximation algorithm is not without price. It is conceivable that the higher the order, the higher is the computation complexity and hence more complicated and costly it will be for practical implementation. Depending on the application, a trade-off between complexity and performance may be made. Table 1 evaluates and compares the computation complexities of various algorithms discussed in this paper. It is seen that the complexity of an algorithm is related to channel dispersion length v, OFDM symbol block length N, order of approximation K, as well as iteration number M in a rather complicated way. Generally, higher approximation order will give higher complexity. Usually, N is much larger than v, K, or M and hence has the dominant impact on the computation complexity. Thus, from Table 1, we can get a rough estimate that, among the first-order approximation algorithms, the EM-based algorithm should have the lowest complexity, but its performance is poorest. Our first-order algorithm may have higher complexity than the EM-based and the JML-CFE algorithms, but not much higher. Therefore, in light of all the gaining in tracking range, estimation accuracy, and error rate performance, it should be well worthwhile to adopt our algorithm.
Table 1

Computation complexity comparison between various tracking algorithms


Real multiplication

Real addition

Real division

Square roots


(M + 1)(8vN + 13N - 1)

2(M + 1)(2vN + N - v - 2)

M + 1



2N(13N + 2v + 8)

12N2 + 2vN - 3N - 2v - 6



The first-order algorithm

M + 1 ( 8 N 2 + 13 N - 1 ) + 4 vN

M + 1 ( 4 N 2 - 3 ) + 2 v N - 1 - 1

M + 1


The K th-order algorithm (K ≥ 2)

M + 1 { ( 20 K + 8 ) N 2 + 5 K + 4 N + K [ L 3 K 2 - 1 - 1 ] } + 4 vN

M + 1 { 2 ( 2 K + 1 ) N 2 - 2 K + 1 N + L K - 1 ( 3 K 2 + K / 2 - 1 ) } + M + 2 v N - 1

M + 1 [ LK 3 K - 1 / 2 + K ]

(M + 1)LK

5. Conclusions

To track frequency drift and channel state variation in OFDM systems over mobile wireless channels, we propose a high-order approximation algorithm using the MLE scheme which outperforms existing OFDM tracking algorithms in terms of tracking range, estimation accuracy, as well as SER performance. Computer simulation results show that a fourth-order algorithm will suffice to cover a wide tracking range exceeding half the carrier spacing while producing results with good accuracy. A second-order algorithm will suffice if the tracking range is less than half the carrier spacing.


Cramer-Rao bounds

Define the following real vectors by stacking the real and imaginary parts of corresponding complex vectors as
r ¯ = Re r T , Im r T T ,
m ¯ = 1 N Re D δ F N H X F v h T , Im D δ F N H X F v h T T = m ¯ 1 T , m ¯ 2 T T ,
θ ¯ = Re h T , Im h T , δ T .
Then, the log-likelihood function of (7) can be rewritten as
ln Λ = - N ln π σ w 2 - 1 σ w 2 r ¯ - m ¯ 2 .
The fisher information matrix for the real vector θ ¯ is given by[35]
J θ ¯ = E ln Λ θ ¯ ln Λ θ ¯ T ,
from which the (l, l′)th element is given by
J l , l = 2 σ w 2 m ¯ T θ ¯ l m ¯ θ ¯ l = 2 σ w 2 Re { m ¯ 1 + j m ¯ 2 H θ ¯ l m ¯ 1 + j m ¯ 2 θ ¯ l } = 2 N σ w 2 Re D δ F N H X F v h H θ ¯ l D δ F N H X F v h θ ¯ l ,
Where θ ¯ L denotes the l th element of θ ¯ . Alternatively, using (A6), we can write
J θ ¯ = 2 N σ w 2 Re D δ F N H X F v h H θ ¯ D δ F N H X F v h θ ¯ T .
Now, using the following facts
D δ F N H X F v h Re h T = D δ F N H X F v ,
D δ F N H X F v h Im h T = j D δ F N H X F v ,
D δ F N H X F v h δ = j 2 π N Q D δ F N H X F v h ,
the Fisher information of (32) can be expressed as
J θ ¯ = 2 σ w 2 Re F v H X H X F v j F v H X H X F v j 2 π N 2 F v H X H F N Q F N H X F v h - j F v H X H X F v F v H X H X F v 2 π N 2 F v H X H F N Q F N H X F v h j 2 π N 2 h H F v H X H F N Q F N H X F v - 2 π N 2 h H F v H X H F N Q F N H X F v 4 π 2 N 3 h H F v H X H F N Q 2 F N H X F v h .

The Cramer-Rao bounds (CRBs) can then be obtained from the diagonal elements of J-1. Note that the CRBs obtained here are jointly for the ML estimates of CIR h and CFO δ (i.e., h ^ and δ ^ ).



carrier frequency offset


channel frequency response


channel impulse response


discrete Fourier transform




independent, identically distributed


joint maximum-likelihood channel and frequency estimation




ML estimation


orthogonal frequency division multiplexing


recursive least squares


random variable


symbol error rate.


Authors’ Affiliations

Department of Electrical Engineering, Tamkang University
Department of Electrical Engineering, Fu Jen Catholic University


  1. Keller T, Piazzo L, Mandarini P, Hanzo L: Orthogonal frequency division multiplex synchronization techniques for frequency-selective fading channels. IEEE. J. Seclected Areas Commun. 2001, 19(6):999-1008. 10.1109/49.926356View ArticleGoogle Scholar
  2. Moose PH: A technique for orthogonal frequency division multiplexing frequency offset correction. IEEE Trans. Commun. 2001, 42(10):2908-2914.View ArticleGoogle Scholar
  3. van de Beek JJ, Sandell M, Borjesson PO: ML estimation of time and frequency offset in OFDM systems. IEEE. Trans Signal Proces. 1997, 45(7):1800-1805. 10.1109/78.599949View ArticleGoogle Scholar
  4. Schmidl TM, Cox DC: Robust frequency and timing synchronization for OFDM. IEEE Trans. Commun. 1997, 45(12):1613-1621. 10.1109/26.650240View ArticleGoogle Scholar
  5. Kim YH, Lee JH: Joint maximum likelihood estimation of carrier and sampling frequency offsets for OFDM systems. IEEE Trans. Broadcast. 2011, 57(2):277-283.View ArticleGoogle Scholar
  6. Chin WL: ML estimation of timing and frequency offsets using distinctive correlation characteristics of OFDM signals over dispersive fading channels. IEEE. Trans. Vehicular Technol. 2011, 60(2):444-456.View ArticleGoogle Scholar
  7. Choi YS, Voltz PJ, Cassara FA: ML estimation of carrier frequency offset for multicarrier signals in Rayleigh fading channels. IEEE. Trans. Vehicular Technol. 2001, 50(2):644-655. 10.1109/25.923075View ArticleGoogle Scholar
  8. Salberg BB, Swami A: Doppler and frequency-offset synchronization in wideband OFDM. IEEE. Trans. Wireless Commun. 2005, 4(6):2870-2881.View ArticleGoogle Scholar
  9. Morelli M, Mengali U: Carrier-frequency estimation for transmissions over selective channels. IEEE Trans. Commun. 2000, 48(9):1580-1589. 10.1109/26.870025View ArticleGoogle Scholar
  10. Ma X, Kobayashi H, Schwartz SC: Joint frequency offset and channel estimation for OFDM. Proceedings of the IEEE GLOBECOM 2003, vol. 1. San Francisco: IEEE, Piscataway; 2003:15-19.Google Scholar
  11. Zheng L, Cheng W, Hu J, Yuan D: The ML frequency tracking algorithm for OFDM systems based on pilot symbols and decision data. Proceedings of the IEEE International Symposium on Microwave, Antenna, Propagation, and EMC Technologies for Wireless Communications, vol. 2. Beijing: IEEE, Piscataway; 2005:1611-1614.Google Scholar
  12. Lee JH, Han JC, Kim SC: Joint carrier frequency synchronization and channel estimation for OFDM systems via the EM algorithm. IEEE. Trans. Vehicular Technol. 2006, 55(1):167-172. 10.1109/TVT.2005.861212View ArticleGoogle Scholar
  13. Pun MO, Morelli M, Kuo CCJ: Maximum-likelihood synchronization and channel estimation for OFDMA uplink transmissions. IEEE Trans. Commun. 2006, 54(4):726-736.View ArticleGoogle Scholar
  14. Merli FZ, Vitetta GM: Iterative ML-based estimation of carrier frequency offset, channel impulse response and data in OFDM transmissions. IEEE Trans. Commun. 2008, 56(3):497-506.View ArticleGoogle Scholar
  15. van de Beek JJ, Edfors O, Sandell M, Wilson SK, Borjesson PO: On channel estimation in OFDM systems. Proceedings of the IEEE 45th Vehicular Technology Conference. Chicago: IEEE, Piscataway; 1995:815-819.Google Scholar
  16. Edfors O, Sandell M, van de Beek JJ, Wilson SK, Borjesson PO: OFDM channel estimation by singular value decomposition. IEEE Trans. Commun. 1998, 46(7):931-939. 10.1109/26.701321View ArticleGoogle Scholar
  17. Li Y: Pilot-symbol-aided channel estimation for OFDM in wireless systems. IEEE. Trans. Vehicular Technol. 2000, 49(4):1207-1215. 10.1109/25.875230View ArticleGoogle Scholar
  18. Li Y: Simplified channel estimation for OFDM systems with multiple transmit antennas. IEEE. Trans. Wireless Commun. 2002, 1(1):67-75. 10.1109/7693.975446View ArticleGoogle Scholar
  19. Li Y, Winters JH, Sollenberger NR: MIMO-OFDM for wireless communications: signal detection with enhanced channel estimation. IEEE Trans. Commun. 2002, 50(9):1471-1477. 10.1109/TCOMM.2002.802566View ArticleGoogle Scholar
  20. Coleri S, Ergen M, Puri A, Bahai A: Channel estimation techniques based on pilot arrangement in OFDM systems. IEEE Trans. Broadcast. 2002, 48(3):223-229. 10.1109/TBC.2002.804034View ArticleGoogle Scholar
  21. Nguyen-Le H, Le-Ngoc T, Ko CC: RLS-based joint estimation and tracking of channel response, sampling, and carrier frequency offsets for OFDM. IEEE Trans. Broadcast. 2009, 55(1):84-94.View ArticleGoogle Scholar
  22. Jamalabdollahi M, Salari S, Zahedi B: Efficient joint frequency synchronization and channel estimation in OFDM systems. Proceedings of the IEEE 19th Conference on Signal Processing and Communications Applications. Antalya: IEEE, Piscataway; 2011:206-209.Google Scholar
  23. Liu Y, Tan Z, Wang H, Kwak KS: Joint estimation of channel impulse response and carrier frequency offset for OFDM Systems. IEEE. Trans. Vehicular Technol. 2011, 60(9):4645-4650.View ArticleGoogle Scholar
  24. Salari S, Kandovan RS: Joint frequency and channel estimation for MIMO-OFDM systems via the EM algorithm. Proceedings of the IEEE Wireless and Optical Communications Networks. Colombo: IEEE, Piscataway; 2010:1-5.Google Scholar
  25. Simon EP, Ros L, Hijazi H, Ghogho M: Joint carrier frequency offset and channel estimation for OFDM systems via the EM algorithm in the presence of very high mobility. IEEE. Trans. Signal Proces. 2012, 60(2):754-765.MathSciNetView ArticleGoogle Scholar
  26. Sewell G: Computational Methods of Linear Algebra. 2nd edition. New York: Wiley; 2005.View ArticleGoogle Scholar
  27. Golub GH, Van Loan CF: Matrix Computations. 3rd edition. Maryland: The Johns Hopkins University Press; 1996.Google Scholar
  28. Francis JFG: The QR transformation, a unitary analogue to the LR transformation-part 1. Comp. J. 1961, 4(3):265-271. 10.1093/comjnl/4.3.265MathSciNetView ArticleGoogle Scholar
  29. Proakis JG, Salehi M: Digital Communications. 5th edition. New York: McGraw-Hill; 2008.Google Scholar
  30. Yen RY, Liu HY, Tsai WK: QAM symbol error rate in OFDM systems over frequency-selective fast Ricean fading channels. IEEE. Trans. Vehicular Technol. 2008, 57(2):1322-1325.View ArticleGoogle Scholar
  31. Morelli M, Kuo CJ, Pun MO: Synchronization techniques for orthogonal frequency division multiple access (OFDMA): a tutorial review. Proc. IEEE 2007, 95(7):1394-1427.View ArticleGoogle Scholar
  32. Stoica P, Besson O: Training sequence design for frequency offset and frequency-selective channel estimation. IEEE Trans. Commun. 2003, 51(11):1910-1917. 10.1109/TCOMM.2003.819199MathSciNetView ArticleGoogle Scholar
  33. Chu DC: Polyphase codes with good periodic correlation properties. IEEE. Trans. Info. Theory. 1972, 18(4):531-532. 10.1109/TIT.1972.1054840View ArticleGoogle Scholar
  34. Haykin S: Adaptive Filter Theory. 4th edition. New Jersey: Prentice Hall; 2002.Google Scholar
  35. Kay SM: Fundamentals of Statistical Signal Processing, Volume I: Estimation Theory. New Jersey: Prentice Hall; 1993.Google Scholar


© Yen et al.; licensee Springer. 2014

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.