- Research
- Open Access
- Published:

# On massive MIMO performance with semi-orthogonal pilot-assisted channel estimation

*EURASIP Journal on Wireless Communications and Networking*
**volume 2014**, Article number: 220 (2014)

## Abstract

With the rapidly increasing demand for high-speed data transmission and a growing number of terminals, massive multiple-input multiple-output (MIMO) has been shown promising to meet the challenges owing to its high spectrum efficiency. Although massive MIMO can efficiently improve the system performance, usage of orthogonal pilots and growing terminals causes large resource consumption especially when the coherence interval is short. This paper proposes a semi-orthogonal pilot design with simultaneous data and pilot transmission. In the proposed technique, we exploit the asymptotic channel orthogonality in massive MIMO systems, with which a successive interference cancellation (SIC)-based channel estimation is applied to mitigate the mutual interference between data and pilot. We derived the theoretical expressions of the achievable rates in massive MIMO systems with our proposed pilot design. Further discussion on performance verifies the superiority of our proposed pilot design for high or low signal-to-noise-ratios (SNRs) with any coherence interval length. And simulation results show that the proposed pilot design can achieve a significant performance improvement with reduced pilot resource consumption compared with the conventional orthogonal pilots.

## 1 Introduction

With a rapidly increasing demand for high data rates, as well as the growing number of serving users, massive multiple-input multiple-output (MIMO) is emerging as a promising technology to meet the challenge by providing a significant increment in reliability and data rate for wireless communications[1–3].

For MIMO systems, channel state information (CSI) is crucial for achieving multi-antenna gains. It becomes more challenging in massive MIMO systems due to numerous antennas at the base station (BS). Massive MIMO requires a large number of pilots if frequency-division duplexing (FDD) is used since the burden for downlink pilots is proportional to the number of BS antennas, while for time-division duplexing (TDD)[3, 4], uplink training is an effective method to obtain CSI by exploiting channel reciprocity. Generally, orthogonal pilot patterns are widely used for multi-channel estimation. It is well understood that the length of orthogonal pilots equals at least the number of users in a cell, which is in general much smaller than the number of massive BS antennas. However, even in TDD, the required resource for orthogonal pilots increases dramatically in a multi-cell massive MIMO system. Moreover, under the restriction of coherence interval duration and increasing user numbers, the same set of orthogonal pilots is reused for adjacent cells, thus pilot contamination[5–7] occurs in a muti-cell MIMO system. When the BS estimates the channel for a particular user, it may obtain a channel estimate contaminated by adjacent cell users that share the same pilot.

It has been revealed in[2, 8] that pilot contamination becomes a bottleneck that limits the performance benefits of massive MIMO. To solve this problem, recent studies[9–11] proposed various approaches tackling with pilot contamination. Although they tried to alleviate the pilot contamination between multiple cells, they still use orthogonal pilots in a single cell, which implies large pilot resource consumption, especially for short coherence interval.

Considering the pilot resource consumption as well as the importance of channel estimation’s accuracy, an efficient pilot design is essential for achieving full potential of massive MIMO systems. However, as far as we know, little attention has been paid to pilot design in a massive MIMO system. An exception[12] studied the pilot sequence design which matters little about pilot resource consumption. Therefore, in this work, we study the problem of an efficient pilot design by exploiting the asymptotic channel orthogonality[13] incorporated with successive interference cancellation (SIC) in massive MIMO systems.

The technical contributions of this work are summarized as follows: We present a novel pilot design with low resource consumption. In the proposed technique, we allow simultaneous data and pilot transmission and insert shifted pilot locations in slots, i.e., different users transmit pilots in different slots. It takes advantage of the asymptotic channel orthogonality for massive MIMO. Hence the mutual interference between data and pilot due to a semi-orthogonal pilot design can be mitigated by SIC. Numerical results show that the proposed pilot design outperforms the conventional orthogonal pilots. In particular, for low or high signal-to-noise-ratios (SNRs), we also theoretically prove the superiority of our proposed pilot design.

The paper is organized as follows. In Section 2, we describe the system model and transmission protocol of conventional massive MIMO systems. Section 3 addresses the transmission scheme of massive MIMO systems with the proposed pilot design. We analyze the achievable rates of both the uplink and downlink in Section 4. In Section 5, we deduce the asymptotic achievable rate as the SNR tends to infinity and zero. In Section 6, numerical results show that the proposed pilot design increases data transmission rates in various scenarios. Section 7 contains our conclusions.

## 2 System model with orthogonal pilots

We consider a cellular system composed of one BS with *M* antennas and *K*(*K* ≪ *M*) single-antenna users. Let *ρ*_{
p
}, *ρ*_{
u
}, and *ρ*_{
d
} be the pilot SNR, the uplink SNR, and the downlink SNR, respectively. Denote${\mathbf{g}}_{k}=\sqrt{{\beta}_{k}}{\mathbf{h}}_{k}$ as the channel vector between the BS and the *k* th user, where **h**_{
k
}, the corresponding small-scale fading vector, is i.i.d$\mathcal{C}\mathcal{N}\left(\mathbf{0},{\mathbf{I}}_{M}\right)$ and$\sqrt{{\beta}_{k}}$ models the geometric attenuation as well as shadowing effects which is assumed to be constant and known *a priori*. We assume channel obeys reciprocity in TDD, i.e., the channel factors are the same for both the uplink and downlink, and **h**_{
k
} remains constant during a coherence interval of length *T*.

The transmission scheme of a conventional massive MIMO system with orthogonal pilots is shown in Figure1, where *τ*_{
op
}(*τ*_{
op
} ≥ *K*) and *D*_{
op
} respectively represent the length of pilot and downlink data. Following our study, the orthogonal pilots is also referred to as the conventional pilot design and as a benchmark. Since **h**_{
k
} changes over coherence intervals, without loss of generality, we take the communication between the BS and users in a coherence interval as an example for deeper analysis. From Figure1, a coherence interval is organized in three phases: uplink channel estimation period, uplink data transmission period, and downlink data transmission period.

The conventional pilot design in massive MIMO systems utilizing orthogonal pilots can prevent pilot contamination within one cell and obtain relatively accurate channel estimates[2]. However, the required pilot overhead is$\frac{{\tau}_{\mathit{\text{op}}}}{T}({\tau}_{\mathit{\text{op}}}\ge K)$ for each user in a cell, which is too large especially when *T* is small and *K* is large in massive MIMO systems. In the next section, we will propose a possible pilot design which can keep a balance between the efficiency of data transmission and performance of the system.

## 3 Semi-orthogonal pilot design

In this section, we propose a semi-orthogonal pilot design with shifted locations, which reduces the pilot overhead while guaranteeing the system performance due to the merit of asymptotic channel orthogonality in massive MIMO systems.

The transmission scheme with the proposed pilot design is shown in Figure2, where *τ*_{
pp
} and *D*_{
pp
} respectively denote the length of pilot and downlink data, and${q}_{1}^{2}[1]$,${q}_{1}^{3}[1]$, *q*_{1}[1] refer to the definitions in Table1. In the first coherence interval, when the first user transmits a pilot, the other users are mute so that the BS can estimate the first user’s channel without contamination from other users. When the second user transmits a pilot, the first user transmits data while the other users still remain quiet, and so forth. In brief, when one user transmits a pilot, other users whose channels have already been estimated can transmit uplink data. And the BS adopts SIC to decontaminate interference from other users by exploiting the estimated channel information and gets all channel estimates. The channel estimates can be used in the following uplink data reception and downlink data transmission. As for the second coherence interval, all users, except the one who transmits a pilot, can transmit uplink data simultaneously during the channel estimation stage. The BS can utilize the known channel information estimated either from the previous or the present time to remove the contamination from other users.

*Remark* 1. Note that the pilot overhead for each user in the *i* th(*i* > 1) coherence interval of the proposed pilot design is$\frac{1}{T}$ from Figure2. And$\frac{1}{T}\ll \frac{K}{T}$ especially when *T* is small. As for the first coherence interval of the proposed pilot design, the pilot overhead is$\frac{K+1}{2T}$, which is also smaller than$\frac{K}{T}$ of conventional orthogonal pilots.

Due to the difference between the communications in the first and the *i* th(*i* > 1) coherence intervals, the uplink and downlink data transmissions as well as the channel estimation are elaborated in detail in the following subsections. Before the elaboration, we first show notational definitions in Table1, where parameter *i* represents the *i* th coherence interval. Besides, we replace **g**_{
k
} with **g**_{
k
}[*i*] to signify the channel vector between the BS and the *k* th user.

### 3.1 Communication in the first coherence interval

#### 3.1.1 Uplink of the first user

Before data transmission, the first symbol of the first coherence interval is reserved for uplink channel estimation. For the first user in the first coherence interval, it transmits pilot while the other users are mute. The received signal at the BS is

Then the BS gets an minimum mean squared error (MMSE)[14, 15] estimate:

Generally, the channel can be decomposed as${\mathbf{g}}_{1}[1]={\stackrel{~}{\mathbf{g}}}_{1}[1]+{\widehat{\mathbf{g}}}_{1}[1]$. From the properties of MMSE estimation,${\widehat{\mathbf{g}}}_{1}[1]\sim \mathcal{C}\mathcal{N}\left(\mathbf{0},{\sigma}_{1,1}^{2}{\mathbf{I}}_{M}\right)$,${\stackrel{~}{\mathbf{g}}}_{1}[1]\sim \mathcal{C}\mathcal{N}\left(\mathbf{0},{\epsilon}_{1,1}^{2}{\mathbf{I}}_{M}\right)$ is the independent estimation error, where${\epsilon}_{1,1}^{2}={\beta}_{1}-{\sigma}_{1,1}^{2}$ and${\sigma}_{1,1}^{2}=\frac{{\rho}_{p}{\beta}_{1}^{2}}{1+{\rho}_{p}{\beta}_{1}}$.

Once the BS gets the first user’s channel estimate, the first user starts uplink data transmission. By exploiting the merit of massive MIMO, simultaneous pilot and data transmission of other users has little impact on the first user’s uplink data detection.

Concerning the first user’s uplink data detection when the *k* th(*k* > 1) user transmits pilot, the received signal at the BS is

The BS applies maximum-ratio combining (MRC) since it is viable and common in massive MIMO uplink data reception[2, 16]. The uplink data detection of the first user when the *k* th user transmits pilot equals

where we divide both the denominators and numerators in$\stackrel{\text{(a)}}{=}$ by *M* and apply Lemma 1, because${\widehat{\mathbf{g}}}_{1}[1]$ is independent of **g**_{
t
}[1](*t* ≠ 1),${\stackrel{~}{\mathbf{g}}}_{1}[1]$, and **w**^{k}[1] from (1) and (2) according to the nature of MMSE estimation. Note that in a massive MIMO system, we assume that *M* is large enough to meet Lemma 1.

**Lemma 1.** [16]*Let* **p** *and* **q** *are two mutually independent L* × 1 *vectors whose elements are i.i.d*$\mathcal{C}\mathcal{N}$*(0,1) random variables. Then*$\underset{L\to \infty}{\text{lim}}{\mathbf{p}}^{H}\mathbf{p}/L\stackrel{\mathrm{a.s.}}{=}1$*and*$\underset{L\to \infty}{\text{lim}}{\mathbf{p}}^{H}\mathbf{q}/L\stackrel{\mathrm{a.s.}}{=}0$, *where* ‘$\stackrel{\mathrm{a.s.}}{=}$’ *denotes almost sure convergence.*

Then we consider the uplink data detection of the first user after the BS has obtained all users’ channel estimates. The received signal at the BS when all users transmit uplink data is

where$\mathbf{z}[1]\in {\mathbb{C}}^{M\times 1}$ is the additive white Gaussian noise. Following the steps in (4), the BS detects${\widehat{q}}_{1}[1]$ as follows:

Equations (4) and (6) show that the uplink data for the first user in the first coherence interval can be exactly detected in a massive MIMO system with the proposed pilot design.

#### 3.1.2 Uplink of the *kth(k>1)* user

As for the other users in the first coherence interval, the main difference lies in the channel estimation period.

We first consider the *k* th user’s channel estimate. Given **y**^{k}[1] in (3), the BS removes the data interference caused by the *t* th user from **y**^{k}[1] by using${\widehat{\mathbf{g}}}_{t}[1]$ and$\widehat{{q}_{t}^{k}}[1]$ (as revealed in (4),$\widehat{{q}_{t}^{k}}[1]$ is assumed to be accurate and equal to${q}_{t}^{k}[1]$, *k* > *t* ≥ 1), which are obtained before the present period. The processed received signal is

where${\mathbf{n}}^{k}[1]=\left(\sqrt{{\rho}_{u}}\sum _{t=1}^{k-1}{\stackrel{~}{\mathbf{g}}}_{t}[1]{q}_{t}^{k}[1]+{\mathbf{w}}^{k}[1]\right){\phi}_{k}^{\ast}$ means the residual interference and noise during the *k* th user’s channel estimation in the first coherence interval. Based on the assumption that$\{{q}_{t}^{k}[1]\}$ is an independent Gaussian sequence, assuming${\stackrel{~}{\mathbf{g}}}_{t}[1]\sim \mathcal{C}\mathcal{N}\left(\mathbf{0},{\epsilon}_{t,1}^{2}{\mathbf{I}}_{M}\right)$, we have

Obviously, **n**^{k}[1] is independent of **g**_{
l
}[1] for any *l*(1 ≤ *l* ≤ *K*), given **r**^{k}[1] in (7), the MMSE estimate of **g**_{
k
}[1] is

where **A**_{
k
}[1] is defined as

We exploit the independency between **g**_{
k
}[1] and **n**^{k}[1] in$\stackrel{\text{(b)}}{=}$ and also apply it in$\stackrel{\text{(c)}}{=}$ to obtain **Q**_{
k
}[1].

By substituting (10) and (11) into (9), we obtain

Again, the channel can be decomposed as${\mathbf{g}}_{k}[1]={\stackrel{~}{\mathbf{g}}}_{k}[1]+{\widehat{\mathbf{g}}}_{k}[1]$. From the properties of MMSE estimation,${\widehat{\mathbf{g}}}_{k}[1]\sim \mathcal{C}\mathcal{N}\left(\mathbf{0},{\sigma}_{k,1}^{2}{\mathbf{I}}_{M}\right)$ and${\stackrel{~}{\mathbf{g}}}_{k}[1]\sim \mathcal{C}\mathcal{N}\left(\mathbf{0},{\epsilon}_{k,1}^{2}{\mathbf{I}}_{M}\right)$ is the independent estimation error, where${\epsilon}_{k,1}^{2}={\beta}_{k}-{\sigma}_{k,1}^{2}$ and

Then we concern the uplink data detection of the *k* th user when the *k*_{1}th(*k*_{1} > *k*) user transmits pilot. From (7) and (12),${\widehat{\mathbf{g}}}_{k}[1]$ and **g**_{
t
}[1] (*t* ≠ *k*) are independent due to the fact that **g**_{
t
}[1] is independent of **g**_{
k
}[1] and **n**^{k}[1]. Following the steps in (4), the uplink data detection for *k*_{1} > *k* equals

As for the uplink data detection of the *k*^{th} user when all users transmit uplink data, we can achieve a similar result to (6):

From (14) and (15), the uplink data for the *k* th user in the first coherence interval can also be precisely detected in a massive MIMO system with the proposed pilot design.

#### 3.1.3 Downlink of all users

After all users complete the uplink pilot and data transmission, the BS starts downlink data transmission, which is the same for all users. The BS applies matched filter (MF)-based beamforming since it works well enough in massive MIMO downlink data transmission[17]. Without loss of generality, we consider the downlink data transmission of the *k* th user. The *k* th user receives

where${y}_{k}^{d}[1]\in \mathbb{C}$ is the receiving signal and${\upsilon}_{k}[1]\in \mathbb{C}$ is the unit AWGN. **p**_{
j
}[1] is the beamforming vector for the *j* th user and is defined as

Because${\widehat{\mathbf{g}}}_{k}[1]$ is independent of${\stackrel{~}{\mathbf{g}}}_{k}[1]$ and **g**_{
k
}[1] is independent of${\widehat{\mathbf{g}}}_{j}[1]$ (*j* ≠ *k*), we apply it to equality$\stackrel{\text{(d)}}{=}$ in the following and can obtain the downlink data estimate of the *k* th user:

where we divide both the denominator and numerator of the last term in$\stackrel{\text{(d)}}{=}$ by and it vanishes as *M* → *∞*. Equation (18) shows that the downlink data can also be accurately detected.

### 3.2 Communication in the *i* th(*i* > 1) coherence interval

Note that, different from the first coherence interval, all users show the same transmission pattern in the *i* th(*i* > 1) coherence interval as shown in Figure2. Hence, without loss of generality, we take the *k* th user for instance. Its channel estimation is contaminated by all the other users’ uplink data. As for the uplink and downlink data transmissions, they are similar to the procedures elaborated for the first coherence interval in the above subsection and we omit the detailed description due to space limitation. To be concise, we here briefly introduce the processing procedure of channel estimation in the following.

The received signal at the BS when the *k* th user transmits a pilot is

Then the BS gets the uplink data estimate${q}_{t}^{k}[i]$ of the *t* th(*t* ≠ *k*) user:

which uses the known channel estimate from the present coherence interval when 1 ≤ *t* ≤ *k* - 1 and uses that from the previous one when *k* ≤ *t* ≤ *K*. Hence (20) is separately expressed based on two conditions.

Here, we assume that the variation of channel vectors during a coherence interval is ignorable. Any two channel vectors between the same user and BS can be treated as approximately equal if they locate within a distance of *T*. Take the channel vector from the second user to BS for example. Let **g**_{
A
} and **g**_{
B
} respectively denote the channel vectors at time slots ‘A’ and ‘B’ in Figure2. Since the distance between ‘A’ and ‘B’ is no larger than *T*, it is reasonable to approximately treat **g**_{
B
} as **g**_{
A
}, i.e., **g**_{
B
} ≈ **g**_{
A
}. Therefore, as shown in (20) for the *k* ≤ *t* ≤ *K* case, the BS utilizes the estimate of **g**_{
A
} to detect the second user’s uplink data at time slot ‘B’. This is different from the uplink data detection in (14), which is expressed based on only one condition. As for the following channel estimation process, it is performed in the similar way to that in Section 3.1. In this way, the *k* ≤ *t* ≤ *K* case for (20) establishes the main difference of the analysis for the *i* th(*i* > 1) coherence interval from that of Section 3.1.

Equation (20) can also be extended similarly to the form in (4) and then we arrive at the conclusion that$\widehat{{q}_{t}^{k}}[i]\stackrel{\mathrm{a.s.}}{=}{q}_{t}^{k}[i]$.

Similar to the procedure in (7), the BS first removes the data contamination from the received signal and generates

where **n**^{k}[*i*] means the residual interference and noise during the *k* th user’s channel estimation in the *i* th(*i* > 1) coherence interval. Following the steps in (8), (9), (10), and (11), the BS computes

and

Then the BS gets an MMSE estimate:

Again, the channel can be decomposed as${\mathbf{g}}_{k}[i]={\stackrel{~}{\mathbf{g}}}_{k}[i]+{\widehat{\mathbf{g}}}_{k}[i]$. From the properties of MMSE estimation,${\widehat{\mathbf{g}}}_{k}[i]\sim \mathcal{C}\mathcal{N}\left(\mathbf{0},{\sigma}_{k,i}^{2}{\mathbf{I}}_{M}\right)$,${\stackrel{~}{\mathbf{g}}}_{k}[i]\sim \mathcal{C}\mathcal{N}\left(\mathbf{0},{\epsilon}_{k,i}^{2}{\mathbf{I}}_{M}\right)$ is the independent estimation error, where${\epsilon}_{k,i}^{2}={\beta}_{k}-{\sigma}_{k,i}^{2}$ and

Equation (26) is the unified form of${\sigma}_{k,i}^{2}$ for any *i* th(*i* ≥ 1) coherence interval by comparing (13) with (26). Analysis in the next section shows that${\sigma}_{k,i}^{2}$ will be useful in the performance analysis of our proposed pilot design.

## 4 Performance analysis

Given the processing procedure elaborated in the above section, we are now ready to conduct theoretical performance analysis of our proposed scheme. From Figure2, the proposed pilot design saves more resource for data transmission compared with the conventional orthogonal one when *K* grows large along with *M*. However, due to the semi-orthogonal pilot pattern, SIC-based channel estimation is adopted, leading to a larger estimation error than the conventional one. In this section, performance analysis is presented to show explicitly the potential benefits that can be achieved by our proposed pilot design.

In this section, we analyze the performance of the massive MIMO systems in terms of both the uplink and downlink achievable rates. The approximation of the *k* th user’s achievable rate[18] is defined as

where *γ*_{
k
} is the associated signal-to-interference-noise-ratio (SINR); *S*, *I*, and *N* stand for the power of signal, interference, and noise, respectively. Next the achievable rate is calculated by evaluating the power of these items term by term.

### 4.1 Downlink analysis

Here we first conduct downlink performance analysis since it is much simpler than the uplink one. Assume that the *k* th user is of interest. Review the process of downlink communication in (16) and replace **p**_{
j
}[1] with **p**_{
j
}[*i*] (*i* ≥ 1) to formulate the downlink communication for any *i* th coherence intervals:

where the signal, interference, and noise terms are marked with *S*, *I*, and *N*, and${\mathbf{g}}_{k}^{H}[i]{\mathbf{p}}_{k}[i]$ is assumed to be accurately estimated at the *k* th user.

First, we derive the power of the intended signal in (28), denoted by${S}_{k,i}^{d}$. By applying Khintchine’s law of large numbers[19] and with some basic manipulations, we have the following equality in$\stackrel{\text{(e)}}{=}$ with high probability for large *M*. Thus we arrive at

where E[·] means the expectation operator and **p**_{
k
}[*i*] is the beamforming vector:

The scalar${\alpha}_{\mathrm{k.i}}=\frac{\u2225{\widehat{\mathbf{g}}}_{k}[i]\u2225}{\sqrt{M}}$ is a normalization factor[10]. By using Lemma 1, we can get the value of *α*_{k,i} as *M* → *∞*:

Besides,$\stackrel{\text{(f)}}{=}$ exploits Lemma 2 in the following and$\stackrel{\text{(h)}}{=}$ utilizes the fact that${\sigma}_{k,i}^{2}+{\epsilon}_{k,i}^{2}={\beta}_{k}$.

**Lemma 2.** *Let* **x** *and* **y** *are two mutually independent L* × 1 *vectors whose elements are i.i.d*$\mathbb{C}\mathbb{N}$ (0, *σ*^{2}) *random variables. Then* E[|**x**^{H}**x**|^{2}]=(*L*^{2} + *L*)*σ*^{4} *and* E[|**x**^{H}**y**|^{2}] = *L* *σ*^{4}.

*Proof*.: It can be directly obtained by correlating the vectors in the element-wise way.

Using a similar technique to (29), we have the power of the interference in (28), denoted by${I}_{k,i}^{d}$, as follows:

where$\stackrel{\text{(i)}}{=}$ is based on the independency between **g**_{
k
}[*i*] and${\widehat{\mathbf{g}}}_{j}[i]$ (*k* ≠ *j*) and the fact that the variance of **g**_{
j
}[*i*] is *β*_{
j
}**I**_{
M
}.

The noise power in (28) can be readily obtained by${N}_{k,i}^{d}=\mathrm{E}[{\upsilon}_{k}^{\ast}[i]{\upsilon}_{k}[i]]=1$. Then the downlink achievable rate of the *k* th user in the *i* th coherence interval is

### 4.2 Uplink analysis

There are three kinds of uplink transmission patterns in massive MIMO systems with the proposed pilot design. The first kind of uplink communication contains only uplink data, i.e., case ① in Figure2, while the other two kinds of uplink communications, i.e., cases ② and ③ in Figure2, cover both the uplink data and pilot. Here we take the first kind of uplink communication as an example since the respective analysis of the other two kinds of uplink communications are similar.

Assume that the *k* th user is of interest. Write down the MRC processed received signal at the BS based on (5) and replace ‘1’ with ‘*i*’ (*i* ≥ 1) to formulate the uplink data transmission for any *i* th coherence interval. It gives

where the signal, interference (composed of two parts) and noise terms are marked with *S*, *I*_{1}, *I*_{2}, and *N*. By applying the similar analytical procedure for the downlink, it is not hard to calculate the power of the desired signal, the interference and the noise in (34) term by term, which are respectively denoted by${S}_{k,i}^{u}$,${I}_{k,i}^{u}$, and${N}_{k,i}^{u}$. The power of the desired signal is

And the power of the interference in (34) is

Then the noise power in (34) is readily achieved:

Substituting the analyzed${S}_{k,i}^{u}$,${I}_{k,i}^{u}$, and${N}_{k,i}^{u}$ into (27), the *k* th user’s uplink achievable rate of case ① in Figure2 is

As for the second kind of uplink communication, we can write down the processed receiving signal for the *k* th user during the *k*_{1}th user’s first channel estimation as follows:

The main difference of (39) from (34) lies in the interference term. Hence we only compute the power of the interference, denoted by${I}_{k,1}^{{u}_{{k}_{1}}}$:

Then the *k* th user’s uplink achievable rate when the *k*_{1}th(*k*_{1} > *k*) user transmits a pilot in the first coherence interval is

Further concerning the third kind of uplink communication in a similar way, we can also get the uplink rate of the *k* th user when the *k*_{1}th user transmits pilot in the *i* th(*i* > 1) coherence interval is

where${\sigma}^{2}=\left\{\begin{array}{rl}{\sigma}_{k,i}^{2},& k<{k}_{1}\\ {\sigma}_{k,i-1}^{2},& k>{k}_{1}\end{array}\right.$.

### 4.3 Performance evaluation

Observing (33), (38), (41), and (42), the channel estimation accuracy${\sigma}_{k,i}^{2}$ has an influential effect on the uplink and downlink achievable rates. Obviously, The channel estimates of the conventional one are more accurate than those of our proposed one. However, an inherent merit of the proposed one is that it provides more available resources for data communication, which is implied in (41) and (42). It makes a difference in performance evaluation. Assume that the number of concerned coherence intervals is *N*_{
c
}. Then from Figure2, the uplink achievable rate of the proposed one, in terms of bps/Hz, is

Meanwhile, from (33), the downlink achievable rate (bps/Hz) of the proposed one is

For comparison, we also provide the uplink and downlink achievable rates${\stackrel{~}{R}}_{\mathit{\text{op}}}^{u}$ and${\stackrel{~}{R}}_{\mathit{\text{op}}}^{d}$ (bps/Hz) of the conventional orthogonal one in Figure1 as follows:

where the *k* th user’s uplink and downlink achievable rates of the conventional orthogonal pilots${R}_{\mathit{\text{op}},k}^{u}$ and${R}_{\mathit{\text{op}},k}^{d}$ are

Here, the variance of channel estimate${\sigma}_{\mathit{\text{op}},k}^{2}$ for the conventional orthogonal pilots as shown in Figure1 can be easily obtained and${\sigma}_{\mathit{\text{op}},k}^{2}=\frac{{\rho}_{p}{\tau}_{\mathit{\text{op}}}{\beta}_{k}^{2}}{1+{\rho}_{p}{\tau}_{\mathit{\text{op}}}{\beta}_{k}}$[16].

*Remark* 2. By comparing (43) with (45), the additional uplink data transmission, i.e., cases ② and ③ in Figure2, has an influential positive effect on the uplink achievable rate of the proposed pilot design for a small *T*. Though at a cost of channel estimation accuracy, however, by exploiting the merit of massive MIMO, the proposed pilot design can achieve a better tradeoff between pilot resource consumption and channel estimation accuracy. It outperforms the conventional one in many scenarios.

*Remark* 3. Note that from the elaboration in Section 3, compared with the conventional orthogonal pilot design, the main difference of our proposed one lies in the subtraction of data interference from the received signal at the BS, which can be easily completed without much additional complexity.

For a better view of the system performance comparison,${C}_{\mathit{\text{pp}}}^{s}={\stackrel{~}{R}}_{\mathit{\text{pp}}}^{d}+{\stackrel{~}{R}}_{\mathit{\text{pp}}}^{u}$ and${C}_{\mathit{\text{op}}}^{s}={\stackrel{~}{R}}_{\mathit{\text{op}}}^{d}+{\stackrel{~}{R}}_{\mathit{\text{op}}}^{u}$ are introduced to respectively stand for the system overall achievable rates of the proposed pilot design and the conventional one.

## 5 Asymptotic performance analysis

In this section, we will look at the performance at asymptotically low and high pilot and data SNRs. For simplicity, assume that$\frac{{\rho}_{u}}{{\rho}_{p}}$ and$\frac{{\rho}_{d}}{{\rho}_{p}}$ are fixed. Hence, *ρ*_{
u
} → 0 and *ρ*_{
d
} → 0 as *ρ*_{
p
} → 0, and it is likewise as *ρ*_{
p
} → *∞*. We can use *ρ* to stand for *ρ*_{
p
}, *ρ*_{
u
}, and *ρ*_{
d
} when considering asymptotically low and high SNRs. Furthermore, the SNR of our proposed pilot design is defined as${\rho}_{\mathit{\text{pp}}}=\frac{2{\mathit{\text{TN}}}_{c}}{2{\mathit{\text{TN}}}_{c}-K+1}{\rho}_{\mathit{\text{op}}}$ to provide an equal overall system power, where *ρ*_{
op
} denotes the SNR of the conventional orthogonal one.

### 5.1 High SNR analysis

In order to evaluate the performance of two pilot designs at an asymptotically high SNR, we first reformulate the SINR value in (38) by some manipulations as follows:

where${\lambda}_{\mathit{\text{pp}},k}=\frac{{\beta}_{k}^{2}}{\left(\frac{1}{{\rho}_{\mathit{\text{pp}}}}+\sum _{j=1}^{K}{\beta}_{j}\right)\left(\frac{{\sigma}_{{\mathbf{n}}^{k}[i]}^{2}}{{\rho}_{\mathit{\text{pp}}}}+{\beta}_{k}\right)}$ is from (26). Following the steps in (49), the SINR value in (47) is

where${\lambda}_{\mathit{\text{op}},k}=\frac{{\beta}_{k}^{2}}{\left(\frac{1}{{\rho}_{\mathit{\text{op}}}}+\sum _{j=1}^{K}{\beta}_{j}\right)\left(\frac{1}{{\rho}_{\mathit{\text{op}}}}+{\beta}_{k}\right)}$. Furthermore, based on (22) and the fact that$\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{\epsilon}_{t,i}^{2}=0$ for any *i* and *t*, we have

It is easily seen that$\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{\lambda}_{\mathit{\text{pp}},k}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{\lambda}_{\mathit{\text{op}},k}=\frac{{\beta}_{k}}{\sum _{j=1}^{K}{\beta}_{j}}$. Further, it achieves that$\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{\gamma}_{k,i}^{u}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{\gamma}_{\mathit{\text{op}},k}^{u}$. Define${\stackrel{\u0304}{R}}_{k,\mathit{\text{op}}}^{u}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{R}_{\mathit{\text{op}},k}^{u}$ and${\stackrel{\u0304}{R}}_{k,\mathit{\text{pp}}}^{u}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{R}_{k,i}^{u}$. We have

The similar manipulations to (49) is applied to${R}_{k,1}^{{u}_{{k}_{1}}}$ and${R}_{k,i}^{{u}_{{k}_{1}}}$ in (41) and (42), respectively, and it achieves that$\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{R}_{k,1}^{{u}_{{k}_{1}}}\ge \underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{R}_{k,i}^{{u}_{{k}_{1}}}={\stackrel{\u0304}{R}}_{k,\mathit{\text{pp}}}^{u}$.

Then considering the value of${\stackrel{~}{R}}_{\mathit{\text{pp}}}^{d}$ at asymptotically high SNRs, we first compute the downlink rate${\gamma}_{k,i}^{d}$ in (33). Following the steps in (49), it achieves

Applying the similar manipulations in (48), it arrives at

Therefore, based on (51), it is easily seen that$\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{\gamma}_{k,i}^{d}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{\gamma}_{\mathit{\text{op}},k}^{d}$. Define${\stackrel{\u0304}{R}}_{k,\mathit{\text{pp}}}^{d}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{R}_{k,i}^{d}$ and${\stackrel{\u0304}{R}}_{k,\mathit{\text{op}}}^{d}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{R}_{\mathit{\text{op}},k}^{d}$, then we have${\stackrel{\u0304}{R}}_{k,\mathit{\text{pp}}}^{d}={\stackrel{\u0304}{R}}_{k,\mathit{\text{op}}}^{d}$.

Thus far, the uplink rates of both pilot designs at high SNR are

And the downlink rates of both pilot designs at high SNR are

Some remarks on the high SNR analysis show the priority of our proposed pilot design over the conventional one.

### 5.2 Low SNR analysis

In the sequel, consider the performance of two pilot designs at an asymptotically low SNR. Following the steps in (49) and based on the fact that${\sigma}_{{\mathbf{n}}^{k}[i]}^{2}\le 1+{\rho}_{\mathit{\text{pp}}}\sum _{j=1,j\ne k}^{K}{\beta}_{j}$, we have

where${\underset{\_}{\lambda}}_{\mathit{\text{pp}},k}=\frac{{\beta}_{k}^{2}}{{\left(\frac{1}{{\rho}_{\mathit{\text{pp}}}}+\sum _{j=1}^{K}{\beta}_{j}\right)}^{2}}$. Furthermore, based on${\rho}_{\mathit{\text{pp}}}=\frac{2T{N}_{c}}{2T{N}_{c}-K+1}{\rho}_{\mathit{\text{op}}}$, we have

where the last term tends to infinity at asymptotically low SNR. Compared with *λ*_{op,k}, it arrives at the result that$\underset{{\rho}_{\mathit{\text{pp}}}\to 0}{\text{lim}}{\gamma}_{k,i}^{u}\ge \underset{{\rho}_{\mathit{\text{pp}}}\to 0}{\text{lim}}{\gamma}_{\mathit{\text{op}},k}^{u}$, which is also applied to${\gamma}_{k,1}^{{u}_{{k}_{1}}}$ and${\gamma}_{k,i}^{{u}_{{k}_{1}}}$ in (41) and (42). Define${\underset{\_}{R}}_{k,\mathit{\text{op}}}^{u}=\underset{{\rho}_{\mathit{\text{pp}}}\to \infty}{\text{lim}}{R}_{\mathit{\text{op}},k}^{u}$ and${\underset{\_}{R}}_{k,\mathit{\text{pp}}}^{u}=min\underset{{\rho}_{\mathit{\text{pp}}}\to 0}{\text{lim}}\left\{{R}_{k,i}^{u},{R}_{k,1}^{{u}_{{k}_{1}}},{R}_{k,i}^{{u}_{{k}_{1}}}\right\}$, then we have${\underset{\_}{R}}_{k,\mathit{\text{pp}}}^{u}\ge {\underset{\_}{R}}_{k,\mathit{\text{op}}}^{u}$.

As for the value of${\stackrel{~}{R}}_{\mathit{\text{pp}}}^{d}$ at asymptotically low SNR, it arrives at

Define${\underset{\_}{R}}_{k,\mathit{\text{pp}}}^{d}=\underset{{\rho}_{\mathit{\text{pp}}}\to 0}{\text{lim}}{R}_{k,i}^{d}$ and${\underset{\_}{R}}_{k,\mathit{\text{pp}}}^{d}=\underset{{\rho}_{\mathit{\text{op}}}\to 0}{\text{lim}}{R}_{\mathit{\text{op}},k}^{d}$, then obviously, we have${\underset{\_}{R}}_{k,\mathit{\text{pp}}}^{d}\ge {\underset{\_}{R}}_{k,\mathit{\text{op}}}^{d}$.

Above all, the uplink rates of both pilot designs at low SNR are

And the downlink rates of both pilot designs at low SNR are

Fortunately, based on the above analysis, we are able to achieve Theorem 1 characterizing the advantage of our proposed pilot design.

**Theorem 1.** *The proposed pilot design outperforms the conventional one for both low and high SNRs*.

*Proof*. Assume that *D*_{
pp
} = *D*_{
op
}. Based on (55) to (58), we have

and

Therefore, it arrives at

The similar result at asymptotically low SNR can be achieved by using (62) to (65):

The conclusion in Theorem 1 is independent of coherence interval length *T* and number of concerned coherence intervals *N*_{
c
}. It provides a superior pilot design for scenarios of small or large noise and interference.

## 6 Numerical results

In this section, we present some numerical results about the performance of the proposed pilot design. The system tested here consists of *K* = 5 users within the same distance from the BS. Without loss of generality, assume that the large-scale fading coefficients *β*_{
k
} are all 1. In practice, the users can be scheduled according to their channel conditions. If it is not specified, the number of BS antennas is set to be 128, and *ρ*_{
u
} = *ρ*_{
d
} = *ρ*_{
p
}.

First, we compare the system overall achievable rate of our proposed pilot design with the conventional one under different numbers of concerned coherence intervals with varying SNR. Figure3 shows that, with the proposed pilot design, the system overall achievable rates with different *N*_{
c
} all surpass the rate of the conventional one when SNR varies from -1 dB to 40 dB, which verifies Theorem 1. Further, the system overall achievable rate keeps increasing along with SNR when SNR is high, whereas that of the conventional pilot design tends to saturate. Besides, a larger *N*_{
c
} generally leads to a smaller system overall achievable rate due to the accumulative CSI estimation error. However, *N*_{
c
} impacts little on the comparison of system overall achievable rates between two pilot designs.Then, we consider the performance of two pilot designs with various number of BS antennas. Figure4 shows that the system overall achievable rates of both pilot designs improve as the number of BS antennas grows. This results from the fact that the channel vectors tend to be orthogonal as the number of BS antennas increases. Moreover, the system overall achievable rates of the proposed pilot design respectively outperform those of the conventional one.Moreover, Figure5 illustrates that the system overall achievable rates of our proposed pilot design achieve a more significant performance gain compared with those of the conventional one as the number of users increases. This verifies again the priority of our proposed pilot design when the number of users is large.According to Figures3,4, and5, we conclude that the proposed pilot design provides a performance improvement compared with the conventional one at most SNR under the above scenario due to the tradeoff between pilot consumption and channel estimation accuracy.

Further, in Figure6, we compare two pilot designs considering SNR and the length of coherence interval *T*. We assume that the length of downlink data per coherence interval is$\u230a\frac{3\left(T-K\right)}{4}\u230b$, where ⌊·⌋ means rounding down to the closest integer. Figure6 shows that the proposed pilot design outperforms the conventional one at low and high SNRs no matter how long the coherence interval is, which also coincides with Theorem 1. Besides, the shorter the coherence interval is, the larger the SNR region is, in which the system overall achievable rates of the proposed pilot design win. This again strengthens the motivation and effectiveness of our proposed pilot design.Generally, massive MIMO in many cases works at low SNRs. We have analyzed the performance at low SNRs in both theoretical and numerical ways. Theorem 1 indicates that our proposed pilot design outperforms the conventional one at low SNRs, which is later validated by simulation results in Figure6.

Concerning the high SNR analysis in both Theorem 1 and Figures3 and6, this is due to the consideration that massive MIMO applies not only for future wireless communication systems but also for current long term evolution/long term evolution-advanced (LTE/LTE-A) systems. For systems like LTE/LTE-A, the operation region in terms of SNR varies widely, for example, from -3 dB to 30 dB[20]. In particular, for users locating in the proximity of BS, they experience a relatively high quality of SNR. Moreover, users are more likely to experience high SNR transmissions especially for the emerging small cell deployment with reduced cell sizes. The combination of small cells and massive MIMO[21] could lead to a high SNR scenario. In Theorem 1, the high SNR analysis validates the application of our proposed pilot design for these scenarios. Finally, the analysis for both low and high SNRs presents a complete performance comparison between two pilot designs.

## 7 Conclusions

This paper proposes a semi-orthogonal pilot design using SIC in a TDD massive MIMO system, which makes full use of the asymptotic channel orthogonality. The performance of the proposed pilot design is elaborated both theoretically and numerically. Simulation results show that the proposed pilot design outperforms the conventional orthogonal pilots. And particularly for low or high SNRs with any coherence interval length, the superiority of our proposed pilot design is theoretically proven.

## References

- 1.
Larsson EG, Tufvesson F, Edfors O, Marzetta TL: Massive MIMO for next generation wireless systems.

*IEEE Commun*2014, 52(2):186-195. - 2.
Marzetta TL: Noncooperative cellular wireless with unlimited numbers of base station antennas.

*IEEE Trans. Wireless Commun*2010, 9(11):3590-3600. - 3.
Hoydis J, ten Brink S, Debbah M: Massive MIMO in the UL/DL of cellular networks: How many antennas do we need?

*IEEE J. Select. Areas Commun*2013, 31(2):160-171. - 4.
Ngo HQ, Larsson EG, Marzetta TL: Massive MU-MIMO downlink TDD systems with linear precoding and downlink pilots,. In

*Proc. Annual Allerton Conference on Communication, Control, and Computing (Allerton),*. Monticello, IL; 2013:293-298. - 5.
Jose J, Ashikhmin A, Marzetta TL, Vishwanath S: Pilot contamination problem in multi-cell TDD systems,. In

*Proc. IEEE International Symposium on Information Theory (ISIT),*. Seoul, Korea; 2009:2184-2188. - 6.
Xiang Z, Tao M, Wang X: Massive MIMO multicasting in noncooperative cellular networks.

*IEEE J. Select. Areas Commun*2014, 32(6):1180-1193. - 7.
Guo K, Guo Y, Fodor G, Ascheid G: Uplink power control with MMSE receiver in multi-cell MU-massive-MIMO systems,. In

*Proc. IEEE ICC*. Sydney, Australia; 2014. - 8.
Jose J, Ashikhmin A, Marzetta TL, Vishwanath S: Pilot contamination and precoding in multi-cell TDD systems,.

*IEEE Trans. Wireless Commun,*2011, 2640-2651. - 9.
Yin H, Gesbert D, Filippou M, Liu Y: A coordinated approach to channel estimation in large-scale multiple-antenna systems.

*IEEE J. Select. Areas Commun*2013, 31(2):264-273. - 10.
Fernandes F, Ashikhmin A, Marzetta TL: Inter-cell interference in noncooperative TDD large scale antenna systems.

*IEEE J. Select. Areas Commun*2013, 31(2):192-201. - 11.
Ashikhmin A, Marzetta T: Pilot contamination precoding in multi-cell large scale antenna systems,. In

*Proc. IEEE ISIT*. Cambridge, USA; 2012:1137-1141. - 12.
Hu A, Lv T, Gao H, Lu Y, Liu E: Pilot design for large-scale multi-cell multiuser MIMO systems,. In

*Proc. IEEE ICC*. Budapest, Hungary; 2013:5381-5385. - 13.
Rusek F, Persson D, Lau BK, Larsson EG, Marzetta TL, Edfors O, Tufvesson F: Scaling up MIMO: Opportunities and challenges with very large arrays.

*IEEE Sig. Proc. Mag*2013, 30(1):40-60. - 14.
Liu X, Lu S, Bialkowski ME, Hui HT: MMSE channel estimation for MIMO system with receiver equipped with a circular array antenna,. In

*Proc. Asia-Pacific Microwave Conference (APMC)*. Bangkok, Thailand; 2007:1-4. - 15.
Kay SM:

*Fundamentals of Statistical Signal Processing: Theory*. Prentice-Hall, Englewood Cliffs, NJ; 2013. - 16.
Ngo HQ, Larsson EG, Marzetta TL: Energy and spectral efficiency of very large multiuser MIMO systems.

*IEEE Trans. Wireless Commun*2013, 61(4):1436-1449. - 17.
Yang H, Marzetta TL: Performance of conjugate and zero-forcing beamforming in large-scale antenna systems.

*IEEE J. Select. Areas Commun*2013, 31(2):172-179. - 18.
Wu X, Xu W: Downlink performance analysis with enhanced multiplexing gain in multi-cell large-scale MIMO under pilot contamination,. In

*Proc. IEEE WCNC*. Istanbul, Turkey; 2014. - 19.
Khuri AI:

*Advanced Calculus With Applications In Statistics*. John Wiley and Sons, New York; 2003. - 20.
Evolved universal terrestrial radio access (E-UTRA); user equipment (UE) radio transmission and reception (release 10)

*3GPP TS 36.101 V10.3.0*2011. - 21.
Emil MK Bjornson, Debbah M: Massive MIMO and small cells: improving energy efficiency by optimal soft-cell coordination,. In

*Proc. ICT*. Casablanca, Morocco; 2013. - 22.
Zheng X, Zhang H, Xu W, You X: Semi-orthogonal pilot design for massive MIMO systems using successive interference cancellation,. In

*Proc. IEEE Globecom*. Austin, USA; 2014.

## Acknowledgements

Part of this paper will be presented at the IEEE Globecom, Austin, USA, Dec. 2014[22]. This work was supported by the 973 Program under 2013CB329203, the NSFC under 61471114 and 61223001, and the Important National Science & Technology Specific Projects 2012ZX03001038 and 2013ZX03003016.

## Author information

## Additional information

### Competing interests

The authors declare that they have no competing interests.

## Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

## Rights and permissions

## About this article

#### Received

#### Accepted

#### Published

#### DOI

### Keywords

- Massive MIMO
- Multiuser
- Semi-orthogonal pilot design
- Interference cancellation