On massive MIMO performance with semi-orthogonal pilot-assisted channel estimation

With the rapidly increasing demand for high-speed data transmission and a growing number of terminals, massive multiple-input multiple-output (MIMO) has been shown promising to meet the challenges owing to its high spectrum efficiency. Although massive MIMO can efficiently improve the system performance, usage of orthogonal pilots and growing terminals causes large resource consumption especially when the coherence interval is short. This paper proposes a semi-orthogonal pilot design with simultaneous data and pilot transmission. In the proposed technique, we exploit the asymptotic channel orthogonality in massive MIMO systems, with which a successive interference cancellation (SIC)-based channel estimation is applied to mitigate the mutual interference between data and pilot. We derived the theoretical expressions of the achievable rates in massive MIMO systems with our proposed pilot design. Further discussion on performance verifies the superiority of our proposed pilot design for high or low signal-to-noise-ratios (SNRs) with any coherence interval length. And simulation results show that the proposed pilot design can achieve a significant performance improvement with reduced pilot resource consumption compared with the conventional orthogonal pilots.


Introduction
With a rapidly increasing demand for high data rates, as well as the growing number of serving users, massive multiple-input multiple-output (MIMO) is emerging as a promising technology to meet the challenge by providing a significant increment in reliability and data rate for wireless communications [1][2][3].
For MIMO systems, channel state information (CSI) is crucial for achieving multi-antenna gains.It becomes more challenging in massive MIMO systems due to numerous antennas at the base station (BS).Massive MIMO requires a large number of pilots if frequencydivision duplexing (FDD) is used since the burden for downlink pilots is proportional to the number of BS antennas, while for time-division duplexing (TDD) [3,4], uplink training is an effective method to obtain CSI by exploiting channel reciprocity.Generally, orthogonal pilot patterns are widely used for multi-channel estimation.It is well understood that the length of orthogonal pilots equals at least the number of users in a cell, which is in general much smaller than the number of massive BS antennas.However, even in TDD, the required resource for orthogonal pilots increases dramatically in a multi-cell massive MIMO system.Moreover, under the restriction of coherence interval duration and increasing user numbers, the same set of orthogonal pilots is reused for adjacent cells, thus pilot contamination [5][6][7] occurs in a muticell MIMO system.When the BS estimates the channel for a particular user, it may obtain a channel estimate contaminated by adjacent cell users that share the same pilot.
It has been revealed in [2,8] that pilot contamination becomes a bottleneck that limits the performance benefits of massive MIMO.To solve this problem, recent studies [9][10][11] proposed various approaches tackling with pilot contamination.Although they tried to alleviate the pilot contamination between multiple cells, they still use orthogonal pilots in a single cell, which implies large pilot resource consumption, especially for short coherence interval.
Considering the pilot resource consumption as well as the importance of channel estimation's accuracy, an efficient pilot design is essential for achieving full potential of massive MIMO systems.However, as far as we know, little attention has been paid to pilot design in a massive MIMO system.An exception [12] studied the pilot sequence design which matters little about pilot resource consumption.Therefore, in this work, we study the problem of an efficient pilot design by exploiting the asymptotic channel orthogonality [13] incorporated with successive interference cancellation (SIC) in massive MIMO systems.
The technical contributions of this work are summarized as follows: We present a novel pilot design with low resource consumption.In the proposed technique, we allow simultaneous data and pilot transmission and insert shifted pilot locations in slots, i.e., different users transmit pilots in different slots.It takes advantage of the asymptotic channel orthogonality for massive MIMO.Hence the mutual interference between data and pilot due to a semi-orthogonal pilot design can be mitigated by SIC.Numerical results show that the proposed pilot design outperforms the conventional orthogonal pilots.In particular, for low or high signal-to-noise-ratios (SNRs), we also theoretically prove the superiority of our proposed pilot design.
The paper is organized as follows.In Section 2, we describe the system model and transmission protocol of conventional massive MIMO systems.Section 3 addresses the transmission scheme of massive MIMO systems with the proposed pilot design.We analyze the achievable rates of both the uplink and downlink in Section 4. In Section 5, we deduce the asymptotic achievable rate as the SNR tends to infinity and zero.In Section 6, numerical results show that the proposed pilot design increases data transmission rates in various scenarios.Section 7 contains our conclusions.

System model with orthogonal pilots
We consider a cellular system composed of one BS with M antennas and K(K M) single-antenna users.Let ρ p , ρ u , and ρ d be the pilot SNR, the uplink SNR, and the downlink SNR, respectively.Denote g k = √ β k h k as the channel vector between the BS and the kth user, where h k , the corresponding small-scale fading vector, is i.i.d CN (0, I M ) and √ β k models the geometric attenuation as well as shadowing effects which is assumed to be constant and known a priori.We assume channel obeys reciprocity in TDD, i.e., the channel factors are the same for both the uplink and downlink, and h k remains constant during a coherence interval of length T.
The transmission scheme of a conventional massive MIMO system with orthogonal pilots is shown in Figure 1, where τ op (τ op ≥ K) and D op respectively represent the length of pilot and downlink data.Following our study, the orthogonal pilots is also referred to as the conventional pilot design and as a benchmark.Since h k changes over coherence intervals, without loss of generality, we take the communication between the BS and users in a coherence interval as an example for deeper analysis.From Figure 1, a coherence interval is organized in three phases: uplink channel estimation period, uplink data transmission period, and downlink data transmission period.
The conventional pilot design in massive MIMO systems utilizing orthogonal pilots can prevent pilot contamination within one cell and obtain relatively accurate channel estimates [2].However, the required pilot overhead is τ op T (τ op ≥ K) for each user in a cell, which is too large especially when T is small and K is large in massive MIMO systems.In the next section, we will propose a possible pilot design which can keep a balance between the efficiency of data transmission and performance of the system.

Semi-orthogonal pilot design
In this section, we propose a semi-orthogonal pilot design with shifted locations, which reduces the pilot overhead while guaranteeing the system performance due to the merit of asymptotic channel orthogonality in massive MIMO systems.
The transmission scheme with the proposed pilot design is shown in Figure 2, where τ pp and D pp respectively denote the length of pilot and downlink data, and q 2  1 [1], q 3 1 [1], q 1 [1] refer to the definitions in Table 1.In the first coherence interval, when the first user transmits a pilot, the other users are mute so that the BS can estimate the first user's channel without contamination from other users.When the second user transmits a pilot, the first user transmits data while the other users still remain quiet, and so forth.In brief, when one user transmits a pilot, other users whose channels have already been estimated can transmit uplink data.And the BS adopts SIC to decontaminate interference from other users by exploiting the estimated channel information and gets all channel estimates.The channel estimates can be used in the following uplink data reception and downlink data transmission.As for the second coherence interval, all users, except the one who transmits a pilot, can transmit uplink data simultaneously during the channel estimation stage.The BS can utilize the known channel information estimated either from the previous or the present time to remove the contamination from other users.
Remark 1.Note that the pilot overhead for each user in the ith(i > 1) coherence interval of the proposed pilot design is 1  T from Figure 2. proposed pilot design, the pilot overhead is K+1 2T , which is also smaller than K T of conventional orthogonal pilots.Due to the difference between the communications in the first and the ith(i > 1) coherence intervals, the uplink and downlink data transmissions as well as the channel estimation are elaborated in detail in the following subsections.Before the elaboration, we first show notational definitions in Table 1, where parameter i represents the ith coherence interval.Besides, we replace g k with g k [i] to signify the channel vector between the BS and the kth user.

Communication in the first coherence interval 3.1.1 Uplink of the first user
Before data transmission, the first symbol of the first coherence interval is reserved for uplink channel estimation.For the first user in the first coherence interval, it transmits pilot while the other users are mute.The received signal at the BS is Then the BS gets an minimum mean squared error (MMSE) [14,15] estimate: Generally, the channel can be decomposed as g 1 [1] = g1 [1] +ĝ 1 [1].From the properties of MMSE estimation, Received signal at the BS when the kth user transmits pilot Unit variance AWGN when the kth user transmits pilot The kth user's pilot The tth user's uplink data signal when the kth user transmits pilot The tth user's uplink data signal after the channel estimation period The tth user's downlink data signal AWGN, additive white Gaussian noise.
Once the BS gets the first user's channel estimate, the first user starts uplink data transmission.By exploiting the merit of massive MIMO, simultaneous pilot and data transmission of other users has little impact on the first user's uplink data detection.
Concerning the first user's uplink data detection when the kth(k > 1) user transmits pilot, the received signal at the BS is The BS applies maximum-ratio combining (MRC) since it is viable and common in massive MIMO uplink data reception [2,16].The uplink data detection of the first user when the kth user transmits pilot equals where we divide both the denominators and numerators in = by M and apply Lemma 1, because ĝ1 [1] is independent of g t [1] (t = 1), g1 [1], and w k [1] from (1) and ( 2) according to the nature of MMSE estimation.Note that in a massive MIMO system, we assume that M is large enough to meet Lemma 1. Lemma 1. [16] Let p and q are two mutually independent L×1 vectors whose elements are i.i.d CN (0,1) random variables.Then lim where ' a.s.

= ' denotes almost sure convergence.
Then we consider the uplink data detection of the first user after the BS has obtained all users' channel estimates.The received signal at the BS when all users transmit uplink data is where z[1] ∈ C M×1 is the additive white Gaussian noise.Following the steps in (4), the BS detects q1 [1] as follows: Equations ( 4) and (6) show that the uplink data for the first user in the first coherence interval can be exactly detected in a massive MIMO system with the proposed pilot design.

Uplink of the kth(k > 1) user
As for the other users in the first coherence interval, the main difference lies in the channel estimation period.
We first consider the kth user's channel estimate.Given y k [1] in (3), the BS removes the data interference caused by the tth user from y k [1] by using ĝt [1] and qk t [1] (as revealed in (4), qk t [1] is assumed to be accurate and equal http://jwcn.eurasipjournals.com/content/2014/1/220 to q k t [1], k > t ≥ 1), which are obtained before the present period.The processed received signal is where the residual interference and noise during the kth user's channel estimation in the first coherence interval.Based on the assumption that {q k t [1] } is an independent Gaussian sequence, assuming gt [1] Obviously, n k [1] is independent of g l [1] for any l(1 ≤ l ≤ K), given r k [1] in (7), the MMSE estimate of g k [1] is where A k [1] is defined as We exploit the independency between g k [1] and = and also apply it in By substituting (10) and ( 11) into ( 9), we obtain Again, the channel can be decomposed as g k [1] = gk [1] +ĝ k [1].From the properties of MMSE estimation, Then we concern the uplink data detection of the kth user when the k 1 th(k 1 > k) user transmits pilot.From ( 7) and (12), ĝk [1] and g t [1] (t = k) are independent due to the fact that g t [1] is independent of g k [1] and n k [1].Following the steps in (4), the uplink data detection for As for the uplink data detection of the k th user when all users transmit uplink data, we can achieve a similar result to (6): From ( 14) and ( 15), the uplink data for the kth user in the first coherence interval can also be precisely detected in a massive MIMO system with the proposed pilot design.

Downlink of all users
After all users complete the uplink pilot and data transmission, the BS starts downlink data transmission, which is the same for all users.The BS applies matched filter (MF)-based beamforming since it works well enough in massive MIMO downlink data transmission [17].Without loss of generality, we consider the downlink data transmission of the kth user.The kth user receives where y d k [1] ∈ C is the receiving signal and υ k [1] ∈ C is the unit AWGN.p j [1] is the beamforming vector for the jth user and is defined as Because ĝk [1] is independent of gk [1] and g k [1] is independent of ĝj [1] (j = k), we apply it to equality = in the following and can obtain the downlink data estimate of the kth user: http://jwcn.eurasipjournals.com/content/2014/1/220 where we divide both the denominator and numerator of the last term in = by √ M and it vanishes as M → ∞.Equation (18) shows that the downlink data can also be accurately detected.

Communication in the ith(i > 1) coherence interval
Note that, different from the first coherence interval, all users show the same transmission pattern in the ith(i > 1) coherence interval as shown in Figure 2. Hence, without loss of generality, we take the kth user for instance.Its channel estimation is contaminated by all the other users' uplink data.As for the uplink and downlink data transmissions, they are similar to the procedures elaborated for the first coherence interval in the above subsection and we omit the detailed description due to space limitation.To be concise, we here briefly introduce the processing procedure of channel estimation in the following.
The received signal at the BS when the kth user transmits a pilot is Then the BS gets the uplink data estimate q k t [i] of the tth(t = k) user: which uses the known channel estimate from the present coherence interval when 1 ≤ t ≤ k − 1 and uses that from the previous one when k ≤ t ≤ K. Hence (20) is separately expressed based on two conditions.
Here, we assume that the variation of channel vectors during a coherence interval is ignorable.Any two channel vectors between the same user and BS can be treated as approximately equal if they locate within a distance of T. Take the channel vector from the second user to BS for example.Let g A and g B respectively denote the channel vectors at time slots ' A' and 'B' in Figure 2. Since the distance between ' A' and 'B' is no larger than T, it is reasonable to approximately treat g B as g A , i.e., g B ≈ g A .Therefore, as shown in (20) for the k ≤ t ≤ K case, the BS utilizes the estimate of g A to detect the second user's uplink data at time slot 'B' .This is different from the uplink data detection in (14), which is expressed based on only one condition.As for the following channel estimation process, it is performed in the similar way to that in Section 3.1.In this way, the k ≤ t ≤ K case for (20) establishes the main difference of the analysis for the ith(i > 1) coherence interval from that of Section 3.1.
Equation (20) can also be extended similarly to the form in (4) and then we arrive at the conclusion that . Similar to the procedure in (7), the BS first removes the data contamination from the received signal and generates where n k [i] means the residual interference and noise during the kth user's channel estimation in the ith(i > 1) coherence interval.Following the steps in ( 8), ( 9), (10), and (11), the BS computes and Then the BS gets an MMSE estimate: Again, the channel can be decomposed as Equation ( 26) is the unified form of σ 2 k,i for any ith(i ≥ 1) coherence interval by comparing ( 13) with (26).Analysis in the next section shows that σ 2 k,i will be useful in the performance analysis of our proposed pilot design.

Performance analysis
Given the processing procedure elaborated in the above section, we are now ready to conduct theoretical performance analysis of our proposed scheme.From Figure 2, the proposed pilot design saves more resource for data transmission compared with the conventional orthogonal one when K grows large along with M.However, due to the semi-orthogonal pilot pattern, SIC-based channel estimation is adopted, leading to a larger estimation error than the conventional one.In this section, performance analysis is presented to show explicitly the potential benefits that can be achieved by our proposed pilot design.
In this section, we analyze the performance of the massive MIMO systems in terms of both the uplink and downlink achievable rates.The approximation of the kth user's achievable rate [18] is defined as where γ k is the associated signal-to-interference-noiseratio (SINR); S, I, and N stand for the power of signal, interference, and noise, respectively.Next the achievable rate is calculated by evaluating the power of these items term by term.

Downlink analysis
Here we first conduct downlink performance analysis since it is much simpler than the uplink one.Assume that the kth user is of interest.Review the process of downlink communication in (16) and replace p j [1] with p j [i] (i ≥ 1) to formulate the downlink communication for any ith coherence intervals: where the signal, interference, and noise terms are marked with S, I, and N, and g H k [i]p k [i] is assumed to be accurately estimated at the kth user.
First, we derive the power of the intended signal in (28), denoted by S d k,i .By applying Khintchine's law of large numbers [19] and with some basic manipulations, we have the following equality in (e) = with high probability for large M. Thus we arrive at where E [•] means the expectation operator and p k [i] is the beamforming vector: The scalar √ M is a normalization factor [10].By using Lemma 1, we can get the value of α k,i as M → ∞: = exploits Lemma 2 in the following and Using a similar technique to (29), we have the power of the interference in (28), denoted by I d k,i , as follows: = is based on the independency between g k [i] and ĝj [i] (k = j) and the fact that the variance of g j [i] is β j I M .
The noise power in (28) can be readily obtained by Then the downlink achievable rate of the kth user in the ith coherence interval is

Uplink analysis
There are three kinds of uplink transmission patterns in massive MIMO systems with the proposed pilot design.The first kind of uplink communication contains only uplink data, i.e., case 1 in Figure 2, while the other two kinds of uplink communications, i.e., cases 2 and 3 in Figure 2, cover both the uplink data and pilot.Here we take the first kind of uplink communication as an example since the respective analysis of the other two kinds of uplink communications are similar.Assume that the kth user is of interest.Write down the MRC processed received signal at the BS based on (5) and replace '1' with 'i' (i ≥ 1) to formulate the uplink data transmission for any ith coherence interval.It gives where the signal, interference (composed of two parts) and noise terms are marked with S, I 1 , I 2 , and N. By applying the similar analytical procedure for the downlink, it is not hard to calculate the power of the desired signal, the interference and the noise in (34) term by term, which are respectively denoted by S u k,i , I u k,i , and N u k,i .The power of the desired signal is And the power of the interference in (34) is Then the noise power in (34) is readily achieved: Substituting the analyzed S u k,i , I u k,i , and N u k,i into (27), the kth user's uplink achievable rate of case 1 in Figure 2 is As for the second kind of uplink communication, we can write down the processed receiving signal for the kth user during the k 1 th user's first channel estimation as follows: The main difference of (39) from (34) lies in the interference term.Hence we only compute the power of the interference, denoted by I u k 1 k,1 : http://jwcn.eurasipjournals.com/content/2014/1/220 Then the kth user's uplink achievable rate when the Further concerning the third kind of uplink communication in a similar way, we can also get the uplink rate of the kth user when the k 1 th user transmits pilot in the ith(i > 1) coherence interval is where

Performance evaluation
Observing (33), (38), (41), and (42), the channel estimation accuracy σ 2 k,i has an influential effect on the uplink and downlink achievable rates.Obviously, The channel estimates of the conventional one are more accurate than those of our proposed one.However, an inherent merit of the proposed one is that it provides more available resources for data communication, which is implied in (41) and (42).It makes a difference in performance evaluation.Assume that the number of concerned coherence intervals is N c .Then from Figure 2, the uplink achievable rate of the proposed one, in terms of bps/Hz, is Meanwhile, from (33), the downlink achievable rate (bps/Hz) of the proposed one is For comparison, we also provide the uplink and downlink achievable rates Ru op and Rd op (bps/Hz) of the conventional orthogonal one in Figure 1 as follows: where the kth user's uplink and downlink achievable rates of the conventional orthogonal pilots R u op,k and R d op,k are Here, the variance of channel estimate σ 2 op,k for the conventional orthogonal pilots as shown in Figure 1 45), the additional uplink data transmission, i.e., cases 2 and 3 in Figure 2, has an influential positive effect on the uplink achievable rate of the proposed pilot design for a small T. Though at a cost of channel estimation accuracy, however, by exploiting the merit of massive MIMO, the proposed pilot design can achieve a better tradeoff between pilot resource consumption and channel estimation accuracy.It outperforms the conventional one in many scenarios.Remark 3. Note that from the elaboration in Section 3, compared with the conventional orthogonal pilot design, the main difference of our proposed one lies in the subtraction of data interference from the received signal at the BS, which can be easily completed without much additional complexity.
For a better view of the system performance comparison, C s pp = Rd pp + Ru pp and C s op = Rd op + Ru op are introduced to respectively stand for the system overall achievable rates of the proposed pilot design and the conventional one.

Asymptotic performance analysis
In this section, we will look at the performance at asymptotically low and high pilot and data SNRs.For simplicity, assume that ρ u ρ p and ρ d ρ p are fixed.Hence, ρ u → 0 and ρ d → 0 as ρ p → 0, and it is likewise as ρ p → ∞.We can use ρ to stand for ρ p , ρ u , and ρ d when considering asymptotically low and high SNRs.Furthermore, the SNR of our proposed pilot design is defined as ρ pp = 2TN c 2TN c −K+1 ρ op to provide an equal overall system power, where ρ op denotes the SNR of the conventional orthogonal one.

High SNR analysis
In order to evaluate the performance of two pilot designs at an asymptotically high SNR, we first reformulate the SINR value in (38) by some manipulations as follows: where λ pp,k = ρpp +β k is from (26).Following the steps in (49), the SINR value in (47) is where λ op,k = . Furthermore, based on (22) and the fact that lim ρ pp →∞ ε 2 t,i = 0 for any i and t, we have lim It is easily seen that lim Further, it achieves that lim The similar manipulations to (49) is applied to 41) and (42), respectively, and it achieves that lim Then considering the value of Rd pp at asymptotically high SNRs, we first compute the downlink rate γ d k,i in (33).Following the steps in (49), it achieves Applying the similar manipulations in (48), it arrives at lim Some remarks on the high SNR analysis show the priority of our proposed pilot design over the conventional one.

Low SNR analysis
In the sequel, consider the performance of two pilot designs at an asymptotically low SNR.Following the steps in (49) and based on the fact that σ we have where λ pp,k = where the last term tends to infinity at asymptotically low SNR.Compared with λ op,k , it arrives at the result that lim γ u op,k , which is also applied to γ As for the value of Rd pp at asymptotically low SNR, it arrives at . Above all, the uplink rates of both pilot designs at low SNR are lim lim And the downlink rates of both pilot designs at low SNR are lim Fortunately, based on the above analysis, we are able to achieve Theorem 1 characterizing the advantage of our proposed pilot design.
Theorem 1.The proposed pilot design outperforms the conventional one for both low and high SNRs.
Proof.Assume that D pp = D op .Based on (55) to (58), we have lim and lim Therefore, it arrives at lim The similar result at asymptotically low SNR can be achieved by using (62) to (65): The conclusion in Theorem 1 is independent of coherence interval length T and number of concerned coherence intervals N c .It provides a superior pilot design for scenarios of small or large noise and interference.

Numerical results
In this section, we present some numerical results about the performance of the proposed pilot design.The system tested here consists of K = 5 users within the same distance from the BS.Without loss of generality, assume that the large-scale fading coefficients β k are all 1.In practice, the users can be scheduled according to their channel conditions.If it is not specified, the number of BS antennas is set to be 128, and ρ u = ρ d = ρ p .
First, we compare the system overall achievable rate of our proposed pilot design with the conventional one under different numbers of concerned coherence intervals with varying SNR. Figure 3 shows that, with the proposed pilot design, the system overall achievable rates http://jwcn.eurasipjournals.com/content/2014/1/220with different N c all surpass the rate of the conventional one when SNR varies from −1 dB to 40 dB, which verifies Theorem 1. Further, the system overall achievable rate keeps increasing along with SNR when SNR is high, whereas that of the conventional pilot design tends to saturate.Besides, a larger N c generally leads to a smaller system overall achievable rate due to the accumulative CSI estimation error.However, N c impacts little on the comparison of system overall achievable rates between two pilot designs.
Then, we consider the performance of two pilot designs with various number of BS antennas.Figure 4 shows that the system overall achievable rates of both pilot designs improve as the number of BS antennas grows.This results from the fact that the channel vectors tend to be orthogonal as the number of BS antennas increases.Moreover, the system overall achievable rates of the proposed pilot design respectively outperform those of the conventional one.
Moreover, Figure 5 illustrates that the system overall achievable rates of our proposed pilot design achieve a more significant performance gain compared with those of the conventional one as the number of users increases.This verifies again the priority of our proposed pilot design when the number of users is large.
According to Figures 3, 4, and 5, we conclude that the proposed pilot design provides a performance improvement compared with the conventional one at most SNR  under the above scenario due to the tradeoff between pilot consumption and channel estimation accuracy.
Further, in Figure 6, we compare two pilot designs considering SNR and the length of coherence interval T. We assume that the length of downlink data per coherence interval is 3(T−K)

4
, where • means rounding down to the closest integer.Figure 6 shows that the proposed pilot design outperforms the conventional one at low and high SNRs no matter how long the coherence interval is, which also coincides with Theorem 1. Besides, the shorter the coherence interval is, the larger SNR in which the system overall achievable rates of the proposed pilot design win.This again strengthens the motivation effectiveness of our proposed pilot design.Generally, massive MIMO in many cases works at low SNRs.We have analyzed the performance at low SNRs in both theoretical and numerical ways.Theorem 1 indicates that our proposed pilot design outperforms the conventional one at low SNRs, which is later validated by simulation results in Figure 6.
Concerning the high SNR analysis in both Theorem 1 and Figures 3 and 6, this is due to the consideration that massive MIMO applies not only for future wireless communication systems but also for current long Figure 6 of system overall achievable rates of two designs considering SNR and c = 2. http://jwcn.eurasipjournals.com/content/2014/1/220term evolution/long term evolution-advanced (LTE/LTE-A) systems.For systems like LTE/LTE-A, the operation region in terms of SNR varies widely, for example, from − 3 dB to 30 dB [20].In particular, for users locating in the proximity of BS, they experience a relatively high quality of SNR.Moreover, users are more likely to experience high SNR transmissions especially for the emerging small cell deployment with reduced cell sizes.The combination of small cells and massive MIMO [21] could lead to a high SNR scenario.In Theorem 1, the high SNR analysis validates the application of our proposed pilot design for these scenarios.Finally, the analysis for both low and high SNRs presents a complete performance comparison between two pilot designs.

Conclusions
This paper proposes a semi-orthogonal pilot design using SIC in a TDD massive MIMO system, which makes full use of the asymptotic channel orthogonality.The performance of the proposed pilot design is elaborated both theoretically and numerically.Simulation results show that the proposed pilot design outperforms the conventional orthogonal pilots.And particularly for low or high SNRs with any coherence interval length, the superiority of our proposed pilot design is theoretically proven.

u k 1 k,1 and γ u k 1 k
,i in (41) and (42).Define R u k,op = lim ρ pp →∞ R u op,k and R u k,pp = min lim

Figure 3
Figure 3 Comparison of system overall rates of two designs versus SNR.We set T = 10 and D op = D pp = 3.

Figure 4
Figure 4 Comparison of system overall achievable rates of two designs versus number of BS antennas.We set T = 20 and N c = 2.

Figure 5
Figure 5 Comparison of system overall achievable rates of two designs versus number of users.We set T = 20 and N c = 2.