Skip to main content

Self-interference suppression with imperfect channel estimation in a shared-antenna full-duplex massive MU-MIMO system

Abstract

In this paper, we consider a shared-antenna full-duplex massive MU-MIMO system and prove that the self-interference (SI) at the base station (BS) can be suppressed, though the direct-path SI exists and the signal/SI channel is imperfectly estimated. The BS is assumed to employ zero-forcing (ZF) or maximal-ratio transmission/maximal-ratio combining (MRT/MRC) method to linearly process signals. We propose a precoded SI channel training scheme by employing orthogonal sequences and downlink precoding, so the SI channel can be estimated like the signal channel with a much lower dimension. Based on the channel estimation, we further analyze the SI suppression by combining the SI removal and the large-scale antenna linear processing (LALP) method. Numerical results show that an additional 36 dB SI can be suppressed by combining the SI removal and the LALP suppression. We derive the tight closed-form lower bounds of the achievable rates for the combined suppression. Finally, we maximize the system spectral efficiency and energy efficiency based on the simulation and closed-form approximations.

1 Introduction

Massive multiple-input multiple-output (MIMO) and in-band full-duplex (IBFD) techniques have attracted intensive research interest because they can be used for broadband green communications [18]. IBFD can realize simultaneous bi-directional (uplink and downlink) data transmissions using the same time-frequency resource. Massive MIMO system exploits a large number of antennas at the transmitter to focus the energy into a narrow beam to improve the signal strength and mitigate interference. Both techniques are promising to meet the ever-growing requirements of wireless data transmissions, but bring some design variations over the physical or higher layers [917].

The massive MIMO technique can significantly improve the spectral efficiency (SE) and energy efficiency (EE) while keeping a lower transmit power for both uplink and downlink [15, 1820] by mitigating the detrimental effects of small-scale fading, noise, and interference [4]. With more antennas equipped, the signal processing becomes much easier than the traditional MIMO system [4, 19]. The massive MIMO system usually operates with time-division duplexing (TDD) rather than frequency-division duplexing (FDD). In the FDD system, channel state information (CSI) should be acquired by the feedback, and much overhead will be introduced with the increase of antenna-array scale [15], thus the antenna grouping and CSI compression techniques were studied to reduce the estimation overhead [16, 17]. While in the TDD system, channel reciprocity can be considered to reduce the overhead, as the downlink CSI can be acquired by the base station (BS) through the uplink training [18, 21].

The full-duplex (FD) technique can potentially double the spectral efficiency, but it faces the problem of self-interference (SI), which refers to transmitted signals that can be directly or indirectly received by its own receivers. Recently, much effort has been made to suppress SI [2225]. Riihonen et al. designed natural isolation, time-domain cancellation, and spatial SI suppression schemes [22]. Everett et al. proposed to exploit the antenna directional isolation and cross-polarization to suppress the SI in the wireless propagation domain [23]. Bharadia et al. proposed analog circuit domain cancellation methods that can keep a copy of the transmitted signal and subtract it from the received signal [24]. An analog-and-digital hybrid SI cancellation method has been developed by Duarte et al. and can provide approximately 85 dB interference suppression [25]. Recently, Ngo et al. proposed the loop interference cancellation technique in a separate-antenna relay system, where the SI can be suppressed with a large-scale antenna linear processing (LALP) method in a random reflected-path SI environment [26].

Sabharwal et al. discussed two methods of interfacing antennas to an IBFD wireless system: (i) separate-antenna FD and (ii) shared-antenna FD [1]. In the shared-antenna model, a bidirection circulator is used to transmit and receive signals without modifying many architecture standards, thus the RF hardwares such as antenna, RF cable and filters can be saved [2]. At the same time, the nonlinearities of the RF hardware, for example the circulators and amplifiers, introduce weak nonlinear distortions in such systems.

In this paper, we consider a shared-antenna FD massive multiuser MIMO (MU-MIMO) system. In our system, a BS is equipped with a single antenna array, and it serves multiple users, each with a single antenna. Like [2], we ignore the nonlinearities introduced by the RF hardware, and focus on the linear processing in this model. A linear processing method, such as zero-forcing (ZF) or maximal-ratio transmission/maximal-ratio combining (MRT/MRC), is adopted for the signal precoding and decoding over the FD links. Unlike [26], we discuss a Rician SI channel rather than Rayleigh channel model, the uplink and downlink channels in our model are completely dependent on each other, because the same antenna array is employed for the signal receive and transmission. The perfect channel reciprocity can be used to reduce the estimation overhead. Unlike [27], we consider the channel estimation for both the signal and SI channels in this paper. A precode SI channel training scheme is adopted to estimate the SI channel using the minimum mean-square-error (MMSE) technique. The main contributions of our work are summarized as:

  • We consider both the non-random direct-path and the random reflected-path SI channels in this model. We prove that the Rician SI channel changes into Rayleigh channel when the downlink channel/SI channel is precoded. We also prove that the SI at BS with imperfect channel estimation can be asymptotically suppressed, though the uplink-downlink channels are totally dependent and the line of sight (LOS) SI exists. It is shown that the capability of LALP suppression is proportional to the square root of the antenna array scale M, which is different from the separate-antenna massive MIMO system. Because of the channel reciprocity, the signal channel estimation of the shared-antenna array is reduced by half compared with the separate-antenna arrays.

  • We estimate the SI through the precoded SI channel with MMSE method, which is different from the traditional SI acquisition method. The estimation overhead and the dimension of pilots can be greatly suppressed compared with the direct estimation of the unprecoded SI channel. With the estimated SI, we can further analyze the SI suppression by combining the SI removal and the LALP method. It is shown that an additional 36 dB suppression capability can be achieved through combining the two methods. We also derive the closed-form lower bounds of the uplink achievable rates with imperfect CSI. The optimal SE and EE are further studied based on the rate lower bounds.

The rest of this paper is organized as follows. The system model is described in Section 2. Section 3 studies the SI suppression and the uplink achievable rates. Section 4 analyzes the SE and EE. Numerical results and conclusions are presented in Section 5 and Section 6, respectively.

Notations: The boldface upper and lower-case letters represent matrices and vectors, respectively. The superscripts [·],[·]T, and [·]H stand for conjugate, transpose, and Hermitian transpose of matrices or vectors, respectively. The mathematical expectation is denoted as \(\mathbb {E}\{\cdot \}\). · and represents the Euclidean norm. The (i,j)th entry of a matrix is denoted as [·] ij . The trace of a square matrix is denoted as tr(·). The \(\overset {a.s.}\longrightarrow \) represents almost sure convergence. u n =o(v n ) and u n =O(v n ) denote that there exists a constant C, such that u n =C v n and u n C v n , n, respectively.

2 System model

As shown in Fig. 1, we consider the uplink of a shared-antenna FD massive MU-MIMO system, where a BS is equipped with M antennas and serves K users, each with a single antenna. Like the separate-antenna full-duplex system [23, 25], physical isolation can be adopted between the BS antennas to avoid front-end saturation at the receiver, and active analog SI cancellation is used to suppress the SI before the baseband signal processing. We assume that M is much larger than K, i.e., MK. Both the BS and the users are working in the FD mode, i.e., they can transmit and receive signals using the same time-frequency resource. Therefore, the M×1 uplink received signal at the BS is given as

$$ {\boldsymbol{y}_{\text{BS}}} = \sqrt {p_{\mathrm{u}}} \boldsymbol{G}{\boldsymbol{x}_{\mathrm{u}}} + \sqrt {p_{\mathrm{d}}} {\boldsymbol{G}_{\mathrm{s}}}\boldsymbol{s} + {\boldsymbol{n}}, $$
(1)
Fig. 1
figure 1

Data transmission and reception with SI at the BS in the shared-antenna FD massive MU-MIMO system

where the first term represents the desired signal, and the second term represents the SI from the downlink transmission. G is the M×K channel matrix between the K users and the BS; x u denotes the symbols transmitted by the K users with unit power \(\mathbb {E}\{{\boldsymbol {x}_{\mathrm {u}}\boldsymbol {x}{_{\mathrm {u}}^{H}}}\}=\boldsymbol {I}_{K}, p_{\mathrm {u}}\) is the transmit power of each user; G s is the M×M SI channel matrix between the BS transceivers; s is the downlink precoded signal transmitted to the K users with unit power \(\mathbb {E}\{{\boldsymbol {s}^{H}\boldsymbol {s}}\}=1; p_{\mathrm {d}}\) is the downlink transmit power; and n represents the additive noise with zero mean and unit variance.

2.1 Channel model

The channel matrix G models the small-scale Rayleigh fading and the large-scale path-loss, so we have

$$\begin{array}{*{20}l} \boldsymbol{G}=\boldsymbol{H} \boldsymbol{D}^{1/2}, \end{array} $$
(2)

where H is the M×K matrix with i.i.d entries [4], representing the small-scale fading between the K users and the BS, and the entry h mk is a circular symmetric complex Gaussian (CSCG) random variable with zero mean and unit variance, i.e., \(h_{mk}\sim \mathcal {CN}(0,1)\); D is a K×K diagonal matrix of the large-scale path-loss with [D] kk =β k .

Since the physical isolation and active analog SI cancellation are performed preliminarily, it is reasonable to assume that the SI can be suppressed greatly [26, 28]. The magnitude of the SI channel can be modeled as Rician distribution with LOS SI [29]. Therefore, we can write the matrix G s as the direct-path plus the reflected-path coefficients, i.e.,

$$\begin{array}{*{20}l} \boldsymbol{G}_{\mathrm{s}}&={\bar{\boldsymbol{G}}}_{\mathrm{s}}+{\tilde{\boldsymbol{G}}}_{\mathrm{s}}, \end{array} $$
(3)

where \({\bar {\boldsymbol {G}}}_{\mathrm {s}}\) is a complex deterministic matrix with \([{\bar {\boldsymbol {G}}}_{\mathrm {s}}]_{mj}=c_{mj}\), which represents the antenna intra-leakage and inter-coupling coefficients related to the direct path, and \({\tilde {\boldsymbol {G}}}_{\mathrm {s}}\) is a complex random matrix that represents the reflection coefficients related to the reflected path. Similar to channel matrix G, random matrix \({\tilde {\boldsymbol {G}}}_{\mathrm {s}}\) can be expressed as

$$\begin{array}{*{20}l} {\tilde{\boldsymbol{G}}}_{\mathrm{s}}={\tilde{\boldsymbol{H}}}_{\mathrm{s}} {\tilde{\boldsymbol{D}}}{_{\mathrm{s}}^{1/2}}, \end{array} $$
(4)

where \({\tilde {\boldsymbol {H}}}_{\mathrm {s}}\) is an M×M matrix with i.i.d. entries representing the small-scale fading. We suppose \([{\tilde {\boldsymbol {H}}}_{\mathrm {s}}]_{mj}={h}_{\mathrm {s},{mj}}\), where h s,m j is a CSCG random variable with zero mean and unit variance, and \({\tilde {\boldsymbol {D}}}_{\mathrm {s}}\) is an M×M diagonal matrix of the large-scale path-loss, where \([{\tilde {\boldsymbol {D}}}_{\mathrm {s}}]_{mm}={\beta }\), namely, all of the SI large-scale path-loss coefficients are assumed to be identical. This assumption is reasonable, as the antenna array size is much smaller than the distance to the scatters. If these coefficients are not identical, we can set all the elements of SI channel matrix \({\tilde {\boldsymbol {G}}}_{\mathrm {s}}\) to be the maximum value β max, which represents the worst case, then \(\tilde {\boldsymbol {G}_{s}}\) becomes a matrix with identical large-scale fading.

For the shared-antenna FD massive MU-MIMO system, if the array scale is large enough and greatly exceeds the number of users, that is MK, the column vectors of the signal channel matrix are mutually orthogonal [4], and so do the vectors of the SI channel matrix. We then have

$$\begin{array}{*{20}l} \quad\quad \; {{\boldsymbol{G}}^{H}{\boldsymbol{G}}\over M}={\boldsymbol{D}}^{1/2}{{\boldsymbol{H}}^{H}{\boldsymbol{H}}\over M}{\boldsymbol{D}}^{1/2} \approx {\boldsymbol{D}}, \end{array} $$
(5)
$$\begin{array}{*{20}l} {\boldsymbol{G}{_{s}^{H}}{\boldsymbol{G}_{s}}\over M}&\approx{\bar{\boldsymbol{G}}{_{s}^{H}}\bar{\boldsymbol{G}_{s}}\over M}+ {\tilde{\boldsymbol{G}}{_{s}^{H}}\tilde{\boldsymbol{G}_{s}}\over M}\approx \bar {\boldsymbol{D}}_{s}+\tilde{\boldsymbol{D}}_{s}, \end{array} $$
(6)
$$\begin{array}{*{20}l} \quad \text{tr} ({\boldsymbol{G}{_{s}^{H}}{\boldsymbol{G}_{s}}})&\approx \text{tr} ({\bar{\boldsymbol{G}}{_{s}^{H}}\bar{\boldsymbol{G}_{s}}})+ \text{tr}({\tilde{\boldsymbol{G}}{_{s}^{H}}\tilde{\boldsymbol{G}_{s}}})= O(M^{2}), \end{array} $$
(7)

where \(\bar {\boldsymbol {D}}_{s}\) is an M×M deterministic matrix and \(\tilde {\boldsymbol {D}}_{s}\) is an M×M diagonal matrix with \([\tilde {\boldsymbol {D}}_{s}]_{mm} =\beta \). The proofs of (6) and (7) are given in Appendix.

2.2 Channel estimation

Channel estimation is a prerequisite for SI suppression. Commonly, a series of orthogonal training sequences is sent within a period 2τ, which is a small part of the channel coherence time T. The channel training/estimation is performed in half-duplex. Therefore, the M×τ received pilot matrix for the uplink channel and SI channel at the BS can be written as

$$ {\boldsymbol{Y}_{\mathrm{u}}} = \sqrt {P_{\text{{u}}}} {\boldsymbol{G}}{\boldsymbol{\Phi}}_{\mathrm{u}} + {\boldsymbol{N}}, $$
(8)

and

$$ {\boldsymbol{Y}_{\mathrm{s}}} = \sqrt {P_{\mathrm{d}}} {\boldsymbol{G}_{\mathrm{s}}}{\boldsymbol{A}}{\boldsymbol{\Phi}}_{\mathrm{d}} + {\boldsymbol{N}_{\mathrm{s}}}, $$
(9)

where P u and P d are the transmit powers of pilot sequences for the uplink and downlink, respectively. Suppose that the power of each pilot symbol equals the average power of data transmission, then we have P u=τ p u and P d=τ p d;Φ u=Φ d are K×τ matrices whose rows are the pilot sequences for the estimation of the signal channel and SI channel. The pilot matrices satisfy \({\boldsymbol {\Phi }}_{\mathrm {u}}{\boldsymbol {\Phi }}{_{\mathrm {u}}^{H}}={\boldsymbol {\Phi }}_{\mathrm {d}}{\boldsymbol {\Phi }}{_{\mathrm {d}}^{H}} ={\boldsymbol {I}}_{K}\) and τK;A is an M×K precoding matrix, which is used for downlink beamforming and updated from the uplink channel estimation; N and N s are M×τ AWGN matrices with i.i.d \(\mathcal {CN} (0,1)\) entries.

2.2.1 Signal channel estimation

The signal channel is estimated in the first training period τ. Given Y u, the MMSE estimate of G can be written as [26]

$$ {\hat {\boldsymbol{G}}}={1 \over {\sqrt {P_{\mathrm{u}}}}}{\boldsymbol{Y}}_{\text{{u}}}{\boldsymbol{\Phi}{_{\mathrm{u}}^{H}}}{\check {\boldsymbol{D}}}=\left(\boldsymbol{G} + {1 \over {\sqrt {\tau p_{\mathrm{u}}}}} {\boldsymbol{N}}_{\mathrm{u}}\right){\check{\boldsymbol{D}}}, $$
(10)

where \(\boldsymbol {N}_{\mathrm {u}} \triangleq {\boldsymbol {N}}{\Phi {_{\mathrm {u}}^{H}}}\) and \(\check {\boldsymbol {D}} \triangleq ({1 \over {{\tau p_{\mathrm {u}}}}}{{\boldsymbol {D}}^{-1}}+{{\boldsymbol {I}}_{K}})^{-1}\). Because \({\boldsymbol {\Phi }}_{\mathrm {u}}{\boldsymbol {\Phi }}{_{\mathrm {u}}^{H}}={\boldsymbol {I}}_{K}\), the entries of N u are i.i.d random variables distributed as \(\mathcal {CN} (0,1)\). Let Δ denote the estimation error of G, then we have

$$ \boldsymbol{G}=\hat {\boldsymbol{G}} + {\boldsymbol{\Delta}}. $$
(11)

According to the property of statistician’s Pythagorean theorem of the MMSE estimation [30], the estimation error Δ is independent of \(\hat {\boldsymbol {G}}\). Therefore, the kth columns of \(\hat {\boldsymbol {G}}\) and Δ are mutually independent, and the elements of the kth vector are distributed as \(\mathcal {CN}(0,{\hat {\beta }_{k}})\) and \(\mathcal {CN}(0,{\tilde {\beta }_{k}})\), respectively, with \(\hat {\beta }_{k} \triangleq {\tau p_{\mathrm {u}}\beta {_{k}^{2}}\over {\tau p_{\mathrm {u}}\beta _{k} + 1}}\) and \(\tilde {\beta }_{k} \triangleq {\beta {_{k}}\over {\tau p_{\mathrm {u}}\beta _{k} + 1}}\).

Note that the shared-antenna array massive MIMO system has a single array, so the transmit and receive CSI are the same during the coherent time. We only need to estimate the uplink channel, and thus the estimation overhead is reduced by half. This is different from the separate-antenna arrays system, where the CSI of the receive and transmit arrays are different, the uplink and downlink CSI have to be estimated separately.

2.2.2 SI channel estimation

The SI channel is estimated in the second training period τ. In fact, it is not necessary to estimate G s, we can estimate the equivalent or precoded SI channel G sa instead, which represents the product of G s and A. The new precoding matrix A can be updated for the SI channel estimation after \(\hat {\boldsymbol {G}} \) is acquired. Because G sa has the same dimension as the uplink channel G, the training pilot length and pilot amount depend on the user number K rather than the number of BS antennas M. Therefore, the dimension of the SI channel estimation can be decreased from M×M to M×K, and thus fewer orthogonal pilot sequences are needed. Note that the proposed training and estimation method is different from the traditional SI acquisition method where the SI is directly estimated with/without pilots [22].

The precoding matrix A is derived from \(\hat {\boldsymbol {G}}\). For a linear precoding scheme such as ZF or MRT, A is given as

$$\begin{array}{@{}rcl@{}} \boldsymbol{A}= \left\{ \begin{aligned} &\alpha_{\text{ZF}}{{\hat{\boldsymbol{G}}}^{*}}(\hat{\boldsymbol{G}}^{T}\hat{\boldsymbol{G}}^{*})^{-1},&&\quad\text{for ZF,}\\ &\alpha_{\text{MRT}}{{\hat {\boldsymbol{G}}}^{*}},&&\quad \text{for MRT,} \end{aligned} \right. \end{array} $$
(12)

where \(\alpha _{\text {ZF}} \triangleq \sqrt {M-K\over \sum _{k=1}^{K}{\hat \beta }{_{k}^{-1}}}\) and \(\alpha _{\text {MRT}} \triangleq \sqrt {1\over {M\sum _{k=1}^{K}{\hat \beta }{_{k}}}}\) are the normalization factors [26] satisfying \(\mathbb {E}\{{\boldsymbol {s}^{H}\boldsymbol {s}}\}=1\).

Proposition 1

In the shared-antenna FD massive MU-MIMO system, the SI equivalent channel G sa=G s A is an M×K complex Gaussian matrix if K is fixed and M. The (m,i)th entry follows the distribution \(\mathcal {CN}\left (0, \rho _{m,i}\right)\), where \(\rho _{m,i}={1\over M^{2}}\alpha _{\text {ZF}}^{2}\hat \beta _{i}^{-\!1}\!\sum _{j=1}^{M}\left (\left |c_{mj}\right |^{2}\,+\,\beta \right)\) for ZF precoding and \(\rho _{m,i}=\alpha _{\text {MRT}}^{2}\hat \beta _{i}\!\sum _{j=1}^{M}\left (\left |c_{mj}\right |^{2}\,+\,\beta \right)\) for MRT precoding, \(\hat \beta _{i} \triangleq {\tau p_{\mathrm {u}}\beta {_{i}^{2}}\over {\tau p_{\mathrm {u}}\beta _{i} + 1}}\), i=1,…,K.

Proof.

Please see Appendix. □

From Proposition 1, we can see that the equivalent SI channel is a zero-mean complex Gaussian channel, i.e., the Rayleigh channel, even though the SI channel is a Rician channel. Therefore, the precoded SI channel can still be treated as Rayleigh channel. Like the signal channel G, G sa can be estimated by MMSE if Y s is given. Let \(\boldsymbol {\mathcal {E}}\) denote the estimation error matrix of G sa, we then have

$$\begin{array}{*{20}l} \boldsymbol{G}_{\text{sa}}={\hat {\boldsymbol{G}}}_{\text{sa}} + \boldsymbol{\mathcal{E}}, \end{array} $$
(13)

the (m,i)th entry of \({\hat {\boldsymbol {G}}}_{\text {sa}}\) and \(\boldsymbol {\mathcal {E}}\) are distributed as \(\mathcal {CN}(0,{\hat {\rho }_{m,i}})\) and \(\mathcal {CN}(0,{\tilde {\rho }_{m,i}})\), respectively, with \(\hat \rho _{m,i} \triangleq {\tau p_{\mathrm {d}}\rho {_{m,i}^{2}}\over {\tau p_{\mathrm {d}}\rho _{m,i} + 1}}\), \(\tilde \rho _{m,i} \triangleq {\rho {_{m,i}}\over {\tau p_{\mathrm {d}}\rho _{m,i} + 1}}\), m=1,...,M, i=1,...,K.

2.3 Uplink decoding under precoded SI

Precoding can be used in the training phase for the SI channel estimation, and it can also be used in the data transmission phase for beamforming, which can help enhance the strength of the desired signal and reduce the interference to others. During the data transmission period, the ZF or MRT scheme is adopted by the BS for the downlink beamforming. Correspondingly, the BS receives the precoded SI from the transmitters, and it uses the ZF or MRC linear processing method for the uplink decoding.

Suppose x d is a K×1 data vector transmitted from the BS to the users with unit power \(\mathbb {E}\{{\boldsymbol {x}_{\mathrm {d}}\boldsymbol {x}{_{\mathrm {d}}^{H}}}\}=\boldsymbol {I}_{K}\), and s=A x d. (1) can be rewritten as

$$\begin{array}{*{20}l} {\boldsymbol{y}_{\text{BS}}} = \sqrt {p_{\mathrm{u}}} \boldsymbol{G}{\boldsymbol{x}_{\mathrm{u}}} + \sqrt {p_{\mathrm{d}}} {\boldsymbol{G}_{\mathrm{s}}}\boldsymbol{A} \boldsymbol{x}_{\mathrm{d}} + {\boldsymbol{n}}. \end{array} $$
(14)

To decode the uplink data of K users, the BS receiver will multiply a linear matrix W T before y BS, which is a function of the signal channel estimation, i.e.,

$$\begin{array}{@{}rcl@{}} \boldsymbol{W}^{T}= \left\{ \begin{aligned} &\left({\hat{\boldsymbol{G}}}^{H} {\hat {\boldsymbol{G}}}\right)^{-1}{\hat {\boldsymbol{G}}}^{H},&&\quad\text{for ZF};\\ &{1\over M}{\hat {\boldsymbol{G}}}^{H}, &&\quad\text{for MRC}. \end{aligned} \right. \end{array} $$
(15)

With the uplink linear receiver at the BS, the user data can be decoded as

$$\begin{array}{*{20}l} {}\boldsymbol{r}=\boldsymbol{W}^{T}{\boldsymbol{y}_{\text{BS}}}=\sqrt {p_{\mathrm{u}}} \boldsymbol{W}^{T}\boldsymbol{G}{\boldsymbol{x}_{\mathrm{u}}} + \sqrt {p_{\mathrm{d}}}\boldsymbol{W}^{T} {\boldsymbol{G}_{\mathrm{s}}}\boldsymbol{A} \boldsymbol{x}_{\mathrm{d}} + \boldsymbol{W}^{T}{\boldsymbol{n}}, \end{array} $$
(16)

where \(\sqrt {p_{\mathrm {u}}} \boldsymbol {W}^{T}{\boldsymbol {G}}{\boldsymbol {x}_{\mathrm {u}}}\) is the desired signal, \(\sqrt {p_{\mathrm {d}}}\boldsymbol {W}^{T} {\boldsymbol {G}_{\mathrm {s}}}\boldsymbol {A} \boldsymbol {x}_{\mathrm {d}}\) is the SI between the BS transceivers, and W T n is the additive noise.

Further, from (11) and (13), we have

$$\begin{array}{*{20}l} \boldsymbol{r}=&\sqrt {p_{\mathrm{u}}} \boldsymbol{W}^{T}\hat{\boldsymbol{G}}{\boldsymbol{x}_{\mathrm{u}}}+\sqrt {p_{\mathrm{u}}} \boldsymbol{W}^{T}{\boldsymbol{\Delta}}{\boldsymbol{x}_{\mathrm{u}}} \\ &+\sqrt {p_{\mathrm{d}}}\boldsymbol{W}^{T} {\hat {\boldsymbol{G}}_{\text{sa}}}\boldsymbol{x}_{\mathrm{d}}+ \sqrt {p_{\mathrm{d}}}\boldsymbol{W}^{T} \boldsymbol{\mathcal{E}}\boldsymbol{x}_{\mathrm{d}}+ \boldsymbol{W}^{T}{\boldsymbol{n}}, \end{array} $$
(17)

where \(\sqrt {p_{\mathrm {u}}} \boldsymbol {W}^{T}\hat {\boldsymbol {G}}{\boldsymbol {x}_{\mathrm {u}}}\) is the estimated signal, \(\sqrt {p_{\mathrm {d}}}\boldsymbol {W}^{T} {\hat {\boldsymbol {G}}_{\text {sa}}}\boldsymbol {x}_{\mathrm {d}}\) is the SI from the estimation, \(\sqrt {p_{\mathrm {u}}} \boldsymbol {W}^{T}{\boldsymbol {\Delta }}{\boldsymbol {x}_{\mathrm {u}}}\) and \(\sqrt {p_{\mathrm {d}}}\boldsymbol {W}^{T} \boldsymbol {\mathcal {E}}\boldsymbol {x}_{\mathrm {d}}\) are the estimation errors of the desired signal and SI, respectively.

3 SI suppression and achievable rate

In the massive MIMO system, the white noise and the inter-user interference can be eliminated by an LALP method [15, 19, 31]. In this section, we consider the shared-antenna FD massive MU-MIMO system and discuss the combined SI suppression based on the estimated channels of signal and SI. We also derive the closed-form lower bounds of the uplink achievable rates, which can be used as a system performance metric.

3.1 SI suppression with LALP

In the TDD half-duplex (HD) system, the interferences are asymptotically orthogonal to the subspace spanned by the desired signals when the antenna-array scale M approaches infinity. The desired signal can be recovered by projecting the received signal in a desired subspace with linear processing [19, 26]. In the shared-antenna FD massive MU-MIMO system, the asymptotic orthogonality property still holds true. Unlike the separate-antenna array system [26], the SI in our model includes the nonrandom direct-path interference, and the uplink channel is totally dependent on the downlink channel, as the same antenna array is employed to transmit and receive signals simultaneously.

Theorem 1

In the shared-antenna FD massive MU-MIMO system, when the signal and SI channels are not perfectly estimated and ZF or MRT/MRC linear processing is adopted, the SI at the BS will vanish if K is fixed and M.

Proof.

We consider the SI term \(\sqrt {p_{\mathrm {d}}}\boldsymbol {W}^{T} {\boldsymbol {G}_{\mathrm {s}}}\boldsymbol {A} \boldsymbol {x}_{\mathrm {d}}\) of (16). From (5) and (7), we have \({\boldsymbol {G}^{H}\boldsymbol {G}\over M}=\boldsymbol {D}\) and \(\text {tr}\left (\boldsymbol {G}_{s}\boldsymbol {G}{_{s}^{H}}\right)=O(M^{2})\) when M. According to the properties of MMSE estimation, we know that the estimated channels \(\hat {\boldsymbol {G}}\) and \(\hat {\boldsymbol {G}}_{\text {sa}}\) are independent on the estimation errors Δ and \(\boldsymbol {\mathcal {E}}\), respectively, and they are zero-mean complex Gaussian matrices as well. Substituting (12) and (15) into W T G s A, we have

$$ {\selectfont{\begin{aligned} \boldsymbol{W}^{T} \!{\boldsymbol{G}_{\mathrm{s}}}\boldsymbol{A}\,=\, \left\{ \begin{array}{ll} \alpha_{\text{ZF}}\left({\hat {\boldsymbol{G}}}^{H} \!{\hat {\boldsymbol{G}}}\right)^{-\!1}\!{\hat {\boldsymbol{G}}}^{H}\!{\boldsymbol{G}_{\mathrm{s}}}\!{\hat {\boldsymbol{G}}}^{*}\left({\hat {\boldsymbol{G}}}^{T}\!{\hat {\boldsymbol{G}}}^{*}\right)^{-\!1},& \text{ for\, ZF};\\ \alpha_{\text{MRT}}{1\over M}{\hat {\boldsymbol{G}}}^{H}{\boldsymbol{G}_{\mathrm{s}}}\hat {\boldsymbol{G}}, & \text{ for\, MRT/MRC}. \end{array}\right. \end{aligned}}} $$
(18)

Note that \(\text {tr}({\hat {\boldsymbol {G}}}^{H} {\hat {\boldsymbol {G}}})=\text {tr}({\hat {\boldsymbol {G}}}^{T} {\hat {\boldsymbol {G}}^{*}})=o(M)\), \(\alpha _{\text {ZF}}=o(\sqrt {M})\) and \(\alpha _{\text {MRT}}=o({1\over \sqrt {M}})\), by using Lemma 13 and 14 in [32], we obtain

$$\begin{array}{*{20}l} {1\over M}{\hat {\boldsymbol{g}_{i}}}^{H}{\boldsymbol{G}_{\mathrm{s}}}\hat {\boldsymbol{g}_{i}}-{1\over M} \text{tr}{\boldsymbol{G}_{\mathrm{s}}}\overset{a.s.}\longrightarrow 0, \quad i = 1,\ldots,K, \end{array} $$
(19)
$$\begin{array}{*{20}l} {1\over M}{\hat {\boldsymbol{g}_{i}}}^{H}{\boldsymbol{G}_{\mathrm{s}}}\hat {\boldsymbol{g}_{j}}\overset{a.s.}\longrightarrow0, \quad i,j = 1,\ldots,K, \quad i\neq j, \end{array} $$
(20)

and

$$\begin{array}{*{20}l} {1\over M^{2/3}}{\hat {\boldsymbol{G}}}^{H}{\boldsymbol{G}_{\mathrm{s}}}\hat {\boldsymbol{G}}\overset{a.s.}\longrightarrow\boldsymbol{0}_{K\times K}, \end{array} $$
(21)

where \(\hat {\boldsymbol {g}_{i}}\) and \(\hat {\boldsymbol {g}_{j}}\) are the ith and jth column of G, respectively. Thus, we have the SI term of (17)

$$\begin{array}{*{20}l} \sqrt {p_{\mathrm{d}}}\boldsymbol{W}^{T} \hat {\boldsymbol{G}}_{\text{sa}}\boldsymbol{x}_{\mathrm{d}}\overset{a.s.}\longrightarrow\boldsymbol{0}_{K\times 1}, \end{array} $$
(22)

and

$$\begin{array}{*{20}l} \sqrt {p_{\mathrm{d}}}\boldsymbol{W}^{T} {\boldsymbol{\mathcal{E}}}\boldsymbol{x}_{\mathrm{d}}\overset{a.s.}\longrightarrow\boldsymbol{0}_{K\times 1}, \end{array} $$
(23)

for ZF and MRT/MRC linear processing schemes. □

Remark: In the shared-antenna FD massive MU-MIMO system, the decoding matrix W T and the precoding matrix A in the SI term are totally dependent because of the channel reciprocity between uplink and downlink. This is different from the FD relay massive MIMO system where the source-relay and relay-destination channels are independent. According to Lemma 13 in [32] and Theorem 3.7 in [33], the LALP suppression capability is O(1/M 1/2), and it is different from [26] where the LALP suppression capability is O(1/M) for the separate-antenna arrays.

Proposition 2

In the shared-antenna FD massive MU-MIMO system with imperfect channel estimation, if K is fixed and M, the decoded uplink signal vector at the BS is

$$\begin{array}{*{20}l} \boldsymbol{r}\overset{a.s.}\longrightarrow\sqrt{p_{\mathrm{u}}}\boldsymbol{x}_{\mathrm{u}}, \end{array} $$
(24)

for the ZF precoding and decoding and

$$\begin{array}{*{20}l} \hat {\boldsymbol{D}}^{-1} {\boldsymbol{r}}\overset{a.s.}\longrightarrow\sqrt{p_{\mathrm{u}}}\boldsymbol{x}_{\mathrm{u}}, \end{array} $$
(25)

for the MRT/MRC precoding and decoding, where \([\hat {\boldsymbol {D}}]_{kk}=\hat \beta _{k}\) and \(\hat \beta _{k} \triangleq {\tau p_{\mathrm {u}}\beta {_{k}^{2}}\over {\tau p_{\mathrm {u}}\beta _{k} + 1}}\), k=1,…,K.

Proof.

According to (5), (11), (16), and Theorem 1, the results can be obtained. □

3.2 Combined SI suppression

Practically, the number of antennas cannot be infinite, the LALP method cannot ideally suppress the SI, especially for the dependent shared-antenna array. As mentioned above, the SI in the FD massive MU-MIMO system can be estimated by the MMSE through a precoded SI channel training process, thus the known SI can be removed before the LALP suppression. After the SI removal, the remaining interference becomes much weaker, and the LALP can be further adopted to suppress the remaining SI. Therefore, the suppression capability is greatly improved by combining the two suppression methods.

As shown in (17), the term \(\sqrt {p_{\mathrm {d}}}\boldsymbol {W}^{T} {\hat {\boldsymbol {G}}_{\text {sa}}}\boldsymbol {x}_{\mathrm {d}}\) is known by the BS. We can subtract it from the received signal r, so that (17) can be rewritten as

$$ {}\begin{aligned} \hat {\boldsymbol{r}} &= \sqrt {p_{\mathrm{u}}} \boldsymbol{W}^{T}\hat {\boldsymbol{G}}{\boldsymbol{x}_{\mathrm{u}}}+\sqrt {p_{\mathrm{u}}} \boldsymbol{W}^{T}\boldsymbol{\Delta}{\boldsymbol{x}_{\mathrm{u}}} + \sqrt {p_{\mathrm{d}}}\boldsymbol{W}^{T} \boldsymbol{\mathcal{E}}\boldsymbol{x}_{\mathrm{d}}+ \boldsymbol{W}^{T}{\boldsymbol{n}}, \end{aligned} $$
(26)

where \(\hat {\boldsymbol {r}} = \boldsymbol {r}- \sqrt {p_{\mathrm {d}}}\boldsymbol {W}^{T} {\hat {\boldsymbol {G}}_{\text {sa}}}\boldsymbol {x}_{\mathrm {d}}\). The kth decoded signal can then be expressed as

$$\begin{array}{*{20}l} \hat r_{k}=&\sqrt{p_{\mathrm{u}}}\boldsymbol{w}{_{k}^{T}}{\hat {\boldsymbol{g}}_{k}}{x_{k}} + \sqrt{p_{\mathrm{u}}}\sum_{i=1,i\neq k}^{K}{\boldsymbol{w}{_{k}^{T}}}{\hat{\boldsymbol{g}}_{i}}{x_{i}}\\ &+ \sqrt{p_{\mathrm{u}}}\sum_{i=1}^{K}{\boldsymbol{w}{_{k}^{T}}}{\boldsymbol{\delta}_{i}}{x_{i}}+\sqrt{p_{\mathrm{d}}}\sum_{i=1}^{K}{\boldsymbol{w}{_{k}^{T}}}\boldsymbol{{\epsilon}}_{i} x{_{i}^{\mathrm{d}}}+\boldsymbol{w}{_{k}^{T}}{\boldsymbol{n}}, \end{array} $$
(27)

where w k , \(\hat {\boldsymbol {g}}_{k}\), δ i , and ε i are the kth or ith column of W, \(\hat {\boldsymbol {G}}\), Δ, and \(\boldsymbol {\mathcal {E}}\), respectively; x k and \(x{_{i}^{\mathrm {d}}}\) are the kth and ith element of x u and x d, respectively; \(\sqrt {p_{\mathrm {u}}}\boldsymbol {w}{_{k}^{T}}{\hat {\boldsymbol {g}}_{k}}{x_{k}}\) is the desired signal from the kth user; \(\sqrt {p_{\mathrm {u}}}\sum _{i=1,i\neq k}^{K}{\boldsymbol {w}{_{k}^{T}}}{\hat {\boldsymbol {g}}_{i}}{x_{i}}\) is the inter-user interference; \(\sqrt {p_{\mathrm {u}}}\sum _{i= 1}^{K}{\boldsymbol {w}{_{k}^{T}}}{\boldsymbol {\delta }_{i}}{x_{i}}\) is the estimation error from the signal channel G; \(\sqrt {p_{\mathrm {d}}}\sum _{i= 1}^{K}{\boldsymbol {w}{_{k}^{T}}}\boldsymbol {{\epsilon }}_{i} x{_{i}^{\mathrm {d}}}\) is the estimation error from the SI equivalent channel G sa and \(\boldsymbol {w}{_{k}^{T}}\boldsymbol {n}\) is the additive noise.

Theoretically, the SI and other interferences and noise can be suppressed as much as possible, and the achievable rate of users can approach infinity with the increase of the antenna array scale. Practically, tens or hundreds of antennas can be a good approximation of infinity [4, 34].

3.3 Achievable rate

In this subsection, we will discuss the uplink achievable rate of the shared-antenna FD massive MIMO system for the LALP and the combined suppression.

3.3.1 Combined method

From (27), the kth user uplink achievable rate is given as

$$\begin{array}{*{20}l} R_{k}=\mathbb{E}\{\text{log}_{2}(1+\gamma_{k})\}, \end{array} $$
(28)

where γ k is given as

$$\begin{array}{*{20}l} {}\gamma_{k}\,=\,{{p_{\mathrm{u}}\left|\boldsymbol{w}{_{k}^{T}}\hat {\boldsymbol{g}}_{k}\right|^{2}}\over {p_{\mathrm{u}}\!\!\!\sum\limits_{i=1,i\neq k}^{K}\!\!\left|{\boldsymbol{w}}{_{k}^{T}}\!\hat {\boldsymbol{g}}_{i}\right|^{2}\!}\!\,+\,{p_{\mathrm{u}}\!\sum\limits_{i=1}^{K}\!\!\left|{\boldsymbol{w}}{_{k}^{T}}\!{\boldsymbol{\delta}_{i}}\right|^{2}}\!\!\,+\,{p_{\mathrm{d}}\!\sum\limits_{i=1}^{K}\!\left|{\boldsymbol{w}}{_{k}^{T}}\!{\boldsymbol{\epsilon}_{i}}\right|^{2}}\!\!\,+\,{\left\|\boldsymbol{w}{_{k}^{T}}\right\|^{2}}}. \end{array} $$
(29)

3.3.2 LALP only

If we estimate the signal channel and suppress the SI by the LALP method only, we have

$$\begin{array}{*{20}l} r'_{k}=&\sqrt{p_{\mathrm{u}}}\boldsymbol{w}{_{k}^{T}}{\hat {\boldsymbol{g}}_{k}}{x_{k}} + \sqrt{p_{\mathrm{u}}}\sum_{i=1,i\neq k}^{K}{\boldsymbol{w}{_{k}^{T}}}{\hat{\boldsymbol{g}}_{i}}{x_{i}}\\ &+\!\! \sqrt{p_{\mathrm{u}}}\sum_{i=1}^{K}{\boldsymbol{w}{_{k}^{T}}}\!{\boldsymbol{\delta}_{i}}{x_{i}}\,+\,\sqrt{p_{\mathrm{d}}}\sum_{i=1}^{K}{\boldsymbol{w}{_{k}^{T}}}\!\boldsymbol{g}_{\text{sa},i} x{_{i}^{\mathrm{d}}}\,+\,\boldsymbol{w}{_{k}^{T}}{\boldsymbol{n}}, \end{array} $$
(30)

where g sa,i is the ith column of G sa, i=1,...,K; \(\sqrt {p_{\mathrm {d}}}\sum _{i= 1}^{K}{\boldsymbol {w}{_{k}^{T}}}\boldsymbol {{g}}_{\text {sa},i} x{_{i}^{\mathrm {d}}}\) is the SI. The kth user achievable rate with imperfect estimation and the LALP method only can be expressed as

$$\begin{array}{*{20}l} R'_{k}=\mathbb{E}\{\text{log}_{2}(1+\gamma'_{k})\}, \end{array} $$
(31)

where \(\gamma ^{\prime }_{k}\) is given as

$$\begin{array}{*{20}l} {}\gamma^{\prime}_{k}=\!{{p_{\mathrm{u}}\left|\boldsymbol{w}{_{k}^{T}}\hat {\boldsymbol{g}}_{k}\right|^{2}}\over {p_{\mathrm{u}}\!\!\!\!\!\sum\limits_{i=1,i\neq k}^{K}\!\!\!\left|{\boldsymbol{w}}{_{k}^{T}}\!\hat {\boldsymbol{g}}_{i}\right|^{2}}\!\,+\,{p_{\mathrm{u}}\!\!\sum\limits_{i=1}^{K}\!\!\left|{\boldsymbol{w}}{_{k}^{T}}\!{\boldsymbol{\delta}_{i}}\right|^{2}}\!\,+\,{p_{\mathrm{d}}\!\!\sum\limits_{i=1}^{K}\!\!\left|{\boldsymbol{w}}{_{k}^{T}}\!\boldsymbol{g}_{\text{sa},i} \right|^{2}}\!\,+\,{\left\|\boldsymbol{w}{_{k}^{T}}\right\|^{2}}}. \end{array} $$
(32)

3.3.3 Perfect channel estimation

We compare the achievable rate of the imperfect case with the perfect case and verify the accuracy of the channel estimation. If the perfect CSI can be obtained, the signal channel estimation error and the SI terms can be removed from (35). By setting the estimation error as zero in (27), the uplink decoded signal of the kth user with perfect estimation can be obtained as

$$\begin{array}{*{20}l} {r}^{\prime\prime}_{k}=&\sqrt{p_{\mathrm{u}}}\boldsymbol{w}{_{k}^{T}}{{\boldsymbol{g}}_{k}}{x_{k}} + \sqrt{p_{\mathrm{u}}}\sum\limits_{i=1,i\neq k}^{K}{\boldsymbol{w}{_{k}^{T}}}{{\boldsymbol{g}}_{i}}{x_{i}}+\boldsymbol{w}{_{k}^{T}}{\boldsymbol{n}}, \end{array} $$
(33)

where \({r}^{\prime \prime }_{k}=r_{k}-\sqrt {p_{\mathrm {d}}}\sum _{i=1}^{K}{\boldsymbol {w}{_{k}^{T}}}\boldsymbol {g}_{\text {sa},i} x{_{i}^{\mathrm {d}}}\), and w k , g k , and g sa,i are the kth or ith column of W, G, and G sa, respectively; \(\sqrt {p_{\mathrm {u}}}\boldsymbol {w}{_{k}^{T}}{{\boldsymbol {g}}_{k}}{x_{k}}\) is the desired signal from the kth user; \(\sqrt {p_{\mathrm {u}}}\sum _{i=1,i\neq k}^{K}{\boldsymbol {w}{_{k}^{T}}}{{\boldsymbol {g}}_{i}}{x_{i}}\) is the inter-user interference; and \(\sqrt {p_{\mathrm {d}}}\sum _{i= 1}^{K}{\boldsymbol {w}{_{k}^{T}}}\boldsymbol {{g}}_{\text {sa},i} x{_{i}^{\mathrm {d}}}\) is the SI. Therefore, the kth user achievable rate with perfect CSI can be given as

$$\begin{array}{*{20}l} R^{\prime\prime}_{k}=\mathbb{E}\left\{\text{log}_{2}\left(1+\gamma^{\prime\prime}_{k}\right)\right\}, \end{array} $$
(34)

where γ k″ is given as

$$\begin{array}{*{20}l} \gamma^{\prime\prime}_{k}={{p_{\mathrm{u}}\left|\boldsymbol{w}{_{k}^{T}} {\boldsymbol{g}}_{k}\right|^{2}}\over {p_{\mathrm{u}}\sum_{i=1,i\neq k}^{K}\left|{\boldsymbol{w}}{_{k}^{T}} {\boldsymbol{g}}_{i}\right|^{2}}+{\left\|\boldsymbol{w}{_{k}^{T}}\right\|^{2}}}. \end{array} $$
(35)

3.4 Achievable rate lower bounds of the combined method

From (28), we can derive the lower bounds of the achievable rates for the combined method with ZF and MRT/MRC linear processing.

Lemma 1

In the shared-antenna FD massive MU-MIMO system, the random variable \(\tilde g_{i}={{\boldsymbol {w}}{_{k}^{T}}{\hat {\boldsymbol {g}}_{i}}\over \left \|{\boldsymbol {w}}{_{k}^{T}}\right \|}\) is a zero-mean complex Gaussian variable with variance \(\hat \beta _{i}\), which is independent on \({\boldsymbol {w}}{_{k}^{T}}\) if K is fixed and M, where \(\hat \beta _{i}={\tau p_{\mathrm {u}}\beta {_{i}^{2}}\over \tau p_{\mathrm {u}}\beta {_{i}}+1}\), i=1,...,K, ik.

Proof.

Please see Appendix A of [19]. □

Lemma 2

In the shared-antenna FD massive MU-MIMO system, the random variable \(\tilde \delta _{i}={{\boldsymbol {w}}{_{k}^{T}}{\boldsymbol {\delta }_{i}}\over \left \|{\boldsymbol {w}}{_{k}^{T}}\right \|}\) is a zero-mean complex Gaussian variable with variance \(\tilde \beta _{i}\), which is independent on \({\boldsymbol {w}}{_{k}^{T}}\) if K is fixed and M, where \(\tilde \beta _{i}={\beta {_{i}}\over \tau p_{\mathrm {u}}\beta {_{i}}+1}\), i=1,...,K.

Proof.

According to the optimum MMSE estimation property, δ i is independent of \(\hat {\boldsymbol {g}}_{i}\) for i=1,...,K. Similarly, as the proof of Lemma 1, we can obtain the result. □

Lemma 3

In the shared-antenna FD massive MU-MIMO system, the random variable \(\tilde \epsilon _{i}={{\boldsymbol {w}}{_{k}^{T}}{\boldsymbol {\epsilon }_{i}}\over \left \|{\boldsymbol {w}}{_{k}^{T}}\right \|}\) is a zero-mean complex Gaussian variable with variance \(\tilde \rho _{i}\), which is independent on \({\boldsymbol {w}}{_{k}^{T}}\) if K is fixed and M, where \(\tilde \rho _{i}={1\over M}\!\sum _{m=1}^{M}{\rho _{m,i}\over {\tau p_{\mathrm {d}}\rho _{m,i}+1}}\), i=1,...,K.

Proof.

Please see Appendix. □

Proposition 3

With the combined scheme of SI removal and ZF large-scale antenna linear processing, the kth uplink achievable rate in the shared-antenna FD massive MU-MIMO system is lower-bounded by

$$\begin{array}{*{20}l} \underline {R}{{_{k}^{\mathrm{Z}}}}&=\log_{2}\Bigg(1+{{p{_{\mathrm{u}}}(M-K)}\hat\beta{_{k}}\over {p_{\mathrm{u}}\sum_{i=1}^{K}{\tilde \beta_{i}}}+{p_{\mathrm{d}}\sum_{i=1}^{K}\tilde \rho_{i}}+1}\Bigg), \end{array} $$
(36)

where \(\hat \beta _{i}={\tau p_{\mathrm {u}}\beta {_{i}^{2}}\over \tau p_{\mathrm {u}}\beta {_{i}}+1}\), \(\tilde \beta _{i}={\beta {_{i}}\over \tau p_{\mathrm {u}}\beta {_{i}}+1}\), and \(\tilde \rho _{i}={1\over M}\!\sum _{m=1}^{M}{\rho _{m,i}\over {\tau p_{\mathrm {d}}\rho _{m,i}+1}}\).

Proof.

From (15), we have \(\boldsymbol {W}^{T}{\hat {\boldsymbol {G}}}=\boldsymbol {I}_{K}\) and \(\boldsymbol {W}^{T}\boldsymbol {W}^{*}=[({\hat {\boldsymbol {G}}^{H}}\hat {\boldsymbol {G}})^{-1}]^{H}\) for the ZF linear processing, so we can get \(\boldsymbol {w}{_{k}^{T}}\hat {\boldsymbol {g}}_{k}=1\), \(\boldsymbol {w}{_{k}^{T}}\hat {\boldsymbol {g}}_{i}=0\), and \({\left \|\boldsymbol {w}{_{k}^{T}}\right \|^{2}}=[(\hat {\boldsymbol {G}}^{H}\hat {\boldsymbol {G}})^{-1}]_{kk}\). Similarly, as [19], using the convexity of φ(x)=log2(1+x −1) and Jensen’s inequality, from (28), we can obtain

$$ {{}\begin{aligned} R_{k}&\!\ge\!\text{log}_{2}\! \left(\!1+{p_{\mathrm{u}}\over {\mathbb{E}\Big\{{p_{\mathrm{u}}\!\!\sum\limits_{i=1}^{K}\!\!\left|{\boldsymbol{w}}{_{k}^{T}}\!{\boldsymbol{\delta}_{i}}\right|^{2}}\,+\,{p_{\mathrm{d}}\sum\limits_{i=1}^{K}\!\left|{\boldsymbol{w}}{_{k}^{T}}\!{\boldsymbol{\epsilon}_{i}}\right|^{2}}\,+\,\left\|{\boldsymbol{w}}{_{k}^{T}}\right\|^{2}}\Big\}}\right)\\ &\,=\,\text{log}_{2}\!\left(\!1+{p_{\mathrm{u}}\over {\mathbb{E}\Big\{\Big({p_{\mathrm{u}}\!\sum\limits_{i=1}^{K}|{\tilde \delta_{i}}|^{2}}\,+\,{p_{\mathrm{d}}\!\sum\limits_{i=1}^{K}\left|{\tilde \epsilon_{i}}\right|^{2}}\,+\,1\Big)\left\|{\boldsymbol{w}}{_{k}^{T}}\right\|^{2}}\Big\}}\right). \end{aligned}} $$
(37)

From Lemma 2 and Lemma 3, we know that the new variables \(\tilde \delta _{i}={{\boldsymbol {w}}{_{k}^{T}}{\boldsymbol {\delta }_{i}}\over \left \|{\boldsymbol {w}}{_{k}^{T}}\right \|}\) and \(\tilde \epsilon _{i}={{\boldsymbol {w}}{_{k}^{T}}{\boldsymbol {\epsilon }_{i}}\over \left \|{\boldsymbol {w}}{_{k}^{T}}\right \|}\) are both zero-mean Gaussian variables with variances of \(\tilde \beta _{i}\) and \(\tilde \rho _{i}\), respectively, so we have

$$\begin{array}{*{20}l} {}R_{k}\ge\text{log}_{2}\left(1+{p_{\mathrm{u}}\over {\Big({p_{\mathrm{u}}\sum\limits_{i=1}^{K}\tilde\beta_{i}}+{p_{\mathrm{d}}\sum\limits_{i=1}^{K}\tilde\rho_{i}}+1\Big)\mathbb{E}\big\{\left\|{\boldsymbol{w}}{_{k}^{T}}\right\|^{2}\big\}}}\right). \end{array} $$
(38)

Then, by using the property of Wishart matrices \(\mathbb {E}\{\text {tr}(\boldsymbol {M}{^{-1}})\}=K/(M-K)\) (Lemma 2.10) in [34], we have

$$\begin{array}{*{20}l} \mathbb{E}\big\{\left\|{\boldsymbol{w}}{_{k}^{T}}\right\|^{2}\big\}&=\mathbb{E}\big\{[(\hat {\boldsymbol{G}}^{H}\hat{\boldsymbol{G}})^{-1}]_{kk}\big\}\\ &=\hat \beta{_{k}^{-1}}\mathbb{E}\big\{[(\hat {\boldsymbol{H}}^{H}\hat {\boldsymbol{H}})^{-1}]{_{kk}}\big\}\\ &={1\over {K \hat\beta_{k}}}\mathbb{E}\big\{\text{tr}((\hat {\boldsymbol{H}}^{H}\hat {\boldsymbol{H}}){^{-1}})\big\}\\ &={1\over(M-K){\hat\beta{_{k}}}}, \quad \hbox {for}\quad M\ge K+1. \end{array} $$
(39)

Substitute (39) into (38), we can obtain the result. □

Proposition 4

With the combined scheme of SI removal and MRT/MRC large-scale antenna linear processing, the kth uplink achievable rate in the shared-antenna FD massive MU-MIMO system is lower-bounded by

$$\begin{array}{*{20}l} {}\underline{R}{{_{k}^{\mathrm{M}}}}&=\log_{2}\left(1+{{p_{\mathrm{u}}(M-1)\hat \beta_{k}}\over {p_{\mathrm{u}}\left(\sum_{i=1}^{K}{\beta_{i}}-\hat \beta_{k}\right)}+{p_{\mathrm{d}}\sum_{i=1}^{K}\tilde\rho_{i}}+1}\right), \end{array} $$
(40)

where \(\hat \beta _{i}={\tau p_{\mathrm {u}}\beta {_{i}^{2}}\over \tau p_{\mathrm {u}}\beta {_{i}}+1}\) and \(\tilde \rho _{i}={1\over M}\!\sum _{m=1}^{M}{\rho _{m,i}\over {\tau p_{\mathrm {d}}\rho _{m,i}+1}}\).

Proof.

From (15), we have \(\boldsymbol {w}{_{k}^{T}}{\hat {\boldsymbol {g}}_{k}}={1\over M}\left \|{\hat {\boldsymbol {g}}_{k}}\right \|^{2}\) and \(\left \|\boldsymbol {w}{_{k}^{T}}\right \|^{2}={1\over M^{2}}\left \|{\hat {\boldsymbol {g}}_{k}}\right \|^{2}\)for the MRT/MRC linear processing. Similarly, from (28) and by using Jensen’s inequality, we have

$$\begin{array}{*{20}l} {}R_{k}\!\ge\!\text{log}_{2}\!\left(\!1\,+\,{p_{\mathrm{u}}\over \mathbb{E}\left\{{p_{\mathrm{u}}\sum\limits_{i=1,i\neq k}^{K}|{\tilde g_{i}}|^{2}}+{p_{\mathrm{u}}\sum\limits_{i=1}^{K}|{\tilde\delta_{i}}|^{2}}+{p_{\mathrm{d}}\sum\limits_{i=1}^{K}\left|{ \tilde\epsilon_{i}}\right|^{2}}+1 \over \left\|\hat{\boldsymbol{g}}{_{k}}\right\|^{2}\right\}}\right). \end{array} $$
(41)

Then, by using the property of the Wishart matrix (Lemma 2.10) in [34], we get

$$\begin{array}{*{20}l} {}\mathbb{E}\left\{\left\|\hat{\boldsymbol{g}}_{k}\right\|^{-2}\right\}&=\mathbb{E}\left\{(\hat {\boldsymbol{g}}{_{k}^{H}}\hat{\boldsymbol{g}}{_{k}})^{-1}\right\} ={1\over(M\,-\,1){\hat \beta{_{k}}}}, \quad \!\!\hbox {for}\!\quad M\!\ge\! 2. \end{array} $$
(42)

From Lemma 1, Lemma 2, and Lemma 3, by substituting (42) into (41), we obtain the result. □

4 Spectral efficiency and energy efficiency

The SE and EE are important performance metrics for next-generation mobile communication systems. In this section, we discuss the SE and EE of a shared-antenna FD massive MU-MIMO system with the combined SI suppression method.

4.1 SE and EE

4.1.1 SE and EE for the FD

Generally, the system SE can be defined as the sum-rate per site or sum-rate per cell. According to (28), the SE of the SI suppression and channel estimation in the shared-antenna FD massive MU-MIMO can be given as

$$\begin{array}{*{20}l} S{_{\mathrm{f}}}={T-2\tau\over T}\sum_{k=1}^{K}R_{k}. \end{array} $$
(43)

The EE is defined as the SE divided by the total transmit power, so we have

$$\begin{array}{*{20}l} E{_{\mathrm{f}}}={S{_{\mathrm{f}}}\over {Kp}_{\mathrm{u}}}={T-2\tau\over {TKp}_{\mathrm{u}}}\sum_{k=1}^{K}R_{k}. \end{array} $$
(44)

4.1.2 SE and EE for the HD

In comparison, the SE and EE of the HD system can be given as

$$\begin{array}{*{20}l} S_{\mathrm{h}}&= \frac{T-\tau}{2T}{\sum_{k=1}^{K}}\mathbb{E}\left\{\text{log}_{2}\left(1+\gamma^{\mathrm{h}}_{k}\right)\right\}, \end{array} $$
(45)
$$\begin{array}{*{20}l} E{_{\mathrm{h}}}&={T-\tau\over K p_{\mathrm{u}}T}\sum_{k=1}^{K}\mathbb{E}\left\{\text{log}_{2}\left(1+\gamma^{\mathrm{h}}_{k}\right)\right\}, \end{array} $$
(46)

where \(\gamma ^{\mathrm {h}}_{k}\) is given as

$$\begin{array}{l} \gamma^{\mathrm{h}}_{k}={{p_{\mathrm{u}}\left|\boldsymbol{w}{_{k}^{T}}\hat {\boldsymbol{g}}_{k}\right|^{2}}\over {p_{\mathrm{u}}\sum_{i=1,i\neq k}^{K}\left|{\boldsymbol{w}}{_{k}^{T}}\hat {\boldsymbol{g}}_{i}\right|^{2}}+{p_{\mathrm{u}}\sum_{i=1}^{K}\left|{\boldsymbol{w}}{_{k}^{T}}{\boldsymbol{\delta}_{i}}\right|^{2}}+{\left\|\boldsymbol{w}{_{k}^{T}}\right\|^{2}}}. \end{array} $$
(47)

We can see that the SE of the HD system is smaller than that of the FD system when the antenna scale is large enough, because it transmits data only half of the time. In contrast, the EE is slightly higher because there is no SI and fewer training overhead in the HD system.

4.1.3 Lower bounds of SE and EE

Replacing R k by \(\underline {R}{{_{k}^{\mathrm {Z}}}}\) and \(\underline {R}{{_{k}^{\mathrm {M}}}}\), we can obtain the lower bounds of the SE and EE for the shared-antenna FD massive MU-MIMO system with imperfect estimation. By substituting (36) and (40) into (43) and (44), we have

$$\begin{array}{*{20}l} \underline {S}{{_{\mathrm{f}}^{A}}}&={T-2\tau\over T}\sum_{k=1}^{K}\underline R{_{k}^{A}}, \end{array} $$
(48)

and

$$\begin{array}{*{20}l} \underline {E}{{_{\mathrm{f}}^{A}}}&={T-2\tau\over {TKp}_{\mathrm{u}}}\sum_{k=1}^{K}\underline R{_{k}^{A}}, \end{array} $$
(49)

where the superscript A is “Z” or “M”, which represents ZF or MRT/MRC. The lower bounds are in the closed form, so they can be easily used to show the system performance.

4.2 Maximizing SE and EE

4.2.1 Maximizing SE

The SE monotonically grows with the antenna array scale M and the uplink transmit power p u. We suppose that M and p u are fixed and M is large. In this case, only the training length τ affects the SE, and the optimization problem can be formulated as

$$ \begin{array}{ll} \underset{\tau}{\max} \;\quad & {S{_{\mathrm{f}}}}(\tau)\\ \text{s.~t.~} \quad & K \le \tau \le T/2,\\ \end{array} $$
(50)

This problem is neither linear nor convex, and it is even difficult to solve using the linear search because of the expectation operation and the large dimensions of the channels. Fortunately, the SE can be approximated by the lower bounds of (48), because they are very close after the further SI suppression with the combined method, which can be seen in the subsequent simulations in next section. Then, the maximization problem can be approximated as

$$ \begin{array}{ll} \underset{\tau}{\max} \;\quad & \underline{S}{{_{\mathrm{f}}^{A}}}(\tau)\\ \text{s.~t.~} \quad & K \le \tau \le T/2.\\ \end{array} $$
(51)

This problem can be numerically solved by the linear search, the simulation in the next section shows that this solution is global optimal when the antenna scale M is large enough.

4.2.2 Maximizing the EE

The EE of the FD massive MU-MIMO system monotonically increases with M, but not with p u. Because it has the same solution as the SE for τ, here we only focus on p u. The antenna-scale M is large and fixed, so the problem of maximizing the EE can be expressed as

$$ \begin{array}{ll} \underset{p_{\mathrm{u}}}{\max} \;\quad & E{_{\mathrm{f}}}(\;p_{\mathrm{u}})\\ \text{s.~t.~} \quad & 0 < p_{\mathrm{u}} \le p_{0}.\\ \end{array} $$
(52)

Similarly, the EE can be approximated by the lower bound in (49) when the antenna-array scale M is large enough. The maximization problem can be modified as

$$ \begin{array}{cc} \underset{p_{\mathrm{u}}}{\max} \;\quad & \underline E{{_{\mathrm{f}}^{A}}}(\;p_{\mathrm{u}})\\ \text{s.~t.~} \quad &0 < p_{\mathrm{u}} \le p_{0}.\\ \end{array} $$
(53)

It can be numerically solved by the linear search, the simulation in the next section shows that this solution is also global optimal.

4.2.3 Tradeoff of EE versus SE

The maximal EE can be obtained at the point \((\tau ^{*},\; p{{\!~\!}_{\mathrm {u}}^{*}})\). However, the SE decreases monotonically with the decreasing p u. To keep a reasonable uplink throughput, the SE is lower constrained. Therefore, the optimal \(p{{\!~\!}_{\mathrm {u}}^{*}}\) should be constrained by the SE requirement. The optimal value \(p{{\!~\!}_{\mathrm {u}}^{*}}\) is adopted if the minimum SE is satisfied; otherwise, the tradeoff value \(p^{\prime }{{\!~\!}_{\mathrm {u}}^{*}}\),which meets the SE requirement, is used.

5 Numerical and simulation results

In this section, we present the performance results of the shared-antenna FD massive MU-MIMO system. All the users work in FD mode with a single antenna, and the insertion loss of the circulators is neglected. The number of users is given as K=4. Unless stated otherwise, the system parameters are set as follows: The coherent time T of the uplink data transmission is 400 (symbol length), and the training length τ of each channel is 8. The normalized uplink power p u and the downlink power p d are set as 10 dB and 13 dB, respectively. Similarly, as [26], the large-scale path-loss β k (k=1,2,3,4) of the signal channel is set as 0.749, 0.246, 0.125, and 0.635, respectively; The large-scale path-loss β of the SI channel is set as 0.6, and the direct-path SI coefficient c mj is set as arbitrary but deterministic values in a circle on the complex plane with radius 0.33.

5.1 Achievable rate and lower bounds

We obtain the numerical results according to (28), (34), (34), (36), and (40). The SI suppression and the achievable rate for the 2nd user are shown in Fig. 2 and Fig. 3, respectively. We can see that both the large-scale antenna linear processing method and the combined method can suppress the SI as well as the inter-user interferences and noise like [4, 15, 19, 26]. Our combined suppression method significantly outperforms the LALP method. As shown in Fig. 2, at the scale of M=300, the SI level is -14 dB by using the ZF linear processing only. In contrast, the SI is -50 dB by using the combined method, providing an additional 36 dB suppression. Hence, the achievable rates for the imperfect estimation are close to the perfect cases. For example, there are small gaps of approximately 1 bits/s/Hz for the ZF and 0.2 bits/s/Hz for the MRT/MRC with M=50 in Fig. 3. Therefore, a higher data rate can be achieved at the same scale of the antenna array, or fewer antennas are required at the same data rate. For instance, the achievable rate increases from 5.3 bits/s/Hz to 8.5 bits/s/Hz at the antenna scale M=300 for the ZF, while the scale M decreases from about 450 to 200 at the achievable rate of 5.1 bits/s/Hz for the MRT/MRC linear processing.

Fig. 2
figure 2

SI at BS vs. antenna array scale M for the 2nd user

Fig. 3
figure 3

Achievable rate of the 2nd user with perfect/imperfect estimation vs. the antenna array scale M

Figure 4 also shows the achievable rates and their lower bounds for the 2nd and 4th users. We can see that the lower bounds are very close to the simulation results when M is large enough. For example, there is a small gap of approximately 0.2 bits/s/Hz when M is 50 for the MRT/MRC, and the gap is even smaller for the ZF (the 2nd user). Therefore, it is reasonable to approximate the SE with the lower bounds.

Fig. 4
figure 4

Achievable rate lower bounds of the 2nd and 4th users with imperfect estimation vs. antenna array scale M

5.2 SE and EE with SI suppression

Figure 5 and Fig. 6 show the SE and EE of the FD and HD systems versus the antenna array scale according to (43), (44), (45), and (46). We can see that the SE gets larger with the increase of the antenna array scale M for both the HD system and the FD system. The SE of the FD system is almost twice as high as that of the HD, as shown in Fig. 5, when employing our combined suppression method.

Fig. 5
figure 5

SE vs. antenna array scale M for the FD and HD systems

Fig. 6
figure 6

EE vs. antenna array scale M for the FD and HD systems

The EE of the FD system is slightly lower than that of the HD system because of the residual SI in the FD system, but the gaps are very small, so it is worth acquiring twice the SE by losing some EE. Furthermore, the EE grows with the increase of the antenna array scale M, so the performance is much better than that of the traditional FD system when M is large enough.

5.3 Maximizing SE and EE

Figures 7 and 8 show the SE and EE of the FD system based on (51) and (53). The scale M is supposed to be fixed and large enough. As shown in Fig. 7 and Fig. 8, both the SE and EE get higher at first and then turn lower. The approximate SE and EE are very close to the real values for large M, for example, M=500. Since it is inefficient and time-consuming to search the optimal values with the Monte Carlo simulations, a good choice is to search the optimal values using the approximation. Simulations show that the SE increases when τ is small (τ<10), and decreases when τ is large, this is because the channels can be trained and estimated more sufficiently with the increase of τ. However, the data transmission time will be shortened if τ is too big. The similar results can be seen for the EE in Fig. 8, because the SE increases quadratically with p u when p u is small, while it increases logarithmically when p u is big. The SE or EE increases with the antenna array scale M, because more SI can be suppressed and less training time/uplink transmit power is required when M is larger.

Fig. 7
figure 7

SE vs. training length τ for the FD system

Fig. 8
figure 8

EE versus uplink transmit power p u for the FD system

6 Conclusion

We introduced and analyzed a full-duplex massive MU-MIMO system with a single shared-antenna array at the BS. Either the ZF or MRT/MRC method can be adopted to linearly process signals, and a novel training scheme via the precoded SI channel is proposed to estimate the SI with a much lower dimension. We proved that the SI can vanish by using the large-scale antenna linear processing method even if the SI/signal channels are not perfectly estimated and the uplink/downlink channels completely depend on each other. By combining the SI removal and the LALP method, we show that the SI can be sufficiently suppressed. We also derived the closed-form lower bounds of the achievable rates and showed that they are very close to the simulation results. With the combination of the SI suppression, the “large enough” number of antennas can be scaled down, and the FD massive MU-MIMO system significantly outperforms the half-duplex system in terms of SE.

7 \thelikesection Appendix

7.1 \thelikesubsection Derivations of (6) and (7)

From (3), we have

$$\begin{array}{*{20}l} {\boldsymbol{G}{_{s}^{H}}{\boldsymbol{G}_{s}}\over M} &= {(\bar {\boldsymbol{G}}{_{s}^{H}} + \tilde {\boldsymbol{G}}{_{s}^{H}})(\bar {\boldsymbol{G}}{_{s}} + \tilde {\boldsymbol{G}}{_{s}})\over M}\\ &= {(\bar {\boldsymbol{G}}{_{s}^{H}}\bar {\boldsymbol{G}}{_{s}} + \tilde {\boldsymbol{G}}{_{s}^{H}}\bar {\boldsymbol{G}}{_{s}} + \bar {\boldsymbol{G}}{_{s}^{H}}\tilde {\boldsymbol{G}}{_{s}} + \tilde {\boldsymbol{G}}{_{s}^{H}}\tilde {\boldsymbol{G}}{_{s}})\over M}. \end{array} $$
(54)

Since the direct-path interference is suppressed by the physical isolation and analog domain cancelation, the direct-path channel coefficients c mj are not infinity. Because all of the entries c mj of \(\bar {\boldsymbol {G}}{_{s}}\) are bounded and all of the entries of \(\tilde {\boldsymbol {G}}{_{s}}\) are i.i.d. Gaussian random variables with zero mean, we obtain \({[{\bar {\boldsymbol {G}}{_{s}^{H}}\tilde {\boldsymbol {G}}{_{s}}}]_{mj}\over M} \approx 0\) when M is large enough, so we have

$$\begin{array}{*{20}l} {\boldsymbol{G}{_{s}^{H}}{\boldsymbol{G}_{s}}\over M} & \approx {(\bar {\boldsymbol{G}}{_{s}^{H}}\bar {\boldsymbol{G}}{_{s}} + \tilde {\boldsymbol{G}}{_{s}^{H}}\tilde {\boldsymbol{G}}{_{s}})\over M} \approx {\bar {\boldsymbol{D}}{_{s}}} + {\tilde {\boldsymbol{D}}{_{s}}}. \end{array} $$
(55)

From (6), we have

$$\begin{array}{*{20}l} {}{1\over M} \text{tr}({\boldsymbol{G}{_{s}^{H}}{\boldsymbol{G}_{s}}}) &\approx \text{tr}({\bar {\boldsymbol{D}}{_{s}}} + {\tilde {\boldsymbol{D}}{_{s}}})={1\over M}\sum_{m=1}^{M}\sum_{j=1}^{M}\left|{c_{mj}}\right|^{2} + \beta M\\ &\le \left|c_{max}\right|^{2}M + \beta M= O (M), \end{array} $$
(56)

so we have

$$\begin{array}{*{20}l} \text{tr}({\boldsymbol{G}{_{s}^{H}}{\boldsymbol{G}_{s}}}) = O (M^{2}). \end{array} $$
(57)

Hence, we can obtain (6) and (7) when M is large enough.

7.2 \thelikesubsection Proof of Proposition 1

The ith column g sa,i of matrix G sa can be written as

$$\begin{array}{*{20}l} \boldsymbol{g}_{\text{sa},i}=\boldsymbol{G}_{\mathrm{s}} \boldsymbol{a}_{i}, \end{array} $$
(58)

where a i is the ith column of A, so the mth entry of g sa,i can be given as

$$\begin{array}{*{20}l} g_{\text{sa},mi}=\boldsymbol{g}_{\mathrm{s},m} \boldsymbol{a}_{i}, \end{array} $$
(59)

where g s,m is the mth row of G s.

For the ZF precoding, \(\boldsymbol {a}_{i}= {1\over M}\alpha _{\text {ZF}}\beta {_{i}^{-\frac {1}{2}}}\boldsymbol {h}_{i}^{*}\), so we have

$$\begin{array}{*{20}l} g_{\text{sa},mi}={1\over M}\alpha_{\text{ZF}}\beta{_{i}^{-\frac{1}{2}}}\sum_{j=1}^{M}\left(c_{mj}+\sqrt{\beta}h_{\mathrm{s},mj}\right)h_{ij}^{*}. \end{array} $$
(60)

Hence, the variance of mth entry of g sa,i can be given as

$$\begin{array}{*{20}l} &\mathbb{E}\left\{\left|g_{\text{sa},mi}\right|^{2}\right\}={1\over M^{2}}\alpha_{\text{ZF}}^{2}\beta{_{i}^{-1}}\\ &\times \mathbb{E}\!\left\{\!\sum_{j=1}^{M}\left(c_{mj}\,+\,\sqrt{\beta}h_{\mathrm{s},mj}\right)h_{ij}^{*}\!\sum_{j=1}^{M}\left(c_{mj}^{*}\,+\,\sqrt{\beta}h_{\mathrm{s},mj}^{*}\right)h_{ij}\!\right\}\\ &={1\over M^{2}}\alpha_{\text{ZF}}^{2}\beta_{i}^{-\!1}\!\sum_{j=1}^{M}\left(\left|c_{mj}\right|^{2}\,+\,\beta\right). \end{array} $$
(61)

Similarly, we can obtain

$$\begin{array}{*{20}l} \mathbb{E}\left\{\left|g_{\text{sa},mi}\right|^{2}\right\}=\alpha_{\text{MRT}}^{2}\beta_{i}\!\sum_{j=1}^{M}\left(\left|c_{mj}\right|^{2}\,+\,\beta\right), \end{array} $$
(62)

for the MRT precoding.

7.3 \thelikesubsection Proof of Lemma 3

The ith column ε i of matrix \(\boldsymbol {\mathcal {E}}\) can be written as

$$\begin{array}{*{20}l} \boldsymbol{\epsilon}_{i} = \boldsymbol{g}_{\text{sa},i}-\hat{\boldsymbol{g}}_{\text{sa},i}. \end{array} $$
(63)

Note that the correlation between w k and ε i is zero because w k and g sa,i are independent for ik, and the correlation disappears for i=k when M. According to the optimum estimation of MMSE, ε i is a Gaussian random vector, and the variance of the mth entry for ε i is \(\rho _{m,i}\over {\tau p_{\mathrm {d}} \rho _{m,i}+1}\). Similarly, from the Lyapunov central limit theorem [35] and Proposition 1, we can derive that the variable \(\tilde \epsilon _{i}={{\boldsymbol {w}_{k}^{T}\boldsymbol {\epsilon }_{i}}\over \left \|\boldsymbol {w}_{k}^{T}\right \|}\) is a zero-mean complex Gaussian variable, and the variance is

$$\begin{array}{*{20}l} \mathbb{E}\left\{\left|\tilde \epsilon_{i}\right|^{2}\right\}={1\over M}\sum_{j=1}^{M}{\rho_{m,i}\over {\tau p_{\mathrm{d}} \rho_{m,i}+1}}. \end{array} $$
(64)

References

  1. A Sabharwal, P Schniter, D Guo, DW Bliss, S Rangarajan, R Wichman, In-band full-dulex wireless: Challenges and opportunities. IEEE J. Sel. Areas Commun. 32(9), 1637–1652 (2014).

    Article  Google Scholar 

  2. S Hong, J Brand, JI Choi, M Jain, J Mehlman, S Katti, P Levis, Applications of self-interference cancellation in 5G and beyond. IEEE Commun. Mag. 52(2), 114–121 (2014).

    Article  Google Scholar 

  3. EG Larsson, O Edfors, F Tufvesson, TL Marzetta, Massive MIMO for next generation wireless systems. IEEE Commun. Mag. 52(2), 186–195 (2014).

    Article  Google Scholar 

  4. F Rusek, D Persson, BK Lau, EG Larsson, TL Marzetta, O Edfors, F Tufvesson, Scaling up MIMO: Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 30(1), 40–60 (2013).

    Article  Google Scholar 

  5. Z Zhang, X Chai, K Long, AV Vasilakos, L Hanzo, Full duplex techniques for 5G networks: Self-interference cancellation, protocol design, and relay selection. IEEE Commun. Mag. 53(5), 128–137 (2015).

    Article  Google Scholar 

  6. X Jia, P Deng, L Yang, H Zhu, Spectrum and energy efficiencies for multiuser pairs massive MIMO systems with full-duplex amplify-and-forward relay. IEEE Access. 3:, 1907–1918 (2015).

    Article  Google Scholar 

  7. TL Marzetta, Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans. Wireless Commun. 9(11), 3590–3600 (2010).

    Article  Google Scholar 

  8. JI Choi, M Jain, K Srinivasan, P Levis, S Katti, in Proc. ACM MobiCom On. Achieving single channel, full duplex wireless communications (Chicago, 2010), pp. 1–12.

  9. S Goyal, P Liu, SS Panwar, RA Difazio, R Yang, E Bala, Full duplex cellular systems: Will doubling interference prevent doubling capacity?IEEE Commun. Mag. 53(5), 121–127 (2015).

    Article  Google Scholar 

  10. KM Thilina, H Tabassum, E Hossain, DI Kim, Medium access control design for full duplex wireless systems: Challenges and approaches. IEEE Commun. Mag. 53(5), 112–120 (2015).

    Article  Google Scholar 

  11. Y Liao, K Bian, L Song, Z Han, Full–duplex MAC protocol design and analysis. IEEE Commun. Lett. 19(7), 1185–1188 (2015).

    Article  Google Scholar 

  12. L Wang, F Tian, T Svensson, D Feng, M Song, S Li, Exploiting full duplex for device-to-device communications in heterogeneous networks. IEEE Commun. Mag. 53(5), 146–152 (2015).

    Article  Google Scholar 

  13. M Heino, D Korpi, T Huusari, E Antonio-Rodriguez, et al, Recent advances in antenna design and interference cancellation algorithms for in-band full duplex relays. IEEE Commun. Mag. 53(5), 91–101 (2015).

    Article  Google Scholar 

  14. L Song, Y Li, Z Han, Resource allocation in full-duplex communicaitons for future wireless networks. IEEE Wireless Commun. 22(4), 88–96 (2015).

    Article  Google Scholar 

  15. J Hoydis, ST Brink, M Debbah, Massive MIMO in the UL/DL of cellular networks: How many antennas do we need?IEEE J. Sel. Areas Commun. 31(2), 160–171 (2013).

    Article  Google Scholar 

  16. X Rao, VKN Lau, Distributed compressive CSIT estimation and feedback for FDD multi-user massive MIMO systems. IEEE Trans. Signal Process. 62(12), 3261–3271 (2014).

    Article  MathSciNet  Google Scholar 

  17. B Lee, J Choi, J Seol, DJ Love, B Shim, Antenna grouping based feedback compression for FDD-based massive MIMO systems. IEEE Trans. Commun. 63(9), 3261–3274 (2015).

    Article  Google Scholar 

  18. L Lu, GY Li, AL Swindlehurst, A Ashikhmin, R Zhang, An overview of massive MIMO: Benefits and challenges. IEEE J. Sel. Topics Signal Process. 8(5), 742–758 (2014).

    Article  Google Scholar 

  19. HQ Ngo, EG Larsson, TL Marzetta, Energy and spectral efficiency of very large multiuser MIMO systems. IEEE Trans. Commun. 61(4), 1436–1449 (2013).

    Article  Google Scholar 

  20. X Pang, W Hong, T Yang, L Li, Design and implementation of an active multibeam antenna system with 64 RF channels and 256 antenna elements for massive MIMO application in 5G wireless communications. China Commun. 11(11), 16–23 (2014).

    Article  Google Scholar 

  21. TL Marzetta, in Proc. IEEE Asilomar Conference on Signals, Systems and Computers (ACSSC06) On. How much training is required for multiuser MIMO? (Pacific Grove, 2006), pp. 359–363.

  22. T Riihonen, S Werner, R Wichman, Mitigation of loopback self-interference in full-duplex MIMO relays. IEEE Trans. Signal Process. 59(12), 5983–5993 (2011).

    Article  MathSciNet  Google Scholar 

  23. E Everett, A Sahai, A Sabharwal, Passive self-interference suppression for full-duplex infrastructure nodes. IEEE Trans. Wireless Commun. 13(2), 680–694 (2014).

    Article  Google Scholar 

  24. D Bharadia, E McMilin, S Katti, in Proc. ACM SIGCOMM On. Full duplex radio (Hong Kong, 2013), pp. 375–386.

  25. M Duarte, A Sabharwal, V Aggarwal, et al, Design and characterization of a full-duplex multiantenna system for WiFi networks. IEEE Trans. Veh. Technol. 63(3), 1160–1177 (2013).

    Article  Google Scholar 

  26. HQ Ngo, HA Suraweera, M Matthaiou, EG Larsson, Multipair full-duplex relaying with massive arrays and linear processing. IEEE J. Sel. Areas Commun. 32(9), 1721–1737 (2014).

    Article  Google Scholar 

  27. P Xing, J Liu, C Zhai, X Wang, L Zheng, Self-interference suppression for the full-duplex wireless communication with large-scale antenna. Trans. Emerg. Telecom. Technol. 27(6), 764–774 (2016).

    Article  Google Scholar 

  28. Z Zhang, Z Chen, M Shen, B Xia, Spectral and energy efficiency of multi-pair two-way full-duplex relay systems with massive MIMO. IEEE J. Sel. Areas Commun. 34(4), 848–863 (2016).

    Article  Google Scholar 

  29. M Duarte, C Dick, A Sabharwal, Experiment-driven characterization of full-duplex wireless systems. IEEE Trans. Wireless Commun. 11(12), 4296–4307 (2012).

    Article  Google Scholar 

  30. S Haykin, Adaptive Filter Theory, 4th edn. (Prentice-Hall, Upper Saddle River, 2002).

    MATH  Google Scholar 

  31. H Yang, TL Marzetta, Performance of conjugate and zero-forcing beamforming in large-scale antenna systems. IEEE J. Sel. Areas Commun. 31(2), 172–179 (2013).

    Article  Google Scholar 

  32. J Hoydis, Random matrix theory for advanced communication systems, Ph.D. dissertation (Supelec Univ., France, 2012).

    Google Scholar 

  33. R Couillet, M Debbah, Random matrix methods for wireless communications, 1st edn. (Cambridge Univ. Press, New York, 2011).

    Book  MATH  Google Scholar 

  34. AM Tulino, S Verdu, Random matrix theroy and wireless communications. Foundations Trends Commun. Inf. Theory. 1(1), 1–182 (2004).

    Article  MATH  Google Scholar 

  35. RJ Serfling, Approximation Theorems of Mathematical Statistics, 2nd edn. (John Wiley & Sons, New York, 1980).

    Book  MATH  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (61371188), Research Fund for the Doctoral Program of Higher Education (20130131110029), China Postdoctoral Science Foundation (2014M560553), Special Funds for Postdoctoral Innovative Projects of Shandong Province (201401013).

Competing interests

The authors declare that they have no competing interests.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ju Liu.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xing, P., Liu, J., Zhai, C. et al. Self-interference suppression with imperfect channel estimation in a shared-antenna full-duplex massive MU-MIMO system. J Wireless Com Network 2017, 18 (2017). https://doi.org/10.1186/s13638-017-0805-7

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13638-017-0805-7

Keywords