 Research
 Open Access
 Published:
Pilot allocation scheme based on coalition game for TDD massive MIMO systems
EURASIP Journal on Wireless Communications and Networking volume 2019, Article number: 60 (2019)
Abstract
Pilot contamination is a major factor that restricts the system performance of time division duplex (TDD) massive multipleinput multipleoutput (MIMO) systems. Reasonable pilot allocation is an effective method to reduce pilot contamination. However, most of the existing pilot allocation methods focus on cells of the same size, a regular shape, an equal number of users per cell, and/or independent fading channels. When they are generalized or modified to actual scenes (in which cells have irregular shapes, users are randomly distributed, and channels are correlated fading channels), they are inflexible and cannot guarantee the target performance requirement. Considering both flexibility of pilot allocation and system performance, this paper introduces the idea of a coalition game into pilot allocation and proposes a pilot allocation scheme based on the coalition game. Different utility functions are analyzed under correlated fading channels; the adjustment principle of coalition structure and the coalition formation algorithm are given. Simulation results show that the proposed scheme can be used in any actual scene as well as guarantee the target performance requirement.
Introduction
The research background of this paper
With the appearance of smartphones and tablets, wireless communication requires increasingly higher communication rates. Therefore, many technologies have been proposed to improve the communication rate of 5G, and massive multipleinput multipleoutput (MIMO) is one of these technologies. Compared with traditional wireless networks, massive MIMO significantly enhances the spectral and power efficiency [1].
However, the excellent performance of massive MIMO depends on the accuracy of channel estimation. For conventional channel estimation in time division duplex (TDD) massive MIMO systems, users transmit pilot signals in the uplink, and base stations (BSs) receive the pilot signal and estimate channel state information (CSI). Then, BSs can use the CSI for uplinking received signal detection and downlinking transmission precoding. Therefore, the accuracy of channel estimation is one of the key factors that affect the system performance of uplink and downlink. In pilotbased channel estimation, the number of pilots is finite and limited by the channel coherence time, so mobile users in different cells might have to reuse the same pilot. Pilot reuse seriously affects the channel estimation accuracy at BSs and further influences the system performance of uplink and downlink, where the influence is known as pilot contamination [2]. Recent studies have shown that pilot contamination is one of the key factors that affect the performance of TDD massive MIMO systems [1,2,3,4]. Therefore, many methods have been proposed to reduce pilot contamination, such as pilot contamination precoding [4], timeshifted pilots [5, 6], spatial domainbased pilot allocation [7], design and transmit power control of pilot sequences [8, 9], and pilot reuse [10, 11].
For instance, reference [4] shows that the performance of TDD massive MIMO system is limited by pilot contamination. Furthermore, reference [4] considers the uplink precoding and downlink precoding (called pilot contamination precoding) based on slow fading coefficients to reduce pilot contamination. Papers [5, 6] discuss the timeshifted pilot method in hexagonal cells, which means cells are divided into different groups, and cells in different groups transmit pilots in different time slots; analyses show that this method can effectively reduce pilot contamination. However, this method assumes that the number of users in each cell is fixed and equal. In addition, it requires strict system synchronization, making implementation complex and difficult. Paper [7] proposes a spatial domain method with the main idea that users with high channel orthogonality can use the same pilot. This method also assumes that cells have a regular shape and that the number of users in each cell is fixed and equal. Furthermore, this method is highly complex because BSs need to know the CSIs between all users and all BSs to carry out the search for highly orthogonal channels. Paper [8] gives a design for nonorthogonal pilot signals, and paper [9] proposes a pilot power allocation method based on cell grouping to alleviate pilot contamination. The methods in [8, 9] both assume that cells have a regular shape and the number of users per cell is fixed. Paper [10] presents a pilot reuse method based on cell grouping and analyzes the system performance of different pilot reuse factors. In paper [11], a fractional pilot reuse method is proposed in which users are classified into cellcenter users and celledge users, where the cellcenter users reuse the same pilot subset and the celledge users use the remaining pilots according to cell grouping.
Studies [12, 13] try to introduce game theory into pilot allocation to reduce pilot contamination. However, the pilot allocation methods in [12, 13] both consider the games between cells. Simply because of the games between cells, the suitable application scene of [12, 13] is still fixed and has an equal number of users in each cell, although reference [13] attempts to discuss irregularly shaped cells.
Reviewing the studies in [4,5,6,7,8,9,10,11,12,13], reasonable pilot allocation can effectively reduce pilot contamination and improve system performance. However, in order to facilitate theoretical analysis and simulation, most of these pilot allocation methods assume that cells have a regular shape (most are hexagons), the number of users per cell is equal, and/or channels are independent. However, in reality, the cell shapes are affected by various factors, such as landforms and buildings, and do not always have a regular shape. Moreover, mobile users are randomly distributed in reality, so it is impossible for the number of users in each cell to be equal or fixed. When the number of base station antennas is very large, it is difficult to ensure the distance between the antennas, and it is difficult to ensure that all channels are independent.
Most of these pilot allocation methods (in references [4,5,6,7,8,9,10,11,12,13]) assume that the number of users per cell is equal or fixed for two main reasons. First, when the number of users in each cell is equal or fixed, it is easy to allocate pilots. For example, in references [4,5,6,7,8,9,10,11,12,13], in order to avoid the use of the same pilot in adjacent cells (pilot contamination is large in this case), these pilot allocation methods [4,5,6,7,8,9,10,11,12,13] always reserve a certain number of pilots for each cell. If the number of users in each cell is equal or fixed, it is convenient to estimate the number of pilots per cell, so that when pilots are allocated, the pilots of adjacent cells can be differentiated. Second, when the number of users in each cell is equal, the dimension of the channel matrix in each cell is equal, which is convenient for merging in mathematical operation in theoretical analysis.
If we modify these methods [4,5,6,7,8,9,10,11,12,13] to the actual scenes in which all users are randomly distributed and the number of users per cell is unequal, the system performance will not be guaranteed. For example, if the number of users is large in a cell, using these methods [4,5,6,7,8,9,10,11,12,13], preallocated pilots are insufficient, and pilot reuse will cause large pilot contamination. On the other hand, if the number of users in a cell is small, using these methods [4,5,6,7,8,9,10,11,12,13], the pilot resource cannot be fully utilized, and this will cause the waste of pilot resources and then affect overall performance. Obviously, this point reflects that these methods [4,5,6,7,8,9,10,11,12,13] lack flexibility.
The assumption of a regular cell shape is also beneficial to pilot allocation. When the cell shape is regular, cells can be grouped first. For example, as shown in Fig. 1a, taking pilot reuse factor 3 as an example, the cells in the picture have three types of background pattern. The cells with the same background pattern are grouped together and reuse the same pilot subset (the total pilot set has been divided into three orthogonal pilot subsets). Cells with different background patterns use different orthogonal pilot subsets. References [9,10,11] use similar cell grouping pilot allocation methods. However, the actual situation is that the cell shape is irregular. The pilot multiplexing in 11 cells shown in Fig. 1b is based on the pilot allocation method in Fig. 1a. It can be seen that in Fig. 1b, because of the irregular cell shape, the same pilot subset is used in adjacent cells, which will cause serious pilot contamination; thus, the system performance cannot be guaranteed.
So we can conclude that if we modify these methods [4,5,6,7,8,9,10,11,12,13] to the actual scenes (in which cells have irregular shapes, users are randomly distributed), the system performance will not be guaranteed.
Papers [14, 15] propose different greedy algorithms to allocate pilots in the scene of correlated fading channels. When compared with the methods in [4,5,6,7,8,9,10,11,12,13], greedy algorithms in papers [14, 15] are flexible and can be easily generalized or modified to the actual scenes, but they still cannot guarantee the target performance requirement. Paper [14] presents a statistical greedy pilot scheduling algorithm, which focuses on users in a single cell and searches users with small channel correlations to use the same pilot. This algorithm does not consider the application scenario of multicells and the correlation of the channels between different users and different BSs, so it cannot reduce the interference among different cells and thus cannot guarantee system performance in multicell scenarios. That is to say, although this algorithm can be generalized to multicell scenarios of arbitrary BS locations and an unequal number of users per cell, it cannot guarantee target performance satisfaction. Thus, it is essentially different from our work; the proposed pilot allocation scheme in our work can be used in arbitrary scenarios as well as guarantee system performance.
Paper [15] proposes a greedy approach; its main idea is selecting one user from each cell (there are P users per cell) to form a user set during each search, and users in the same set use the same pilot. The principle of user selection is to minimize the sum mean square error of channel estimation for the users using the same pilot. The greedy approach assumes the number of users per cell is equal; it cannot be used in the case of different numbers of users per cell, so it is different from our work and we have to make modifications in simulations. In addition, it only considers the summeansquare error of users with the same pilot and does not consider the total performance of all users; furthermore, it is a onetime search for all users and cannot achieve global optimization, whereas the proposed pilot allocation scheme in our work considers the total performance of all users and performs a cyclic search for all users. Therefore, in comparison, the greedy approach in [15] cannot meet the needs of total performance, whereas our scheme can meet overall performance requirements through cyclic search.
The main contents of this paper
In order to find the flexible pilot allocation method that can be used in any actual scenes and provide performance guarantee, we introduce the idea of a coalition game into pilot allocation and propose a pilot allocation scheme based on the coalition game in this paper. We consider the game between users and divide users into different subcoalitions. Because this coalition game is among users, the proposed pilot allocation scheme is more flexible and can be used in any actual scene. Because the purpose of each coalition adjustment is to improve the target performance (average utility function), the target performance can be satisfied through the cyclic search of coalition adjustment. The contributions of this paper are as follows.

1)
To be applied in arbitrary actual scene (with irregular shape cells, unequal number of users per cell, and correlated fading channels) and provide target performance guarantee, a novel pilot allocation scheme based on the coalition game is proposed. For the game between users, the coalition structure is defined, different utility functions are analyzed, and the coalition formation algorithm is given.

2)
The implementation method of the proposed pilot allocation scheme is presented.

3)
Simulations are developed to show that the proposed scheme can be used in any actual scene, effectively reduce pilot contamination, and provide a certain target performance guarantee. Moreover, the comparisons of computational complexity between different schemes are discussed.
The remainder of this paper is organized as follows. The system model for the massive MIMO system is described in Section 2. Different utility functions are analyzed under correlated fading channels in Section 3, including mean square error of channel estimation (MSECE), mean square error of signal detection (MSESD), received signaltointerferenceplusnoise ratio (SINR), and spectrum efficiency (SE). In Section 4, the definition of the coalition game is given, and then the coalition structure, the adjustment principle of coalition structure, and the condition of final stability are defined. After that, the coalition formation algorithm is given, a pilot allocation scheme based on the coalition game is proposed, and the implementation method of the proposed scheme is discussed. In Section 5, simulations are developed with respect to different utility functions, and the proposed pilot allocation scheme is compared with the greedy approach in reference [15] and the random coalition scheme. Section 6 concludes this paper. The proof of theorem is given in the Appendices 1, 2, and 3.
System model
Consider the massive MIMO system as shown in Fig. 2. There are L cells and N users in a fixed area, where we focus on the uplink. Each cell is assigned an index in the set ℒ = {1, 2, ⋯, L}. The kth user in the jth cell is denoted by user (j, k). Assume cell j has K_{j} active users, thus \( \sum \limits_{j=1}^L{K}_j=N \). All users form the set \( \mathcal{N} \) with \( \mathcal{N}=\left\{\left(1,1\right),\cdots \left(1,{K}_1\right),\cdots \left(j,1\right),\cdots \left(j,{K}_j\right),\cdots \left(L,1\right),\cdots \left(L,{K}_L\right)\right\} \). Each cell has one BS, and one cell and its BS use the same index number, i.e., BS j is in the jth cell. Each BS is equipped with an array of M antennas, the shape of antennas at BSs is uniformly spaced linear array (ULA) [15], and each user is equipped with a single antenna. Assume that BSs are subject to a random distribution and that the users are subject to another random distribution, the distributions of BSs and users are independent. Each user selects and accesses its nearest BS according to the strength of the received BS signals, so the number of users in each cell (K_{j}, j ∈ {1, 2, ⋯, L}) is not equal.
Assume \( {z}_{jk}^u\in {\mathrm{R}}^2 \)and \( {z}_l^b\in {\mathrm{R}}^2 \) are the locations of user (j, k) and BS l respectively, and z_{jk} is known at all BSs. d_{l}(z_{jk}) is the distance from user (j, k) to BS l. The channel between user (j, k) and BS l is defined as h_{ljk} ∈ ℂ^{M × 1}, and we consider correlated Rayleigh fading channels [15,16,17]. (For other types of correlated channel models and independent channel models, our pilot allocation scheme is also applicable.) The channel h_{ljk} is the superposition of F arriving paths,
where \( a\left({\theta}_{ljk}^{(f)}\right)\in {\mathrm{\mathbb{C}}}^{M\times 1} \) is the antenna steering vector corresponding to angleofarrivals (AoAs) θ_{ljk} ∈ [0, 2π), f is the path index, and \( {\alpha}_{ljk}^{(f)}\overset{i.i.d.}{\sim}\mathcal{CN}\left(0,\kern0.5em {\beta}_{ljk}\right) \) is the channel coefficient of the fth path, β_{ljk} denotes the largescale fading coefficient and \( {\beta}_{ljk}=\delta {\left\Vert {z}_{jk}^u{z}_l^b\right\Vert}_2^{v} \), where ν is the path loss exponent, δ is a constant of channel coefficient, and we let δ be a constant for arbitrary path f and arbitrary channel h_{ljk} in our simulations.
The expression of a(θ) is
where D is the interval of antenna spacing at the BS, and λ is the signal wavelength.
As an approximation, we consider a ring of radius r_{s} comprising many scatters around the user. In this case [16], θ_{ljk} obeys a uniform distribution on \( \left({\theta}_{ljk}^{\mathrm{min}},{\theta}_{ljk}^{\mathrm{max}}\right) \), where \( {\theta}_{ljk}^{\mathrm{min}}<{\theta}_{ljk}^{\mathrm{max}}\in \left[0,\pi \right) \), and
The channel covariance matrix is \( {\mathbf{R}}_{ljk}=\mathbb{E}\left\{{\boldsymbol{h}}_{ljk}{\boldsymbol{h}}_{ljk}^H\right\}=\frac{1}{F}{\beta}_{ljk}\sum \limits_{f=1}^F{\int}_{\theta_{ljk}^{\mathrm{min}}}^{\theta_{ljk}^{\mathrm{max}}}a\left({\theta}_{ljk}^{(f)}\right){\left[a\left({\theta}_{ljk}^{(f)}\right)\right]}^Hd{\theta}_{ljk}^{(f)} \), where \( \mathbb{E}\left\{\cdot \right\} \) denotes the expectation.
Analysis of different utility functions
The uplink transmission timefrequency resources are divided into frames consisting of T_{c} seconds and B_{c} Hz; thus, each frame contains Q = T_{c}B_{c} transmission symbols (P pilot symbols and QP data symbols, pilot symbols and data symbols are transmitted in different time slots). Assume channels between BSs and users have constant channel response within a frame but vary between frames. Therefore, the number of available orthogonal pilot sequences is P.
Assume that the coalition game divides all users into P subcoalitions (N > P). P pilots are allocated to the P subcoalitions respectively. That is, users in the same subcoalition use the same pilot, users in different subcoalitions use different pilots, and P pilots are mutually orthogonal. The analysis of utility function must be carried out under a certain coalition structure, so we first give the coalition structure definition in definition 1.
Definition 1 (Coalition structure). User set \( \mathcal{N} \) is divided into P disjoint subcoalitions \( {\mathcal{S}}_1,{\mathcal{S}}_2,\cdots, {\mathcal{S}}_P \), and \( {\cup}_{n=1}^P{\mathcal{S}}_n=\mathcal{N} \), \( {\cap}_{n=1}^P{S}_n=\varnothing \). Coalition structure \( \mathcal{S} \) is the coalitional division state, i.e., \( \mathcal{S}=\left\{{\mathcal{S}}_1,{\mathcal{S}}_2,\cdots, {\mathcal{S}}_P\right\} \).
Analysis of MSECE
MSECE means the mean square error of channel estimation. During the uplink pilot transmission, all users transmit pilots at the same time, thus the received pilot signal at the BS l is
where w_{jk} ∈ ℂ^{P × 1} is the pilot sequence used by user (j, k), and \( \mathbb{E}\left\{{\boldsymbol{w}}_{jk}^H{\boldsymbol{w}}_{jk}\right\}=1 \). t_{jk} is the pilot transmit power of user (j, k). In order to avoid the nearfar issue in the uplink, here we consider the transmit power control and let t_{jk} = T/β_{jjk}, where Tis a design parameter of transmit power. \( {\boldsymbol{N}}_l^{\mathrm{pilot}}\in {\mathrm{\mathbb{C}}}^{M\times P} \) is the equivalent additive white Gaussian noise (AWGN) at the receiver with independent and identically distributed (i.i.d.) elements of \( \mathcal{CN}\left(0,{\sigma}^2\right) \), where σ^{2}is the variance of each element in noise .
After decorrelation and power normalization of \( {\boldsymbol{y}}_l^{\mathrm{pilot}} \), BS l can obtain the channel observation of h_{ljk}, which is
where \( {\Lambda}_{\mathcal{S}}\left(j,k\right) \) is one of the subcoalitions in coalition structure \( \mathcal{S} \), and user (j, k) belongs to the subcoalition \( {\Lambda}_{\mathcal{S}}\left(j,k\right) \).
According to the definition of minimum mean square error (MMSE) estimation and expressions (14), (15) in reference [14], the MMSE estimation of channel h_{ljk} at BS l, which is based on the channel observation \( {\overline{\overline{\boldsymbol{h}}}}_{ljk} \), is given by
where \( {\mathbf{C}}_{ljk}=\sum \limits_{\left(m,n\right)\in {\Lambda}_{\mathcal{S}}\left(j,k\right)}\kern0em \frac{t_{mn}}{t_{jk}}{\mathbf{R}}_{lmn}+\frac{\sigma^2}{t_{jk}}{\mathbf{I}}_M \) is the covariance matrix of \( {\overline{\overline{\boldsymbol{h}}}}_{ljk} \).
Similar to expression (7), we can obtain the channel estimation of h_{jjk} (the channel between user (j, k) and its own BS j), which is
with covariance \( {\mathbf{R}}_{{\widehat{h}}_{jjk}}={\mathbf{R}}_{jjk}{\mathbf{C}}_{jjk}^{1}{\mathbf{R}}_{jjk} \). Thus, we can determine that the channel estimation error is \( {\tilde{\boldsymbol{h}}}_{jjk}={\boldsymbol{h}}_{jjk}{\widehat{\boldsymbol{h}}}_{jjk} \). From the orthogonality principle of MMSE estimation [14, 18], channel estimation error \( {\tilde{\boldsymbol{h}}}_{jjk} \) is independent of \( {\widehat{\boldsymbol{h}}}_{jjk} \) and the covariance of \( {\tilde{\boldsymbol{h}}}_{jjk} \) is \( {\mathbf{R}}_{{\tilde{h}}_{jjk}}={\mathbf{R}}_{jjk}{\mathbf{R}}_{{\widehat{h}}_{jjk}}={\mathbf{R}}_{jjk}{\mathbf{R}}_{jjk}{\mathbf{C}}_{jjk}^{1}{\mathbf{R}}_{jjk} \). Note that \( {\widehat{\boldsymbol{h}}}_{jjk} \) and \( {\mathbf{R}}_{{\tilde{h}}_{jjk}} \) are also mean and covariance of h_{jjk} conditioned on \( {\overline{\overline{\boldsymbol{h}}}}_{ljk} \), respectively [18].
The mean square error of channel estimation (MSECE) \( {\widehat{\boldsymbol{h}}}_{jjk} \) means the sum of squares estimation errors for each element of h_{jjk} (you can refer to expression (18) in [14]); so the MSECE of \( {\widehat{\boldsymbol{h}}}_{jjk} \) is \( \mathrm{MSE}\hbox{} {\mathrm{CE}}_{jk}={\varepsilon}_{{\tilde{h}}_{jjk}}^{\mathrm{MSE}\hbox{} \mathrm{CE}}=\mathbb{E}\left\{{\left({\tilde{\boldsymbol{h}}}_{jjk}\right)}^H{\tilde{\boldsymbol{h}}}_{jjk}\right\}=\mathbb{E}\left\{\mathrm{tr}\left\{{\tilde{\boldsymbol{h}}}_{jjk}{\tilde{\boldsymbol{h}}}_{jjk}^H\right\}\right\}=\mathrm{tr}\left\{{\mathbf{R}}_{{\tilde{h}}_{jjk}}\right\} \). We define the sum MSECE of all users as the sum of the estimation mean square error (MSE) of the channels between each user and its own BS. Thus, the average MSECE per user is
Analysis of MSESD and received SINR
MSESD means the mean square error of signal detection. During the uplink data transmission, all users send their data to all BSs at the same time, and the received signal at BS j is
where x_{mn} ∈ ℂ^{1 × 1} represents the symbol transmitted from user (m, n), \( \mathbb{E}\left\{{\left{x}_{mn}\right}^2\right\}=1 \), \( {\boldsymbol{y}}_j^{\mathrm{data}}={\left[{y}_{j1},\cdots, {y}_{jM}\right]}^T\in {\mathrm{\mathbb{C}}}^{M\times 1} \), \( {\boldsymbol{n}}_j^{\mathrm{data}} \) is AWGN, and \( {\boldsymbol{n}}_j^{\mathrm{data}}\sim \mathcal{CN}\left(0,{\sigma}^2{\mathbf{I}}_M\right) \), \( {\boldsymbol{g}}_{jmn}=\sqrt{t_{mn}^d}{\boldsymbol{h}}_{jmn} \).
Let \( {\boldsymbol{H}}_{jm}=\left[{\boldsymbol{h}}_{jm1},{\boldsymbol{h}}_{jm2},\cdots, {\boldsymbol{h}}_{{jm K}_m}\right]\in {\mathrm{\mathbb{C}}}^{M\times {K}_m} \), and \( {\boldsymbol{T}}_m^d=\operatorname{diag}\left(\sqrt{t_{m1}^d},\cdots, \sqrt{t_{mK_m}^d}\right) \), \( {\boldsymbol{x}}_m={\left[{x}_{m1},\cdots, {x}_{mK_m}\right]}^T \), \( {\boldsymbol{G}}_{jm}={\boldsymbol{H}}_{jm}{\boldsymbol{T}}_m^d=\left[{\boldsymbol{g}}_{jm1},\cdots, {\boldsymbol{g}}_{{jm K}_m}\right] \); thus, the received signal at BS j can be rewritten as \( {\boldsymbol{y}}_j^{\mathrm{data}}=\sum \limits_{m=1}^L{\boldsymbol{H}}_{jm}\kern0.5em {\boldsymbol{T}}_m^d{\boldsymbol{x}}_m+{\boldsymbol{n}}_j^{\mathrm{data}}=\sum \limits_{m=1}^L{\boldsymbol{G}}_{jm}{\boldsymbol{x}}_m+{\boldsymbol{n}}_j^{\mathrm{data}} \).
To distinguish the symbol of each user, the BS j selects the combination matrix \( {\boldsymbol{W}}_{jj}^{\mathrm{scheme}}\in {\mathrm{\mathbb{C}}}^{M\times {K}_j} \), which is directly multiplied with the received signal, and the processed data of all K_{j} users in the jth cell can be written as
Considering the maximum ratio combining (MRC), zeroforcing combining (ZFC), and MMSE schemes of the received signal \( {\boldsymbol{y}}_j^{\mathrm{data}} \), thus
where \( {\widehat{\boldsymbol{H}}}_{jj}=\left[{\widehat{\boldsymbol{h}}}_{jj1},{\widehat{\boldsymbol{h}}}_{jj2},\cdots, {\widehat{\boldsymbol{h}}}_{{jj K}_j}\right]\in {\mathrm{\mathbb{C}}}^{M\times {K}_j} \), and \( {\boldsymbol{T}}_j^d=\operatorname{diag}\left(\sqrt{t_{j1}^d},\cdots, \sqrt{t_{jK_j}^d}\right) \), \( {\boldsymbol{D}}_{jj}=\operatorname{diag}\left(\frac{1}{\delta_{jj1}},\cdots, \frac{1}{\delta_{{jj K}_j}}\right) \) is the diagonal matrix parameter that is set for \( \mathbb{E}\left\{{\left({\boldsymbol{w}}_{jjk}^{MRC}\right)}^H{\boldsymbol{h}}_{jjk}\right\}=1 \), and \( {\delta}_{jjk}=\mathrm{tr}\left\{{\mathbf{R}}_{{\widehat{h}}_{jjk}}\right\} \).
Theorem 1 For a given coalition structure \( \mathcal{S} \), the mean square error of signal detection (MSESD) for user (j, k) at BS j for MRC, ZFC, and MMSE schemes are given by the following expressions (13), (14) and (15) respectively,
where \( {\boldsymbol{Q}}_1={\mathbb{E}}_{\boldsymbol{h}}\left\{{\boldsymbol{w}}_{jjk}^{zfc}{\left({\boldsymbol{w}}_{jjk}^{zfc}\right)}^H\right\} \), \( {\boldsymbol{Q}}_2={\mathbb{E}}_{\boldsymbol{h}}\left\{{\widehat{\boldsymbol{h}}}_{jjk}{\left({\boldsymbol{w}}_{jjk}^{zfc}\right)}^H\right\} \), \( {\boldsymbol{Q}}_3={\mathbb{E}}_{\boldsymbol{h}}\left\{{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}\right)}^{1}\right\} \), \( {\boldsymbol{Q}}_4={\mathbb{E}}_{\boldsymbol{h}}\left\{{\sigma}^2{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1}\right\} \), \( {\boldsymbol{Q}}_5={\mathbb{E}}_{\boldsymbol{h}}\left\{{\boldsymbol{w}}_{jjk}^{\mathrm{mmse}}{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mmse}}\right)}^H\right\} \), \( {\boldsymbol{Q}}_6={\mathbb{E}}_{\boldsymbol{h}}\left\{{\widehat{\boldsymbol{h}}}_{jjk}{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mmse}}\right)}^H\right\} \), \( {\boldsymbol{Q}}_7={\mathbb{E}}_{\boldsymbol{h}}\left\{{\left({\boldsymbol{W}}_{jj}^{\mathrm{mmse}}\right)}^H{\boldsymbol{W}}_{jj}^{\mathrm{mmse}}\right\} \), \( {\boldsymbol{w}}_{jjk}^{\mathrm{zfc}} \) is the kth column of the matrix \( {\boldsymbol{W}}_{jj}^{\mathrm{zfc}}={\widehat{\boldsymbol{H}}}_{jj}{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}\right)}^{1} \), and \( {\boldsymbol{w}}_{jjk}^{\mathrm{mmse}} \) is the kth column of the matrix \( {\boldsymbol{W}}_{jj}^{\mathrm{mmse}}={\widehat{\boldsymbol{H}}}_{jj}{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1} \).
For a given coalition structure \( \mathcal{S} \), the uplink received SINR for user (j, k) at BS j for MRC, ZFC, and MMSE schemes are given by the expressions (17), (18) and (19) respectively.
Proof The proofs of Theorem 1 for MRC, ZFC, and MMSE schemes are given in Appendices 1, 2, and 3, respectively.
Analysis of spectrum efficiency
Based on previous theoretical analyses, according to Shannon capacity and referring to expression (11) in [19], the uplink spectrum efficiency (SE) of user (j,k) is
where the \( {\mathrm{SINR}}_{jk}^{\mathrm{scheme}} \) for MRC, ZFC, and MMSE schemes are given by expressions (17), (18), and (19), respectively.
Pilot allocation scheme based on a coalition game
As mentioned in Section 3, we use the idea of partition form [20] in the coalition game, and all users will be divided into P subcoalitions (N > P). P pilots are allocated to the P subcoalitions respectively. That is, users are players in the coalition game. Users in the same subcoalition use the same pilot and interact with each other in channel estimation. Users in different subcoalitions use different pilots, and they do not affect each other in channel estimation. Therefore, there are mutually influencing and mutually constraining relationships between users, which coincides with the idea of the coalition game. For this reason, we introduce the idea of the coalition game into pilot allocation. The definition of the coalition game is given as follows.
Definition 2 (Coalition game). The coalition game is \( \mathbf{\mathcal{G}}=\left\langle \mathcal{N},{\left\{{u}_{j,k}\right\}}_{\left(j,k\right)\in \mathcal{N}}\right\rangle \), where \( \mathcal{N} \) is the set of game players; that is to say, users are players in the coalition game. u_{j, k}is the utility function of player (j, k) (user (j, k)), and it is also called the payoff function in game theory.
In this coalition game, each player must belong to a subcoalition, and the definition of coalition structure is given in definition 1.
The purpose of the coalition game is to optimize utility function, so different performance metrics can be used as the utility function in the coalition game. In this paper, we use the MSECE (\( {u}_{j,k}\left(\mathcal{S}\right)=\mathrm{MSE}\hbox{} {\mathrm{CE}}_{jk}\left(\mathcal{S}\right) \)), MSESD (\( {u}_{j,k}\left(\mathcal{S}\right)=\mathrm{MSE}\hbox{} {\mathrm{SD}}_{jk}^{\mathrm{scheme}}\left(\mathcal{S}\right) \)), received SINR (\( {u}_{j,k}\left(\mathcal{S}\right)={\mathrm{SINR}}_{jk}^{\mathrm{scheme}}\left(\mathcal{S}\right) \)), and SE (\( {u}_{j,k}\left(\mathcal{S}\right)={\mathrm{SE}}_{jk}^{\mathrm{scheme}}\left(\mathcal{S}\right) \)) as the utility functions in the coalition game when we want to satisfy the corresponding system performance. The optimization of a certain utility function is also the minimization of pilot contamination corresponding to this performance.
According to the partition form definition in the coalition game theory [20], we need to describe three elements: (1) the coalition structure, (2) the adjustment principle of the coalition structure, and (3) the final stability of the coalition structure.
During each adjustment of the coalition structure, one user leaves its subcoalition and joins another subcoalition. This adjustment reduces interference of the original subcoalition of the user and increases the interference of the newly joined subcoalition.
The definition of coalition structure is given in definition 1. To benefit each user, the coalition structure needs to continuously adjust until final stability. For a certain adjustment, assume player (j, k) leaves its subcoalition \( {\Lambda}_{\mathcal{S}}\left(j,k\right) \) and joins another subcoalition \( {\mathcal{S}}_i \), where \( {\mathcal{S}}_i\in \left\{\left\{{\mathcal{S}}_1,{\mathcal{S}}_2,\cdots, {\mathcal{S}}_P\right\}\backslash {\Lambda}_{\mathcal{S}}\Big(j,k\Big)\right\} \). That is, the entire coalition structure changes from the original \( \mathcal{S} \) into a new structure \( {\mathcal{S}}^{\Phi} \). We describe this adjustment as \( \mathcal{S}\overset{\left(j,k\right)}{\to }{\mathcal{S}}^{\Phi} \). According to the individual stability concept in the coalition game [21], the adjustment of a user (j, k) leaving its subcoalition \( {\Lambda}_{\mathcal{S}}\left(j,k\right) \) and joining another subcoalition \( {\mathcal{S}}_i \) is permissible if this adjustment can strictly improve its utility function and does not reduce the average utility function of all users. Therefore, we have definition 3.
Definition 3 (Principle of adjustment). The adjustment \( \mathcal{S}\overset{\left(j,k\right)}{\to }{\mathcal{S}}^{\Phi} \) is permissible if \( {u}_{j,k}\left({\mathcal{S}}^{\Phi}\right) \) is better than \( {u}_{j,k}\left(\mathcal{S}\right) \) and \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left({\mathcal{S}}^{\Phi}\right) \) is not worse than \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left(\mathcal{S}\right) \) .
In definition 3, that \( {u}_{j,k}\left({\mathcal{S}}^{\Phi}\right) \) is better than \( {u}_{j,k}\left(\mathcal{S}\right) \) means \( {u}_{j,k}\left({\mathcal{S}}^{\Phi}\right)<{u}_{j,k}\left(\mathcal{S}\right) \) for MSECE and MSESD, and \( {u}_{j,k}\left({\mathcal{S}}^{\Phi}\right)>{u}_{j,k}\left(\mathcal{S}\right) \) for SINR and SE. That \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left({\mathcal{S}}^{\Phi}\right) \) is not worse than \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left(\mathcal{S}\right) \) means \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left({\mathcal{S}}^{\Phi}\right)\le \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left(\mathcal{S}\right) \) for MSECE and MSESD, and \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left({\mathcal{S}}^{\Phi}\right)\ge \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left(\mathcal{S}\right) \) for SINR and SE.
From definition 3, we can see that any user that wants to join another subcoalition should be beneficial to itself without damaging the average interest of all users. Thus, the average utility function will be optimal at the final stability.
This adjustment comes to an end as defined in definition 4, which is similar to the definition of final stability in [21].
Definition 4 (Final stability). The whole coalition structure \( \mathcal{S} \) is stable if there is no permission of adjustment \( \mathcal{S}\overset{\left(j,k\right)}{\to }{\mathcal{S}}^{\Phi} \) for all users \( \left(j,k\right)\in \mathcal{N} \) and all subcoalitions, or α ≥ τ, or the average utility function (\( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left(\mathcal{S}\right) \)) is not worse than U for a given performance index U (where U ≥ U_{min}for MSECE and MSESD, U ≤ U_{min}for SINR and SE).
In definition 4, assume the average utility function is equal to U_{min} when there is no permission of adjustment \( \mathcal{S}\overset{\left(j,k\right)}{\to }{\mathcal{S}}^{\Phi} \) for all users \( \left(j,k\right)\in \mathcal{N} \) and all subcoalitions. That is to say, U_{min} is the global optimal value of average utility function. In addition, for the given performance index of the average utility function, \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left(\mathcal{S}\right) \) is not worse than U means \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left({\mathcal{S}}^{\Phi}\right)\le U \) for MSECE and MSESD, and \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left({\mathcal{S}}^{\Phi}\right)\ge U \) for SINR and SE.
According to the discussions above, the coalition formation algorithm is given in algorithm 1. In definition 4 and algorithm 1, α is the current number of searches and τ is the limitation of total number of searches, which is used to prevent the coalition formation algorithm from falling into an endless loop.
We can see from algorithm 1 that each adjustment of the coalition structure further optimizes the average utility function. Taking the utility function MSECE as an example, the purpose of every searching in algorithm 1 is to reduce the average MSECE, and every adjustment of the coalition structure can reduce the average MSECE. According to the conditions of final stability in definition 4, algorithm 1 is convergent. Therefore, when the condition of final stability is that \( \frac{1}{N}\sum \limits_{\left(l,m\right)\in \mathcal{N}}{u}_{l.m}\left(\mathcal{S}\right) \) is not worse than U for a given performance index U (where U ≥ U_{min} for MSECE), this algorithm can search continuously until the performance U is satisfied, it is a state of finite convergence. Therefore, this pilot allocation scheme can provide performance guarantee.
We go on to discuss the implementation method of the proposed pilot allocation scheme. The utility function calculations require the distance of each user to each BS. If algorithm 1 operates at BSs, all BSs should cooperate and exchange many messages. Therefore, both the calculations of utility functions and the processes of algorithm 1 can be set at the upper layer of BSs, i.e., completed by the base station controller (BSC). Each BS sends its own location and user locations to BSC. BSC calculates distances, utility functions, and the coalition formation algorithm, and then BSC sends the results of the coalition formation algorithm (i.e., the results of pilot allocation) to each user through BS forwarding. This can greatly reduce the complexity of the whole system. Although the computational complexity of this scheme is still somewhat high, with the general improvement of computing power, processing delays of BSC can be reduced. In addition, BSC needs to send the results of pilot allocation to each user; this will increase the signaling delays of the system. However, this is similar to the case that the receiver transmits the channel estimation back to the sender, only the transmission delay between BSs and BSC is added; however, the transmission delay of this kind of optical fiber line is very small, so we think this signal transmission can be hopefully achievable. Stated thus, the proposed scheme can be hopefully applied in actual situations.
Simulation results and discussions
In this section, simulation results are provided to illustrate the system performance of the proposed pilot allocation scheme based on the coalition game. All experiments are Monte Carlo simulations (based on the results of theoretical analysis of different utility functions) under correlated Rayleigh fading channels. Then, we compare the proposed pilot allocation scheme with the pilot allocation scheme based on the greedy approach in [15] and the random coalition scheme (i.e., randomized pilot allocation) in terms of different performance and computational complexity.
The greedy approach in [15] assumes that there are P users per cell and there are P orthogonal pilots. The greedy approach selects one user from each cell to form a user set during each search, all users are grouped into P sets by P searches. Users in the same set use the same pilot, and users in the different sets use different pilots. The principle of user selection is to minimize the sum mean square error of channel estimation for the users using the same pilot.
The greedy approach in [15] assumes the number of users per cell is equal, and it cannot be used in the case of different number of users per cell, so we have to make modifications to facilitate the comparison. When simulating this greedy approach, \( \eta =\left\lceil \frac{N}{P}\right\rceil \) (η represents the average number of users in each set) users are selected to form a set during each search, regardless of whether the selected users are in different cells. In addition, in all the following simulations, this greedy approach uses the MMSE combining scheme. Taking the MSECE as an example, the performance comparison of the greedy approach before and after the modification is shown in Fig. 3 (when using parameters in Table 1). We can see that the values of MSECE are very close before and after modification. As to other performance (like MSESD, SINR, SE), the results are similar according to our simulations; thus, we omit these results to save space. So, we can compare our scheme with the modified greedy approach, and the performance improvement can be seen in Figs. 4, 5, 6, 7, 8, 9, 10, 11, and 12. The greedy approach in Figs. 4, 5, 6, 7, 8, 9, 10, 11, and 12 is referring to the greedy approach after modification.
Simulation parameters are given in Table 1. We use a fixed area of 2000 m × 2000 m; BSs are subject to a uniform distribution, and the users are subject to a uniform distribution; and the distributions of BSs and users are independent. Each user selects and accesses its nearest BS. About the simulation parameters in Table 1, we refer to the simulation parameters in references [7, 16, 17]. For the setting of the ring radius of scatters, we consider a nonlineofsight (nonLOS) scenario with 50 scattering paths as in references [7, 16, 17].
In the following simulations, for all coalition game pilot allocation schemes, we show the optimal results of the coalition formation algorithm, i.e., the condition of the final stability is the no permission of adjustment \( \mathcal{S}\overset{\left(j,k\right)}{\to }{\mathcal{S}}^{\Phi} \). However, we know that the proposed pilot allocation scheme based on the coalition game can provide a certain performance guarantee through a user cyclic search. For a given performance (average utility function) index U, the proposed scheme can search continuously until the performance index U is satisfied. Taking the utility function of MSECE as an example, Fig. 4 shows the relationship between the average MSECE and the number of searches (α) for a certain location of BSs and users. We can see that, with the increase of α, the average MSE decreases, and whenα ≥ 22, the average MSE reaches its minimum value. To satisfy the average MSECE performance index of 10^{−4}, we should set the total number of search limitation τ to greater than or equal to 14.
Figure 5 shows the average MSECE per user versus the transmit power T with P = 4. The unit of transmission power T is the watt in decibels (i.e., dBW). The “coalition game, MSECE” means the pilot allocation scheme based on the coalition game when utility function is MSECE. The “greedy approach” means the greedy approach in [15]. We can see that the coalition game for MSECE can achieve lower average MSE than the greedy approach and random coalition. Although the object function of the greedy approach in [15] is also MSECE, the greedy approach is worse than the coalition game for MSECE, because the greedy approach in [15] is a onetime search for users, but the coalition game for MSECE is a cyclic search for users. In addition, the greater the transmit power, the more obvious the performance improvements.
Figure 6 shows the average MSECE per user versus the number of available pilots P with T = 20 dBW. We can see that the coalition game for MSECE can achieve lower average MSECE than the greedy approach. The performance of random alliance is the worst because of random allocation of pilots. When the number of pilots becomes larger, the average MSECE of all schemes becomes smaller. This is because the larger the number of pilots, the larger the number of subcoalitions and the smaller the interference between users.
Figure 7 shows the average MSESD per user versus the transmit power T with P = 4. The “coalition game, MSESD, MMSE,” “coalition game, MSESD, ZFC,” and “coalition game, MSESD, MRC” means the pilot allocation scheme based on the coalition game when utility function is MSESD with the MMSE scheme, when utility function is MSESD with the ZFC scheme, and when utility function is MSESD with the MRC scheme, respectively. It can be seen that the coalition game for MSESD with MMSE is better than the coalition game for MSESD with ZFC, and the coalition game for MSESD with ZFC is better than the coalition game for MSESD with MRC. That is because the MRC scheme cannot reduce interference and noise, and the ZFC scheme can reduce interference, but cannot reduce noise, whereas the MMSE scheme can reduce both interference and noise. When the transmit power is small, the performance of the greedy approach is slightly worse than that of the coalition game for MSESD with MMSE and better than that of the coalition game for MSESD with ZFC and the coalition game for MSESD with MRC. Because the purpose of the greedy approach in [15] is to minimize the sum of normalized MSECE of all users, its performance is better than the coalition game for MSESD with ZFC and the coalition game for MSESD with MRC. The greedy approach is worse than the coalition game for MSESD with MMSE because the greedy approach is a onetime search for users, but the coalition game for MSESD with MMSE is a cyclic search for users. With the increase of transmit power T, the advantage of the coalition game for MSESD with MMSE relative to the greedy approach is increasingly obvious. In addition, for each combining scheme (MMSE, ZFC, or MRC), the coalition game is better than random coalition.
Figure 8 shows the average received SINR per user versus the transmit power T with P = 4. The utility functions of all coalition game schemes in Fig. 8 are received SINR. When transmit power T becomes larger, the average received SINR of the coalition game schemes and greedy approach become larger. But random coalition schemes change little with the increase of T, that is because the interference between all users is increasing when transmit power T becomes larger. From Fig. 8, we can see that the coalition game for received SINR with MMSE is better than the greedy approach, and other results are similar to Fig. 7.
Figure 9 shows the average SE per user versus transmit power T with P = 4. The utility functions of all coalition game schemes in Fig. 9 are SE. We can see that when transmit power T becomes larger, the average SE of the coalition game schemes and greedy approach become larger. When comparing these different schemes, the results are similar to Figs. 7 and 8.
Figure 10 shows the average MSESD per user versus the number of available pilots P with T = 20 dBW. The utility functions of all coalition game schemes in Fig. 10 are MSESD. It can be seen that the coalition game for MSESD with MMSE is better than the coalition game for MSESD with ZFC, and the coalition game for MSESD with ZFC is better than the coalition game for MSESD with MRC. All these results are the same as Fig. 7. In addition, the larger the number of available pilots, the smaller the average MSESD of all schemes. This is because when the number of pilots becomes larger, fewer users use the same pilot; thus, the interference between users can be reduced, and the system performance can be improved.
Figure 11 shows the average received SINR per user versus the number of available pilots with T = 20 dBW. The utility functions of all coalition game schemes in Fig. 11 are received SINR. We can get the similar results as in Fig. 8.
Figure 12 shows the average SE per user versus the number of available pilots with T = 20 dBW. The utility functions of all coalition game schemes in Fig. 12 are SE. The results in Fig. 12 are different from those in Fig. 9. When the number of pilots becomes larger, the average SE of all schemes first becomes larger and then becomes smaller. That is because when the number of pilots is very small, the number of subcoalitions is small and the interference between users is large, so the SE is small. On the other hand, when the number of pilots becomes very large, the number of data symbols in each frame becomes small and the coefficient in front of the log in expression (16) becomes small, so the SE is also small. That is to say, there exists the optimal number of pilots to maximize SE. We can see in Fig. 12 that the optimal number of pilots is four for all schemes.
Table 2 shows the complexity (in terms of the multiplication times) during each search for different schemes. It is assumed that calculating the inversion of an Mdimensional matrix needs about M^{3} multiplications, \( \eta =\left\lceil \frac{N}{P}\right\rceil \) represents the average number of users in each subcoalition, and \( \xi =\left\lceil \frac{N}{L}\right\rceil \) represents the average number of users in each cell, where ⌈^{∗}⌉ means the round up number of *. The expressions in Table 2 are obtained according to the multiplication times of Eqs. (9),(13)~(19), and we have assumed that R_{jlm} is known according to largescale channel fading and the locations of users and BSs and that \( {\mathbf{R}}_{{\widehat{h}}_{jlm}} \), Q_{1}, Q_{2}, Q_{3}, Q_{4}, Q_{5}, Q_{6}, and Q_{7} are obtained through statistical average of channel estimation and calculated once during each search. o(NM^{2}) represents higher order infinitesimal of NM^{2}.
Table 3 shows the average number of searches (i.e., \( \overline{\alpha} \)) for each scheme, and also means the calculation times of average utility function in coalition formation algorithm. The values in Table 3 are obtained through Monte Carlo simulations, and they are the average number of searches for 10,000 user and BS locations. For all coalition game pilot allocation schemes in Table 3, the condition of the final stability is the no permission of adjustment \( \mathcal{S}\overset{\left(j,k\right)}{\to }{\mathcal{S}}^{\Phi} \).
In Table 3, the average number of search of random coalition is 0 because random coalition means randomized pilot allocation and there is no adjustment of the coalition structure. The average number of search of the greedy approach in [15] is 12 because the greedy approach is a onetime search for all users and the number of users is 12. The average number of search of the coalition game for different utility functions (MSESD, received SINR, or SE) is very close, and they are all close to 36, which means each user is searched about 3 times on average. The average number of searches of the coalition game for MSECE is 24, which means each user is searched about 2 times on average.
The total complexity of each scheme is the result of the multiplication of the corresponding items in Tables 2 and 3. Combining Tables 2 and 3, when comparing the total multiplication times, we can see that random coalition has the lowest complexity, the greedy approach is higher than random coalition, the coalition game for MSECE is higher than the greedy approach, and the coalition game for MSESD, received SINR, and SE are very close and has the highest complexity. The total multiplication times of the coalition game for different utility functions (MSESD, received SINR, or SE) are also very close, and they are higher than the coalition game for MSECE and the greedy approach, although their average search time does not differ much. That is because the multiplication times during each search of the coalition game for different utility functions (MSESD, received SINR, or SE) are larger than the coalition game for MSECE and the greedy approach. In a word, the total multiplication times of the coalition game for MSECE and the greedy approach is much higher than that of random coalition, but lower than that of the coalition game for MSESD, received SINR, and SE.
Conclusions
A pilot allocation scheme based on the coalition game is proposed for TDD massive MIMO systems. This scheme can be suitable for any actual scene in which cells are irregularly shaped (BSs distribute randomly), numbers of users per cell are not equal (users distribute randomly), and channels are correlated fading channels, and this scheme can provide performance guarantee. The definition of the coalition structure is given, and different utility functions are analyzed. To benefit each user, the adjustment principle of the coalition structure is set, and the coalition formation algorithm is provided. According to the coalition formation algorithm, the pilot allocation scheme based on the coalition game (for different utility functions) is simulated and compared with the greedy approach in [15] and random coalition. And then, the computational complexity in terms of multiplication times for different pilot allocation schemes is compared. Simulation results show that when considering the performance comparison of different pilot allocation schemes, the results for different performance metrics (different utility functions) are similar. The coalition game for each utility function with MMSE achieves the best performance with high computational complexity. For each utility function and for each combining scheme (MMSE, ZFC, or MRC), the coalition game is better than random coalition. When comparing the performance of different combining schemes, with regard to the coalition game for each utility function, MMSE is better than ZFC and ZFC is better than MRC. The performance of the greedy approach in [15] is between the coalition game for each utility function with MMSE and the coalition game for each utility function with ZFC. In addition, we obtain the optimal number of pilots to maximize SE through simulations.
Abbreviations
 AoA:

Angleofarrival
 AWGN:

Additive white Gaussian noise
 BS:

Base station
 BSC:

Base station controller
 CSI:

Channel state information
 MIMO:

Multipleinput multipleoutput
 MMSE:

Minimum mean square error
 MRC:

Maximum ratio combining
 MSE:

Mean square error
 MSECE:

Mean square error of channel estimation
 MSESD:

Mean square error of signal detection
 SE:

Spectrum efficiency
 SINR:

Signaltointerferenceplusnoise ratio
 TDD:

Time division duplex
 ULA:

Uniformly spaced linear array
 ZFC:

Zeroforcing combining
References
 1.
L. Lu, G.Y. Li, A.L. Swindlehurst, et al., An overview of massive MIMO: benefits and challenges. IEEE J. Sel. Top. Sign. Proces. 8(5), 742–758 (2014).
 2.
Z. Gong, C. Li, F. Jiang, Pilot contamination mitigation strategies in massive MIMO systems. IET Commun. 11(16), 2403–2409 (2017).
 3.
T.L. Marzetta, Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans. Wirel. Commun. 9(11), 3590–3600 (2010).
 4.
A. Ashikhmin, T. Marzetta, Pilot Contamination Precoding in MultiCell Large Scale Antenna Systems. Proceedings International Symposium on Information Theory (ISIT) (2012), pp. 1137–1141.
 5.
M.H.C. Garcia, J. Luo, in Proceedings of 2018 IEEE Wireless Communications and Networking Conference (WCNC). Timeshifted pilots multiplexed with uplink data and unequal power allocation (IEEE, Barcelona, 2018), pp. 1–6.
 6.
W. Chang, Y.K. Hua, S.F. Liao, Partial overlapped timeshifted pilots for massive MIMO systems. IEEE Commun. Lett. 21(11), 2480–2483 (2017).
 7.
H. Echigo, T. Ohtsuki, W. Jiang, Y. Takatori, in Proceedings of 2017 IEEE Global Communications Conference (GLOBECOM). Fair pilot assignment based on AOA and pathloss with location information in massive MIMO (IEEE, Singapore, 2017), pp. 1–6.
 8.
H. Wang, W. Zhang, Y. Liu, Q. Xu, P. Pan, On design of nonorthogonal pilot signals for a multicell massive MIMO system. IEEE Wireless Commun. Lett. 4(2), 129–132 (2015).
 9.
P. Liu, S. Jin, T. Jiang, Q. Zhang, M. Matthaiou, Pilot power allocation through user grouping in multicell massive MIMO systems. IEEE Trans. Commun. 65(4), 1561–1574 (2017).
 10.
H. Zhi, Q. Yuan, J. Zhu, Y. Hu, A pilot allocation scheme for TDD massive MIMO system enabled HetNet, 9th IEEE International Conference on Communication Software and Networks (ICCSN, GuangZhou, 2017), pp. 443–447.
 11.
J. Fan, W. Li, Y. Zhang, J. Deng, in Proceedings of IEEE 85th Vehicular Technology Conference (VTC Spring). Fractional pilot reuse with vertical sectorization in massive MIMO systems (IEEE, Sydney, 2017), pp. 1–5.
 12.
H. Ahmadi, A. Farhang, N. Marchetti, A. MacKenzie, A game theoretic approach for pilot contamination avoidance in massive MIMO. IEEE Wireless Commun. Lett. 5(1), 12–15 (2016).
 13.
R. Mochaourab, E. Björnson, M. Bengtsson, Adaptive pilot clustering in heterogeneous massive MIMO networks. IEEE Trans. Wirel. Commun. 15(8), 5555–5568 (2016).
 14.
L. You, X. Gao, X. Xia, N. Ma, Y. Peng, Pilot reuse for massive MIMO transmission over spatially correlated rayleigh fading channels. IEEE Trans. Wirel. Commun. 14(6), 3352–3366 (2015).
 15.
H. Yin, D. Gesbert, M. Filippou, Y. Liu, A coordinated approach to channel estimation in largescale multipleantenna systems. IEEE J. Sel. Areas Commun. 31(2), 264–273 (2013).
 16.
L.S. Muppirisetty, H. Wymeersch, J. Karout, G. Fodor, in Proceedings of 2015 IEEE Global Communications Conference (GLOBECOM). Locationaided pilot contamination elimination for massive MIMO systems (IEEE, San Diego, 2015), pp. 1–5.
 17.
L.S. Muppirisetty, T. Charalambous, J. Karout, G. Fodor, H. Wymeersch, Locationaided pilot contamination avoidance for massive MIMO systems. IEEE Trans. Wirel. Commun. 17(4), 2662–2674 (2018).
 18.
T. Kailath, A.H. Sayed, B. Hassibi, Linear Estimation (PrenticeHall, Upper Saddle River, 2000).
 19.
E. Björnson, E.G. Larsson, M. Debbah, Massive MIMO for maximal spectral efficiency: How many users and pilots should be allocated? IEEE Trans. Wirel. Commun. 15(2), 1293–1308 (2016).
 20.
M.J. Osborne, A. Rubinstein, A Course in Game Theory (MIT Press, Cambridge, 1994).
 21.
A. Bogomolnaia, M.O. Jackson, The stability of hedonic coalition structures. Games Econ. Behavior. 38(2), 201–230 (2002).
Funding
This research is supported by the Natural Science Foundation of Anhui Province, the Startup Research Foundation of Anhui University (0100177010117700011), the College Natural Science Research Project of Anhui Province (KJ2016A042).
Availability of data and materials
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
Author information
Affiliations
Contributions
HZ has contributed 70% of the work, and the rest is contributed by XD. Both authors read and approved the final manuscript.
Corresponding author
Correspondence to Hui Zhi.
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1
According to expression (11), the mean square error of signal detection (MSESD) for user (j, k) is
Then, according to reference ([19], Lemma 2), the received SINR for user (j, k) at BS j is
For the MRC scheme, we can obtain \( {\mathbb{E}}_{\boldsymbol{h}}\left\{{\left{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mrc}}\right)}^H{\boldsymbol{h}}_{jjk}1\right}^2\right\}={\mathbb{E}}_{\boldsymbol{h}}\left\{{\left{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mrc}}\right)}^H{\widehat{\boldsymbol{h}}}_{jjk}1\right}^2\right\}+{\mathbb{E}}_{\boldsymbol{h}}\left\{{\left{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mrc}}\right)}^H{\tilde{\boldsymbol{h}}}_{jjk}\right}^2\right\} \). That is the first and the second items of expression (20).
According to the definition of the MRC scheme, we obtain
Therefore, the first item of expression (20) is
When M → ∞, \( {\mathbb{E}}_{\boldsymbol{h}}\left\{{\left{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mrc}}\right)}^H{\boldsymbol{h}}_{jjk}\right}^2\right\}\approx {\left{\mathbb{E}}_{\boldsymbol{h}}\left\{{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mrc}}\right)}^H{\boldsymbol{h}}_{jjk}\right\}\right}^2=1 \). This is the result of the fourth item of expression (20).
Because \( {\tilde{\boldsymbol{h}}}_{jjk} \) is independent of \( {\boldsymbol{w}}_{jjk}^{\mathrm{wrc}} \), the second item of expression (20) is
where A ∘ B is the Hadamard product of A and B.
For the third item of expression (20) and the first item of the denominator in expression (21), two cases (i.e., \( \left(l,m\right)\notin {\Lambda}_{\mathcal{S}}\left(j,k\right) \) and \( \left(l,m\right)\in {\Lambda}_{\mathcal{S}}\left(j,k\right) \)) are discussed.
When \( \left(l,m\right)\notin {\Lambda}_{\mathcal{S}}\left(j,k\right) \),
where the second equation exists because of the independence of \( {\widehat{\boldsymbol{h}}}_{jjk} \) and h_{jlm} for \( \left(l,m\right)\notin {\Lambda}_{\mathcal{S}}\left(j,k\right) \).
When \( \left(l,m\right)\in {\Lambda}_{\mathcal{S}}\left(j,k\right) \),
According to expression (7) and the analysis above, when \( \left(l,m\right)\in {\Lambda}_{\mathcal{S}}\left(j,k\right) \),\( {\widehat{\boldsymbol{h}}}_{jlm}=\sqrt{\frac{t_{lm}^d}{t_{jk}^d}}{\mathbf{R}}_{jlm}{\mathbf{R}}_{jjk}^{1}{\widehat{\boldsymbol{h}}}_{jjk} \). Substituting it into expression (26), we can get (A8) for \( \left(l,m\right)\in {\Lambda}_{\mathcal{S}}\left(j,k\right) \)
where equation (a) exists for \( {\mathbf{R}}_{jlm}{\mathbf{R}}_{jjk}^{1} \) is a Hermitian matrix, and \( {\widehat{\boldsymbol{h}}}_{jjk}^H{\mathbf{R}}_{jlm}{\mathbf{R}}_{jjk}^{1}{\widehat{\boldsymbol{h}}}_{jjk}=\mathrm{sum}\left\{\left({\mathbf{R}}_{jlm}{\mathbf{R}}_{jjk}^{1}\right)\circ {\left({\widehat{\boldsymbol{h}}}_{jjk}{\widehat{\boldsymbol{h}}}_{jjk}^H\right)}^T\right\} \) is a real number. When M → ∞, \( {\mathbb{E}}_{\boldsymbol{h}}\left\{{\left{\widehat{\boldsymbol{h}}}_{jjk}^H{\mathbf{R}}_{jlm}{\mathbf{R}}_{jjk}^{1}{\widehat{\boldsymbol{h}}}_{jjk}\right}^2\right\}\approx {\left{\mathbb{E}}_{\boldsymbol{h}}\left\{{\widehat{\boldsymbol{h}}}_{jjk}^H{\mathbf{R}}_{jlm}{\mathbf{R}}_{jjk}^{1}{\widehat{\boldsymbol{h}}}_{jjk}\right\}\right}^2 \). In addition, due to the mutual independence of \( {\widehat{\boldsymbol{h}}}_{jjk} \) and h_{jlm}, \( {\mathbb{E}}_{\boldsymbol{h}}\left\{{\widehat{\boldsymbol{h}}}_{jjk}^H{\tilde{\boldsymbol{h}}}_{jlm}{\tilde{\boldsymbol{h}}}_{jlm}^H{\widehat{\boldsymbol{h}}}_{jjk}\right\}=\mathrm{sum}\left\{{\mathbf{R}}_{{\tilde{h}}_{jlm}}\circ {\mathbf{R}}_{{\widehat{h}}_{jjk}}^T\right\} \). Therefore, the third item of expression (20) (the first item of the denominator in expression (21)) is
For the last item of expression (20) and the last item of the denominator in expression (21),
Combining expressions (20), (23), (24), (28), and (29), the MSESD for user (j, k) at BS j is
Combining expressions (21), (22), (28), and (29), the received SINR for user (j, k) at BS j is
Appendix 2
We consider the ZFC scheme. For ZFC, the \( {\boldsymbol{w}}_{jjk}^{\mathrm{zfc}} \) is the kth column of the matrix \( {\boldsymbol{W}}_{jj}^{\mathrm{zfc}}={\widehat{\boldsymbol{H}}}_{jj}{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}\right)}^{1} \), and thus, \( {\left({\boldsymbol{W}}_{jj}^{\mathrm{zfc}}\right)}^H{\widehat{\boldsymbol{H}}}_{jj}={\mathbf{I}}_{K_j} \) and \( {\left({\boldsymbol{W}}_{jj}^{\mathrm{zfc}}\right)}^H{\boldsymbol{W}}_{jj}^{\mathrm{zfc}}={\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}\right)}^{1} \). Therefore
Therefore, the first item of expression (20) is
Because \( {\tilde{\boldsymbol{h}}}_{jjk} \) is independent of \( {\boldsymbol{w}}_{jjk}^{\mathrm{zfc}} \), the second item of expression (20) is
Here, we assume that \( {\boldsymbol{Q}}_1={\mathbb{E}}_{\boldsymbol{h}}\left\{{\boldsymbol{w}}_{jjk}^{\mathrm{zfc}}{\left({\boldsymbol{w}}_{jjk}^{\mathrm{zfc}}\right)}^H\right\} \). The fourth item of expression (20) is
Using a similar analysis method as MRC in Appendix 1, we can divide the third item of expression (20) (the first item of the denominator in expression (21)) into two cases.
When \( \left(l,m\right)\notin {\Lambda}_{\mathcal{S}}\left(j,k\right) \),
When \( \left(l,m\right)\in {\Lambda}_{\mathcal{S}}\left(j,k\right) \),
where \( {\boldsymbol{Q}}_2={\mathbb{E}}_{\boldsymbol{h}}\left\{{\widehat{\boldsymbol{h}}}_{jjk}{\left({\boldsymbol{w}}_{jjk}^{\mathrm{zfc}}\right)}^H\right\} \).
In addition, the last item of expression (20) (the last item of the denominator in expression (21)) is
where \( {\boldsymbol{Q}}_3={\mathbb{E}}_h\left\{{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}\right)}^{1}\right\} \) and [Q_{3}]_{k, k} means the kth row and kth column element of matrix Q_{3}.
Combining expressions (20), (33), (34), (35), (36), (37), and (38), the MSESD for user (j, k) at BS j is
Combining expressions (21), (32), (36), (37) and (38), the received SINR for user (j, k) at BS j is
Appendix 3
We consider the MMSE scheme, the \( {\boldsymbol{w}}_{jjk}^{\mathrm{mmse}} \) is the kth column of the matrix \( {\boldsymbol{W}}_{jj}^{\mathrm{mmse}}={\widehat{\boldsymbol{H}}}_{jj}{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1} \). Because \( {\left({\boldsymbol{W}}_{jj}^{\mathrm{zfc}}\right)}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1}={\mathbf{I}}_{K_j} \), \( {\left({\boldsymbol{w}}_{jj k}^{\mathrm{mmse}}\right)}^H{\widehat{\boldsymbol{h}}}_{jj k}={\left[{\left({\boldsymbol{W}}_{jj}^{zfc}\right)}^H{\widehat{\boldsymbol{H}}}_{jj}\right]}_{k,k}=1{\left[{\sigma}^2{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1}\right]}_{k,k} \), thus
Therefore, the first item of expression (20) is
where \( {\boldsymbol{Q}}_4={\mathbb{E}}_{\boldsymbol{h}}\left\{{\sigma}^2{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1}\right\} \). Because \( {\tilde{\boldsymbol{h}}}_{jjk} \) is independent of \( {\boldsymbol{w}}_{jjk}^{\mathrm{zfc}} \), the second item of expression (20) is
where \( {\boldsymbol{Q}}_5={\mathbb{E}}_{\boldsymbol{h}}\left\{{\boldsymbol{w}}_{jjk}^{\mathrm{mmse}}{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mmse}}\right)}^H\right\} \). The fourth item of expression (20) is
where the last approximate equality exists when M → ∞.
Using a similar analysis method as MRC in Appendix 1, we can divide the third item of expression (20) (the first item of the denominator in expression (21)) into two cases. The results is
where \( {\boldsymbol{Q}}_6={\mathbb{E}}_{\boldsymbol{h}}\left\{{\widehat{\boldsymbol{h}}}_{jjk}{\left({\boldsymbol{w}}_{jjk}^{\mathrm{mmse}}\right)}^H\right\} \).
In addition, the last item of expression (20) (the last item of the denominator in expression (21)) is5
where \( {\boldsymbol{Q}}_7={\mathbb{E}}_{\boldsymbol{h}}\left\{{\left({\boldsymbol{W}}_{jj}^{\mathrm{mmse}}\right)}^H{\boldsymbol{W}}_{jj}^{\mathrm{mmse}}\right\}={\mathbb{E}}_{\boldsymbol{h}}\left\{{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1}{\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}{\left({\widehat{\boldsymbol{H}}}_{jj}^H{\widehat{\boldsymbol{H}}}_{jj}+{\sigma}^2{\left({\boldsymbol{T}}_j^d\right)}^{\hbox{} 2}\right)}^{1}\right\} \) .
Combining expressions (20), (42), (43), (44), (45), and (46), the MSESD for user (j, k) at BS j is
Combining expressions (21), (41), (45), and (46), the received SINR for user (j, k) at BS j is
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Zhi, H., Ding, X. Pilot allocation scheme based on coalition game for TDD massive MIMO systems. J Wireless Com Network 2019, 60 (2019) doi:10.1186/s136380191372x
Received
Accepted
Published
DOI
Keywords
 Massive MIMO
 Pilot allocation scheme
 Coalition game