 Research
 Open Access
 Published:
Rotating cluster mechanism for coordinated heterogeneous MIMO cellular networks
EURASIP Journal on Wireless Communications and Networking volume 2018, Article number: 59 (2018)
Abstract
To increase the average achievable rates per user for clusteredge users, a rotating clustering scheme for the downlink of a coordinated multicell multiuser multipleinput multipleoutput system is proposed in this paper and analyzed in two network layouts. In the multicell heterogeneous cellular network, base stations of a cluster cooperate to transmit data signals to the users within the cluster; rotating cluster patterns enable all users to be nearer the cluster center in at least one of the patterns. Considering cellular layouts with three or six macrocells per site, different rotating patterns of clusters are proposed and the system performance with the proposed sets of clustering patterns is investigated using a simulated annealing algorithm for user scheduling and successive zeroforcing dirty paper coding as the precoding method. The rotating clustering scheme is less complex than fully dynamic clustering, and it is primarily designed to improve the throughput of clusteredge users. As an extra secondary benefit, it is also capable of slightly improving the average achievable sum rate of the network overall. The effectiveness of the proposed methods with two different scheduling metrics, namely throughput maximization and proportionally fair scheduling, is of interest in this work. Moreover, the speed of rotation affects the performance of the system; the higher the speed of rotation, the more frequently any specific users will be nearer the cluster center. Our simulations demonstrate the effectiveness of the proposed rotational approach and determine the speed of rotation beyond which any additional performance gains become negligible.
Introduction
Exploding demand for everhigher data throughputs in cellular networks is one of the main drivers behind their current evolution toward the 5th generation [1]. As a promising solution for increased throughput, multiuser multipleinput multipleoutput (MUMIMO) techniques have been introduced [2], in which several single and/or multipleantenna users receive their corresponding signals simultaneously from multipleantenna base stations (BSs). High spectral efficiency gains potentially achievable with MIMO spatial multiplexing are available at high signaltointerferenceplusnoise ratios (SINRs). However, to maximize capacity, cellular networks are normally designed to allow high levels of intercell interference, which prevents high spatial multiplexing gains. Network coordination (also known as coordinated multipoint (CoMP) transmission/reception or network MIMO) is one potential solution to reduce the intercell interference [3–6].
Historically, the key driver behind dramatic increases in area capacity of cellular networks has been reduction of cell sizes and densification of cellular layouts. More recently, this trend has evolved into the development of dense heterogeneous networks (HetNets) [7–9]. Coordinated transmission on the downlink of MUMIMO HetNets is considered in this work. Since coordination of all BSs in a large cellular network is neither practical nor necessary, coordination of transmissions within limitedsize clusters of BSs is considered instead [10, 11]. Although the SINR of most users is improved by coordination of clustered BSs, the intercell interference is now replaced by intercluster interference (ICI). Users located near the edge of the cluster experience much higher levels of ICI than users closer to its center, and they will suffer from poor throughput or even service starvation. A proportionally fair (PF) user scheduler [12–15] will improve the throughput and fairness to these users. To improve the performance of clusteredge users, clusters can change and be formed dynamically [16–21]. This increases the possibility of any given user being near the center of a cluster for at least a portion of the time.
Clusters can be formed in different ways. In the simplest method, i.e., using a fixed pattern, BSs of the cluster are assigned with a predefined pattern that remains static during the system operation [3, 22]. This method has comparatively low complexity and overhead, but clusteredge users still adversely suffer from ICI. In fully dynamic clustering, all features of the cluster such as the size and/or shape of the clusters and the set of BSs forming a cluster can be changed as often as every scheduling interval [16–18, 21, 23–30]. Clustering can be controlled and managed with a centralized processor using channel state information (CSI) of involved users [16, 21, 23–30]. This method is quite complex, but it is able to improve the performance of the system considerably in comparison to that of a fixed pattern. For example, [30] introduced a type of dynamic clustering algorithm based on weighted sum rate maximization for the downlink of a MUMIMO CoMP system. The clusters were formed specifically for the set of scheduled users such that they experience minimum ICI. It was shown that the algorithm improved the system sum rate in comparison to that of static clustering methods.
A simplified version of dynamic clustering is cluster rotation, in which the complexity and overhead burden on the centralized processor are reduced by using one of several predefined patterns of clustering that change periodically. Thus, the clusteredge effect is averaged out and diminished over time since users, who are periodically located either near the border or the center of the clusters as the cluster borders change with each pattern, each receive data primarily during their most favorable pattern(s). In [19], we proposed a method of cluster rotation for the downlink of a multicell MUMIMO HetNet, and its performance was compared with static clustering using both maximum throughput (MT) and PF scheduling metrics. A simulated annealing user scheduling algorithm and successive zeroforcing dirty paper coding (SZFDPC) [31] precoding were used in the system. We demonstrated that instantaneous achievable rates of users and the average achievable peruser rates were improved for the lowest throughput users (i.e., clusteredge users). Also, the average achievable sum rate was improved compared to fixed clustering, especially for PF scheduling. Naturally, enforcing fairness in the system reduced average peruser rates for the highest throughput users under MT scheduling. We also evaluated the effect of different cluster rotation speeds on the system performance under PF scheduling in [20]. The results demonstrated that faster rotation in general performed better than slow rotation. However, there was an upper limit on increasing the rotation speed, and beyond that point, further increases did not result in any notable additional gains in sum rate or peruser rate.
The macro BSs in [19, 20] were each equipped with one out of six sets (per site) of antenna arrays, with each set covering a cell; the hexagonalshaped macro site coverage area thus was divided into six cells^{Footnote 1}. We are interested to know whether the results in [19, 20] can be generalized into other types of network layouts and/or larger sets of possible cluster patterns, and how the rotating cluster patterns can be modified for those other layouts. This generalization aspect is quite important; cluster rotation would not be nearly so useful, if it only worked for a certain type of network layout. Thus, we extend our rotating clusters idea to a layout with three macrocells per site and attempt to find a suitable set of rotating patterns that can improve the performance of the system. The cloverleafshaped cell layout is the most suitable threecell layout to use, and it is recommended by ITUR for IMTAdvanced [32, 33]. Because of the closer resemblance between the cellular contour and the coverage of the cloverleafshaped layout [34], it is also more often used in modeling practical cellular systems than the simple hexagonal layout. Assuming a cloverleafshaped cell layout, we propose a set of five rotating cluster patterns and again compare the system’s performance in terms of the achievable throughput with that of static clusters under both MT and PF scheduling metrics. It should be noted that while we examine only two regular gridlike network layouts in this work, the concept of cluster rotation can be also applied to more general, irregular layouts.
To keep the same general methodology as in [19], we continue to use SZFDPC precoding, which partially nulls the interference between users [31]. Since reducedcomplexity user scheduling is necessary in practical systems with a large number of users, we use our previously proposed simulated annealing user scheduling (SAS) method [35]. It was shown in [35] that this method achieves performance close to that of an exhaustive search with much lower complexity, making its continued use in this context reasonable. The SAS algorithm in [19] was memoryless, while in [20], we added memory to the algorithm to further improve its performance. We have made additional refinements to the SAS algorithm in [35]; it is that version (adapted for weighted sum rates) that we use in this work. We wish to emphasize, though, that the focus of this work is on the cluster rotation and its performance, not on the scheduling algorithm, nor on the precoding method. The cluster rotation methods employed in this work can be generalized and applied both to different user scheduling algorithms and different precoding methods, including (but not limited to) those discussed in [35]. The outcomes of this work are expected to be independent of the user scheduling and/or precoding approaches. Please recall that the cluster rotation scheme addresses the impact of intercluster interference, while precoding and user scheduling attempt to resolve the intracluster interference. Hence, SZFDPC precoding and the SAS algorithm are simply meant to serve as a representative case.
Additionally, compared to [19, 20], we have revised some of the simulation parameters in this paper to correspond to recommendations in [32, 33]. These include the values of the path loss exponent and the standard deviation of shadow fading for macro and pico BSs, along with their transmitted power levels. We also have corrected a minor flaw in our simulations from [19]. There, when two adjacent 60° cells are coordinated, they have erroneously been given a single 120° antenna pattern. We have corrected this misbehavior so the coordinated cells now indeed have two 60° patterns as intended. (This misbehavior was also corrected in [20].) Lastly, we provide a comparison of the performance of cluster rotation with that of fully dynamic clustering, specifically the scheme described in [30].
The notation used in this paper is as follows. An italic variable a or A denotes a scalar, while boldface lowercase and uppercase variables a and A denote a vector and matrix, respectively. I_{ M } is the M×M identity matrix. A^{T} and A^{H} denote the transpose and conjugate (Hermitian) transpose, respectively, of A. For a square matrix A,Tr(A),A^{−1}, and A^{−1/2} are its trace, inverse, and inverse square root, respectively. A≽0 denotes A is positive semidefinite. a denotes the absolute value of the variable, and A denotes the determinant of a (square) matrix.
System model, design, and achievable weighted sum rate maximization
We consider the downlink of a coordinated multicell MUMIMO HetNet. Several macro BSs are colocated at each macro site, the coverage of which is partitioned into different cells each covered by an antenna array installed on a macro BS. We assume two different network layouts, the first with six macrocells per site, and the second with three macrocells per site. (For shorthand, we refer to these respectively as “sixcell” and “threecell” layouts in this paper.) Different system model characteristics are assumed, which are described in Sections 2.1 and 2.2, respectively, for the sixcell and threecell layouts. In both system models, omnidirectional pico BSs surround each macro site and overlay the macro coverage area. The macro BSs each transmit with power P_{ t }, and the intersite distance (ISD) between macro sites is fixed and denoted by D. Each macro BS is equipped with N_{macro} transmit antennas, while each pico BS has N_{pico} antennas. We assume for simplicity that N_{macro} and N_{pico} are both equal to N. Since all macro and pico BSs in a cluster jointly transmit their signals, essentially virtually forming one large antenna array, this assumption is reasonable and does not impact the proposed methods or the analysis in this work. Having N_{macro} larger than N_{pico} would of course increase the system sum rates and the number of users that could be scheduled simultaneously, but would also thereby significantly (and likely needlessly) increase the complexity and length of time of relevant simulations. Additionally, the examination herein assumes perfect and instantaneous CSI and data shared across the backhaul of the network. A centralized processor for the network is assumed to collect the CSI, e.g., by (idealized) feedback from the mobile users of the CSI obtained using pilot reference signals; it also performs scheduling and coordinates the transmissions between BSs. The specifics of the CSI gathering and coordination over the backhaul are outside the scope of this paper.
Layout 1: sixcell layout (hexagonalshaped cooperating area)
In our first HetNet model, the coverage area of the six cells per site overall forms a hexagonalshaped region. Each macro BS covers a 60° angle of the area with a directional antenna. The macro site is surrounded by 12 lowpowered pico BSs that form picocells overlaying the macro coverage area (see Fig. 1). Of the six cells per site, two adjacent ones are coordinated at any given time to form an effectively larger cell area. The picocells also coordinate within whatever cluster that the macrocell they overlay is part of. Without loss of generality, we may consider any arbitrary macro site (with coverage area shown in green) and the clusters it participates in (shown by the red dashed lines). Therefore, the BSs of any macro site contribute to three different clusters.
As depicted in Fig. 1, two different patterns of clustering are possible, in which different adjacent cells cooperate with each other. All cells within each thick red dashed hexagon coordinate signals from their BSs to form a cluster; one example cluster in each pattern is emphasized in the figure for clarity. As Fig. 1a depicts, those users in a cluster that are located near the border of the cluster (for example, at location “1”) experience the poorest channel conditions from the BSs in the cooperating set. By rotating the clustering pattern by 60° around any macro site (see Fig. 1b), those previously poorcoverage users are now in the middle of the cluster (i.e., they will have better channel gains or higher achievable rates). Therefore, most users will have the opportunity to have a higher chance of being scheduled and to achieve reasonably good data rates for a fraction of the overall transmission time. With users being scheduled primarily during their most favorable clustering pattern, their corresponding rates will be higher than otherwise. Averaging the throughput over all transmission periods and clustering patterns, the overall achievable transmission sum rate of the users will be improved.
There are K users uniformly distributed over the coverage area of each macro site, each user equipped with M receive antennas. K_{c(i)} is the number of users assigned to cluster c(i), from which U_{c(i)} users are served, where i refers to the ith pattern of clustering. Each cluster transmits coordinated data signals from all its BSs to its scheduled users.
Layout 2: threecell layout (cloverleafshaped cooperating area)
For the second HetNet model, which is more commonly used in LTEadvanced design [32] and is called a cloverleaf model, each cell in a macro site is covered by a highpowered BS, which is located at a corner of the cell. The directional antenna at a macro BS covers a hexagonalshaped cell within the angle of 120°. Each macrocell is overlaid by four lowpowered omnidirectional pico BSs. These are located near the four edges of the macrocell that are the most distant from the macro BS, as depicted in Fig. 2. Any three adjacent macrocells and their constituent picocells may form a cluster, if the macrocells share a corner that is not a site. Therefore, considering an arbitrary macro site and its corresponding three macrocells (shown in green in Fig. 2), the macro BSs may belong to two or three independent clusters (shown by the red dashed lines).
As depicted in Fig. 2, five different patterns of clustering are possible. We again highlight one example cluster in each pattern for clarity. Those users that are located near the edge of the cluster experience poor channel conditions from the BSs in cooperating set. Consequently, their achievable rates will be smaller compared to the users in the middle of the cluster. By rotating the clustering pattern (see Fig. 2b), a portion of those previously poorcoverage users are now in the middle of the cluster, and some of the users, previously located at the middle of cluster, are now near the edge of the cluster. To put the remainder of the clusteredge users in Fig. 2a near the middle of a cluster, more rotations are required, which are depicted in Fig. 2c–e. Therefore, in this layout after five intervals of rotation all users have the opportunity of being at least once in the middle of the cooperating area; they thus have a higher chance of being scheduled and achieving reasonably good data rates.
There are K users, each equipped with M receive antennas, uniformly distributed over each cell. This is in contrast to the sixcell layout, which has K users distributed over the coverage area of the macro site. Thus, for the threecell layout, there are 3K users per site, i.e., K_{c(i)}=3K.
For both layouts, other patterns of clustering could in theory be used, e.g., by coordinating more macrocells together in a cluster. However, please consider the corners of each macrocell that do not contain a macro BS site. These locations have the worst SINR when no coordination occurs. The patterns that we use cluster the smallest possible number of macro BSs such that it allows each of those corners to be in the center of the cluster in one of the patterns. At the same time, the duration between any given corner being in the center (as the scheme rotates through the patterns) is also the smallest possible for the number of macrocells per cluster being used. (Note there are two of these corners per macrocell in the sixcell layout, and five such corners in the threecell layout, hence leading to two and five patterns, respectively, for the layouts.)
Cluster rotation in general network layouts
There are, in general, two “rotation” aspects to cluster rotation. The first can be viewed as a physical rotation. Please note the highlighted cluster in Figs. 1 and 2 (denoted by dashed and solid vertical lines, respectively). As the cluster patterns change, that cluster, in a sense, can be imagined as rotating around some location in the network. In the two cases depicted by Figs. 1 and 2, that location is the macro site in the middle of each subfigure, though this need not be the case in general. The second aspect of rotation is the periodic rotation through a set of cluster patterns, in a roundrobin fashion. This latter aspect is more general to any arbitrary cell layout. The first aspect may not necessarily be applicable, or at least quite so readily visible, as the second. For example, the five patterns in Fig. 2 could be ordered arbitrarily. If so, the physical rotation aspect would not be as apparent, but the rotation through the set of (reordered) patterns would still occur.
While we investigate two regular gridlike cell layouts herein, the concept of cluster rotation can also be applied to more general irregular layouts. For such irregular layouts, it would first be necessary to determine sets of BSs in the network for coordination and then assign different clustering patterns to them. This may not be as simple as with a regular cell layout, but remains feasible, given a set of BS locations and coverage areas and/or where interference results without coordination. Voronoi diagrams of order n [36] could be of use to locate regions of coordinated BSs, by identifying the n nearest BSs at any given location; the distances should also be weighted based on the type/tier of each transmitting node. The system can then rotate through those patterns just as in this work.
Complexity comparison of dynamic clustering and rotating clustering
Fully dynamic clustering (whether the scheme in [30] or otherwise) results in significantly higher overhead in computational load and signaling. The system must determine and exchange possible choices of BSs for each user, run some sort of optimization or other routine to determine the choice of which BSs to serve which user, and finally communicate these choices across the network and to the users. This could occur potentially as often as every scheduling interval, though the system could also perform these operations less frequently. In comparison, almost none of those computations are required with rotating clustering since the sets of clusters are predetermined beforehand and known at all transmitting nodes. The additional overhead beyond that of static clustering is simply the same as the last stage of fully dynamic clustering, i.e., to periodically inform the users what their new cluster will be.
Furthermore, there are additional savings in complexity in regard to cell association. With rotating clustering, the association of a user to a specific anchor BS has much less impact on the network’s operation (disregarding the context of high user mobility and/or handoff, which are outside the scope of this work). Note that a user receives data from a macrocell and all picocells overlaying that macrocell. Borders between macrocells (where the received power from the BSs of those cells are equal) are statistically identical; at times, that cell border may also be a cluster border, while at other times, it will not. Thus, a complicated cell association scheme is not required. Whether a user chooses an anchor BS by closest distance, highest average received power, adding on a tierdependent association bias factor, etc., the performance of the scheme is by and large unchanged. Essentially, users can be considered more to be associated with a cluster rather than with an individual cell; in terms of performance, it is largely equivalent to associate with any one of the cells in the cluster. There may still be, for example, considerations of offloading traffic, but these would now be between clusters rather than between cells. In any event, such factors are beyond the scope of this paper.
Achievable weighted sum rate maximization and user scheduling
For both layouts, as stated earlier, averaging the throughput over all transmission periods and clustering patterns will improve the overall achievable sum rate of the users, with users scheduled primarily during their most favorable pattern. Defining T_{ cl } as a specific clustering pattern duration in units of scheduling intervals, rotation to the next pattern will occur every T_{ cl } scheduling intervals. Denoting B_{c(i)} as the number of BSs in the c(i)th cluster of the ith pattern, the aggregate downlink channel \(\mathbf {H}_{c(i),k} \in \mathcal {C}^{M\times B_{c(i)} N}\) of the kth user from all these B_{c(i)} BSs is defined by H_{c(i),k}=[H_{c(i),k}(1),⋯,H_{c(i),k}(B_{c(i)})], where \(\mathbf {H}_{c(i),k} (b) \in \mathcal {C}^{M\times N}\) denotes the downlink channel matrix between the kth user and bth BS of the cluster. Each element of H_{c(i),k}(b), denoted by h_{c(i),k}(b,m,n), is the complex downlink channel signal strength coefficient between the mth receiving antenna of the kth user and the nth transmitting antenna of the bth BS in the c(i)th cluster. This coefficient includes path loss, lognormal shadowing, and Rayleigh fading, and is modeled by
z_{c(i),k}(b,m,n) represents smallscale frequencyflat Rayleigh fading with an i.i.d. complex Gaussian random variable distributed as \(~\mathcal {CN}(0,1)\). R_{ m } is the reference distance^{Footnote 2}, and Γ_{0} is a scaling factor controlling the reference signaltonoise ratio (SNR) at a distance of R_{ m } in the boresight direction of the directional antenna. The distance between user k and BS b in cluster c(i) is represented by d_{c(i),k}(b), and α(b) is the path loss exponent for BS b. P_{ t }(b) is the transmit power of BS b, and ρ_{c,k}(b) denotes the lognormal shadow fading coefficient with standard deviation σ_{ ρ }. The antenna pattern A(θ,b) of a macro BS, where θ is the angle between the direction of interest and the boresight of the antenna at BS b, is defined as described in [32, 37]; A(θ,b) is equal to unity for pico BSs with omnidirectional antennas.
All B_{c(i)} BSs of cluster c(i) cooperatively transmit the data vector \(\mathbf {s}_{c(i),k} \in \mathcal {C}^{M\times 1}\) for user k using the aggregate precoding matrix \(\mathbf {W}_{c(i),k} \in \mathcal {C}^{B_{c(i)} N\times M}\). The received signal \(\mathbf {y}_{k} \in \mathcal {C}^{M\times 1}\) for user k is given by
The first term in (2) is the received signal from cluster c(i), to which the user belongs, while the second term describes the interference from other clusters. Applying the central limit theorem, the total interference signal from all clusters not including c(i), denoted by \(\check {c}(i)\neq c(i)\), is approximated by an M×1 complex Gaussian random vector with zero mean and standard deviation σ_{ I }. To estimate the standard deviation of this interference, it is assumed that all BSs outside the cluster c(i) are transmitting with full power, representing the worst case for ICI. The interference from these BSs experienced at different locations within the cluster c(i) is determined and averaged via Monte Carlo simulation over many channel realizations. The standard deviation of these realizations is used as the value of σ_{ I }. The last term \(\mathbf {n}_{k} \in \mathcal {C}^{M\times 1}\) is a complex additive white Gaussian noise vector with each element having zero mean and unity variance. The summation of interference and noise is denoted by Z_{c(i),k}, which with the Gaussian interference approximation ends up as a complex Gaussian random vector with zero mean and variance \(\sigma _{I}^{2} + 1\). For convenience of calculation, the interferenceplusnoise power is normalized at the receiver. This is equivalent to applying a filter at the receiver of \(\mathbf {Q}_{r} = \left (\sigma _{I}^{2} + 1\right)^{1/2}\mathbf {I}_{M}\). Hence, by defining \(\tilde {\mathbf {H}}_{c(i),k}= \mathbf {Q}_{r}\mathbf {H}_{c(i),k}\) as the postprocessed equivalent channel matrix and \(\tilde {\mathbf {Z}}_{c(i),k}= \mathbf {Q}_{r}\mathbf {Z}_{c(i),k}\) as the normalized interference plus noise, (2) is revised as
We choose to use the SZFDPC precoding technique, where the encoding order of the users is very important for maximization of the achievable weighted sum rate. Given a set of users with order \(\pi _{c(i)}^{j}\) and defining the user encoded at position k as \(\pi _{c(i),k}^{j}\), the postprocessed received signal can be modified and expanded as
The two summations in the second and third line of (4) represent the intracluster interference for user k. In SZFDPC, the precoding matrix \(\mathbf {W}_{c(i),\pi _{k}^{j} }\) is constrained to lie in the null space of the channel matrices of all users encoded before \(\pi _{c(i),k}^{j}\); the aggregate channel matrix of previously encoded users is defined as \(\mathbf {H}_{k1}=\left [\tilde {\mathbf {H}}_{c(i),\pi _{c(i),1}^{j}}^{T},\ldots,\tilde {\mathbf {H}}_{c(i),\pi _{c(i),k1}^{j}}^{T} \right ]^{T}\). The precoding matrix cancels the intracell interference from the summation in the third line of (4), while the effect of the remaining intracell interference represented by the summation in the second line of (4) is removed by using DPC. Using singular value decomposition of H_{k−1}, for a given ordered user \(\pi _{c(i),k}^{j}\), its achievable rate \(R_{c(i),\pi _{c(i),k}^{j}}\) is given by
\(\mathbf {Q}_{c(i),\pi _{c(i),k}^{j}}\) is the transmit covariance matrix for user \(\pi _{c(i),k}^{j}\) in cluster c(i), and \(\mathbf {V}_{k1}^{0}\) are orthonormal basis vectors for the joint null space of H_{k−1} for the users before \(\pi _{c(i),k}^{j}\) in the encoding order; \(\mathbf {V}_{0}^{0} \triangleq \mathbf {I}_{B_{c(i)}N}\).
The throughput maximization criterion results in the selection of a scheduled vector of users that achieves the largest sum rate among all possible vectors of users. Those users who have better channel gains have a higher likelihood to be selected by an MT scheduler. Thus, users with poorer channel gains may be very infrequently (and potentially never) selected by the scheduler, which is not fair. In PF scheduling, each user has a weight related to its priority for being chosen by the scheduler. The scheduler adjusts each weight based on the average achievable rates in the user’s history. A PF scheduler chooses those users whose instantaneous rates relative to their average rates are better than the others and uses a weighted sum rate as its scheduling metric, i.e., the combination of those users with maximum weighted sum rate will be chosen to be scheduled. If a user has been selected by the scheduler often, its weight for the next interval will be decreased (as its average rate increases), i.e., its chance to be chosen in the next scheduling interval diminishes. Meanwhile, another user with a worse channel matrix may have more opportunity to be scheduled in the next interval simply by having higher weight. Using this method provides more fairness in the network among all users.
In each cluster, the maximum achievable weighted sum rate WSR_{c(i)} is given by^{Footnote 3}
where \(\mu _{c(i),\pi _{c(i),k}^{j}}(t)\) is the priority weight of the kth user during the tth scheduling interval in cluster c(i). In PF scheduling, for the lth user out of K_{c(i)}, \(\mu _{c(i),l} (t) = 1/\bar {R}_{c(i),l} (t)\), where \(\bar {R}_{c(i),l} (t)\) is the average achievable data rate of the lth user at time t, averaged over a window of the past t_{ c } intervals. In each time interval, \(\bar {R}_{c(i),l}(t)\) (and thus μ_{c(i),l}(t)) is updated by an exponential filter as
R_{c(i),l}(t) is the instantaneous rate of the lth user, and is obtained from (5), assuming the lth user is scheduled in position k of the ordered scheduling vector \(\pi _{c(i)}^{j}\). One important special case of achievable weighted sum rate maximization is MT, which is defined by setting μ_{c(i),l} to a constant of 1 for all users. Let us define the best ordered user vector as π^{∗}. Then, in any clustering pattern i, the maximum average achievable weighted sum rate over the area of an arbitrary macrocell, averaged over time t when using pattern i, is given as
where w_{c(i)} is the number of macrocells in cluster c(i).
To solve the optimization problem in (6) using (5) as \(R_{c(i),\pi _{c(i),k}^{j}}\), we must consider the \(\mu _{c(i),\pi _{c(i),k}^{j}}(t)\) weights when calculating \(\mathbf {Q}_{c(i),\pi _{c(i),k}^{j}}\) using the waterfilling algorithm, which allocates power over the eigenmodes of the blockdiagonal matrix formed using the effective channel matrices^{Footnote 4}\(\mathbf {G}_{c(i),\pi _{c(i),k}^{j}} = \tilde {\mathbf {H}}_{c(i),\pi _{c(i),k}^{j}}\mathbf {V}_{k1}^{0}\). The user selection within a cluster is performed by using a SAS algorithm similar to what we proposed in [35] and described in Algorithm 3 therein. For ease of reference, the pseudocode of the SAS algorithm is described in Algorithm 1 here. The main difference between [35] and here is that the solution values s_{ x } and \(s_{\hat {\mathbf {x}}}\) are now achievable weighted sum rates as per (6). The ordered user vectors x and \(\hat {\mathbf {x}}\) are used for \(\pi _{c(i)}^{j}\) in (6). The rest of the operation of the algorithm is unchanged. The SAS algorithm operates in parallel separately for each cluster.
Two positive variables B_{1} and B_{2} limit the iterations and control how closely the algorithm approaches the optimal solution with the tradeoff of the algorithm’s performance and its acceptable complexity. The larger the values of B_{1} and B_{2}, the closer the algorithm comes to the optimal solution, but the computational complexity also increases as the algorithm iterates longer. The SAS algorithm starts with the variable τ_{ t } (analogous to the temperature in metallurgical annealing) equal to τ_{hot}. It continues until τ_{ t } “cools” to the value of \(\phantom {\dot {i}\!}\tau _{f} = \tau _{\text {hot}} \phi ^{B_{1}}\). ϕ is a uniformly distributed random variable in the interval (0,1). The neighborhood function N(x(n)) at random either deletes a random user from the vector x(n), adds a random unscheduled user (if possible), replaces one scheduled user with a random unscheduled one, or swaps the encoding order position of two random scheduled users. Each of these actions has an equal probability of being chosen.
We refer the reader to [35] for more details on our SAS algorithm. Note that no change to the scheduling algorithm is required for rotating clustering; better cluster patterns for users are automatically detected by the algorithm through the corresponding more favorable channel gains and/or achievable rates during that pattern, making the users more likely to be scheduled during those better patterns.
Simulation setup and results
In this section, the simulations of the proposed rotating clustering mechanisms are presented and compared with fixed clustering of the otherwise identical cooperative HetNet employing the SZFDPC precoding technique and SAS algorithm. Both MT and PF scheduling are considered. The average achievable sum rate and the average achievable rate per user are determined using the Monte Carlo simulation method. The numbers of transmitter and receiver antennas are assumed to be N_{macro}=N_{pico}=2 and M=2, respectively. These values enable a manageably low simulation complexity, yet still allow a demonstration of the effect of rotating clusters in coordinated MUMIMO systems. It is assumed that the ISD in both layouts has the same value of^{Footnote 5} D=1732 m. In the sixcell layout, there are 12 pico BSs spaced evenly on the imaginary circle with radius 693 m around each macro site. For the threecell layout, four picos are located on a circle of radius 356 m centered at the center point of each macrocell coverage area^{Footnote 6}. The scaling factor for the SNR, i.e., Γ_{0} in (1), is set to result in an SNR of 9.6 dB at the distance R_{ m }=866 m in the sixcell layout, and an SNR of 4.7 dB at R_{ m }=1155 m for the threecell layout. In both cases, R_{ m } is measured from the macro site in the direction of the BS antenna boresight. The transmitting power P_{ t }(b) of each macro BS is 40 times greater than that of each pico BS. The path loss exponent α(b) is assumed to be 3.91 and 3.67, and the standard deviation of the lognormal shadow fading is σ_{ ρ }=6 and 4 dB, respectively, for macro and pico BSs^{Footnote 7}.
The total interference from the other clusters outside the target cluster is approximated by the complex Gaussian random vector with zero mean and standard deviation σ_{ I }. The value of σ_{ I } is measured and averaged over the area of an entire cluster by employing similar methodology for both layouts as was used in [35]. The mean value of σ_{ I } in the sixcell layout is measured as 13.8. In the threecell layout, all five cluster patterns are not exactly symmetric nor statistically identical to each other. As depicted in Fig. 2, the cluster patterns in Fig. 2a, c, and e are similar in that for all three patterns, two sites contribute to a given cluster. Of the three macro BSs covering the cluster, two of them are colocated at the same site. However, in the cluster patterns of Fig. 2b, d, there are three equidistant macro BSs per cluster, with each BS belonging to a different site. This difference creates asymmetry in some features of these patterns. Most notably, the mean standard deviation of the interference across the clusters of Fig. 2a, c, and e is equal, but different from that of Fig. 2b, d, with values of 21.3 and 26.3, respectively, for the two cases. We account for these differences in our simulations. In the SAS algorithm, we use parameter values corresponding to SAm (with memory) case #18 in [35]; we refer readers to that reference for details. A summary of the simulation parameters and their values is provided in Table 1.
Simulation results for sixcell layout
The clustering pattern is changed every T_{ cl } scheduling intervals, and we set t_{ c }=100 for the size of the averaging window in the PF metric. We consider a minimum of four full sets of rotations through both patterns for our simulations (with that minimum occurring at the slowest rotation speed). The cumulative distribution functions (CDFs) of the average achievable rates per user for K_{ c }=12 and various T_{ cl } values are depicted in Fig. 3. As this figure shows, rotating clustering has the most benefit in particular for those users with poorer channel gains (e.g., those with an average achievable rate around the 5th percentile). In comparison, users with better channels (who achieve higher average rates, such as those around the 90th percentile) see their average achievable rate drop with rotating clustering. The poorer 5th percentile users are those having low SINRs in a cluster in one of the patterns. Their achievable rates significantly benefit from rotation because they are located in a position in the cluster where they can have a better SINR after the rotation. The better users trade off their average achievable rate to provide higher overall fairness to the system.
Considering different rotation speeds, Fig. 3 demonstrates that the rotation rate has an impact on each user’s average rate. This is the most easily seen for lowerrate users like those around the 5th percentile. The 5th percentile peruser rates generally increase with decreasing T_{ cl }, but eventually reach an upper limit, where even faster rotation yields no further significant gains. With PF scheduling, the gain in 5th percentile rate over that achieved with fixed clustering ranges between 23.5% to 28%. The gains in rates per user with PF scheduling are further emphasized in Fig. 4, which depicts, for each percentile of the average peruser rate, the percentage gain in average achievable peruser rates obtained when using rotating clustering, relative to the rates achieved with fixed clustering. As can be seen, the majority of the achieved peruser rates (those up to about the 65th to 72nd percentile) experience a gain with rotating clustering, at the expense of the rest. However, even at those remaining upper percentiles, the relative loss in rate is much smaller than the relative gain for the lower percentiles; at worst, the upper percentile rates drop by about 5%.
In each cluster, there are some users with particularly poor channel gains and even cluster rotation cannot help improve them much (such as those users located near the cell borders). These users have a very low chance to be selected by the MT scheduler; their average achieved rate is very close to (and sometimes equal to) zero, especially for fixed clustering. Cluster rotation helps these users by occasionally giving them better channels; however, there still remain some users that suffer from starvation. Nonetheless, although the probability of starvation is still nonzero for MT scheduling, it is reduced significantly (by a factor of over half for K_{ c }=12) by using cluster rotation, as seen in Fig. 3.
With rotation, the best users’ average achievable rates are reduced because they may be scheduled less often; however, they achieve higher instantaneous rates when they are selected. As an example, consider a theoretical extreme case of maximum throughput scheduling, where with fixed clustering, a few users would be scheduled most of the time with high throughput. With rotating clustering, instead about twice as many users would be scheduled overall, half of this total in each clustering pattern. As they would be scheduled about half as often (mostly only during their favorable pattern), their average achievable rate would also be about halved. However, they would achieve somewhat better instantaneous rates when scheduled, leading to an increased (albeit slightly) achievable sum rate. Similar and larger effects on the instantaneous and average achievable rates are seen with PF scheduling. As expected, while the average sum rate of PF is less than that of maximum throughput scheduling, its 5th percentile average achievable peruser rates are higher than for maximum throughput, and there is less overall variation in the average peruser rates achieved.
Figure 5 shows the average achievable sum rate for maximum throughput and proportionally fair scheduling vs. K_{ c } over the area of a macrocell in an arbitrary cluster. In Fig. 5, T_{ cl }=100, which is equal to t_{ c }. As seen, rotating clustering outperforms fixed clustering for both scheduling metrics. However, rotating clustering increases the average achievable sum rate with PF relatively more than with maximum throughput. For instance, for MT scheduling, rotating clustering provides slightly higher throughput, increasing about 0.85% and 0.75% for K_{ c }=4 and K_{ c }=12, respectively, while for PF scheduling, the throughput is more significantly higher (about 1.4% and 3.2%, respectively, for K_{ c }=4 and 12). This is expected because the users that take the most advantage of the rotation are the users either near the border of the cluster or in poor coverage areas, who are scheduled more often with PF than with maximum throughput.
In Fig. 6, the average achievable sum rate vs. K_{ c } for proportionally fair scheduling considering different rotation speeds is presented. As seen, while rotating clustering still outperforms fixed clustering in terms of sum rate, faster rotation (i.e., smaller T_{ cl }) yields diminishing gains. For instance, rotating clustering provides higher throughput with respect to fixed clustering with faster rotation^{Footnote 8}, increasing by about 2.1%, 3.2%, 4.1%, and 4.5% with K_{ c }=12 for T_{ cl }=200, 100, 50, and 25, respectively. Considering an upper limit seen in the sum rate, rotating faster would not provide any further significant increase, but would increase complexity in signaling overhead for cluster setup. For our system, the limit is reached at about T_{ cl }=50 (half of t_{ c }). Smaller values of T_{ cl } (e.g., T_{ cl }=25) yield almost no additional sum rate; the additional gain in sum rate relative to fixed clustering is only a few tenths of a percentage.
Assuming the scheduler has selected a user with poor channel gains in a cluster, the chance of this user being chosen again by the scheduler is comparatively low since its priority weight will have likely dropped in accordance with the update of its average rate by the exponential filter. The user’s priority will gradually increase as time passes if the user is not scheduled. If the cluster pattern change interval is smaller than t_{ c }, the possibility of that user being in a better position in another cluster pattern, and consequently dramatically improving its priority by virtue of its higher SINR and thus achievable rate, is increased. Similarly, a user with high SINR in a cluster that finds itself near a cluster edge after the pattern rotates will have its priority suddenly drop. It will be less likely to be scheduled until either the PF scheduling window passes or the cluster pattern rotates back, whichever comes first. Hence, faster rotations relative to t_{ c } can potentially result in higher priority weights and higher sum rates. However, decreasing T_{ cl } does not imply a linear increase in priority weights.
The upper limit on the performance with increasing rotation speed is understandable. In the PF scheduler, there are two factors influencing whether a user is scheduled: its potential instantaneous rate and its weight. The former is largely influenced by the cluster rotation, whereas the latter, among other things, is an indicator of how long the user has gone without being scheduled. The weight is updated with an exponential filter, meaning its value increases exponentially with time. Specifically, the increase is determined relative to the time constant (averaging window) t_{ c }, expressed as a number of scheduling intervals. If a user is not scheduled for ν scheduling intervals, its weight will increase by a factor of roughly exp(ν/t_{ c }). If the clustering pattern rotation interval is small relative to t_{ c }, the conditions for a user will become favorable or unfavorable sooner than the time it takes for the weight to change significantly. In other words, when T_{ cl } is a fraction of t_{ c }, the weights change little between the instances in which a given user is scheduled. Hence, the specific value of T_{ cl } is no longer much of a factor, leading to the upper limit on (saturation of) the performance with increasing rotation speed.
With larger K_{ c }, cluster rotation is more effective at increasing the sum rate; for example, the achievable gains with T_{ cl }=100 are 1.4% and 3.2% for K_{ c }=4 and 12, respectively, using PF scheduling. Faster rotation with larger K_{ c } is more beneficial. For instance, with K_{ c }=8 and comparing the rotation speed of T_{ cl }=200 to T_{ cl }=100 and 50, the sum rate gains are 0.9% and 1.4%, respectively, whereas with K_{ c }=16, the gain increments are 1.4% and 2.4%. The probability of having users that are located in the cluster with favorable channel gains is increased with larger K_{ c }, simply as a result of multiuser diversity. Thus, being able to schedule more of these users improves the sum rate. Recall, though, that users are uniformly distributed over the entire coverage area. This statistically results in a higher proportion of users who are farther from a BS (with poorer channel gains) than those who are nearer. An increase in K_{ c } thus also means an increase in the total number of “poorer” users. As was seen earlier, it is those users who benefit the most from rotation, explaining further why the gains with rotation are better with higher K_{ c }.
An increase in K_{ c } also means more delays in scheduling users, as a larger pool competes for the same limited resources. Faster rotation also means less of a wait for any given user’s cluster pattern to be at its best, and thus potentially for the user to be scheduled, improving its average rate. In other words, faster rotation somewhat compensates for the increased scheduling delays as K_{ c } grows, thus leading to the higher gains in rate observed with faster rotation at larger K_{ c }.
Figures 7 and 8 further demonstrate the effects of rotating clustering (with T_{ cl }=100) by depicting the average achievable rate per user based on the user’s position within the cellular network; Fig. 7 depicts MT scheduling, whereas Fig. 8 depicts PF. Figures 7a and 8a show the average achievable rates with fixed clustering. The highest rates are unsurprisingly achieved by users nearest to a BS, especially macro BSs, but also to a lesser extent pico BSs. The lowest achievable rates (darkest red) are seen near any macrocell corner where three clusters meet. We will henceforth refer to these areas as the “corner” areas for shorthand. Lower rates are also seen in the area around the border of each cell, whereas somewhat higher achievable rates (orange to yellow to white) are seen near areas that correspond to the directions of the macro BS boresights. For any macro BS, users located near the left or right borders of the cell (i.e., at a 30° angle relative to the antenna boresight) receive weaker signals on account of the antenna directivity pattern. This situation is at its worst near the corners of the macrocells, at the furthest distance from the site. Furthermore, although the corners are surrounded by picos, the received signals from those picos are also weak due to being beyond the picocell borders. Hence, even with clustering (either fixed or rotating) and coordination, users in the corners achieve consistently smaller user rates, compared to the users in other locations of the cluster.
Figures 7b and 8b show the achievable rates with rotating clustering. In comparison, the average rates (or the colors) are more evenly distributed over the entire area, indicating higher overall fairness. As expected, the largest increase in average achievable rate is experienced by users that are closest to the cluster borders in the fixed scheme. Those users previously received the worst SINRs and/or the most ICI. In contrast, most users who are located close to the macro BSs or near the directions of the macro boresights receive very good signal power. The largest decrease in average achievable rates relative to the fixed scheme is by that latter group of users, as well as to a lesser extent users nearby pico BSs. It is these latter users who trade off some of their average rates to provide more fairness and uniformity of throughput across the coverage area. It is interesting to note that there are fairly large areas where the average achievable rate of a user does not change much with rotating clustering. These areas and users are those located in the interior of the cluster. Their conditions (in terms of both useful signals and interference) are more or less the same under either cluster pattern. Hence, rotation does not change their situation much, and so, their achievable rates do not change appreciably from the fixed scheme either. There is also little difference in the rates seen in the cell corner areas with rotation. The rather small changes that exist there are not readily visible in Figs. 7 and 8. However, where three clusters meet in the fixed scheme, those corners do experience a small boost in rates with rotation due to periodically increased coordination. Likewise, the corners at the cluster centers in the fixed scheme see a small decrease in rates with rotation, as signals received there are no longer always coordinated from all nearby BSs.
Simulation results for threecell layout
Similarly to the previous section, the clustering pattern changes (rotates) every T_{ cl } scheduling intervals. For the sixcell layout, as described in Section 3.1, we simulate for a minimum of four complete sets of rotations. Assuming T_{ cl }=200, which corresponds to the slowest (nonzero) rotation speed, the total number of simulated channel realizations (or scheduling intervals) for every drop of users is thus 1600. To compare the threecell layout with the sixcell layout, we consider two different scenarios. First, we keep the same total number of 1600 channel realizations per drop with four complete sets of rotations between the five clustering patterns. We also maintain the longest pattern duration as 2t_{ c }. Thus, t_{ c } for the PF window becomes 1600 realizations/drop ÷4 sets/drop ÷5 patterns/set ÷2 windows/pattern =40 (realizations per window). This gives us different rotation speeds corresponding to pattern durations of T_{ cl }=[10,20,40,80]. Secondly, we keep the value of t_{ c } equal to what we use in Section 3.1, i.e., 100, and the same pattern durations, i.e., T_{ cl }=[25,50,100,200]. With still four full sets of rotations at minimum, this results in 4000 channel realizations per drop. The simulations with these two scenarios were run for K_{ c }=12, and the results were compared with each other. We observed that the results of both scenarios were very close to each other^{Footnote 9}. Thus, we present only the results for where the number of channel realizations per drop equals^{Footnote 10} 1600 and t_{ c }=40.
In Fig. 9, the CDF of the average achievable rates per user for K_{ c }= 12 is illustrated. Similarly to the results depicted by Fig. 3, rotating clustering again yields higher peruser rates for those who are located near cluster edges or in otherwise poor coverage areas in the threecell layout (e.g., the users with rates around the 5th percentile). Comparison of these two figures is also interesting. As seen in Fig. 3 with MT scheduling, while rotating clustering improves the peruser rates in the sixcell layout, there are still some users whose achievable rates are very low, even to the point of starvation. However, cluster rotation in the threecell layout improves the average throughput of users considerably such that the probability of very low rates and/or starvation becomes almost zero. There is a larger tradeoff in the highpercentile users’ rates to achieve this, though.
In the threecell layout and with PF scheduling, the rotation speed corresponding to T_{ cl }=t_{ c } yields a peruser rate increase of about 0.5 bits/s/Hz at the 5th percentile, while the improvement with the sixcell layout is around two thirds of that. In relative terms, the 5thpercentile rate improvements with rotating clustering (at T_{ cl }=t_{ c }) over fixed in the six and threecell layouts are, respectively, about 25.7% and 47.3%. This indicates cluster rotation is even more effective at improving the clusteredge user rates in the threecell layout. Considering the faster rotation speed corresponding to T_{ cl }=t_{ c }/2, the additional 5thpercentile users’ rate increase (relative to T_{ cl }=t_{ c }) is about 1.3% in the sixcell layout, while this increase is about 3.7% using the threecell layout. This shows that faster rotation in the threecell layout is more beneficial than in the sixcell layout to help clusteredge users to gain further higher throughput. This result is not surprising, considering the former rotates through a larger set of patterns.
We also investigate the user rates vs. their position within the network in Fig. 10 (MT scheduling) and Fig. 11 (PF scheduling). Basically, the influence of cluster rotation on achievable peruser rates with MT and PF scheduling is similar to what we have discussed in Section 3.1. As before, it is users located at the cell corners away from the sites that achieve the lowest rates. To a lesser extent, users near the middle of cells, not especially near a macro BS nor a pico BS, also achieve somewhat lower rates. As is obvious from Figs. 10 and 11, the achieved users’ rates are distributed more uniformly with rotating clustering than with fixed clustering for both MT and PF scheduling. Furthermore, the overall rates are increased. The achievable throughput of those users who are located in the corners of a cell is improved, and the size of the areas experiencing lower rates is diminished. This is particularly noticeable in Fig. 10 for MT scheduling; the colors in the cell corners increase from dark red to orange.
Cellcenter users are also significantly affected by cluster rotation. Note that in fixed clustering, and in the patterns in Fig. 2a, c, and e, two parts of the cluster come from the same site. Because of the macro BS antenna pattern, the coverage of the beams from those two BSs has little overlap. Hence, those two cells are essentially impacted by, at best, two coordinated signals: one from their own cell and one from the third BS at the other site that contributes to the cluster. However, with rotation through the patterns in Fig. 2b, d, receiving a significant coordinated signal from three BSs is more common. Thus, users in those two cells in the cluster see the most improvement from rotation^{Footnote 11}. The tradeoff now comes from users in the third cell of the fixed scheme, who with cluster rotation now are forced to experience less advantageous cluster patterns for the sake of overall network fairness.
Figure 12 demonstrates the achievable sum rates vs. K_{ c } for the threecell layout. Much like the sixcell layout, MT scheduling again does not display a gain in sum rate from cluster rotation, for similar reasons as described in Section 3.1. However, the throughput with PF scheduling does again increase considerably. For example, with K_{ c }= 12, comparing the achievable throughput of rotating vs. fixed clustering shows an increase of 4.5% and 6.7%, for T_{ cl } equal to t_{ c } and t_{ c }/2, respectively. Comparing the six and threecell layouts with the same number of users in each cluster, the sixcell layout generally yields higher values for area spectral efficiency on average. For example, the average total sum rates per macrocell for K_{ c }=12 are 8.8 and 13.6 bits/s/Hz, respectively, for the six and threecell layouts with PF using fixed clustering. Yet, also note that the macrocell area in the sixcell layout is half of that in the threecell layout (see Endnote 5). Hence, the sum rate per unit area (e.g., per km^{2}) is larger in the sixcell layout. However, the peruser rate improvement achieved with cluster rotation in the threecell layout is much more significant. Faster rotation yields higher gains in this layout than in the sixcell layout. For instance, the additional sum rate improvement achieved by setting T_{ cl }=t_{ c }/2 relative to T_{ cl }=t_{ c } is 2.1% in the threecell layout, while this gain in the sixcell layout is only 0.8%. Although faster rotation improves the average sum rate, the upper limit for higher speeds of rotation is still similar to what we described in Section 3.1. Thus, we only present the results for T_{ cl } equal to t_{ c } and t_{ c }/2. All these comparisons demonstrate that the rotating cluster method overall helps the system performance in the cloverleafshaped network layout more than in the hexagonalshaped layout.
Comparison of rotating cluster method with dynamic cluster method
In [30], a dynamic clustering method was proposed and compared with static clustering. There are several differences in the simulation methodology in that paper compared to our own (e.g., a singletier homogeneous network, differing numbers of users and antennas, the precoding method). However, it can still serve as a rough guide for the performance benefits of a fully dynamic scheme over static clustering and that can be used for comparison with our less complex rotating clustering method. The simulations in [30] assumed three macrocells per site, four transmit antennas per macrocell, two antennas per user, and PF scheduling. The dynamic scheme was shown to have approximately an 18% gain in average sum rate per cell (Fig. 5 in [30]) vs. a static clustering scheme of three cells per cluster with one macrocell contributing from each site (similar to the macrocells in our threecell layout). The gain in 5th percentile user rate was about 22% (Fig. 6 in [30]); this gain approximately doubled when the maximum cluster size of the dynamic scheme increased from three to six, though at the same time also enforcing that just a single data stream be sent to each user (Fig. 9 in [30]).
In comparison, for our sixcell layout, the 5th percentile user rates for K_{ c }= 12 and PF scheduling increase by about 23.5% to 28% over static clustering for the minimum and maximum examined rotation speeds. In the threecell layout, the gains were even higher, i.e., 47–53% for K_{ c }= 12. The gain in sum rate per cell was lower though, about 2.1–4.5% for K_{ c }= 12 in the sixcell layout, and about 4.5–6.7% for K_{ c }= 12 in the threecell layout. However, we remind the reader that our rotating cluster scheme was primarily designed to help clusteredge users (e.g., those around the 5th percentile); any gains in cell sum rate are an added secondary bonus. Hence, we can see that, compared to the fully dynamic scheme of [30], our rotating scheme provides gains for clusteredge users that are on par with or better than those in [30], but at the tradeoff of lower gains in the sum rate per cell.
Conclusions
We have considered the downlink of a coordinated heterogeneous MIMO cellular network (including macro and pico BSs) in hexagonalshaped and cloverleafshaped cell layouts. A different rotating set of cellcluster patterns has been proposed for each layout. The proposed scheme’s performance, considering two user scheduling metrics, has been evaluated by simulation. The results demonstrate that the proposed cluster rotation scheme performs significantly better than fixed clustering for both metrics and both layouts, while being less complex than fully dynamic clustering. The average achievable rate per user has been improved for clusteredge users, which is the primary goal of cluster rotation. Additionally, the average achievable sum rate has also been improved, which is a secondary benefit. User rates also become more evenly distributed over the network coverage area. A comparison with fully dynamic clustering shows cluster rotation yields similar performance gains for clusteredge users.
We have also evaluated the effect of different cluster rotation speeds on the system performance in both cellular layouts. The results demonstrate improving performance with increasing speed of rotation up to an upper limit on that speed, beyond which further increases yield no notable additional gains in sum rate or peruser rate. The cloverleafshaped layout gains more in performance from cluster rotation than the hexagonalshaped one. This indicates in general that some network layouts (and their associated cluster patterns) may benefit more from cluster rotation than others.
Notes
 1.
This terminology differs from what we used in [19, 20]. In our earlier papers, we referred to each of the six parts of the hexagonal area as a sector, and the six sectors together constituted a single cell. Now, we instead refer to the former sectors as cells, and the six macrocells have their BSs colocated at one site.
 2.
R_{ m } is defined as the farthest distance from the macro BS to the edge of its cell in the direction of its boresight. Since we fix the value D of the ISD for both layouts in our simulations, R_{ m } in the sixcell layout is D/2 while it is 2D/3 for the threecell layout.
 3.
In (6), the sumtrace (transmit power) constraint of 1 on \(\mathbf {Q}_{c(i),\pi _{c(i),k}^{j}}\) assumes the transmit power P_{ t }(b) for BS b is embedded in the channel matrix, as we have done in (1); this method is equivalent to not embedding the power in the channel matrix, but instead using a transmit power constraint of P_{ t }(b) for BS b. Embedding the power in the channel matrix also allows the rates to be calculated independently of the type of transmission nodes (and their specific transmitted power levels) used in the network. For simplicity of calculation, we have used a sumpower constraint over all coordinated BS antennas instead of perBS constraints.
 4.
This is similar to [14, Eq. (32)], except replacing the h_{ k }w_{ k }^{2} terms there with squared singular values of \(\mathbf {G}_{c(i),\pi _{c(i),k}^{j}}\).
 5.
Using this ISD, the circle circumscribing the macro site coverage area in the sixcell layout has a radius of \(D/ \sqrt {3}\), which equals 1 km. The radius of the circle circumscribing a single macrocell in the threecell layout is D/3. Using simple geometry, the area covered by a macrocell in the threecell layout can be calculated as twice to that of a macrocell in the sixcell layout. Since the sixcell layout has twice as many macrocells, the total coverage area of a macro site is the same in both layouts.
 6.
In both layouts, the location of a pico BS is determined such that the border of its coverage area touches the border of the macrocell coverage area it is overlaid on. The picocell borders themselves are determined by where there is an equal SINR from the pico BS and from the macro BS of the macrocell that the picocell overlays. (The hexagonal picocell borders shown in the figures are an approximation to the true shape of the borders, meant to aid the visualization of the layout.)
 7.
The values of the path loss exponents and the shadow fading standard deviations used here for macrocells and picocells correspond to the Urban Macro (UMa) nonlineofsight (NLoS) scenario and the Urban Micro (UMi) NLoS scenario, respectively, as found in [32].
 8.
A similar evaluation has been performed for MT; the results show no notable improvements with higher rotation speeds. This is also expected because MT overall gains very little in sum rate by using cluster rotation in contrast to PF.
 9.
This is likely as a result of the assumption that there is no temporal correlation in the smallscale fading component of the users’ channel gains, i.e., each realization of z_{c(i),k}(b,m,n) in (1) is independent from any other. If there was temporal correlation, the results of the two scenarios would likely differ.
 10.
For certainty, we have simulated all cases for both scenarios and in each case the achieved results were similar. Aside from the fact that fewer realizations means less computation time for the simulations, using 1600 realizations for both the sixcell and threecell layouts means a comparison of the results between the two layouts is somewhat more statistically equitable.
 11.
Abbreviations
 BS:

Base station
 CDF:

Cumulative distribution function
 CoMP:

Coordinated multipoint
 CSI:

Channel state information
 DPC:

Dirty paper coding
 HetNet:

Heterogeneous network
 ICI:

Intercluster interference
 i.i.d.:

Independent and identically distributed
 IMT:

International Mobile Telecommunications
 ISD:

Intersite distance
 ITUR:

International Telecommunication Union – Radiocommunication Sector
 LTE:

Long Term Evolution
 MIMO:

Multipleinput multipleoutput
 MT:

Maximum throughput
 MUMIMO:

Multiuser multipleinput multipleoutput
 NLoS:

Nonlineofsight
 PF:

Proportionally fair
 SAS:

Simulated annealing (user) scheduling
 SINR:

Signaltointerferenceplusnoise ratio
 SNR:

Signaltonoise ratio
 SZFDPC:

Successive zeroforcing dirty paper coding
 UMa:

Urban Macro
 UMi:

Urban Micro
 WSR:

Weighted sum rate
References
 1
JG Andrews, S Buzzi, W Choi, SV Hanly, A Lozano, ACK Soong, JC Zhang, What will 5G be?IEEE J. Sel. Areas Commun. 32(6), 1065–1082 (2014). https://doi.org/10.1109/JSAC.2014.2328098.
 2
D Gesbert, M Kountouris, RW Heath Jr., CB Chae, T Sälzer, Shifting the MIMO paradigm. IEEE Signal Process. Mag. 24(5), 36–46 (2007). https://doi.org/10.1109/MSP.2007.904815.
 3
MK Karakayali, GJ Foschini, RA Valenzuela, Network coordination for spectrally efficient communications in cellular systems. IEEE Wireless Commun. 13(4), 56–61 (2006). https://doi.org/10.1109/MWC.2006.1678166.
 4
D Lee, H Seo, B Clerckx, E Hardouin, D Mazzarese, S Nagata, K Sayana, Coordinated multipoint transmission and reception in LTEadvanced: deployment scenarios and operational challenges. IEEE Commun. Mag. 50(2), 148–155 (2012). https://doi.org/10.1109/MCOM.2012.6146494.
 5
AH Sakr, E Hossain, Locationaware crosstier coordinated multipoint transmission in twotier cellular networks. IEEE Trans. Wireless Commun. 13(11), 6311–6325 (2014). https://doi.org/10.1109/TWC.2014.2348565.
 6
Z Xu, C Yang, GY Li, Y Liu, S Xu, Energyefficient CoMP precoding in heterogeneous networks. IEEE Trans. Sig. Process. 62(4), 1005–1017 (2014). https://doi.org/10.1109/TSP.2013.2296279.
 7
A Damnjanovic, J Montojo, Y Wei, T Ji, T Luo, M Vajapeyam, T Yoo, O Song, D Malladi, A survey on 3GPP heterogeneous networks. IEEE Wireless Commun. 18(3), 10–21 (2011). https://doi.org/10.1109/MWC.2011.5876496.
 8
A Ghosh, N Mangalvedhe, R Ratasuk, B Mondal, M Cudak, E Visotsky, TA Thomas, JG Andrews, P Xia, HS Jo, HS Dhillon, TD Novlan, Heterogeneous cellular networks: from theory to practice. IEEE Commun. Mag. 50(6), 54–64 (2012). https://doi.org/10.1109/MCOM.2012.6211486.
 9
JG Andrews, Seven ways that HetNets are a cellular paradigm shift. IEEE Commun. Mag. 51(3), 136–144 (2013). https://doi.org/10.1109/MCOM.2013.6476878.
 10
D Gesbert, S Hanly, H Huang, S Shamai Shitz, O Simeone, W Yu, Multicell MIMO cooperative networks: a new look at interference. IEEE J. Sel. Areas Commun. 28(9), 1380–1408 (2010). https://doi.org/10.1109/JSAC.2010.101202.
 11
M Eslami, RC Elliott, WA Krzymień, M AlShalash, Locationassisted clustering and scheduling for coordinated homogeneous and heterogeneous cellular networks. Trans. Emerging Telecommun. Technol. (ETT). 24(1), 84–101 (2013). https://doi.org/10.1002/ett.2577.
 12
FP Kelly, AK Maulloo, DKH Tan, Rate control for communication networks: shadow prices, proportional fairness and stability. J. Oper. Res. Soc. 49(3), 237–252 (1998). https://doi.org/10.1057/palgrave.jors.2600523.
 13
P Viswanath, DNC Tse, R Laroia, Opportunistic beamforming using dumb antennas. IEEE Trans. Inf. Theory. 48(6), 1277–1294 (2002). https://doi.org/10.1109/TIT.2002.1003822.
 14
VKN Lau, Optimal downlink spacetime scheduling design with convex utility functions— multipleantenna systems with orthogonal spatial multiplexing. IEEE Trans. Veh. Technol. 54(4), 1322–1333 (2005). https://doi.org/10.1109/TVT.2005.851337.
 15
T Yoo, A Goldsmith, On the optimality of multiantenna broadcast scheduling using zeroforcing beamforming. IEEE J. Sel. Areas Commun. 24(3), 528–541 (2006). https://doi.org/10.1109/JSAC.2005.862421.
 16
A Papadogiannis, D Gesbert, E Hardouin, in Proc. 2008 IEEE Int. Conf. Commun. (ICC’08). A dynamic clustering approach in wireless networks with multicell cooperative processing (IEEE, Beijing, 2008), pp. 4033–4037. https://doi.org/10.1109/ICC.2008.757.
 17
Dynamic cell clustering for CoMP. Technical Report R1090657, LG Electronics, Athens, Greece, 3GPP TSG RAN WG1 Meeting No.56 (2009).
 18
JY Hwang, J Kim, T Kim, Y Han, A periodic frequency band rotation scheme for multicell coordination clustering. IEEE Commun. Lett. 15(9), 956–958 (2011). https://doi.org/10.1109/LCOMM.2011.071311.110684.
 19
H Purmehdi, RC Elliott, WA Krzymień, J Melzer, in Proc. 2014 IEEE Int. Conf. Commun. (ICC’14). Rotating clustering with simulated annealing user scheduling for coordinated heterogeneous MIMO cellular networks (IEEE, Sydney, 2014), pp. 5293–5298. https://doi.org/10.1109/ICC.2014.6884162.
 20
H Purmehdi, RC Elliott, WA Krzymień, J Melzer, in Proc. 2014 Asilomar Conf. Signals, Systems and Comput. Effect of cluster rotation speed in coordinated heterogeneous MIMO cellular networks with proportionally fair user scheduling (IEEE, Pacific Grove, 2014), pp. 1816–1820. https://doi.org/10.1109/ACSSC.2014.7094781.
 21
N Lee, RW Heath Jr., D MoralesJimenez, A Lozano, in Proc. 2013 IEEE GLOBECOM  1st Int. Workshop CloudProcess. Heterogeneous Mobile Commun. Netw. Base station cooperation with dynamic clustering in superdense cloudRAN (IEEE, Atlanta, 2013), pp. 784–788. https://doi.org/10.1109/GLOCOMW.2013.6825084.
 22
J Zhang, R Chen, JG Andrews, A Ghosh, RW Heath Jr., Networked MIMO with clustered linear precoding. IEEE Trans. Wireless Commun. 8(4), 1910–1921 (2009). https://doi.org/10.1109/TWC.2009.080180.
 23
F Boccardi, H Huang, A Alexiou, in Proc. 2008 42nd Asilomar Conf. Signals, Systems and Comput. Network MIMO with reduced backhaul requirements by MAC coordination (IEEE, Pacific Grove, 2008), pp. 1125–1129. https://doi.org/10.1109/ACSSC.2008.5074589.
 24
JM Moon, DH Cho, in Proc. IEEE 73rd Veh. Technol. Conf. (VTC 2011Spring). Intercluster interference management based on cellclustering in network MIMO systems (IEEE, Budapest, 2011), pp. 1–6. https://doi.org/10.1109/VETECS.2011.5956662.
 25
J Liu, D Wang, in Proc. 2009 Int. Conf. Wireless Commun. Signal Process. (WCSP 2009). An improved dynamic clustering algorithm for multiuser distributed antenna system (IEEE, Nanjing, 2009), pp. 1–5. https://doi.org/10.1109/WCSP.2009.5371570.
 26
S Zhou, J Gong, Z Niu, Jia Y, P Yang, in Proc. 2009 IEEE Global Telecommun. Conf. (GLOBECOM 2009). A decentralized framework for dynamic downlink base station cooperation (IEEE, Honolulu, 2009), pp. 1–6. https://doi.org/10.1109/GLOCOM.2009.5425212.
 27
R Weber, A Garavaglia, M Schulist, S Brueck, A Dekorsy, in Proc. IEEE 73rd Veh. Technol. Conf. (VTC 2011Spring). Selforganizing adaptive clustering for cooperative multipoint transmission (IEEE, Budapest, 2011), pp. 1–5. https://doi.org/10.1109/VETECS.2011.5956490.
 28
A Papadogiannis, HJ Bang, D Gesbert, E Hardouin, Efficient selective feedback design for multicell cooperative networks. IEEE Trans. Veh. Technol. 60(1), 196–205 (2011). https://doi.org/10.1109/TVT.2010.2089707.
 29
J Gong, S Zhou, Z Niu, L Geng, M Zheng, in Proc. 2011 IEEE Global Telecommun. Conf. (GLOBECOM 2011). Joint scheduling and dynamic clustering in downlink cellular networks (IEEE, Houston, 2011), pp. 1–5. https://doi.org/10.1109/GLOCOM.2011.6133679.
 30
P Baracca, F Boccardi, N Benvenuto, A dynamic clustering algorithm for downlink CoMP systems with multiple antenna UEs. EURASIP J. Wireless Commun. Netw. 2014(125), 1–14 (2014). https://doi.org/10.1186/168714992014125.
 31
AD Dabbagh, DJ Love, Precoding for multiple antenna Gaussian broadcast channels with successive zeroforcing. IEEE Trans. Signal Process. 55(7), 3837–3850 (2007). https://doi.org/10.1109/TSP.2007.894285.
 32
Guidelines for evaluation of radio interface technologies for IMTadvanced. Technical Report ITUR M.21351, Int. Telecommun. Union (2009).
 33
Coordinated multipoint operation for LTE physical layer aspects (release 11). Technical Report 3GPP TR 36.819 v11.2.0 (201309), 3rd Generation Partnership Project (2013).
 34
E Hossain, DI Kim, VK Bhargava (eds.), Cooperative Cellular Wireless Networks (Cambridge Univ. Press, Cambridge, 2011).
 35
H Purmehdi, RC Elliott, WA Krzymień, Reducedcomplexity user scheduling algorithms for coordinated heterogeneous MIMO networks. IEEE Trans. Veh. Technol. 65(8), 6184–6203 (2016). https://doi.org/10.1109/TVT.2015.2477309.
 36
DT Lee, On knearest neighbor Voronoi diagrams in the plane. IEEE Trans. Comput. C31(6), 478–487 (1982). https://doi.org/10.1109/TC.1982.1676031.
 37
Spatial channel model for multiple input multiple output (MIMO) simulations (release 13). Technical Report 3GPP TR 25.996 v14.0.0 (201703), 3rd Generation Partnership Project (2017).
Acknowledgements
This work has been facilitated by the computing resources of Information Services and Technology (IST) at the University of Alberta, WestGrid and Compute/Calcul Canada. This work was presented in part at the IEEE International Conference on Communications, Sydney, Australia, June 2014 and the Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, November 2014.
Funding
This work was funded in part by TELUS Communications, Rohit Sharma Professorship, Natural Sciences and Engineering Research Council (NSERC) of Canada, Telecommunications Research Laboratories, and Mitacs.
Availability of data and materials
Please contact the corresponding author at wak@ece.ualberta.ca.
Author information
Affiliations
Contributions
The authors have contributed jointly to this manuscript. All authors have read and approved the final manuscript.
Corresponding author
Correspondence to Witold A. Krzymień.
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Received
Accepted
Published
DOI
Keywords
 Heterogeneous multipleantenna cellular networks
 Multiuser MIMO downlink
 Network coordination
 Cell clustering
 Intercell and intercluster interference mitigation
 Cluster pattern rotation