 Research
 Open Access
 Published:
A dynamic clustering algorithm for downlink CoMP systems with multiple antenna UEs
EURASIP Journal on Wireless Communications and Networking volume 2014, Article number: 125 (2014)
Abstract
Coordinated multipoint (CoMP) schemes have been widely studied in the recent years to tackle intercell interference. In practice, latency and throughput constraints on the backhaul allow the organization of only small clusters of base stations (BSs) where joint processing (JP) can be implemented. In this work, we focus on downlink CoMPJP with multiple antenna user equipments (UEs) where the additional degrees of freedom are used to suppress the residual interference by using an interference rejection combiner (IRC) and allow a multistream transmission. The main contribution of this paper is the development of a novel dynamic BS clustering algorithm with corresponding UE scheduling. In particular, we first define a set of candidate BS clusters depending on longterm channel conditions. Then, in each time block, we develop a resource allocation scheme where: (a) for each candidate BS cluster, with corresponding scheduled UEs, a weighted sum rate is estimated and then (b) we select the set of nonoverlapping BS clusters that maximizes the downlink system weighted sum rate. Numerical results show that much higher rates are achieved when UEs are equipped with multiple antennas and dynamic BS clustering is used.
1 Introduction
Coordination among base stations (BSs) has been widely studied in the recent years to tackle intercell interference which strongly limits the rates achieved in cellular systems, in particular by the user equipments (UEs) at the celledge [1]. Supported by the first results promising huge gains with respect to the baseline noncooperative system [2], a lot of attention has been paid to the topic both in the academia [3, 4] and in the industry [5, 6]. These techniques, known in the industry as coordinated multipoint (CoMP), are classified into (a) coordinated scheduling/beamforming (CS/CB), which requires channel state information (CSI) but no data sharing among the BSs, and (b) joint processing (JP), which requires both CSI and data sharing among the BSs. This paper focuses on downlink CoMPJP, where BSs jointly serve the scheduled UEs by sharing the data to be sent. Although CoMPJP is a very promising technique, many issues make its implementation still challenging. First, CSI at the transmitter may be unreliable because of noise on channel estimation in time division duplex (TDD) systems and limited bandwidth available for feedback in frequency division duplex systems. Then, sharing UE data among all the BSs is generally limited by throughput and delay constraints in the backhaul infrastructure. A possible approach to deal with backhaul throughput constraints relies on partial sharing of UE data among the BSs, i.e., a BS serving a certain UE may have only a partial knowledge of the data to be sent toward that UE [7, 8]. Although the promising results achieved under idealistic assumptions, partial UE data sharing has not found application in real systems mainly due to its complexity. To deal with limited throughput backhaul, most of the works in the literature focus on the simpler clustering approach where the BSs are organized in clusters and joint processing is applied within each cluster by sharing the whole data to be sent among all the BSs of the cluster. However, even if intracluster interference is mitigated by using CoMP schemes within each cluster, UEs at the cluster border suffer strong intercluster interference (ICI). Many clustering schemes have been developed in the literature to deal with ICI. In [9], static clustering with block diagonalization is considered and precoders are designed in each cluster by nullifying the interference towards UEs of neighboring clusters close to the border. A more flexible solution is obtained with dynamic clustering[10, 11] where the set of clusters changes over time by adapting to the network conditions. In particular, in [10], a greedy algorithm is developed where, for each cluster, the first BS is selected randomly to guarantee fairness, while the remaining BSs are selected by maximizing the cluster sum rate. In [11] instead, based on the longterm channel conditions, it is defined a set of candidate clusterings and then in each time block is selected the most suitable one. In [12], selected clusters maximize the increase of the achievable UE rate, whereas in [12, 13], they minimize the interference power. In [14], a BS negotiation algorithm is used for cluster formation within a given cluster size. In [15], active clusters are selected by minimizing an overall cost function which depends on the UE average received power. A framework for feedback and backhaul overhead reduction is developed in [16] where each UE feeds back CSI only to a subset of BSs, and UEs associated to the same subset are grouped together. In [17], a greedy UE scheduling algorithm with overlapping clusters is proposed where precoders are designed by considering the layered virtual signal to interference plus noise ratio (SINR) criterion [18]. In [19], an iterative algorithm is proposed to jointly optimize beamforming and clustering in heterogeneous networks.
However, most of the works on dynamic clustering ([10–13, 16, 17]) assume that UEs are equipped with only one antenna, although the Long Term Evolution (LTE) Advanced standard developed by the 3rd Generation Partnership Project (3GPP) considers that UEs may be equipped with up to eight antennas [20]. Although this number seems a bit optimistic for current mobile devices, the technological innovation may allow in the nearfuture manufacturing smartphones or tablets with numerous antennas and hence much more attention should be paid to the study of CoMP schemes with multiple antenna UEs [21, 22]. Therefore, in this work, we consider downlink CoMPJP with a constraint on the maximum cluster size and propose a novel dynamic BS clustering and UE scheduling algorithm by explicitly considering that UEs are equipped with multiple antennas. In our proposal, UEs exploit these additional degrees of freedom by implementing interference rejection combiner (IRC) [23] to partially suppress ICI and being served by means of a multistream transmission. Moreover, differently from many works on dynamic clustering where UE selection is not considered and a simple round robin scheduler is implemented ([10, 12–14, 16, 19]), here, we assume UE scheduling as a part of the optimization. In our approach, we first define a set of candidate BS clusters depending on longterm properties of the channels. Then, in each time block, the proposed algorithm follows a twostep procedure: (a) a weighted sum rate is estimated for each candidate cluster by performing UE selection, precoding design, power allocation, and transmission rank selection and then (b) the central unit (CU) coordinating all the BSs schedules the set of nonoverlapping candidate clusters that maximizes the system weighted sum rate under the assumption of perfect successive interference cancellation (SIC) with IRC at each UE.
For a performance comparison, we use the effective achievable rate at UEs, by assuming that CSI is perfectly known at the receiver. In particular, we evaluate the achievable rate of the proposed solution in a LTETDD scenario and compare it against a baseline singlecell processing (SCP) scheme and two static clustering schemes, where clusters do not dynamically adapt to the network conditions. Numerical results show that the achievable rates strongly increase with the number of UE antennas. Moreover, as with CoMP part of the interference is managed at the transmit side, multistream transmission is more effective with the proposed scheme than with SCP. However, as most of the gain is due to the interference suppression capability of the IRC, the relative gain achieved by the proposed scheme with respect to SCP decreases by increasing the number of UE antennas. Finally, a further decrease of this gain is observed when imperfect CSI is considered at BSs.
Notation. We use (·)^{T} to denote transpose and (·)^{H} conjugate transpose. 0_{N×M} denotes the matrix of size N×M with all zero entries, I_{ N } the identity matrix of size N, tr(X) the trace of matrix X, det(X) the determinant of matrix X, vec(X) the vectorization of X, ∥X∥ the Frobenius norm of X, [X]_{n,m} the entry on row n and column m of X, [X]_{·,m} the m th column of X, and diag(x) the diagonal matrix with the entries of vector x on the diagonal. Expectation is denoted by $\mathbb{E}\left[\xb7\right]$.
2 System model
We consider a TDD system where a set of BSs $\mathcal{J}=\{1,2,\dots ,J\}$, each equipped with M antennas, is serving a set of UEs $\mathcal{K}=\{1,2,\dots ,K\}$, each equipped with N antennas, with K>J M. As the overall number of transmitting antennas is not sufficient to serve all the UEs at the same time, UE scheduling is part of the optimization problem. We assume a block fading channel model and denote with H_{k,j}(t), t=0,1,…,T−1, the multipleinput multipleoutput (MIMO) channel matrix of size N×M between BS j and UE k in block t. We consider that the entries of matrix H_{k,j}(t) are identically distributed zeromean complex Gaussian random variables, i.e., ${\left[{\mathit{H}}_{k,j}\left(t\right)\right]}_{n,m}\sim \mathcal{C}\mathcal{N}\left(0,{\sigma}_{k,j}^{2}\right)$, for n=0,1,…,N−1 and m=0,1,…,M−1, where ${\sigma}_{k,j}^{2}$ represents the large scale fading between BS j and UE k, which depends on path loss and shadowing. We assume that the statistical description of the channels does not change for all the T blocks, whereas fast fading realizations are independent among different blocks. Then, we denote with ${\mathit{\Sigma}}_{k,j}=\mathbb{??}\left[\text{vec}\left({\mathit{H}}_{k,j}\left(t\right)\right)\text{vec}{\left({\mathit{H}}_{k,j}\left(t\right)\right)}^{H}\right]$ the covariance matrix of the channel matrix H_{k,j}(t). We indicate with L_{ E } the number of resource elements, i.e., time slots, forming a block. Note that the block fading model considered in this work can be adapted to represent a more realistic channel which changes continuously both in time and in frequency by suitably selecting the number of resource elements in each block. In fact, by denoting with W_{ C } and T_{ C } the coherence bandwidth and time of the channel, respectively, we have L_{ E }=W_{ C }T_{ C }.
We assume that the BSs are coordinated by a CU, and the backhaul links have zero latency and are errorfree. Each block is organized in three phases: (a) in the first phase, all the UEs send pilot sequences to allow channel estimation at BSs, (b) in the second phase, BS clustering, UE scheduling, beamforming design, transmission rank selection, and power optimization are performed by the CU, and finally, (c) in the third phase, the BSs perform data transmission toward the set of scheduled UEs.
For the sake of clarity, we report in Table 1 the list of important symbols used in this work.
2.1 First phase: uplink pilot transmission
The first L_{ T } resource elements of each block are allocated to the uplink pilot transmission performed by the UEs. We assume that orthogonal sequences, each of length L_{ T }, are employed by the UEs; thus, interference on channel estimation is avoided at BSs: in the considered scenario, this is achieved by imposing a minimum length of the training sequence of L_{ T }≥N K. By denoting with P^{(UE)} the maximum power available at each UE and ${\sigma}_{n}^{2}$ the thermal noise power, under the assumption of perfect reciprocity, BS j estimates the channel matrix H_{k,j}(t) connecting UE k to itself from the observation
where ${\eta}_{k,j,n,m}\left(t\right)\sim \mathcal{C}\mathcal{N}\left(0,\frac{N{\sigma}_{n}^{2}}{{L}_{T}{P}^{\left(\text{UE}\right)}}\right)$. By assuming that BS j knows the covariance matrix Σ_{k,j}, the minimum mean square error (MMSE) estimate ${\widehat{\mathit{H}}}_{k,j}\left(t\right)$ of H_{k,j}(t) given the observation (1) can be written as ([24], [Ch. 10])
where [η_{k,j}(t)]_{n,m}=η_{k,j,n,m}(t).
Note that in the case of uncorrelated channels, i.e., when Σ_{k,j}=I_{ MN }, the expression in (2) turns out to be
2.2 Second phase: resource allocation at the CU
After uplink pilot transmission, each BS j forwards the channel estimates ${\widehat{\mathit{H}}}_{k,j}\left(t\right)$, $k\in \mathcal{K}$, to the CU, which, in turn, organizes BSs in clusters and schedules in each time block t a subset $\mathcal{S}\left(t\right)\subseteq \mathcal{K}$ of UEs.
In this work, we consider dynamic multistream transmission and we denote with l_{ k }(t) the transmission rank allocated to UE k in block t, i.e., the number of streams sent toward UE k. Let us denote with G_{k,j}(t) the M×l_{ k }(t) beamforming matrix used by BS j to serve UE k and with ${\mathit{P}}_{k}\left(t\right)=\left[{P}_{k,0}\left(t\right),{P}_{k,1}\left(t\right),\dots ,{P}_{k,{l}_{k}\left(t\right)1}\left(t\right)\right]$ the power allocation vector for UE k.A basic approach to BS clustering is static, i.e., BS clusters are fixed and do not change over time. For example, in the hexagonal setup with seven sites and three BSs per site, reported in Figure 1, typical standard configurations are the following: •Single cell processing (SCP), where no cooperation is allowed among the BSs and each UE is served by its anchor BS, i.e., the BS characterized by the highest signal to noise ratio (SNR) (baseline scheme).•Intrasite cooperation (ISC), where seven clusters are constructed, each one composed by three colocated BSs (see Figure 2), and each UE is served by the best site in terms of average SNR.•Static clustering (SC), where still seven static clusters are constructed, but with cooperation allowed among three BSs of three different sites as shown in Figure 3.
Here, $\mathcal{C}=\left\{1,2,\dots ,\left\mathcal{C}\right\right\}$ denotes the set of integers, each identifying a candidate BS cluster, while ${\mathcal{J}}_{c}\left(t\right)\subseteq \mathcal{J}$ is the cth candidate BS cluster and ${\mathcal{S}}_{c}\left(t\right)\subseteq \mathcal{K}$ the corresponding set of scheduled UEs. For complexity reasons, in this work, we consider nonoverlapping clusters: hence, in each time block t, the set of BSs is partitioned into nonoverlapping clusters and no UE can be served in the same time block by two different BS clusters. Although a solution with overlapping clusters would provide higher rates, it would be much more challenging in terms of computational complexity, in particular when the number of BSs J managed by the CU is high.
We denote with ${\widehat{\mathit{H}}}_{k}^{\left(c\right)}\left(t\right)$ the matrix of size $N\times M\left{\mathcal{J}}_{c}\left(t\right)\right$ collecting the MIMO channel estimated by BSs in cluster ${\mathcal{J}}_{c}\left(t\right)$ toward UE k and with ${\mathit{G}}_{k}^{\left(c\right)}\left(t\right)$ the corresponding precoding matrix of size $M\left{\mathcal{J}}_{c}\left(t\right)\right\times {l}_{k}\left(t\right)$, with ${\u2225{\left[{\mathit{G}}_{k}^{\left(c\right)}\left(t\right)\right]}_{\xb7,l}\u2225}^{2}=1$, l=0,1,…,l_{ k }(t)−1. Due to the imperfect CSI at BSs, the signal received by UE k is modeled at the CU as
where ${\mathit{s}}_{k}\left(t\right)\sim \mathcal{C}\mathcal{N}\left({0}_{{l}_{k}\left(t\right)\times 1},\text{diag}\left({\mathit{P}}_{k}\left(t\right)\right)\right)$ is the data symbol vector transmitted toward UE k, ${\mathit{n}}_{k}\left(t\right)\sim \mathcal{C}\mathcal{N}\left({0}_{N\times 1},{\sigma}_{n}^{2}{\mathit{I}}_{N}\right)$ is the thermal noise at the UE antennas and ${\widehat{\mathit{i}}}_{k}^{\left(c\right)}\left(t\right)$ is the estimate of the ICI suffered by UE k. Note that the exact value of ${\widehat{\mathit{i}}}_{k}^{\left(c\right)}$ depends on the beamformers used by other clusters to serve their own UEs. By indicating with P^{(BS)} the power available at each BS, beamformers are designed by assuming perBS power constraints, i.e.,
In this work, we make the following assumptions regarding power allocation and beamforming design.

Equal power is allocated to the streams sent toward the UEs scheduled within the same cluster, i.e., P_{k,l}(t)=P^{(c)}(t), $k\in {\mathcal{S}}_{c}\left(t\right)$, l=0,1,…,l_{ k }(t)−1, where P^{(c)}(t) can be analytically computed from (5) as
$${P}^{\left(c\right)}\left(t\right)=\frac{{P}^{\left(\text{BS}\right)}}{\underset{j\in {\mathcal{J}}_{c}\left(t\right)}{max}\sum _{k\in {\mathcal{S}}_{c}\left(t\right)}\sum _{l=0}^{{l}_{k}\left(t\right)1}{\u2225{\left[{\mathit{G}}_{k,j}\left(t\right)\right]}_{\xb7,l}\u2225}^{2}}\phantom{\rule{0.3em}{0ex}}.$$(6) 
Beamformers are designed by using the multiuser eigenmode transmission (MET) scheme [25], where the precoding matrix used to serve UE k is optimized with the aim of nullifying the interference toward the eigenmodes selected for the coscheduled UEs $m\in {\mathcal{S}}_{c}\left(t\right)\setminus \left\{k\right\}$. In detail, let ${\widehat{\mathit{H}}}_{k}^{\left(c\right)}\left(t\right)={\widehat{\mathit{U}}}_{k}^{\left(c\right)}\left(t\right){\widehat{\mathit{\Sigma}}}_{k}^{\left(c\right)}\left(t\right){\widehat{\mathit{V}}}_{k}^{\left(c\right)H}\left(t\right)$ be the singular value decomposition (SVD) of matrix ${\widehat{\mathit{H}}}_{k}^{\left(c\right)}\left(t\right)$, where the eigenvalues in ${\widehat{\mathit{\Sigma}}}_{k}^{\left(c\right)}\left(t\right)$ are arranged so that the ones selected for transmission toward UE k appear in the leftmost columns. By defining matrix
$$\begin{array}{l}{\widehat{\mathit{\Gamma}}}_{k}^{\left(c\right)}\left(t\right)=\\ \left[{\left[{\widehat{\mathit{\Sigma}}}_{k}^{\left(c\right)}\left(t\right)\right]}_{0,0}{\left[{\widehat{\mathit{V}}}_{k}^{\left(c\right)}\left(t\right)\right]}_{\xb7,0},{\left[{\widehat{\mathit{\Sigma}}}_{k}^{\left(c\right)}\left(t\right)\right]}_{1,1}{\left[{\widehat{\mathit{V}}}_{k}^{\left(c\right)}\left(t\right)\right]}_{\xb7,1},\right.\\ {\left(\right)close="]">\dots ,{\left[{\widehat{\mathit{\Sigma}}}_{k}^{\left(c\right)}\left(t\right)\right]}_{{l}_{k}\left(t\right)1,{l}_{k}\left(t\right)1}{\left[{\widehat{\mathit{V}}}_{k}^{\left(c\right)}\left(t\right)\right]}_{\xb7,{l}_{k}\left(t\right)1}}^{}H& \phantom{\rule{0.3em}{0ex}},\end{array}$$(7) 
precoding matrix used to serve UE k satisfies constraints
$${\widehat{\mathit{\Gamma}}}_{m}^{\left(c\right)}\left(t\right){\widehat{\mathit{G}}}_{k}^{\left(c\right)}\left(t\right)={0}_{{l}_{m}\left(t\right)\times {l}_{k}\left(t\right)}\phantom{\rule{0.3em}{0ex}},\phantom{\rule{1em}{0ex}}k\ne m\phantom{\rule{0.3em}{0ex}}.$$(8)
Note that MET has been proven to outperform in a MIMO broadcast channel other linear precoding schemes such as block diagonlization [25], whereas the assumption of equal power allocation among the scheduled streams reduces the computational complexity and is asymptotically optimal at high SNR.
The main contribution of this work, described in Section 3, is a practical algorithm for dynamic BS clustering and corresponding UE scheduling developed in the considered setup with multiple antenna BSs, with equal power allocation and MET, that transmit toward multiple antenna UEs. Moreover, we recall that at UEs, the multiple receive antennas are used to perform IRC with SIC ([26], Ch. 10); in fact, while the rank l_{ k }(t) allocated to UE k is given by the number of columns of the precoder, the remaining degrees of freedom at UE are used to partially suppress the residual ICI. Note that IRC both minimizes the mean square error and maximizes the SINR at the detection point [23].
2.3 Third phase: downlink data transmission
BS clusters serve the scheduled UEs by using the L_{ E }−L_{ T } resource elements still available in block t. By defining matrix ${\mathit{G}}_{k}\left(t\right)={\left[{\mathit{G}}_{k,1}^{T}\left(t\right),{\mathit{G}}_{k,2}^{T}\left(t\right),\dots ,{\mathit{G}}_{k,J}^{T}\left(t\right)\right]}^{T}$ and matrix H_{ k }(t)=[H_{k,1}(t),H_{k,2}(t),…,H_{k,J}(t)], the signal received by UE k can be written as
3 Dynamic clustering algorithm
In this section, we drop the block index t for the sake of clarity. With respect to UE k, let us define the function ${f}_{k}:\mathcal{J}\to \mathcal{J}$, which orders the BSs on the basis of the large scale fading component of the channel, i.e., ${\sigma}_{k,{f}_{k}\left({c}_{1}\right)}^{2}>{\sigma}_{k,{f}_{k}\left({c}_{2}\right)}^{2}$ if c_{1}<c_{2}. Then, we indicate with ${\mathcal{J}}_{k}^{\left(u\right)}$ the cluster of the u BSs with the strongest average channel toward UE k, i.e.,
Hence, f_{ k }(1) is the anchor BS for UE k.
In a network with J BSs and a maximum cluster size of J_{MAX}, the number of possible BS clusters that can be constructed to serve a given UE is
which rapidly increases with J. However, as most of the interference at each UE comes from the closest BSs, we can limit the number of candidate BS clusters. Hence, we assume that set identifies all and only the sets ${\mathcal{J}}_{k}^{\left(u\right)}$ whose size is not bigger than J_{MAX}. As an example, selecting J_{MAX}=3, set identifies three candidate clusters for each UE k: (i) the first cluster includes only its anchor BS, ${\mathcal{J}}_{k}^{\left(1\right)}$, (ii) the second cluster is composed of the two closest BSs, ${\mathcal{J}}_{k}^{\left(2\right)}$, and (iii) the third cluster is composed of the three closest BSs, ${\mathcal{J}}_{k}^{\left(3\right)}$. Note that, as different UEs often have the same candidate clusters, the cardinality of set turns out to be much lower than K J_{MAX}. The considered assumption yields an important saving in terms of computational complexity by strongly limiting the number of candidate clusters with respect to (11): this complexity saving is evaluated in Section 4 for a typical LTE scenario.
Now, clusters ${\mathcal{J}}_{k}^{\left(u\right)}$, with u≤J_{MAX}, are ordered by an integer index c, and for each cluster ${\mathcal{J}}_{c}$, $c\in \mathcal{C}$, we define the corresponding set ${\mathcal{U}}_{c}$ of UEs that can be scheduled for reception, which is formed by the UEs whose anchor BS belongs to ${\mathcal{J}}_{c}$, i.e.,
We highlight that (12) allows BSs in cluster ${\mathcal{J}}_{c}$ to serve all the UEs in its coverage area, even UEs close to the border. Although a different choice could be taken for instance by forcing clusters to serve only the UEs far away from the border, it has been shown in [27] that this alternative choice provides worse performance than (12) when a huge network is considered and fairness among the UEs is taken into account.
In this work, we propose an algorithm for dynamic BS clustering and UE scheduling at the CU which follows a twostep procedure.

1.
For each candidate BS cluster ${\mathcal{J}}_{c}$, we estimate the weighted sum rate ${\widehat{R}}^{\left(c\right)}$ by selecting a suitable subset of UEs ${\mathcal{S}}_{c}\subseteq {\mathcal{U}}_{c}$, designing precoders, selecting transmission ranks, and allocating powers.

2.
After computing the weighted sum rate ${\widehat{R}}^{\left(c\right)}$ for all the candidate BS clusters in , the CU schedules a set of nonoverlapping BS clusters, where each BS belongs to at most one cluster.
Moreover, based on (12), we observe that UE k can be selected only by candidate clusters that include its anchor BS f_{ k }(1). Hence, if we enforce a nonoverlapping solution, each UE is never scheduled by two different nonoverlapping clusters in the same block. However, we highlight that the proposed dynamic solution allows the flexibility of scheduling a given UE in different clusters across successive blocks.
In the rest of this section, we describe more in detail the above two main steps of the algorithm. We stress that the candidate cluster selection, i.e., the construction of set depends on the large scale fading: hence, in our model, it should be performed only every T blocks. On the other hand, the twostep algorithm described above follows a fast fading timescale and therefore must be implemented in each block.In Figure 4, we report a flow chart which summarizes the proposed system configuration.
3.1 Cluster weighted sum rate estimation
The CU estimates the weighted sum rate ${\widehat{R}}^{\left(c\right)}$ that can be achieved by candidate cluster ${\mathcal{J}}_{c}$ by modeling the signal received by UE $k\in {\mathcal{U}}_{c}$ as in (4). Besides the errors due to the imperfect CSI (2), the estimated ICI ${\widehat{\mathit{i}}}_{k}^{\left(c\right)}$ depends on the precoders used by the other clusters scheduled by the CU. In our framework, as the CU schedules the clusters after estimating the weighted sum rate achievable in each candidate, we simply assume ${\widehat{\mathit{i}}}_{k}^{\left(c\right)}\sim \mathcal{C}\mathcal{N}\left({0}_{N\times 1},{\xi}_{k}^{\left(c\right)}{\mathit{I}}_{N}\right)$ with
Note that (13) represents the average ICI power at the UE k when all the BSs outside cluster c are transmitting at full power ([28], (2)), and for each candidate cluster allows the computation of ${\widehat{R}}^{\left(c\right)}$ independently of the other candidates.
From (4), we introduce the interference plus noise covariance matrix
By using (4) and (14), we can write the estimate at the CU of the rate ${\widehat{R}}_{k}$ achieved by UE k as
Note that the estimate of the rate achieved by UE k in (15) is computed by assuming that the multiple receive antennas are exploited by performing SIC with IRC ([26], Ch. 10): the rank l_{ k } allocated to UE k is given by the number of columns of precoder ${\mathit{G}}_{k}^{\left(c\right)}$, whereas the remaining degrees of freedom are used to partially suppress the residual ICI by IRC. Then, the weighted sum rate ${\widehat{R}}^{\left(c\right)}$ is estimated at the CU by solving the following optimization problem:
s.t.
where scaling factor α_{ k } in (16a) represents the quality of service (QoS) for UE k which depends on the employed scheduler.
Maximization (16) is a wellstudied multiuser MIMO problem [29] involving (a) UE selection, (b) transmission rank selection, (c) precoding design, and (d) power allocation.
We solve problem (16) by enforcing the assumptions of equal power allocation among the streams sent within cluster ${\mathcal{J}}_{c}$ and MET introduced in Section 2.2. Moreover, the eigenmodes (and accordingly the set ${\mathcal{S}}_{c}$ of scheduled UEs and the transmission rank allocated to each UE $k\in {\mathcal{S}}_{c}$) are selected by using a greedy iterative algorithm which, at each iteration, includes the eigenmode which maximizes the weighted sum rate ${\widehat{R}}^{\left(c\right)}$ among the ones not scheduled in the previous iterations. The algorithm starts with no UE scheduled and stops when no increase in the weighted sum rate ${\widehat{R}}^{\left(c\right)}$ is observed. Cluster ${\mathcal{J}}_{c}$, among the $N\left{\mathcal{U}}_{c}\right$ possible eigenmodes, selects a maximum of $M\left{\mathcal{J}}_{c}\right$ eigenmodes, due to the limited number of BS antennas. Note that the considered method flexibly adapts to the channel conditions by allowing the allocation of (a) different ranks to different UEs in the same block and (b) different ranks to the same UE across successive blocks.
3.2 Clustering optimization
After computing all the weighted sum rates ${\widehat{R}}^{\left(c\right)}$, $c\in \mathcal{C}$, the CU schedules the set of nonoverlapping clusters that maximizes the system weighted sum rate. In detail, by defining
we consider that each BS belongs to at most one cluster, i.e., we impose
Therefore, at the CU, the clustering optimization is performed by solving the following linear integer optimization problem
s.t. (17).
Note that (18) differs from the optimization carried out in [15] where the objective function simply depends on the received power measured by the UEs.
Maximization (18) is the optimization version of the set packing problem, which is shown to be NPhard [30]. Hence, as the exhaustive search is not a viable method to solve (18), we propose a greedy iterative algorithm which is reported in Algorithm 1: the proposed solution basically selects at each iteration the best (in terms of system weighted sum rate) cluster and ends when each BS has been assigned to at least one cluster. In detail, let ${\mathcal{C}}^{\left(A\right)}\left(n\right)$ be the set identifying the candidate BS clusters considered at iteration n. The algorithm starts by imposing ${\mathcal{C}}^{\left(A\right)}\left(1\right)\leftarrow \mathcal{C}$ and ends when ${\mathcal{C}}^{\left(A\right)}\left(n\right)=\varnothing $. Note that ${\mathcal{C}}^{\left(A\right)}\left(n\right)$ identifies all the candidate clusters that do not overlap with the clusters scheduled in the previous iterations. At iteration n, we select cluster $w\in {\mathcal{C}}^{\left(A\right)}\left(n\right)$ that maximizes the perBS weighted sum rate, i.e.,
and we remove from ${\mathcal{C}}^{\left(A\right)}\left(n\right)$ all the indexes identifying clusters that partially overlap with ${\mathcal{J}}_{w}$. Note that in criterion (19), we normalize the cluster weighted sum rate ${\widehat{R}}^{\left(c\right)}$ to the number of BSs $\left{\mathcal{J}}_{c}\right$ included in the cluster with the aim of scheduling big clusters only if this really provides an improvement. Let us consider as an example a basic scenario with J=2: by using (19), we schedule the cluster of two BSs only if its weighted sum rate ${\widehat{R}}^{\left(c\right)}$ is higher than that achieved when the two BSs are uncoordinated (SCP scheme).
By denoting with ${x}_{c}^{(\ast )}$ the greedy solution to (18) obtained by applying the proposed algorithm, the set of UEs scheduled in the current block turns out to be
4 Numerical results
We consider a hexagonal cellular scenario where J=21 BSs, each equipped with M=4 antennas, are organized in seven sites, each with three colocated BSs (see also Figure 1). We consider ten UEs randomly dropped in the coverage area of each BS, with K=210 UEs overall. The power available at each BS is P^{(BS)}=46 dBm, the power available at each UE is P^{(UE)}=23 dBm, and the thermal noise power is ${\sigma}_{n}^{2}=101$ dBm. The large scale fading between BS j and UE k can be written as
where d_{k,j} is the distance between BS j and UE k, ν=3.5 is the path loss coefficient, Γ^{(CE)}_{dB}=10 dB is the average SNR when an UE is at the cell edge, ${e}^{{\zeta}_{k,j}}$ is the lognormal shadowing with 8 dB as standard deviation, and A(θ_{k,j}) models the antenna gain as a function of the direction θ_{k,j} of UE k with respect to the antennas of BS j, with
where θ_{3d B}=(70/180)π and A_{ s }_{dB}=20 dB ([31], (21.3)). We consider an intersite distance of 500 m and a minimum distance d_{min}=35 m between BSs and UEs. Wraparound is used to deal with boundary effects [32]. We also assume that channels are correlated by considering the popular Kronecker model [33]. By denoting with R_{BS} the square correlation matrix of size M at the BS, with tr(R_{BS})=M, and with R_{UE} the square correlation matrix of size N at the UE, with tr(R_{UE})=N, we can write
where ${\stackrel{\u0304}{\mathit{H}}}_{k,j}\left(t\right)$ is a matrix of size N×M whose entries are independent and identically distributed zeromean complex Gaussian random variables with ${\sigma}_{k,j}^{2}$ as statistical power.
Results are obtained by simulating 100 UE drops and T=200 block channel realizations for each UE drop. We assume that proportional fair scheduling [34] is implemented to provide fairness among UEs, i.e., ${\alpha}_{k}\left(t\right)=1/{\stackrel{~}{R}}_{k}\left(t\right)$, with ${\stackrel{~}{R}}_{k}(t+1)=(1\gamma ){\stackrel{~}{R}}_{k}\left(t\right)+\gamma {R}_{k}\left(t\right)$, t=0,1,…,T−1, where γ=0.1 is the forgetting factor and we initialize ${\stackrel{~}{R}}_{k}\left(0\right)=\underset{2}{log}\left(1+{P}^{\left(\text{BS}\right)}{\sigma}_{k,{j}_{k}}^{2}/{\sigma}_{n}^{2}\right)$. However, to allow the scheduler to reach a steady state, only the last T/2 channels of each UE drop are considered for system performance evaluation.
We compare the developed scheme based on dynamic clustering (DC) against the three static schemes SCP, ISC, and SC, introduced in Section 2. Moreover, we assume that UE scheduling, beamforming design, transmission rank selection, and power allocation are performed, as described in Sections 2.2 and 3.1, also for the static schemes: in particular, UEs are served by using MET [25] with equal power allocation among the eigenmodes and a greedy UE selection is performed within each BS cluster.
For performance evaluation, we assume perfect CSI at the UE side, which employs IRC with SIC, and perfect detection, i.e., there is no error propagation. Moreover, in Section 4.6, we further provide some numerical results that validate the assumption of perfect CSI at the UE employed in most of the CoMP literature.
From (9), the interference plus noise covariance matrix at the UE can be written as
The effective rate achieved by UE k turns out to be ([26], Ch. 10)
In (25), the overhead due to the UE pilot transmission is taken into account in the scaling factor before the logarithm.
The proposed schemes are compared in terms of:

UE rate, defined as
$${\stackrel{\u0304}{R}}_{k}=\frac{2}{T}\sum _{t=T/2}^{T1}{R}_{k}\left(t\right)\phantom{\rule{0.3em}{0ex}}$$(26) 
Average cell rate, defined as
$${\stackrel{\u0304}{R}}_{\text{cell}}=\frac{1}{J}\sum _{k\in \mathcal{K}}{\stackrel{\u0304}{R}}_{k}\phantom{\rule{0.3em}{0ex}}$$(27)
First, to evaluate the complexity saving achieved by the candidate cluster selection described at the beginning of Section 3, we show in Table 2 the 5th, the 50th, and the 95th percentiles of the number of candidate clusters considered with DC by assuming J_{MAX}=3. By adapting the candidate clusters to the longterm channel conditions, we have a saving of about 80% in terms of with respect to the full search (11); in fact, with our approach, we ignore candidate clusters that include far apart BSs.
4.1 Effect of multiple antennas at UEs
In this section, we consider perfect CSI at BSs, i.e., ${\widehat{\mathit{H}}}_{k,j}\left(t\right)={\mathit{H}}_{k,j}\left(t\right)$ in (2), uncorrelated antennas, i.e., in (23) R_{BS}=I_{ M } and R_{UE}=I_{ N }, and we assume J_{MAX}=3 with DC for a fair comparison against ISC and SC in terms of maximum cluster size. In Figures 5 and 6, we report the average cell rate and the fifth percentile of the UE rate for three values of the number N of UE antennas, respectively. First, we observe a significative performance improvement by adding antennas at the UE side. For instance, with SCP by increasing N from 1 to 4, there is a gain of about 76% in terms of the fifth percentile of the UE rate. Two factors mainly contribute to this gain: (a) UEs with lower SINR use IRC to limit the impact of residual ICI not managed at the transmit side and (b) UEs with higher SINR can be served by multiple streams of data. From Table 3, where we report the distribution of the number of streams l_{ k } with N=4, we note that with SCP more than 80% of the transmissions are rank 1. On the other hand, with DC, as the interference level suffered by the UEs is lower, about 33% of transmissions are multistream. This shows that in general, most of the gain is due to the IRC and multistream transmission plays a nonnegligible role only with DC. Then, we observe that ISC provides a moderate gain with respect to SCP in terms of cell rate, but almost no gain in terms of the fifth percentile of the UE rate, whereas the opposite happens with SC. Indeed, ISC of Figure 2, by only allowing cooperation among the sectors of the same site, partially helps the UEs close to the site border, which however get better performance with SC of Figure 3. Moreover, we also observe that the performance gain achieved by DC over SCP decreases by adding more antennas at the UE side. In fact, as the gain of using multiple antenna UEs is mainly due to the IRC which cancels ICI, the benefits of increasing N are seen more in a noncooperative scenario, where the residual ICI is higher with respect to DC. In detail, from Figure 6, the performance gain achieved by DC over SCP drops from about 43% with N=1 to about 28% with N=4.
To further understand the role of IRC and multistream transmission with both SCP and CoMP, we consider now a simplified solution to (16) obtained by assuming a maximum number of streams l^{(MAX)} that can be transmitted toward each UE and limiting the eigenmodes that can be used for each UE to only the strongest l^{(MAX)} eigenmodes. In Table 4, we report the average cell rate and the fifth percentile of the UE rate by considering N=4 and l^{(MAX)}=1,4. Note that l^{(MAX)}=1 means that each UE is always served by rank 1 transmissions along its strongest eigenmode and implements IRC, whereas l^{(MAX)}=N=4 means that there are no constraints on the number of data streams. Again, as the level of ICI is higher with SCP, in terms of the fifth percentile of the UE rate, the gain achieved by DC over SCP decreases from about 28% with l^{(MAX)}=4 to about 18% with l^{(MAX)}=1. These results confirm that the gain of CoMP with respect to the baseline noncooperative scheme can still be important when UEs are equipped with multiple antennas, but only if multistream transmission is properly exploited.
4.2 Effect of antenna correlation
In Figure 7, we consider N=4, l^{(MAX)}=1, and perfect CSI at BSs, and we introduce correlation among UE antennas by assuming that R_{UE} is a symmetric Toeplitz matrix whose first column is [R_{UE}]_{·,0}=[1,β,…,β^{N−1}]^{T}, and plot the fifth percentile of the UE rate in terms of β. As expected, a higher rate is achieved with lowcorrelated antennas, i.e., for lower values of β. Moreover, we also observe that the gain achieved by DC over SCP decreases by decreasing β: in detail, this gain drops from 25% with β=0.9 to 18% with β=0.1. In fact, by decreasing the correlation among UE antennas, we improve the interference suppression capability of IRC. These results confirm that it is not worthy to add more antennas at the UE when they are strongly correlated.
4.3 Effect of UE selection
In this section, we consider N=4, l^{(MAX)}=1, and perfect CSI at the BSs, and we evaluate how the heuristic greedy UE selection method introduced in Section 3.1 to solve problem (16) and the proposed dynamic clustering scheme impact the gain achieved by CoMP over SCP. Therefore, in Figure 8, we evaluate SCP and DC with J_{MAX}=3 in terms of the fifth percentile of the UE rate by considering a round robin UE selection with S^{(BS)}=1,2,3,4 UEs scheduled by each BS: note that when the UEs are selected in a round robin fashion, the gain achieved by CoMP over SCP depends only on the clustering scheme, as no joint optimization of UE scheduling and clustering is performed. First, we observe that S^{(BS)}=2 represents the optimal value for both SCP and DC. In fact, when the UEs are scheduled in a round robin fashion, increasing the value of S^{(BS)}, while it allows more UEs to be served in each block, it also increases the likelihood of selecting UEs whose channels are almost parallel, thus degrading the achieved rates. Then, when we look at the impact of the UE selection method, we observe that with DC, a gain of about 37% is achieved by the greedy method over the round robin approach (see also Table 4). Finally, we observe that the gain achieved by DC over SCP decreases from about 18% with the greedy method to about 11% with the round robin approach.
4.4 Effect of cluster size
In Figure 9, by assuming N=1,4, l^{(MAX)}=1, uncorrelated antennas and perfect CSI at BSs, we compare SCP and DC in terms of the fifth percentile of the UE rate for four values of the maximum cluster size J_{MAX}. An important gain is observed with CoMP by increasing J_{MAX}: for instance, the gain achieved by DC over SCP increases from 43% (18%) with J_{MAX}=3 to about 84% (40%) with J_{MAX}=6 when N=1 (N=4). These results show that although the strongest interferers are managed by CoMP with J_{MAX}=3, the ICI suffered by UEs is still very high and strongly limits system performance. Hence, a general comment is that BS clusters of higher dimension should be employed if the backhaul infrastructure is able to handle it.
4.5 Effect of imperfect CSI at BSs
In this section, we assume that the CSI at BSs is affected by noise (2) due to the finite number of resource elements L_{ T } allocated to the pilot transmissions in each block. After denoting with f_{d} the maximum Doppler frequency, and with ${\stackrel{\u0304}{\tau}}_{\text{rms}}$ the rootmean square delay spread of the channel, we define, respectively, the coherence bandwidth ([35], Ch. 4) and coherence time ([36], Ch. 4) of the channel as
Note that above expressions are only used to determine the block size L_{ E } such that the channel can be modeled as uncorrelated between adjacent blocks. Indeed, if f_{d} or ${\stackrel{\u0304}{\tau}}_{\text{rms}}$ increase, L_{ E } is reduced and this lowers the rate of each UE as given by (25). Due to the problem of obtaining a reliable CSI at BSs in a high mobility scenario, in the following, we consider f_{d}=5 Hz, which at 2.5 GHz carrier frequency roughly corresponds to a mobile velocity of 2 km/h ([31], Ch. 21). In this section, we also assume N=4, l^{(MAX)}=1, uncorrelated antennas and J_{MAX}=3 with DC.
In Figure 10, we consider the extended pedestrian A (EPA) model, which is a very low frequency selective channel with ${\stackrel{\u0304}{\tau}}_{\text{rms}}=43$ ns ([31], Tab. 21.2). In detail, we show the fifth percentile of the UE rate versus the ratio L_{ T }/L_{ E }, which represents the fraction of resources used for pilot transmission. The dashed lines are the rates computed in Section 4.1 by assuming perfect CSI at BSs. We observe that in this case, rate performance close to the perfect CSI case can be achieved by properly increasing the value of L_{ T }. Moreover, note that SCP approaches the best performance faster than CoMP schemes. In fact, while with SCP only the channels between a BS and its anchored UEs are used for precoding design, with CoMP precoders are optimized on the basis also of the channels between some other auxiliary BSs and these UEs. As these channels are generally characterized by a lower SNR with respect to the channel between a BS and its anchored UEs, more pilots are necessary to collect a reliable CSI at transmit side.
In Figure 11, we plot the fifth percentile of the UE rate versus the ratio L_{ T }/L_{ E } for the very frequencyselective extended typical urban (ETU) channel model, characterized by ${\stackrel{\u0304}{\tau}}_{\text{rms}}=991$ ns ([31], Tab. 21.2). In this case, we observe that rates increase with L_{ T } up to a maximum and then decrease. In fact, increasing the value of L_{ T } has two conflicting effects: (a) from (3) a more reliable CSI is collected at BSs thus improving performance and (b) a lower number of resource elements is allocated to data transmission thus obviously reducing the achievable rate. Clearly, for lower values of L_{ T }, the effect of a better CSI dominates, whereas for higher values of L_{ T }, the CSI is reliable enough for the SINR level of the UEs, and a further increase of the number of pilots represents only a waste of resources. Even if we are still considering a low mobility scenario, due to the higher frequency selectivity of the ETU channel model, no scheme reaches the rates achieved with perfect CSI. Then, as observed for the EPA model, the fraction of resources allocated to pilots necessary to reach the peak in performance is lower for SCP (L_{ T }/L_{ E }≈0.02) than that for DC (L_{ T }/L_{ E }≈0.03). By choosing for each scheme the value of L_{ T } which provides the best rate, the performance gain achieved by DC over SCP decreases with respect to the perfect CSI case to about 16%.
4.6 Effect of imperfect CSI at UEs
In this section, we elaborate on the assumption of perfect CSI at UEs considered in (25). Indeed, it has already been shown [37, 38] that in a similar setup, the overhead required to obtain a reliable CSI at UEs is almost negligible when compared to the overhead necessary to acquire CSI at BSs. This result is simply explained by the huge difference in terms of available power at UEs and BSs, which is 23 dB in a typical LTE scenario: hence, for the same quality in CSI estimate, many more resources are needed for an estimate at BSs than for an estimate at UEs. Moreover, in our model, UE k, in order to implement IRC, needs to know the interference covariance matrix Ψ_{ k }(t) (24) of size N×N, which also depends on the precoders used by the interfering clusters serving their own UEs. Two estimates of Ψ_{ k }(t) at UE k are (a) a simple average of the received samples as proposed in ([39], pag. 10) to directly estimate Ψ_{ k }(t) or (b) the use of orthogonal training sequences among different BS clusters to estimate the cascade channel precoders H_{ k }(t)G_{ m }(t), m≠k, which, in turn, are used to construct matrix Ψ_{ k }(t) from (24). Differently from Section 4.5 where the results are shown in terms of the length of the uplink training sequence L_{ T }, here, we assume perfect CSI at BSs and set the length of the downlink training sequences employed by the clusters to its minimum vale J M, equal to the maximum number of streams that can be sent by the BSs. We report in Table 5 the average cell rate and the fifth percentile of the UE rate with SCP and DC by also assuming N=4, l^{(MAX)}=1, uncorrelated antennas, and J_{MAX}=3. We consider the ETU channel in a low mobility scenario with f_{ d }=5 Hz. As expected, when compared to the perfect CSI case reported in Table 4, the performance loss when imperfect CSI is assumed at UEs turns out to be less than 1%, i.e., almost negligible.
5 Conclusions
In this paper, we have considered a downlink CoMPJP system and, by assuming a maximum cluster size, we have developed a dynamic BS clustering algorithm where the clusters change over time adapting to the channel conditions. We consider that UEs are equipped with multiple antennas that implement IRC and are served by a multistream transmission. The proposed algorithm first defines a set of candidate BS clusters depending on the large scale channel fading. Then, a twostep procedure is applied following a fast fading time scale: (a) first, a weighted sum rate is estimated within each candidate BS cluster by performing UE selection, precoding, power and transmission rank selection, and then (b) the CU schedules the set of nonoverlapping BS clusters that maximizes the estimated system weighted sum rate. Numerical results show that much higher effective rates can be achieved when UEs are equipped with multiple antennas. In fact, by reducing the level of interference suffered by UEs, the proposed approach exploits more the multistream transmission than SCP. However, as most of the gain is due to the IRC, the gain achieved by the proposed approach decreases with respect to SCP by increasing the number of UE antennas. Finally, when channel estimation is considered at BSs, the gain promised in the perfect CSI scenario may be achieved only in part; in fact, a better estimate requires a longer training sequence and this lowers the system rate.
References
 1.
Marsch P, Fettweis G: Coordinated multipoint in mobile communications. Cambridge University Press, Cambridge, England; 2011.
 2.
Karakayali MK, Foschini GJ, Valenzuela RA: Network coordination for spectrally efficient communications in cellular systems. IEEE Wireless Commun Mag 2006, 13(4):5661. 10.1109/MWC.2006.1678166
 3.
Gesbert D, Hanly S, Huang H, Shamai S, Simeone O, Yu W: Multicell MIMO cooperative networks: a new look at interference. IEEE J. Sel. Areas Commun 2010, 28(9):13801408.
 4.
Björnson E, Jorswieck E: Optimal resource allocation in coordinated multicell systems. Foundations and Trends in Communications and Information Theory 2012, 9(23):113381.
 5.
3GPP TR 36.819 v11.1.0: Coordinated multipoint operation for LTE physical layer aspects (Release 11). 2011.
 6.
Irmer R, Droste H, Marsch P, Grieger M, Fettweis G, Brueck S, Mayer HP, Thiele L, Jungnickel V: Coordinated multipoint: concepts, performance, and field trial results. IEEE Commun. Mag 2011, 49(2):102111.
 7.
Zakhour R, Gesbert D: Optimized data sharing in multicell MIMO with finite backhaul capacity. IEEE Trans. Signal Process 2011, 59(12):61026111.
 8.
Baracca P, Tomasin S, Benvenuto N: Constellation quantization in constrained backhaul downlink network MIMO. IEEE Trans. Commun 2012, 60(3):830839.
 9.
Zhang J, Chen R, Andrews JG, Ghosh A, Heath RW: Networked MIMO with clustered linear precoding. IEEE Trans. Wireless Commun 2009, 8(4):19101921.
 10.
Papadogiannis A, Gesbert D, Hardouin E: A dynamic clustering approach in wireless networks with multicell cooperative processing. In Proc. IEEE International Conference on Communications (ICC). Beijing, China; 2008.
 11.
Boccardi F, Huang H, Alexiou A: Network MIMO with reduced backhaul requirements by MAC coordination. In Proc. IEEE Conference on Signals, Systems and Computers (Asilomar). Pacific Grove, CA; 2008.
 12.
Moon JM, Cho DH: Intercluster interference management based on cellclustering in network MIMO systems. In Proc. IEEE Vehicular Technology Conference (VTC Spring). Budapest, Hungary; 2011.
 13.
Liu J, Wang D: An improved dynamic clustering algorithm for multiuser distributed antenna system. In Proc. IEEE International Conference on Wireless Communications & Signal Processing (WCSP). Nanjing, China; 2009.
 14.
Zhou S, Gong J, Niu Z, Jia Y, Yang P: A decentralized framework for dynamic downlink base station cooperation. In Proc. IEEE Global Communications Conference (GLOBECOM). Honolulu, HI; 2009.
 15.
Weber R, Garavaglia A, Schulist M, Brueck S, Dekorsy A: Selforganizing adaptive clustering for cooperative multipoint transmission. In Proc. IEEE Vehicular Technology Conference (VTC Spring). Budapest, Hungary; 2011.
 16.
Papadogiannis A, Bang HJ, Gesbert D, Hardouin E: Efficient selective feedback design for multicell cooperative networks. IEEE Trans. Veh. Technol 2011, 60(1):196205.
 17.
Gong J, Zhou S, Niu Z, Geng L, Zheng M: Joint scheduling and dynamic clustering in downlink cellular networks. In Proc. IEEE Global Communications Conference (GLOBECOM). Houston, TX; 2011.
 18.
Zakhour R, Gesbert D: Distributed multicellMISO precoding using the layered virtual SINR framework. IEEE Trans. Wireless Commun 2010, 9(8):24442448.
 19.
Hong M, Sun R, Baligh H, Luo ZQ: Joint base station clustering and beamformer design for partial coordinated transmission in heterogeneous networks. IEEE J. Sel. Areas Commun 2013, 31(2):226240.
 20.
Boccardi F, Clerckx B, Ghosh A, Hardouin E, Jöngren G, Kusume K, Onggosanusi E, Tang Y: Multipleantenna techniques in LTEadvanced. IEEE Commun. Mag 2012, 50(3):114121.
 21.
Hwang I, Chae CB, Lee J, Heath RW: Multicell cooperative systems with multiple receive antennas. IEEE Wireless Commun. Mag 2013, 20(1):5058.
 22.
Clerckx B, Lee H, Hong YJ, Kim G: A practical cooperative multicell MIMOOFDMA network based on rank coordination. IEEE Trans. Wireless Commun 2013, 12(4):14811491.
 23.
Winters J: Optimum combining in digital mobile radio with cochannel interference. IEEE J. Sel. Areas Commun 1984, 2(4):528539.
 24.
Kay S: Fundamentals of statistical signal processing, volume I: estimation theory. Prentice Hall, Upper Saddle River, New Jersey; 1993.
 25.
Boccardi F, Huang H: A nearoptimum technique using linear precoding for the MIMO broadcast channel. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Honolulu, HI; 2007.
 26.
Tse D, Viswanath P: Fundamentals of wireless communication. Cambridge University Press, Cambridge, England; 2005.
 27.
Baracca P, Boccardi F, Braun V: A dynamic joint clustering scheduling algorithm for downlink CoMP systems with limited CSI. In Proc. IEEE International Symposium on Wireless Communication Systems (ISWCS). Paris, France; 2012.
 28.
Huh H, Tulino AM, Caire G: Network MIMO with linear zeroforcing beamforming: large system analysis, impact of channel estimation, and reducedcomplexity scheduling. IEEE Trans. Inf. Theory 2012, 58(5):29112934.
 29.
Spencer QH, Peel CB, Swindlehurst AL, Haardt M: An introduction to the multiuser MIMO downlink. IEEE Commun. Mag 2004, 42(10):6067. 10.1109/MCOM.2004.1341262
 30.
Hoffman K, Padberg M: Set covering, packing and partitioning problems. Springer Encyclopedia of Optimization 2001, 23482352.
 31.
Sesia S, Toufik I, Baker M: LTE: The UMTS Long Term Evolution. John Wiley & Sons, Hoboken, New Jersey; 2009.
 32.
Hytönen T: Optimal wraparound network simulation. Helsinki University of Technology, Report A432 2001.
 33.
Kermoal JP, Schumacher L, Pedersen KI, Mogensen PE, Frederiksen F: A stochastic MIMO radio channel model with experimental validation. IEEE J. Sel. Areas Commun 2002, 20(6):12111226. 10.1109/JSAC.2002.801223
 34.
Viswanath P, Tse D, Laroia R: Opportunistic beamforming using dumb antennas. IEEE Trans. Inf. Theory 2002, 48(6):12771294. 10.1109/TIT.2002.1003822
 35.
Benvenuto N, Cherubini G: Algorithms for communications systems and their applications. John Wiley & Sons, Hoboken, New Jersey; 2002.
 36.
Rappaport T: Wireless communications: principles and practice. Prentice Hall, Upper Saddle River, New Jersey; 2002.
 37.
Marzetta T, Hochwald BM: Fast transfer of channel state information in wireless systems. IEEE Trans. Signal Process 2006, 54(4):12681278.
 38.
Gomadam KS, Papadopoulos HC, Sundberg CEW: Techniques for multiuser MIMO with twoway training. In Proc. IEEE International Conference on Communications (ICC). Beijing, China; 2008.
 39.
Hoydis J, Hosseini K, ten Brink S, Debbah M: Making smart use of excess antennas: massive MIMO, small cells, and TDD. Bell Labs Tech. J 2013, 18(2):521. 10.1002/bltj.21602
Acknowledgments
Part of this work has been performed in the framework of the FP7 project ICT317669 METIS, which is partly funded by the European Union. The authors would like to acknowledge the contributions of their colleagues in METIS, although the views expressed are those of the authors and do not necessarily represent the project. Part of this work has been presented at the International Symposium on Wireless Communication Systems (ISWCS) 2012, Paris (France), and at the International Conference on Signal Processing, Computing and Control (ISPCC) 2013, Shimla (India). This work was carried out when Federico Boccardi was with Bell Labs, AlcatelLucent.
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Baracca, P., Boccardi, F. & Benvenuto, N. A dynamic clustering algorithm for downlink CoMP systems with multiple antenna UEs. J Wireless Com Network 2014, 125 (2014) doi:10.1186/168714992014125
Received:
Accepted:
Published:
Keywords
 Resource allocation and interference management
 MIMO systems
 Cellular technology