A dynamic clustering algorithm for downlink CoMP systems with multiple antenna UEs

Baracca, Paolo; Boccardi, Federico; Benvenuto, Nevio

doi:10.1186/1687-1499-2014-125

Research
Open access
Published: 08 August 2014

A dynamic clustering algorithm for downlink CoMP systems with multiple antenna UEs

Paolo Baracca¹,
Federico Boccardi² &
Nevio Benvenuto³

EURASIP Journal on Wireless Communications and Networking volume 2014, Article number: 125 (2014) Cite this article

2366 Accesses
22 Citations
1 Altmetric
Metrics details

Abstract

Coordinated multi-point (CoMP) schemes have been widely studied in the recent years to tackle inter-cell interference. In practice, latency and throughput constraints on the backhaul allow the organization of only small clusters of base stations (BSs) where joint processing (JP) can be implemented. In this work, we focus on downlink CoMP-JP with multiple antenna user equipments (UEs) where the additional degrees of freedom are used to suppress the residual interference by using an interference rejection combiner (IRC) and allow a multi-stream transmission. The main contribution of this paper is the development of a novel dynamic BS clustering algorithm with corresponding UE scheduling. In particular, we first define a set of candidate BS clusters depending on long-term channel conditions. Then, in each time block, we develop a resource allocation scheme where: (a) for each candidate BS cluster, with corresponding scheduled UEs, a weighted sum rate is estimated and then (b) we select the set of non-overlapping BS clusters that maximizes the downlink system weighted sum rate. Numerical results show that much higher rates are achieved when UEs are equipped with multiple antennas and dynamic BS clustering is used.

1 Introduction

Coordination among base stations (BSs) has been widely studied in the recent years to tackle inter-cell interference which strongly limits the rates achieved in cellular systems, in particular by the user equipments (UEs) at the cell-edge [1]. Supported by the first results promising huge gains with respect to the baseline non-cooperative system [2], a lot of attention has been paid to the topic both in the academia [3, 4] and in the industry [5, 6]. These techniques, known in the industry as coordinated multi-point (CoMP), are classified into (a) coordinated scheduling/beamforming (CS/CB), which requires channel state information (CSI) but no data sharing among the BSs, and (b) joint processing (JP), which requires both CSI and data sharing among the BSs. This paper focuses on downlink CoMP-JP, where BSs jointly serve the scheduled UEs by sharing the data to be sent. Although CoMP-JP is a very promising technique, many issues make its implementation still challenging. First, CSI at the transmitter may be unreliable because of noise on channel estimation in time division duplex (TDD) systems and limited bandwidth available for feedback in frequency division duplex systems. Then, sharing UE data among all the BSs is generally limited by throughput and delay constraints in the backhaul infrastructure. A possible approach to deal with backhaul throughput constraints relies on partial sharing of UE data among the BSs, i.e., a BS serving a certain UE may have only a partial knowledge of the data to be sent toward that UE [7, 8]. Although the promising results achieved under idealistic assumptions, partial UE data sharing has not found application in real systems mainly due to its complexity. To deal with limited throughput backhaul, most of the works in the literature focus on the simpler clustering approach where the BSs are organized in clusters and joint processing is applied within each cluster by sharing the whole data to be sent among all the BSs of the cluster. However, even if intra-cluster interference is mitigated by using CoMP schemes within each cluster, UEs at the cluster border suffer strong inter-cluster interference (ICI). Many clustering schemes have been developed in the literature to deal with ICI. In [9], static clustering with block diagonalization is considered and precoders are designed in each cluster by nullifying the interference towards UEs of neighboring clusters close to the border. A more flexible solution is obtained with dynamic clustering[10, 11] where the set of clusters changes over time by adapting to the network conditions. In particular, in [10], a greedy algorithm is developed where, for each cluster, the first BS is selected randomly to guarantee fairness, while the remaining BSs are selected by maximizing the cluster sum rate. In [11] instead, based on the long-term channel conditions, it is defined a set of candidate clusterings and then in each time block is selected the most suitable one. In [12], selected clusters maximize the increase of the achievable UE rate, whereas in [12, 13], they minimize the interference power. In [14], a BS negotiation algorithm is used for cluster formation within a given cluster size. In [15], active clusters are selected by minimizing an overall cost function which depends on the UE average received power. A framework for feedback and backhaul overhead reduction is developed in [16] where each UE feeds back CSI only to a subset of BSs, and UEs associated to the same subset are grouped together. In [17], a greedy UE scheduling algorithm with overlapping clusters is proposed where precoders are designed by considering the layered virtual signal to interference plus noise ratio (SINR) criterion [18]. In [19], an iterative algorithm is proposed to jointly optimize beamforming and clustering in heterogeneous networks.

However, most of the works on dynamic clustering ([10–13, 16, 17]) assume that UEs are equipped with only one antenna, although the Long Term Evolution (LTE) Advanced standard developed by the 3rd Generation Partnership Project (3GPP) considers that UEs may be equipped with up to eight antennas [20]. Although this number seems a bit optimistic for current mobile devices, the technological innovation may allow in the near-future manufacturing smartphones or tablets with numerous antennas and hence much more attention should be paid to the study of CoMP schemes with multiple antenna UEs [21, 22]. Therefore, in this work, we consider downlink CoMP-JP with a constraint on the maximum cluster size and propose a novel dynamic BS clustering and UE scheduling algorithm by explicitly considering that UEs are equipped with multiple antennas. In our proposal, UEs exploit these additional degrees of freedom by implementing interference rejection combiner (IRC) [23] to partially suppress ICI and being served by means of a multi-stream transmission. Moreover, differently from many works on dynamic clustering where UE selection is not considered and a simple round robin scheduler is implemented ([10, 12–14, 16, 19]), here, we assume UE scheduling as a part of the optimization. In our approach, we first define a set of candidate BS clusters depending on long-term properties of the channels. Then, in each time block, the proposed algorithm follows a two-step procedure: (a) a weighted sum rate is estimated for each candidate cluster by performing UE selection, precoding design, power allocation, and transmission rank selection and then (b) the central unit (CU) coordinating all the BSs schedules the set of non-overlapping candidate clusters that maximizes the system weighted sum rate under the assumption of perfect successive interference cancellation (SIC) with IRC at each UE.

For a performance comparison, we use the effective achievable rate at UEs, by assuming that CSI is perfectly known at the receiver. In particular, we evaluate the achievable rate of the proposed solution in a LTE-TDD scenario and compare it against a baseline single-cell processing (SCP) scheme and two static clustering schemes, where clusters do not dynamically adapt to the network conditions. Numerical results show that the achievable rates strongly increase with the number of UE antennas. Moreover, as with CoMP part of the interference is managed at the transmit side, multi-stream transmission is more effective with the proposed scheme than with SCP. However, as most of the gain is due to the interference suppression capability of the IRC, the relative gain achieved by the proposed scheme with respect to SCP decreases by increasing the number of UE antennas. Finally, a further decrease of this gain is observed when imperfect CSI is considered at BSs.

Notation. We use (·)^T to denote transpose and (·)^H conjugate transpose. 0_N×M denotes the matrix of size N×M with all zero entries, I_N the identity matrix of size N, tr(X) the trace of matrix X, det(X) the determinant of matrix X, vec(X) the vectorization of X, ∥X∥ the Frobenius norm of X, [X]_n,m the entry on row n and column m of X, [X]_·,m the m th column of X, and diag(x) the diagonal matrix with the entries of vector x on the diagonal. Expectation is denoted by $E [\cdot]$ .

2 System model

We consider a TDD system where a set of BSs $J = {1, 2, \dots, J}$ , each equipped with M antennas, is serving a set of UEs $K = {1, 2, \dots, K}$ , each equipped with N antennas, with K>J M. As the overall number of transmitting antennas is not sufficient to serve all the UEs at the same time, UE scheduling is part of the optimization problem. We assume a block fading channel model and denote with H_k,j(t), t=0,1,…,T−1, the multiple-input multiple-output (MIMO) channel matrix of size N×M between BS j and UE k in block t. We consider that the entries of matrix H_k,j(t) are identically distributed zero-mean complex Gaussian random variables, i.e., ${[H_{k, j} (t)]}_{n, m} \sim C N (0, σ_{k, j}^{2})$ , for n=0,1,…,N−1 and m=0,1,…,M−1, where $σ_{k, j}^{2}$ represents the large scale fading between BS j and UE k, which depends on path loss and shadowing. We assume that the statistical description of the channels does not change for all the T blocks, whereas fast fading realizations are independent among different blocks. Then, we denote with $Σ_{k, j} = ?? [vec (H_{k, j} (t)) vec {(H_{k, j} (t))}^{H}]$ the covariance matrix of the channel matrix H_k,j(t). We indicate with L_E the number of resource elements, i.e., time slots, forming a block. Note that the block fading model considered in this work can be adapted to represent a more realistic channel which changes continuously both in time and in frequency by suitably selecting the number of resource elements in each block. In fact, by denoting with W_C and T_C the coherence bandwidth and time of the channel, respectively, we have L_E=W_CT_C.

We assume that the BSs are coordinated by a CU, and the backhaul links have zero latency and are error-free. Each block is organized in three phases: (a) in the first phase, all the UEs send pilot sequences to allow channel estimation at BSs, (b) in the second phase, BS clustering, UE scheduling, beamforming design, transmission rank selection, and power optimization are performed by the CU, and finally, (c) in the third phase, the BSs perform data transmission toward the set of scheduled UEs.

For the sake of clarity, we report in Table 1 the list of important symbols used in this work.

Table 1 List of symbols, time block t has been omitted for simplicity

Full size table

2.1 First phase: uplink pilot transmission

The first L_T resource elements of each block are allocated to the uplink pilot transmission performed by the UEs. We assume that orthogonal sequences, each of length L_T, are employed by the UEs; thus, interference on channel estimation is avoided at BSs: in the considered scenario, this is achieved by imposing a minimum length of the training sequence of L_T≥N K. By denoting with P^(UE) the maximum power available at each UE and $σ_{n}^{2}$ the thermal noise power, under the assumption of perfect reciprocity, BS j estimates the channel matrix H_k,j(t) connecting UE k to itself from the observation

\begin{array}{l} o_{k, j, n, m} (t) = & {[H_{k, j} (t)]}_{n, m} + η_{k, j, n, m} (t), \\ n = 0, 1, \dots, N - 1, m = 0, 1, \dots, M - 1, \end{array}

(1)

where $η_{k, j, n, m} (t) \sim C N (0, \frac{N σ_{n}^{2}}{L_{T} P^{(UE)}})$ . By assuming that BS j knows the covariance matrix Σ_k,j, the minimum mean square error (MMSE) estimate ${\hat{H}}_{k, j} (t)$ of H_k,j(t) given the observation (1) can be written as ([24], [Ch. 10])

\begin{array}{l} vec ({\hat{H}}_{k, j} (t)) = & Σ_{k, j} {(Σ_{k, j} + \frac{N σ_{n}^{2}}{L_{T} P^{(UE)}} I_{MN})}^{- 1} \\ \times (vec (H_{k, j} (t)) + vec (η_{k, j} (t))), \end{array}

(2)

where [η_k,j(t)]_n,m=η_k,j,n,m(t).

Note that in the case of uncorrelated channels, i.e., when Σ_k,j=I_MN, the expression in (2) turns out to be

{\hat{H}}_{k, j} (t) = \frac{1}{1 + \frac{N σ_{n}^{2}}{L_{T} P^{(UE)} σ_{k, j}^{2}}} (H_{k, j} (t) + η_{k, j} (t)) .

(3)

2.2 Second phase: resource allocation at the CU

After uplink pilot transmission, each BS j forwards the channel estimates ${\hat{H}}_{k, j} (t)$ , $k \in K$ , to the CU, which, in turn, organizes BSs in clusters and schedules in each time block t a subset $S (t) \subseteq K$ of UEs.

In this work, we consider dynamic multi-stream transmission and we denote with l_k(t) the transmission rank allocated to UE k in block t, i.e., the number of streams sent toward UE k. Let us denote with G_k,j(t) the M×l_k(t) beamforming matrix used by BS j to serve UE k and with $P_{k} (t) = [P_{k, 0} (t), P_{k, 1} (t), \dots, P_{k, l_{k} (t) - 1} (t)]$ the power allocation vector for UE k.A basic approach to BS clustering is static, i.e., BS clusters are fixed and do not change over time. For example, in the hexagonal setup with seven sites and three BSs per site, reported in Figure 1, typical standard configurations are the following: •Single cell processing (SCP), where no cooperation is allowed among the BSs and each UE is served by its anchor BS, i.e., the BS characterized by the highest signal to noise ratio (SNR) (baseline scheme).•Intra-site cooperation (ISC), where seven clusters are constructed, each one composed by three co-located BSs (see Figure 2), and each UE is served by the best site in terms of average SNR.•Static clustering (SC), where still seven static clusters are constructed, but with cooperation allowed among three BSs of three different sites as shown in Figure 3.

Here, $C = \{1, 2, \dots, |C|\}$ denotes the set of integers, each identifying a candidate BS cluster, while $J_{c} (t) \subseteq J$ is the c-th candidate BS cluster and $S_{c} (t) \subseteq K$ the corresponding set of scheduled UEs. For complexity reasons, in this work, we consider non-overlapping clusters: hence, in each time block t, the set of BSs is partitioned into non-overlapping clusters and no UE can be served in the same time block by two different BS clusters. Although a solution with overlapping clusters would provide higher rates, it would be much more challenging in terms of computational complexity, in particular when the number of BSs J managed by the CU is high.

We denote with ${\hat{H}}_{k}^{(c)} (t)$ the matrix of size $N \times M |J_{c} (t)|$ collecting the MIMO channel estimated by BSs in cluster $J_{c} (t)$ toward UE k and with $G_{k}^{(c)} (t)$ the corresponding precoding matrix of size $M |J_{c} (t)| \times l_{k} (t)$ , with ${∥{[G_{k}^{(c)} (t)]}_{\cdot, l}∥}^{2} = 1$ , l=0,1,…,l_k(t)−1. Due to the imperfect CSI at BSs, the signal received by UE k is modeled at the CU as

\begin{array}{l} {\hat{r}}_{k}^{(c)} (t) = & {\hat{H}}_{k}^{(c)} (t) G_{k}^{(c)} (t) s_{k} (t) + \\ \sum_{m \in S_{c} (t) ∖ {k}} {\hat{H}}_{k}^{(c)} (t) G_{m}^{(c)} (t) s_{m} (t) + n_{k} (t) + {\hat{i}}_{k}^{(c)} (t), \end{array}

(4)

where $s_{k} (t) \sim C N (0_{l_{k} (t) \times 1}, diag (P_{k} (t)))$ is the data symbol vector transmitted toward UE k, $n_{k} (t) \sim C N (0_{N \times 1}, σ_{n}^{2} I_{N})$ is the thermal noise at the UE antennas and ${\hat{i}}_{k}^{(c)} (t)$ is the estimate of the ICI suffered by UE k. Note that the exact value of ${\hat{i}}_{k}^{(c)}$ depends on the beamformers used by other clusters to serve their own UEs. By indicating with P^(BS) the power available at each BS, beamformers are designed by assuming per-BS power constraints, i.e.,

\begin{matrix} \sum_{k \in S_{c} (t)} tr (G_{k, j}^{H} (t) G_{k, j} (t) diag (P_{k} (t))) \leq P^{(BS)}, j \in J_{c} (t) . \end{matrix}

(5)

In this work, we make the following assumptions regarding power allocation and beamforming design.

Equal power is allocated to the streams sent toward the UEs scheduled within the same cluster, i.e., P_k,l(t)=P^(c)(t), $k \in S_{c} (t)$ , l=0,1,…,l_k(t)−1, where P^(c)(t) can be analytically computed from (5) as
$P^{(c)} (t) = \frac{P^{(BS)}}{max_{j \in J_{c} (t)} \sum_{k \in S_{c} (t)} \sum_{l = 0}^{l_{k} (t) - 1} {∥{[G_{k, j} (t)]}_{\cdot, l}∥}^{2}} .$
(6)
Beamformers are designed by using the multi-user eigenmode transmission (MET) scheme [25], where the precoding matrix used to serve UE k is optimized with the aim of nullifying the interference toward the eigenmodes selected for the co-scheduled UEs $m \in S_{c} (t) ∖ \{k\}$ . In detail, let ${\hat{H}}_{k}^{(c)} (t) = {\hat{U}}_{k}^{(c)} (t) {\hat{Σ}}_{k}^{(c)} (t) {\hat{V}}_{k}^{(c) H} (t)$ be the singular value decomposition (SVD) of matrix ${\hat{H}}_{k}^{(c)} (t)$ , where the eigenvalues in ${\hat{Σ}}_{k}^{(c)} (t)$ are arranged so that the ones selected for transmission toward UE k appear in the leftmost columns. By defining matrix
$\begin{array}{l} {\hat{Γ}}_{k}^{(c)} (t) = \\ [{[{\hat{Σ}}_{k}^{(c)} (t)]}_{0, 0} {[{\hat{V}}_{k}^{(c)} (t)]}_{\cdot, 0}, {[{\hat{Σ}}_{k}^{(c)} (t)]}_{1, 1} {[{\hat{V}}_{k}^{(c)} (t)]}_{\cdot, 1}, \\ {\dots, {[{\hat{Σ}}_{k}^{(c)} (t)]}_{l_{k} (t) - 1, l_{k} (t) - 1} {[{\hat{V}}_{k}^{(c)} (t)]}_{\cdot, l_{k} (t) - 1}]}^{H}, \end{array}$
(7)
precoding matrix used to serve UE k satisfies constraints
${\hat{Γ}}_{m}^{(c)} (t) {\hat{G}}_{k}^{(c)} (t) = 0_{l_{m} (t) \times l_{k} (t)}, k \neq m .$
(8)

Note that MET has been proven to outperform in a MIMO broadcast channel other linear precoding schemes such as block diagonlization [25], whereas the assumption of equal power allocation among the scheduled streams reduces the computational complexity and is asymptotically optimal at high SNR.

The main contribution of this work, described in Section 3, is a practical algorithm for dynamic BS clustering and corresponding UE scheduling developed in the considered setup with multiple antenna BSs, with equal power allocation and MET, that transmit toward multiple antenna UEs. Moreover, we recall that at UEs, the multiple receive antennas are used to perform IRC with SIC ([26], Ch. 10); in fact, while the rank l_k(t) allocated to UE k is given by the number of columns of the precoder, the remaining degrees of freedom at UE are used to partially suppress the residual ICI. Note that IRC both minimizes the mean square error and maximizes the SINR at the detection point [23].

2.3 Third phase: downlink data transmission

BS clusters serve the scheduled UEs by using the L_E−L_T resource elements still available in block t. By defining matrix $G_{k} (t) = {[G_{k, 1}^{T} (t), G_{k, 2}^{T} (t), \dots, G_{k, J}^{T} (t)]}^{T}$ and matrix H_k(t)=[H_k,1(t),H_k,2(t),…,H_k,J(t)], the signal received by UE k can be written as

\begin{matrix} r_{k} (t) = & H_{k} (t) G_{k} (t) s_{k} (t) + \sum_{m \in S (t) ∖ {k}} H_{k} (t) G_{m} (t) s_{m} (t) + n_{k} (t) . \end{matrix}

(9)

3 Dynamic clustering algorithm

In this section, we drop the block index t for the sake of clarity. With respect to UE k, let us define the function $f_{k} : J \to J$ , which orders the BSs on the basis of the large scale fading component of the channel, i.e., $σ_{k, f_{k} (c_{1})}^{2} > σ_{k, f_{k} (c_{2})}^{2}$ if c₁<c₂. Then, we indicate with $J_{k}^{(u)}$ the cluster of the u BSs with the strongest average channel toward UE k, i.e.,

J_{k}^{(u)} = \{f_{k} (1), f_{k} (2), \dots, f_{k} (u)\} .

(10)

Hence, f_k(1) is the anchor BS for UE k.

In a network with J BSs and a maximum cluster size of J_MAX, the number of possible BS clusters that can be constructed to serve a given UE is

\sum_{j = 1}^{J_{MAX}} (\binom{J}{j}),

(11)

which rapidly increases with J. However, as most of the interference at each UE comes from the closest BSs, we can limit the number of candidate BS clusters. Hence, we assume that set identifies all and only the sets $J_{k}^{(u)}$ whose size is not bigger than J_MAX. As an example, selecting J_MAX=3, set identifies three candidate clusters for each UE k: (i) the first cluster includes only its anchor BS, $J_{k}^{(1)}$ , (ii) the second cluster is composed of the two closest BSs, $J_{k}^{(2)}$ , and (iii) the third cluster is composed of the three closest BSs, $J_{k}^{(3)}$ . Note that, as different UEs often have the same candidate clusters, the cardinality of set turns out to be much lower than K J_MAX. The considered assumption yields an important saving in terms of computational complexity by strongly limiting the number of candidate clusters with respect to (11): this complexity saving is evaluated in Section 4 for a typical LTE scenario.

Now, clusters $J_{k}^{(u)}$ , with u≤J_MAX, are ordered by an integer index c, and for each cluster $J_{c}$ , $c \in C$ , we define the corresponding set $U_{c}$ of UEs that can be scheduled for reception, which is formed by the UEs whose anchor BS belongs to $J_{c}$ , i.e.,

U_{c} = \{k \in K : f_{k} (1) \in J_{c}\} .

(12)

We highlight that (12) allows BSs in cluster $J_{c}$ to serve all the UEs in its coverage area, even UEs close to the border. Although a different choice could be taken for instance by forcing clusters to serve only the UEs far away from the border, it has been shown in [27] that this alternative choice provides worse performance than (12) when a huge network is considered and fairness among the UEs is taken into account.

In this work, we propose an algorithm for dynamic BS clustering and UE scheduling at the CU which follows a two-step procedure.

1.
For each candidate BS cluster $J_{c}$ , we estimate the weighted sum rate ${\hat{R}}^{(c)}$ by selecting a suitable subset of UEs $S_{c} \subseteq U_{c}$ , designing precoders, selecting transmission ranks, and allocating powers.
2.
After computing the weighted sum rate ${\hat{R}}^{(c)}$ for all the candidate BS clusters in , the CU schedules a set of non-overlapping BS clusters, where each BS belongs to at most one cluster.

Moreover, based on (12), we observe that UE k can be selected only by candidate clusters that include its anchor BS f_k(1). Hence, if we enforce a non-overlapping solution, each UE is never scheduled by two different non-overlapping clusters in the same block. However, we highlight that the proposed dynamic solution allows the flexibility of scheduling a given UE in different clusters across successive blocks.

In the rest of this section, we describe more in detail the above two main steps of the algorithm. We stress that the candidate cluster selection, i.e., the construction of set depends on the large scale fading: hence, in our model, it should be performed only every T blocks. On the other hand, the two-step algorithm described above follows a fast fading time-scale and therefore must be implemented in each block.In Figure 4, we report a flow chart which summarizes the proposed system configuration.

3.1 Cluster weighted sum rate estimation

The CU estimates the weighted sum rate ${\hat{R}}^{(c)}$ that can be achieved by candidate cluster $J_{c}$ by modeling the signal received by UE $k \in U_{c}$ as in (4). Besides the errors due to the imperfect CSI (2), the estimated ICI ${\hat{i}}_{k}^{(c)}$ depends on the precoders used by the other clusters scheduled by the CU. In our framework, as the CU schedules the clusters after estimating the weighted sum rate achievable in each candidate, we simply assume ${\hat{i}}_{k}^{(c)} \sim C N (0_{N \times 1}, ξ_{k}^{(c)} I_{N})$ with

ξ_{k}^{(c)} = P^{(BS)} \sum_{j \in J ∖ J_{c}} σ_{k, j}^{2} .

(13)

Note that (13) represents the average ICI power at the UE k when all the BSs outside cluster c are transmitting at full power ([28], (2)), and for each candidate cluster allows the computation of ${\hat{R}}^{(c)}$ independently of the other candidates.

From (4), we introduce the interference plus noise covariance matrix

\begin{array}{l} {\hat{Ψ}}_{k}^{(c)} = & (σ_{n}^{2} + ξ_{k}^{(c)}) I_{N} + \\ \sum_{m \in S_{c} ∖ {k}} {\hat{H}}_{k}^{(c)} G_{m}^{(c)} diag (P_{m}) G_{m}^{(c) H} {\hat{H}}_{k}^{(c) H} . \end{array}

(14)

By using (4) and (14), we can write the estimate at the CU of the rate ${\hat{R}}_{k}$ achieved by UE k as

\begin{array}{l} {\hat{R}}_{k} = & (1 - \frac{L_{T}}{L_{E}}) \\ \times \underset{2}{log} det (I_{N} + {\hat{H}}_{k}^{(c)} G_{k}^{(c)} diag (P_{k}) G_{k}^{(c) H} {\hat{H}}_{k}^{(c) H} {\hat{Ψ}}_{k}^{(c) - 1}) . \end{array}

(15)

Note that the estimate of the rate achieved by UE k in (15) is computed by assuming that the multiple receive antennas are exploited by performing SIC with IRC ([26], Ch. 10): the rank l_k allocated to UE k is given by the number of columns of precoder $G_{k}^{(c)}$ , whereas the remaining degrees of freedom are used to partially suppress the residual ICI by IRC. Then, the weighted sum rate ${\hat{R}}^{(c)}$ is estimated at the CU by solving the following optimization problem:

\begin{matrix} {\hat{R}}^{(c)} = max_{S_{c} \subseteq U_{c}, {\{G_{k}^{(c)}\}}_{k \in S_{c}}, {\{P_{k}\}}_{k \in S_{c}}} & \sum_{k \in S_{c}} α_{k} {\hat{R}}_{k} \end{matrix}

(16a)

s.t.

\sum_{k \in S_{c}} tr (G_{k, j}^{H} G_{k, j} diag (P_{k})) \leq P^{(BS)}, j \in J_{c},

(16b)

where scaling factor α_k in (16a) represents the quality of service (QoS) for UE k which depends on the employed scheduler.

Maximization (16) is a well-studied multi-user MIMO problem [29] involving (a) UE selection, (b) transmission rank selection, (c) precoding design, and (d) power allocation.

We solve problem (16) by enforcing the assumptions of equal power allocation among the streams sent within cluster $J_{c}$ and MET introduced in Section 2.2. Moreover, the eigenmodes (and accordingly the set $S_{c}$ of scheduled UEs and the transmission rank allocated to each UE $k \in S_{c}$ ) are selected by using a greedy iterative algorithm which, at each iteration, includes the eigenmode which maximizes the weighted sum rate ${\hat{R}}^{(c)}$ among the ones not scheduled in the previous iterations. The algorithm starts with no UE scheduled and stops when no increase in the weighted sum rate ${\hat{R}}^{(c)}$ is observed. Cluster $J_{c}$ , among the $N |U_{c}|$ possible eigenmodes, selects a maximum of $M |J_{c}|$ eigenmodes, due to the limited number of BS antennas. Note that the considered method flexibly adapts to the channel conditions by allowing the allocation of (a) different ranks to different UEs in the same block and (b) different ranks to the same UE across successive blocks.

3.2 Clustering optimization

After computing all the weighted sum rates ${\hat{R}}^{(c)}$ , $c \in C$ , the CU schedules the set of non-overlapping clusters that maximizes the system weighted sum rate. In detail, by defining

\begin{array}{l} a_{j, c} & = & \{\begin{array}{l} 1, & j \in J_{c}, \\ 0, & otherwise, \end{array} \\ x_{c} & = & \{\begin{array}{l} 1, & CU schedules candidate cluster J_{c}, \\ 0, & otherwise, \end{array} \end{array}

we consider that each BS belongs to at most one cluster, i.e., we impose

\sum_{c \in C} a_{j, c} x_{c} \leq 1, j \in J .

(17)

Therefore, at the CU, the clustering optimization is performed by solving the following linear integer optimization problem

max_{x_{c}, c \in C} \sum_{c \in C} {\hat{R}}^{(c)} x_{c},

(18)

s.t. (17).

Note that (18) differs from the optimization carried out in [15] where the objective function simply depends on the received power measured by the UEs.

Maximization (18) is the optimization version of the set packing problem, which is shown to be NP-hard [30]. Hence, as the exhaustive search is not a viable method to solve (18), we propose a greedy iterative algorithm which is reported in Algorithm 1: the proposed solution basically selects at each iteration the best (in terms of system weighted sum rate) cluster and ends when each BS has been assigned to at least one cluster. In detail, let $C^{(A)} (n)$ be the set identifying the candidate BS clusters considered at iteration n. The algorithm starts by imposing $C^{(A)} (1) \leftarrow C$ and ends when $C^{(A)} (n) = \emptyset$ . Note that $C^{(A)} (n)$ identifies all the candidate clusters that do not overlap with the clusters scheduled in the previous iterations. At iteration n, we select cluster $w \in C^{(A)} (n)$ that maximizes the per-BS weighted sum rate, i.e.,

ω = \underset{c \in C^{(A)} (n)}{argmax} \frac{{\hat{R}}^{(c)}}{|J_{c}|},

(19)

and we remove from $C^{(A)} (n)$ all the indexes identifying clusters that partially overlap with $J_{w}$ . Note that in criterion (19), we normalize the cluster weighted sum rate ${\hat{R}}^{(c)}$ to the number of BSs $|J_{c}|$ included in the cluster with the aim of scheduling big clusters only if this really provides an improvement. Let us consider as an example a basic scenario with J=2: by using (19), we schedule the cluster of two BSs only if its weighted sum rate ${\hat{R}}^{(c)}$ is higher than that achieved when the two BSs are uncoordinated (SCP scheme).

By denoting with $x_{c}^{(*)}$ the greedy solution to (18) obtained by applying the proposed algorithm, the set of UEs scheduled in the current block turns out to be

S = ⋃_{c \in C : x_{c}^{(*)} = 1} S_{c} .

(20)

4 Numerical results

We consider a hexagonal cellular scenario where J=21 BSs, each equipped with M=4 antennas, are organized in seven sites, each with three co-located BSs (see also Figure 1). We consider ten UEs randomly dropped in the coverage area of each BS, with K=210 UEs overall. The power available at each BS is P^(BS)=46 dBm, the power available at each UE is P^(UE)=23 dBm, and the thermal noise power is $σ_{n}^{2} = - 101$ dBm. The large scale fading between BS j and UE k can be written as

σ_{k, j}^{2} = Γ^{(CE)} {(\frac{d^{(CE)}}{d_{k, j}})}^{ν} e^{ζ_{k, j}} A (θ_{k, j}),

(21)

where d_k,j is the distance between BS j and UE k, ν=3.5 is the path loss coefficient, Γ^(CE)|_dB=10 dB is the average SNR when an UE is at the cell edge, $e^{ζ_{k, j}}$ is the lognormal shadowing with 8 dB as standard deviation, and A(θ_k,j) models the antenna gain as a function of the direction θ_k,j of UE k with respect to the antennas of BS j, with

{A (θ_{k, j})|}_{dB} = - min \{12 {(θ_{k, j} / θ_{3 dB})}^{2}, A_{s}\},

(22)

where θ_{3d B}=(70/180)π and A_s|_dB=20 dB ([31], (21.3)). We consider an inter-site distance of 500 m and a minimum distance d_min=35 m between BSs and UEs. Wraparound is used to deal with boundary effects [32]. We also assume that channels are correlated by considering the popular Kronecker model [33]. By denoting with R_BS the square correlation matrix of size M at the BS, with tr(R_BS)=M, and with R_UE the square correlation matrix of size N at the UE, with tr(R_UE)=N, we can write

H_{k, j} (t) = R_{UE}^{1 / 2} {\bar{H}}_{k, j} (t) {(R_{BS}^{1 / 2})}^{H},

(23)

where ${\bar{H}}_{k, j} (t)$ is a matrix of size N×M whose entries are independent and identically distributed zero-mean complex Gaussian random variables with $σ_{k, j}^{2}$ as statistical power.

Results are obtained by simulating 100 UE drops and T=200 block channel realizations for each UE drop. We assume that proportional fair scheduling [34] is implemented to provide fairness among UEs, i.e., $α_{k} (t) = 1 / {\tilde{R}}_{k} (t)$ , with ${\tilde{R}}_{k} (t + 1) = (1 - γ) {\tilde{R}}_{k} (t) + γ R_{k} (t)$ , t=0,1,…,T−1, where γ=0.1 is the forgetting factor and we initialize ${\tilde{R}}_{k} (0) = \underset{2}{log} (1 + P^{(BS)} σ_{k, j_{k}}^{2} / σ_{n}^{2})$ . However, to allow the scheduler to reach a steady state, only the last T/2 channels of each UE drop are considered for system performance evaluation.

We compare the developed scheme based on dynamic clustering (DC) against the three static schemes SCP, ISC, and SC, introduced in Section 2. Moreover, we assume that UE scheduling, beamforming design, transmission rank selection, and power allocation are performed, as described in Sections 2.2 and 3.1, also for the static schemes: in particular, UEs are served by using MET [25] with equal power allocation among the eigenmodes and a greedy UE selection is performed within each BS cluster.

For performance evaluation, we assume perfect CSI at the UE side, which employs IRC with SIC, and perfect detection, i.e., there is no error propagation. Moreover, in Section 4.6, we further provide some numerical results that validate the assumption of perfect CSI at the UE employed in most of the CoMP literature.

From (9), the interference plus noise covariance matrix at the UE can be written as

\begin{matrix} Ψ_{k} (t) = σ_{n}^{2} I_{N} + \sum_{m \in S (t) ∖ {k}} H_{k} (t) G_{m} (t) diag (P_{m} (t)) G_{m}^{H} (t) H_{k}^{H} (t) . \end{matrix}

(24)

The effective rate achieved by UE k turns out to be ([26], Ch. 10)

\begin{array}{l} R_{k} (t) = & (1 - \frac{L_{T}}{L_{E}}) \\ \begin{array}{l} \times & \underset{2}{log} det (I_{N} + H_{k} (t) G_{k} (t) \\ diag (P_{k} (t)) G_{k}^{H} (t) H_{k}^{H} (t) Ψ_{k}^{- 1} (t)) . \end{array} \end{array}

(25)

In (25), the overhead due to the UE pilot transmission is taken into account in the scaling factor before the logarithm.

The proposed schemes are compared in terms of:

UE rate, defined as
${\bar{R}}_{k} = \frac{2}{T} \sum_{t = T / 2}^{T - 1} R_{k} (t)$
(26)
Average cell rate, defined as
${\bar{R}}_{cell} = \frac{1}{J} \sum_{k \in K} {\bar{R}}_{k}$
(27)

First, to evaluate the complexity saving achieved by the candidate cluster selection described at the beginning of Section 3, we show in Table 2 the 5th, the 50th, and the 95th percentiles of the number of candidate clusters considered with DC by assuming J_MAX=3. By adapting the candidate clusters to the long-term channel conditions, we have a saving of about 80% in terms of with respect to the full search (11); in fact, with our approach, we ignore candidate clusters that include far apart BSs.

Table 2 Number of candidate clusters with J _MAX = 3: comparison between DC and exhaustive search

Full size table

4.1 Effect of multiple antennas at UEs

In this section, we consider perfect CSI at BSs, i.e., ${\hat{H}}_{k, j} (t) = H_{k, j} (t)$ in (2), uncorrelated antennas, i.e., in (23) R_BS=I_M and R_UE=I_N, and we assume J_MAX=3 with DC for a fair comparison against ISC and SC in terms of maximum cluster size. In Figures 5 and 6, we report the average cell rate and the fifth percentile of the UE rate for three values of the number N of UE antennas, respectively. First, we observe a significative performance improvement by adding antennas at the UE side. For instance, with SCP by increasing N from 1 to 4, there is a gain of about 76% in terms of the fifth percentile of the UE rate. Two factors mainly contribute to this gain: (a) UEs with lower SINR use IRC to limit the impact of residual ICI not managed at the transmit side and (b) UEs with higher SINR can be served by multiple streams of data. From Table 3, where we report the distribution of the number of streams l_k with N=4, we note that with SCP more than 80% of the transmissions are rank 1. On the other hand, with DC, as the interference level suffered by the UEs is lower, about 33% of transmissions are multi-stream. This shows that in general, most of the gain is due to the IRC and multi-stream transmission plays a non-negligible role only with DC. Then, we observe that ISC provides a moderate gain with respect to SCP in terms of cell rate, but almost no gain in terms of the fifth percentile of the UE rate, whereas the opposite happens with SC. Indeed, ISC of Figure 2, by only allowing cooperation among the sectors of the same site, partially helps the UEs close to the site border, which however get better performance with SC of Figure 3. Moreover, we also observe that the performance gain achieved by DC over SCP decreases by adding more antennas at the UE side. In fact, as the gain of using multiple antenna UEs is mainly due to the IRC which cancels ICI, the benefits of increasing N are seen more in a non-cooperative scenario, where the residual ICI is higher with respect to DC. In detail, from Figure 6, the performance gain achieved by DC over SCP drops from about 43% with N=1 to about 28% with N=4.

Table 3 Distribution (%) of l _k with N = 4

Full size table

To further understand the role of IRC and multi-stream transmission with both SCP and CoMP, we consider now a simplified solution to (16) obtained by assuming a maximum number of streams l^(MAX) that can be transmitted toward each UE and limiting the eigenmodes that can be used for each UE to only the strongest l^(MAX) eigenmodes. In Table 4, we report the average cell rate and the fifth percentile of the UE rate by considering N=4 and l^(MAX)=1,4. Note that l^(MAX)=1 means that each UE is always served by rank 1 transmissions along its strongest eigenmode and implements IRC, whereas l^(MAX)=N=4 means that there are no constraints on the number of data streams. Again, as the level of ICI is higher with SCP, in terms of the fifth percentile of the UE rate, the gain achieved by DC over SCP decreases from about 28% with l^(MAX)=4 to about 18% with l^(MAX)=1. These results confirm that the gain of CoMP with respect to the baseline non-cooperative scheme can still be important when UEs are equipped with multiple antennas, but only if multi-stream transmission is properly exploited.

Table 4 Average cell rate and fifth percentile of the UE rate with N = 4 and l ^(MAX) = 1,4

Full size table

4.2 Effect of antenna correlation

In Figure 7, we consider N=4, l^(MAX)=1, and perfect CSI at BSs, and we introduce correlation among UE antennas by assuming that R_UE is a symmetric Toeplitz matrix whose first column is [R_UE]_·,0=[1,β,…,β^N−1]^T, and plot the fifth percentile of the UE rate in terms of β. As expected, a higher rate is achieved with low-correlated antennas, i.e., for lower values of β. Moreover, we also observe that the gain achieved by DC over SCP decreases by decreasing β: in detail, this gain drops from 25% with β=0.9 to 18% with β=0.1. In fact, by decreasing the correlation among UE antennas, we improve the interference suppression capability of IRC. These results confirm that it is not worthy to add more antennas at the UE when they are strongly correlated.

4.3 Effect of UE selection

In this section, we consider N=4, l^(MAX)=1, and perfect CSI at the BSs, and we evaluate how the heuristic greedy UE selection method introduced in Section 3.1 to solve problem (16) and the proposed dynamic clustering scheme impact the gain achieved by CoMP over SCP. Therefore, in Figure 8, we evaluate SCP and DC with J_MAX=3 in terms of the fifth percentile of the UE rate by considering a round robin UE selection with S^(BS)=1,2,3,4 UEs scheduled by each BS: note that when the UEs are selected in a round robin fashion, the gain achieved by CoMP over SCP depends only on the clustering scheme, as no joint optimization of UE scheduling and clustering is performed. First, we observe that S^(BS)=2 represents the optimal value for both SCP and DC. In fact, when the UEs are scheduled in a round robin fashion, increasing the value of S^(BS), while it allows more UEs to be served in each block, it also increases the likelihood of selecting UEs whose channels are almost parallel, thus degrading the achieved rates. Then, when we look at the impact of the UE selection method, we observe that with DC, a gain of about 37% is achieved by the greedy method over the round robin approach (see also Table 4). Finally, we observe that the gain achieved by DC over SCP decreases from about 18% with the greedy method to about 11% with the round robin approach.

4.4 Effect of cluster size

In Figure 9, by assuming N=1,4, l^(MAX)=1, uncorrelated antennas and perfect CSI at BSs, we compare SCP and DC in terms of the fifth percentile of the UE rate for four values of the maximum cluster size J_MAX. An important gain is observed with CoMP by increasing J_MAX: for instance, the gain achieved by DC over SCP increases from 43% (18%) with J_MAX=3 to about 84% (40%) with J_MAX=6 when N=1 (N=4). These results show that although the strongest interferers are managed by CoMP with J_MAX=3, the ICI suffered by UEs is still very high and strongly limits system performance. Hence, a general comment is that BS clusters of higher dimension should be employed if the backhaul infrastructure is able to handle it.

4.5 Effect of imperfect CSI at BSs

In this section, we assume that the CSI at BSs is affected by noise (2) due to the finite number of resource elements L_T allocated to the pilot transmissions in each block. After denoting with f_d the maximum Doppler frequency, and with ${\bar{τ}}_{rms}$ the root-mean square delay spread of the channel, we define, respectively, the coherence bandwidth ([35], Ch. 4) and coherence time ([36], Ch. 4) of the channel as

W_{C} = \frac{1}{{\bar{τ}}_{rms}},

(28a)

T_{C} = \frac{0.423}{f_{d}} .

(28b)

Note that above expressions are only used to determine the block size L_E such that the channel can be modeled as uncorrelated between adjacent blocks. Indeed, if f_d or ${\bar{τ}}_{rms}$ increase, L_E is reduced and this lowers the rate of each UE as given by (25). Due to the problem of obtaining a reliable CSI at BSs in a high mobility scenario, in the following, we consider f_d=5 Hz, which at 2.5 GHz carrier frequency roughly corresponds to a mobile velocity of 2 km/h ([31], Ch. 21). In this section, we also assume N=4, l^(MAX)=1, uncorrelated antennas and J_MAX=3 with DC.

In Figure 10, we consider the extended pedestrian A (EPA) model, which is a very low frequency selective channel with ${\bar{τ}}_{rms} = 43$ ns ([31], Tab. 21.2). In detail, we show the fifth percentile of the UE rate versus the ratio L_T/L_E, which represents the fraction of resources used for pilot transmission. The dashed lines are the rates computed in Section 4.1 by assuming perfect CSI at BSs. We observe that in this case, rate performance close to the perfect CSI case can be achieved by properly increasing the value of L_T. Moreover, note that SCP approaches the best performance faster than CoMP schemes. In fact, while with SCP only the channels between a BS and its anchored UEs are used for precoding design, with CoMP precoders are optimized on the basis also of the channels between some other auxiliary BSs and these UEs. As these channels are generally characterized by a lower SNR with respect to the channel between a BS and its anchored UEs, more pilots are necessary to collect a reliable CSI at transmit side.

In Figure 11, we plot the fifth percentile of the UE rate versus the ratio L_T/L_E for the very frequency-selective extended typical urban (ETU) channel model, characterized by ${\bar{τ}}_{rms} = 991$ ns ([31], Tab. 21.2). In this case, we observe that rates increase with L_T up to a maximum and then decrease. In fact, increasing the value of L_T has two conflicting effects: (a) from (3) a more reliable CSI is collected at BSs thus improving performance and (b) a lower number of resource elements is allocated to data transmission thus obviously reducing the achievable rate. Clearly, for lower values of L_T, the effect of a better CSI dominates, whereas for higher values of L_T, the CSI is reliable enough for the SINR level of the UEs, and a further increase of the number of pilots represents only a waste of resources. Even if we are still considering a low mobility scenario, due to the higher frequency selectivity of the ETU channel model, no scheme reaches the rates achieved with perfect CSI. Then, as observed for the EPA model, the fraction of resources allocated to pilots necessary to reach the peak in performance is lower for SCP (L_T/L_E≈0.02) than that for DC (L_T/L_E≈0.03). By choosing for each scheme the value of L_T which provides the best rate, the performance gain achieved by DC over SCP decreases with respect to the perfect CSI case to about 16%.

4.6 Effect of imperfect CSI at UEs

In this section, we elaborate on the assumption of perfect CSI at UEs considered in (25). Indeed, it has already been shown [37, 38] that in a similar setup, the overhead required to obtain a reliable CSI at UEs is almost negligible when compared to the overhead necessary to acquire CSI at BSs. This result is simply explained by the huge difference in terms of available power at UEs and BSs, which is 23 dB in a typical LTE scenario: hence, for the same quality in CSI estimate, many more resources are needed for an estimate at BSs than for an estimate at UEs. Moreover, in our model, UE k, in order to implement IRC, needs to know the interference covariance matrix Ψ_k(t) (24) of size N×N, which also depends on the precoders used by the interfering clusters serving their own UEs. Two estimates of Ψ_k(t) at UE k are (a) a simple average of the received samples as proposed in ([39], pag. 10) to directly estimate Ψ_k(t) or (b) the use of orthogonal training sequences among different BS clusters to estimate the cascade channel precoders H_k(t)G_m(t), m≠k, which, in turn, are used to construct matrix Ψ_k(t) from (24). Differently from Section 4.5 where the results are shown in terms of the length of the uplink training sequence L_T, here, we assume perfect CSI at BSs and set the length of the downlink training sequences employed by the clusters to its minimum vale J M, equal to the maximum number of streams that can be sent by the BSs. We report in Table 5 the average cell rate and the fifth percentile of the UE rate with SCP and DC by also assuming N=4, l^(MAX)=1, uncorrelated antennas, and J_MAX=3. We consider the ETU channel in a low mobility scenario with f_d=5 Hz. As expected, when compared to the perfect CSI case reported in Table 4, the performance loss when imperfect CSI is assumed at UEs turns out to be less than 1%, i.e., almost negligible.

Table 5 Average cell rate and fifth percentile of the UE rate for the ETU channel with imperfect CSI at UEs

Full size table

5 Conclusions

In this paper, we have considered a downlink CoMP-JP system and, by assuming a maximum cluster size, we have developed a dynamic BS clustering algorithm where the clusters change over time adapting to the channel conditions. We consider that UEs are equipped with multiple antennas that implement IRC and are served by a multi-stream transmission. The proposed algorithm first defines a set of candidate BS clusters depending on the large scale channel fading. Then, a two-step procedure is applied following a fast fading time scale: (a) first, a weighted sum rate is estimated within each candidate BS cluster by performing UE selection, precoding, power and transmission rank selection, and then (b) the CU schedules the set of non-overlapping BS clusters that maximizes the estimated system weighted sum rate. Numerical results show that much higher effective rates can be achieved when UEs are equipped with multiple antennas. In fact, by reducing the level of interference suffered by UEs, the proposed approach exploits more the multi-stream transmission than SCP. However, as most of the gain is due to the IRC, the gain achieved by the proposed approach decreases with respect to SCP by increasing the number of UE antennas. Finally, when channel estimation is considered at BSs, the gain promised in the perfect CSI scenario may be achieved only in part; in fact, a better estimate requires a longer training sequence and this lowers the system rate.

References

Marsch P, Fettweis G: Coordinated multi-point in mobile communications. Cambridge University Press, Cambridge, England; 2011.
Book Google Scholar
Karakayali MK, Foschini GJ, Valenzuela RA: Network coordination for spectrally efficient communications in cellular systems. IEEE Wireless Commun Mag 2006, 13(4):56-61. 10.1109/MWC.2006.1678166
Article Google Scholar
Gesbert D, Hanly S, Huang H, Shamai S, Simeone O, Yu W: Multi-cell MIMO cooperative networks: a new look at interference. IEEE J. Sel. Areas Commun 2010, 28(9):1380-1408.
Article Google Scholar
Björnson E, Jorswieck E: Optimal resource allocation in coordinated multi-cell systems. Foundations and Trends in Communications and Information Theory 2012, 9(2-3):113-381.
Article MATH Google Scholar
3GPP TR 36.819 v11.1.0: Coordinated multi-point operation for LTE physical layer aspects (Release 11). 2011.
Google Scholar
Irmer R, Droste H, Marsch P, Grieger M, Fettweis G, Brueck S, Mayer H-P, Thiele L, Jungnickel V: Coordinated multipoint: concepts, performance, and field trial results. IEEE Commun. Mag 2011, 49(2):102-111.
Article Google Scholar
Zakhour R, Gesbert D: Optimized data sharing in multicell MIMO with finite backhaul capacity. IEEE Trans. Signal Process 2011, 59(12):6102-6111.
Article MathSciNet Google Scholar
Baracca P, Tomasin S, Benvenuto N: Constellation quantization in constrained backhaul downlink network MIMO. IEEE Trans. Commun 2012, 60(3):830-839.
Article Google Scholar
Zhang J, Chen R, Andrews JG, Ghosh A, Heath RW: Networked MIMO with clustered linear precoding. IEEE Trans. Wireless Commun 2009, 8(4):1910-1921.
Article Google Scholar
Papadogiannis A, Gesbert D, Hardouin E: A dynamic clustering approach in wireless networks with multi-cell cooperative processing. In Proc. IEEE International Conference on Communications (ICC). Beijing, China; 2008.
Google Scholar
Boccardi F, Huang H, Alexiou A: Network MIMO with reduced backhaul requirements by MAC coordination. In Proc. IEEE Conference on Signals, Systems and Computers (Asilomar). Pacific Grove, CA; 2008.
Google Scholar
Moon J-M, Cho D-H: Inter-cluster interference management based on cell-clustering in network MIMO systems. In Proc. IEEE Vehicular Technology Conference (VTC Spring). Budapest, Hungary; 2011.
Google Scholar
Liu J, Wang D: An improved dynamic clustering algorithm for multi-user distributed antenna system. In Proc. IEEE International Conference on Wireless Communications & Signal Processing (WCSP). Nanjing, China; 2009.
Google Scholar
Zhou S, Gong J, Niu Z, Jia Y, Yang P: A decentralized framework for dynamic downlink base station cooperation. In Proc. IEEE Global Communications Conference (GLOBECOM). Honolulu, HI; 2009.
Google Scholar
Weber R, Garavaglia A, Schulist M, Brueck S, Dekorsy A: Self-organizing adaptive clustering for cooperative multipoint transmission. In Proc. IEEE Vehicular Technology Conference (VTC Spring). Budapest, Hungary; 2011.
Google Scholar
Papadogiannis A, Bang HJ, Gesbert D, Hardouin E: Efficient selective feedback design for multicell cooperative networks. IEEE Trans. Veh. Technol 2011, 60(1):196-205.
Article Google Scholar
Gong J, Zhou S, Niu Z, Geng L, Zheng M: Joint scheduling and dynamic clustering in downlink cellular networks. In Proc. IEEE Global Communications Conference (GLOBECOM). Houston, TX; 2011.
Google Scholar
Zakhour R, Gesbert D: Distributed multicell-MISO precoding using the layered virtual SINR framework. IEEE Trans. Wireless Commun 2010, 9(8):2444-2448.
Article MathSciNet Google Scholar
Hong M, Sun R, Baligh H, Luo Z-Q: Joint base station clustering and beamformer design for partial coordinated transmission in heterogeneous networks. IEEE J. Sel. Areas Commun 2013, 31(2):226-240.
Article Google Scholar
Boccardi F, Clerckx B, Ghosh A, Hardouin E, Jöngren G, Kusume K, Onggosanusi E, Tang Y: Multiple-antenna techniques in LTE-advanced. IEEE Commun. Mag 2012, 50(3):114-121.
Article Google Scholar
Hwang I, Chae C-B, Lee J, Heath RW: Multicell cooperative systems with multiple receive antennas. IEEE Wireless Commun. Mag 2013, 20(1):50-58.
Article Google Scholar
Clerckx B, Lee H, Hong Y-J, Kim G: A practical cooperative multicell MIMO-OFDMA network based on rank coordination. IEEE Trans. Wireless Commun 2013, 12(4):1481-1491.
Article Google Scholar
Winters J: Optimum combining in digital mobile radio with cochannel interference. IEEE J. Sel. Areas Commun 1984, 2(4):528-539.
Article Google Scholar
Kay S: Fundamentals of statistical signal processing, volume I: estimation theory. Prentice Hall, Upper Saddle River, New Jersey; 1993.
MATH Google Scholar
Boccardi F, Huang H: A near-optimum technique using linear precoding for the MIMO broadcast channel. In Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Honolulu, HI; 2007.
Google Scholar
Tse D, Viswanath P: Fundamentals of wireless communication. Cambridge University Press, Cambridge, England; 2005.
Book MATH Google Scholar
Baracca P, Boccardi F, Braun V: A dynamic joint clustering scheduling algorithm for downlink CoMP systems with limited CSI. In Proc. IEEE International Symposium on Wireless Communication Systems (ISWCS). Paris, France; 2012.
Google Scholar
Huh H, Tulino AM, Caire G: Network MIMO with linear zero-forcing beamforming: large system analysis, impact of channel estimation, and reduced-complexity scheduling. IEEE Trans. Inf. Theory 2012, 58(5):2911-2934.
Article MathSciNet Google Scholar
Spencer QH, Peel CB, Swindlehurst AL, Haardt M: An introduction to the multi-user MIMO downlink. IEEE Commun. Mag 2004, 42(10):60-67. 10.1109/MCOM.2004.1341262
Article Google Scholar
Hoffman K, Padberg M: Set covering, packing and partitioning problems. Springer Encyclopedia of Optimization 2001, 2348-2352.
Chapter Google Scholar
Sesia S, Toufik I, Baker M: LTE: The UMTS Long Term Evolution. John Wiley & Sons, Hoboken, New Jersey; 2009.
Book Google Scholar
Hytönen T: Optimal wrap-around network simulation. Helsinki University of Technology, Report A432 2001.
Google Scholar
Kermoal JP, Schumacher L, Pedersen KI, Mogensen PE, Frederiksen F: A stochastic MIMO radio channel model with experimental validation. IEEE J. Sel. Areas Commun 2002, 20(6):1211-1226. 10.1109/JSAC.2002.801223
Article Google Scholar
Viswanath P, Tse D, Laroia R: Opportunistic beamforming using dumb antennas. IEEE Trans. Inf. Theory 2002, 48(6):1277-1294. 10.1109/TIT.2002.1003822
Article MathSciNet MATH Google Scholar
Benvenuto N, Cherubini G: Algorithms for communications systems and their applications. John Wiley & Sons, Hoboken, New Jersey; 2002.
Book Google Scholar
Rappaport T: Wireless communications: principles and practice. Prentice Hall, Upper Saddle River, New Jersey; 2002.
MATH Google Scholar
Marzetta T, Hochwald BM: Fast transfer of channel state information in wireless systems. IEEE Trans. Signal Process 2006, 54(4):1268-1278.
Article Google Scholar
Gomadam KS, Papadopoulos HC, Sundberg C-EW: Techniques for multi-user MIMO with two-way training. In Proc. IEEE International Conference on Communications (ICC). Beijing, China; 2008.
Google Scholar
Hoydis J, Hosseini K, ten Brink S, Debbah M: Making smart use of excess antennas: massive MIMO, small cells, and TDD. Bell Labs Tech. J 2013, 18(2):5-21. 10.1002/bltj.21602
Article Google Scholar

Download references

Acknowledgments

Part of this work has been performed in the framework of the FP7 project ICT-317669 METIS, which is partly funded by the European Union. The authors would like to acknowledge the contributions of their colleagues in METIS, although the views expressed are those of the authors and do not necessarily represent the project. Part of this work has been presented at the International Symposium on Wireless Communication Systems (ISWCS) 2012, Paris (France), and at the International Conference on Signal Processing, Computing and Control (ISPCC) 2013, Shimla (India). This work was carried out when Federico Boccardi was with Bell Labs, Alcatel-Lucent.

Author information

Authors and Affiliations

Bell Labs, Alcatel-Lucent, Lorenzstrasse 10, Stuttgart, 70435, Germany
Paolo Baracca
Vodafone, Newbury, Berkshire, RG14 2PZ, UK
Federico Boccardi
Department of Information Engineering, University of Padova, Via G. Gradenigo 6/b, Padova, 35131, Italy
Nevio Benvenuto

Authors

Paolo Baracca
View author publications
You can also search for this author in PubMed Google Scholar
Federico Boccardi
View author publications
You can also search for this author in PubMed Google Scholar
Nevio Benvenuto
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Paolo Baracca.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Baracca, P., Boccardi, F. & Benvenuto, N. A dynamic clustering algorithm for downlink CoMP systems with multiple antenna UEs. J Wireless Com Network 2014, 125 (2014). https://doi.org/10.1186/1687-1499-2014-125

Download citation

Received: 13 March 2014
Accepted: 11 July 2014
Published: 08 August 2014
DOI: https://doi.org/10.1186/1687-1499-2014-125

A dynamic clustering algorithm for downlink CoMP systems with multiple antenna UEs

Abstract

1 Introduction

2 System model

2.1 First phase: uplink pilot transmission

2.2 Second phase: resource allocation at the CU

2.3 Third phase: downlink data transmission

3 Dynamic clustering algorithm

3.1 Cluster weighted sum rate estimation

3.2 Clustering optimization

4 Numerical results

4.1 Effect of multiple antennas at UEs

4.2 Effect of antenna correlation

4.3 Effect of UE selection

4.4 Effect of cluster size

4.5 Effect of imperfect CSI at BSs

4.6 Effect of imperfect CSI at UEs

5 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords