Practical decentralized high-performance coordinated beamforming for both downlink and uplink in time-division duplex systems

Lu, Enoch; Lu, I-Tai

doi:10.1186/1687-1499-2013-251

Research
Open access
Published: 29 October 2013

Practical decentralized high-performance coordinated beamforming for both downlink and uplink in time-division duplex systems

Enoch Lu¹ &
I-Tai Lu¹

EURASIP Journal on Wireless Communications and Networking volume 2013, Article number: 251 (2013) Cite this article

1225 Accesses
1 Citations
Metrics details

Abstract

Coordinated beamforming (CBF) has been studied in hope of mitigating the inter-cell interference experienced by cell-edge users. Unfortunately, due to the limitations and/or impracticalities of the proposed designs, the expected performance gains have yet to be realized. Relying on channel soundings from the users and equivalent channel soundings from the cell sites (all on the same frequency), both downlink and uplink decentralized frameworks (and various example designs) are proposed in this paper for the practical transceiver and signaling design of a K-pair system desiring to employ CBF. Remarkably, the proposed singular value decomposition (SVD) example design for both frameworks is equivalent to a centralized interference alignment (IA) design. Furthermore, three other proposed design examples achieve better bit error rate, mean square error, and sum capacity performances than the proposed IA-equivalent SVD example design, respectively. In addition, higher sum capacities than the generalized iterative approach, a centralized minimum mean square error CBF design, are numerically observed. Clearly, practical CBF designs which can deliver the expected performance gains are finally available.^a

1. Introduction

Coordinated multipoint transmission/reception (CoMP) [1–3] is currently attracting a lot of attention (e.g., in Long Term Evolution-Advanced (LTE-A)). It, being based on network multiple-input and multiple-output (MIMO), has multiple cell sites that coordinate their transmissions or receptions (intra-site CoMP can involve only one cell site but this technicality is ignored here for the sake of clarity). If successfully implemented, CoMP can mitigate the inter-site interference, e.g., the inter-cell interference experienced by cell-edge users. In addition, it can improve the coverage and spectral efficiency of the next generation cellular systems [1]. Based on [3], CoMP is divided into four types: joint transmission, dynamic cell selection, coordinated scheduling, and coordinated beamforming (CBF). CBF, the type studied in this paper, has the involved cell sites that coordinate their transmissions/receptions and transceiver designs in order to minimize the inter-site interference experienced by their users (each user’s data comes from or goes to only one of the cell sites). The coordination of the multiple cell sites must not incur too high of a load on the network. Yet it must provide the desired performance. Thus far, no scheme has achieved this difficult goal. On the contrary, the gap between gains envisaged with perfect channel feedback (in academia and industry) and with practical feedback schemes remains high [4]. So, the search continues for a practical and high-performing CoMP scheme.

There are four fundamental reasons why most of the proposed CBF designs are impractical. Firstly and most importantly, the vast majority of the proposed CBF designs, in comparison with their said benefits, require too much information exchange between the nodes. Many of the proposed designs assume that a central processing unit has full perfect channel state information (CSI) and performs the entire transceiver design (e.g., [5–7]). Others assume that some type of iterative process can occur between the cell sites and/or the users (e.g., [8–10]). All in all, most designs assume that a large amount of information (whether it be CSI, precoders, decoders, etc.) can be transported through the network. This is especially true with regard to the CSI if the system is frequency-division duplex (FDD) in nature.

Secondly, many of the proposed CBF transceiver designs are too complex. Due to the number of design variables and the coupling between them, the majority of the designs are iterative in nature (e.g., [5–11]). Though some have proofs of convergence, there is generally no guarantee of how many iterations (equivalently, time) are needed. As the channels are time-varying, this is highly undesirable. Thirdly, many of the designs do not decouple the transceiver designs of the cells involved. Their designs for one cell’s (a cell site and its user(s)) precoder(s) and decoder(s) are generally heavily dependent on the modulation coding schemes (MCS), number of data streams, transmit powers, etc. of the other cells. This can greatly complicate the schedulers. Last but not the least, most of the designs are done using an average total power constraint per transmitter (e.g., [5, 6, 8–11]). Since in practice, each antenna is connected to its own power amplifier, this is not sufficient; the instantaneous power per antenna must be adequately constrained.

In this work, both downlink and uplink frameworks (and various example designs) are proposed for the practical transceiver and signaling design of a K-pair system desiring to employ CBF (the downlink framework has been partially presented in [12]). Relying on channel soundings from the users and equivalent channel soundings from the cell sites - all on the same frequency (possible in time-division duplex (TDD) systems) - both frameworks are able to overcome the four critical issues listed above. Firstly, the frameworks are decentralized and require a low amount of information exchange. Neither central processing unit is used nor is there any explicit feedback or feed-forward of CSI, precoders, and decoders; each node obtains all the CSI it needs from the channel soundings/equivalent channel soundings. Each node designs its own precoder or decoder. Furthermore, time synchronization of the cells is essentially only needed for the initial channel sounding.

Secondly, the frameworks’ transceiver designs at each cell site and user are of a low complexity. At a cell site, the transceiver designs consist of only two steps: (a) using a singular value decomposition (SVD) to form the nulling portion of the precoder or decoder (the nulling action is essentially a block diagonalization [13] of the overall system channel matrix) and (b) applying a single-user MIMO closed-form solution. At a user, they are even simpler - apply a single-user MIMO closed-form solution. Thirdly, the frameworks completely decouple the transceiver designs of the different pairs. It is proved that its design for a pair does not depend on the MCSs, number of data streams, transmit powers, etc. of the other pairs. Lastly, the frameworks allow limits to be imposed on the instantaneous transmit power of each antenna.

Even though the proposed frameworks are practical, their performances are comparable to those which are not. It is proven that one of the proposed example designs (specifically, the SVD design) for both frameworks is equivalent to a centralized interference alignment (IA) design. Furthermore, it is shown numerically that three other proposed examples achieve better bit error rate, mean square error, and sum capacity performances than the proposed IA-equivalent SVD example, respectively. In addition, higher sum capacities than the generalized iterative approach, a centralized minimum mean square error (MMSE) CBF design, are numerically observed for some proposed examples. Clearly, here are practical low-information exchange overhead CBF designs which are able to deliver the long-awaited performance gain.

There are other CBF schemes in the literature which result in zero inter-site interference. They are however, to the best of our knowledge, significantly different from our proposal. For example, the papers with single antenna receivers (e.g., [7, 9]) do not need a decoder. The papers with all multiantenna nodes (e.g., [14] and other IA schemes) run into the four practicality issues discussed before. Lastly, the vast majority of them are purely transceiver designs - they do not consider signaling as is done here.

We have one last introductory comment. The strong aspects of this work are in the practical side: the local channel information (obtained via sounding in TDD mode) is adequate, and each cell site can construct its precoder and decoder locally without further information exchange among different cell sites and users. However, the practicality of the proposed approach may be reduced if some practical challenges faced in a large scale network (such as many antennas, a lot of users, channel estimation error, and block diagonalization error) cannot be properly addressed. Fortunately, the new developments on cloud-radio access network (C-RAN), massive MIMO, spatial user scheduling, etc., have addressed a lot of these practical issues and have helped to demonstrate the importance and timeliness of this work. These practical challenges and their possible solutions will be briefly discussed in the Conclusions section.

The notation of this paper is as follows. All boldface letters indicate vectors (lower case) or matrices (upper case). A^′, $\bar{A}$ , A^*, tr(A), E(A), and rank(A) stand for the transpose, conjugate, conjugate transpose, trace, expectation, and rank of A, respectively. λ_max(A) denotes the largest eigenvalue of A. [a]_i denotes the i th element of a. diag […] denotes the diagonal matrix with elements […] on the main diagonal. I_r signifies a r × r identity matrix. 0 signifies a zero matrix with proper dimension. A > 0 denotes that A is a positive definite matrix. (a)⁺ ≜ max (0,a). CN (μ,σ²) denotes a complex normal random variable with mean μ and variance σ².

2. Proposed frameworks

2.1. System model

The system considered has K cell site-user pairs; the k^th cell site and user only want to send data to each other (k = 1,…, K). The k^th cell site and k^th user have b_k and u_k antennas, respectively. It is assumed that b_k > u_l, ∀k, ∀l. Since the cell sites have more antenna elements than the users, the cell sites will carry out the interference mitigation task for both downlink and uplink scenarios in the proposed frameworks.

In the downlink scenario (see Figure 1), the received signal vector at the k^th user is given by

y_{k} = H_{kk} F_{k} s_{k} + \sum_{l = 1, l \neq k}^{K} H_{kl} F_{l} s_{l} + n_{k} .

(1)

There are m_k data streams for the k^th pair where m_k ≤ u_k. The source (data) to be transmitted from the k^th cell site to the k^th user, s_k, is m_k × 1 and is characterized by its positive definite source covariance matrix, $Φ_{s k} = E (s_{k} s_{k}^{*}) = σ_{s k}^{2} I_{m_{k}}$ ; s_k is precoded by F_k, the b_k × m_k precoder, and then transmitted. The channel from the k^th cell site to the l th user is u_l × b_k and is denoted by H_lk. The noise vector, n_k, is u_k × 1 and is characterized by its noise covariance matrix, $Φ_{n} k_{} = E (n_{k} n_{k}^{*}) = β_{k} I_{u_{k}}$ . The sources and noises of different nodes are all independent of each other and zero-mean. Once the k^th user receives y_k, it applies its m_k × u_k decoder, G_k, to process the received vector.

The notation and definitions for the uplink scenario (see Figure 2) are analogous to those for the downlink. An underline ‘_’ is added below the downlink variables to obtain the corresponding uplink ones. For convenience, we will also denote the antenna numbers at the cell site and the user as b_k and u_k, respectively. Thus, the receive signal vector is

{\underline{y}}_{k} = {\underline{H}}_{kk} {\underline{F}}_{k} {\underline{s}}_{k} + \sum_{l = 1, l \neq k}^{K} {\underline{H}}_{kl} {\underline{F}}_{l} {\underline{s}}_{l} + {\underline{n}}_{k},

(2)

where s_l, ${\underline{Φ}}_{sl} = {\underline{σ}}_{sl}^{2} I_{{\underline{m}}_{l}}$ , m_l ( m_l ≤ u_l < b_l) and F_l are the source vector, source covariance matrix, number of data stream, and precoder of the l th user, respectively; n_k, $Φ_{n}_{k} = {\underline{β}}_{k} I_{{\underline{b}}_{k}}$ , and G_k are the noise vector, noise covariance matrix, and decoder of the k^th cell site; and H_kl is the uplink channel matrix from the l th user to the k^th cell site. Note that the downlink and uplink channels are reciprocal, i.e.,

{\underline{H}}_{kl} = H_{lk}^{'}, \forall k, \forall l .

2.2. Proposed framework for the downlink scenario

The proposed framework has five phases. In the first phase, the users perform channel soundings so that each cell site can estimate its reverse channels, i.e., so that the k^th cell site can estimate H_kl, ∀l. Due to reciprocity, the k^th cell site can thus have an estimate of $H_{lk} = {\underline{H}}_{kl}^{'}, \forall l$ . In the second phase, each of the cell sites uses its channel estimates to design its own precoder. The k^th cell site designs its precoder F_k = F_k,LF_k,R by first designing its b_k × d_k left precoder F_k,L for inter-pair interference mitigation and then its d_k × m_k right precoder F_k,R for performance enhancement. The parameter d_k denotes the maximum number of data streams allowed for the k^th pair where both intra-data-stream and inter-pair interference can be mitigated (see feasibility condition in (23a)).

To avoid the intra-data-stream interference for the k^th pair, we need b_k ≥ d_k ≥ m_k. The condition required for mitigating the inter-pair interference is a bit more complicated. Here, we choose for the k^th cell site the d_k columns of F_k,L to be an orthonormal basis for the null space of

\begin{array}{l} \begin{array}{c} A_{k} = & {[\begin{array}{c} H_{1 k}^{'} & \dots & H_{k_{-} k}^{'} & | & H_{k_{+} k}^{'} & \dots & H_{Kk}^{'} \end{array}]}^{'}, & if 1 < k < K \end{array} \\ A_{k} = {[\begin{array}{c} H_{k_{+} k}^{'} & \dots & H_{Kk}^{'} \end{array}]}^{'}, if k = 1 \\ A_{k} = {[\begin{array}{c} H_{1 k}^{'} & \dots & H_{k_{-} k}^{'} \end{array}]}^{'}, if k = K \end{array}

(3)

where k₋ = k − 1 and k₊ = k + 1. It can be easily done using the SVD. Note that $d_{k} = b_{k} - \sum_{i \neq k} u_{k} \geq m_{k}$ if A_k has full rank. This restriction to the antenna setup due to the need for d_k to be greater than or equal to m_k is discussed in more detail in the section on feasibility conditions, in Section 4.2.

Since each cell site picks its left precoder in this way, the nulling constraint,

H_{lk} F_{k} = 0, \forall l \neq k,

(4)

is satisfied for any F_k,R because H_lkF_k = H_lkF_k,LF_k,R = (H_lkF_k,L)F_k,R = 0F_k,R = 0, ∀ l ≠ k. There will be no inter-pair interference at the users (note that the receiver processing is not taken into account when calculating the null space because we want to minimize the signaling load).

From the perspective of F_k,R, the entire system is simply a single-user MIMO system where F_k,R and H_kkF_k,L are the equivalent precoder and channel matrix, respectively. Using the nulling constraint in (4), (1) reduces to

y_{k} = H_{kk} F_{k} s_{k} + n_{k} = (H_{kk} F_{k, L}) F_{k, R} s_{k} + n_{k} .

(5)

Using H_kkF_k,L, the k^th cell site will thus design F_k,R to optimize its own link subject to some power constraint on F_k. In addition to H_kkF_k,L, the noise covariance matrix $Φ_{n k} = E (n_{k} n_{k}^{*})$ of the k^th user may also be employed in the design of F_k if its estimate is available at the k^th cell site (see Section 3).

In the third phase, each of the cell sites performs an equivalent channel sounding with either its designed precoder or left precoder. The reason why there are two choices is to allow two different decoder designs. Depending on which case, the k^th user can estimate H_kkF_k or H_kkF_{k, L}. With H_kkF_k, the k^th user can design the MMSE decoder while with H_kkF_{k, L}, it can design the SVD one. Since the designed precoder causes no interference to the other users, this and the final two phases do not need synchronization among the pairs. Furthermore, orthogonal pilots are not needed in this phase. In the fourth phase, each user uses its noise covariance matrix Φ_{n
k} and the estimate of H_kkF_k or H_kkF_k,L from phase 3 to design its decoder G_k (see Section 3). Once finished, the fifth and final phase, the data transmission, can now occur. The five phases are summarized in Table 1.

Table 1 Five phases of the proposed frameworks

Full size table

2.3. Proposed framework for the uplink scenario

The proposed framework for the uplink scenario also has five phases. In the first phase, the users perform channel soundings so that each cell site can estimate its channels, i.e., so that the k^th cell site can estimate H_kl, ∀l. In the second phase, each of the cell sites uses its channel estimates to design its own decoder. In particular, the k^th cell site partitions its decoder G_k = G_k,LG_k,R where the d_k × b_k right decoder G_k,R is employed for inter-pair interference mitigation and the m_k × d_k left decoder G_k,L for the performance enhancement. The parameter d_k denotes the maximum number of data streams allowed for the k^th pair where both intra-data-stream and inter-pair interference can be mitigated (see feasibility condition in (23b)).

To avoid the intra-data-stream interference for the k^th pair, we need b_k ≥ d_k ≥ m_k. For mitigating the inter-pair interference, we choose for the k^th cell site the d_k rows of its right decoder G_k,R to be an orthonormal basis of the left null space for

\begin{array}{l} \begin{array}{c} {\underline{A}}_{k} = & [\begin{array}{c} {\underline{H}}_{k 1} & \dots & {\underline{H}}_{{kk}_{-}} & | & {\underline{H}}_{{kk}_{+}} & \dots & {\underline{H}}_{kK} \end{array}], & if 1 < k < K \end{array} \\ {\underline{A}}_{k} = [\begin{array}{c} {\underline{H}}_{{kk}_{+}} & \dots & {\underline{H}}_{kK} \end{array}], if k = 1 \\ {\underline{A}}_{k} = [\begin{array}{c} {\underline{H}}_{k 1} & \dots & {\underline{H}}_{{kk}_{-}} \end{array}], if k = K . \end{array}

(6)

where k₋ = k − 1 and k₊ = k + 1. This is easily done using the SVD. Note that ${\underline{d}}_{k} = {\underline{b}}_{k} - \sum_{i \neq k} {\underline{u}}_{k} \geq {\underline{m}}_{k}$ if ${\underline{A}}_{k}$ has full rank. This restriction to the antenna setup due to the need for d_k to be greater than or equal to m_k is discussed in more detail in the section on feasibility conditions, Section 4.2.

Since each cell site picks its right decoder in this way, the nulling condition

{\underline{G}}_{k} {\underline{H}}_{kl} = 0, \forall l \neq k,

(7)

is satisfied for any ${\underline{G}}_{k, L} ({\underline{G}}_{k} {\underline{H}}_{kl} = {\underline{G}}_{k, L} {\underline{G}}_{k, R} {\underline{H}}_{kl} = {\underline{G}}_{k, L} ({\underline{G}}_{k, R} {\underline{H}}_{kl}) = {\underline{G}}_{k, L} 0 = 0, \forall l \neq k) .$ There will be no inter-pair interference at the cell sites after the right decoders (note that the transmitter processing is not taken into account when calculating the null space because we want to minimize the signaling load).

After decoding by ${\underline{G}}_{k, R}$ , (2) becomes

({\underline{G}}_{k, R} {\underline{y}}_{k}) = ({\underline{G}}_{k, R} {\underline{H}}_{kk}) {\underline{F}}_{k} {\underline{s}}_{k} + ({\underline{G}}_{k, R} {\underline{n}}_{k}) .

(8)

For the k^th pair, the entire system simply becomes a single-user MIMO system where ${\underline{G}}_{k, L}$ , ${\underline{G}}_{k, R} {\underline{y}}_{k}$ , ${\underline{G}}_{k, R} {\underline{H}}_{kk}$ , and ${\underline{G}}_{k, R} {\underline{n}}_{k}$ are the equivalent decoder, received signal vector, channel matrix, and noise vector, respectively. Given G_{k, R}, its estimate of H_kk, and its estimate of Φ_{n
k}, the k^th cell site thus designs its left decoder G_k,L (see Section 3).

In the third phase, each of the cell sites performs an equivalent channel sounding using the transpose of its designed right decoder so that its user can estimate the equivalent channel, i.e., so that the k^th user can estimate G_k,RH_kk. Note that due to the nature of G_k,R, this equivalent channel sounding causes no interference at users l, ∀l ≠ k, and does not have to be synchronized with that of the other pairs. Furthermore, orthogonal pilots are not needed here. In the fourth phase, each user uses its estimate of the equivalent channel from phase 3 to design its precoder subject to some power constraint (the k^th user uses G_k,RH_kk to design F_k). In addition to G_k,RH_kk, the k^th user may also use an estimate of the equivalent noise covariance $E ({\underline{G}}_{k, R} {\underline{n}}_{k} {\underline{n}}_{k}^{*} {\underline{G}}_{k, R}^{*}) = {\underline{G}}_{k, R} {\underline{Φ}}_{n k} {\underline{G}}_{k, R}^{*} = {\underline{β}}_{k} I_{{\underline{d}}_{k}}$ (if available) in its design of F_k (see Section 3). Once finished with the fourth phase, the fifth and final phase, the data transmission, can now occur. The five phases for the uplink are summarized in Table 1.

3. Example precoder-decoder designs

3.1. Conventional and equivalent single-user MIMO systems

As mentioned in Section 2, the nulling constraint (4) (or (7)) decouples the K-pair system into K-independent equivalent single-user MIMO systems as described by (5) for the downlink (or (8) for the uplink). The remaining tasks are to design the precoder and decoder of each equivalent single-user MIMO system for facilitating efficient data transmission. In this subsection, we will compare a conventional single-user MIMO system (shown in Figure 3) with the equivalent single-user MIMO systems for both downlink and uplink scenarios. Firstly, the corresponding variables for the conventional and equivalent single-user MIMO systems are listed in Table 2. Secondly, differences with respect to power constraints and noise covariance will be discussed.

Table 2 Corresponding variables of the conventional and equivalent single-user MIMO systems

Full size table

Regarding the power constraints, there exists no difference between the conventional and uplink equivalent single-user MIMO systems; but there appears to be some difference between the conventional and downlink equivalent single-user MIMO systems. The average total power (ATP) constraint for the conventional single-user MIMO system is

P = tr \{E (Fs s^{*} F^{*})\} = tr \{F F^{*}\} σ_{s}^{2} .

(9a)

The constraint is named as such because it constrains the average power, tr{E(Fss*F*)}, to P. Likewise, the ATP constraint for the downlink equivalent single-user MIMO system is

P_{k} = tr \{E (F_{k, R} s_{k} s_{k}^{*} F_{k, R}^{*})\} = tr \{F_{k, R} F_{k, R}^{*}\} σ_{s k}^{2} .

(9b)

The downlink equivalent single-user MIMO system, however, is not the original downlink system. Does (9b) really constrain the ATP of the k^th cell site to P_k? Surprisingly, yes.

\begin{array}{l} tr {E (F_{k} s_{k} s_{k}^{*} F_{k}^{*})} & = tr {F_{k} F_{k}^{*}} σ_{s k}^{2} = tr {F_{k, L} F_{k, R} F_{k, R}^{*} F_{k, L}^{*}} σ_{s k}^{2} \\ = tr \{F_{k, R} F_{k, R}^{*} F_{k, L}^{*} F_{k, L}\} σ_{s k}^{2} = tr {F_{k, R} F_{k, R}^{*}} σ_{s k}^{2} \\ = tr {E (F_{k, R} s_{k} s_{k}^{*} F_{k, R}^{*})} = P_{k} \end{array}

(10)

because F_k,L is chosen using (3) and thus guarantees $F_{k, L}^{*} F_{k, L} = I_{d_{k}} .$

Define

\begin{array}{l} L = λ_{max} \{E (Fs s^{*} F^{*})\} = λ_{max} \{F E (s s^{*}) F^{*}\} \\ = λ_{max} \{F F^{*}\} σ_{s}^{2} . \end{array}

(11a)

The instantaneous array power (IAP) constraint for the conventional single-user MIMO system is

\begin{array}{l} max_{i, s} \{{|{[Fs]}_{i}|}^{2}\} \leq max_{s} \{s^{*} F^{*} Fs\} \leq λ_{max} (F^{*} F) \cdot max_{s} (s^{*} s) \\ = \frac{L}{σ_{s}^{2}} max_{s} (s^{*} s) . \end{array}

(11b)

It is named as such because it constrains the instantaneous sum power of the antenna array (and hence the instantaneous peak power of each antenna) of the transmitter. The physical meanings of L can be understood from the following two special cases. Firstly, if the precoder F is a unitary matrix, one obtains from (11a)

L = σ_{s}^{2}

(11c)

which represents the average power of each data stream. Thus, the ATP in (9a) is equivalent to the IAP in (11a) if P = mL when F is unitary. Secondly, if a constant amplitude modulation scheme is used and the system is fully loaded, $s^{*} s = m σ_{s}^{2}$ , one obtains from (11a) and (11b) as

\frac{1}{m} max_{s} \{s^{*} F^{*} Fs\} \leq \frac{1}{m} λ_{max} (F^{*} F) \cdot m σ_{s}^{2} = L

(11d)

which represents the upper bound of the spatial average of the instantaneous antenna sum power.

From (11a), the IAP constraint for the downlink equivalent single-user MIMO system is

L_{k} = λ_{max} \{E (F_{k, R} s_{k} s_{k}^{*} F_{k, R}^{*})\} .

(12)

Again, the downlink equivalent single-user MIMO system, however, is not the original downlink system. Does (12) really constrain the IAP of the k^th cell site to $L_{k} max_{s_{k}} (s_{k}^{*} s_{k}) / σ_{sk}^{2} ?$ The answer is yes.

\begin{array}{l} λ_{max} {E (F_{k} s_{k} s_{k}^{*} F_{k}^{*})} & = λ_{max} {F_{k} F_{k}^{*}} σ_{s k}^{2} \\ = λ_{max} {F_{k, L} F_{k, R} F_{k, R}^{*} F_{k, L}^{*}} σ_{s k}^{2} \\ = λ_{max} \{F_{k, R} F_{k, R}^{*} F_{k, L}^{*} F_{k, L}\} σ_{s k}^{2} \\ = λ_{max} {F_{k, R} F_{k, R}^{*}} σ_{s k}^{2} \\ = λ_{max} {E (F_{k, R} s_{k} s_{k}^{*} F_{k, R}^{*})} = L_{k} \end{array}

(13)

Thus, we conclude that the ATP and IAP constraints can also be employed, without any modification, for the downlink equivalent single-user MIMO system. Summarized in Table 3 are the ATP and IAP constraints for the conventional and equivalent single-user MIMO systems.

Table 3 Covariance matrices and ATP and IAP constraints of conventional and equivalent single-user MIMO systems

Full size table

Regarding the noise covariance, there exists no difference between the conventional and downlink equivalent single-user MIMO systems. But there appears to be some difference between the conventional and uplink equivalent single-user MIMO systems. The noise covariance is E(nn*) = β I for the former and $E ({\underline{G}}_{k, R} {\underline{n}}_{k} {\underline{n}}_{k}^{*} {\underline{G}}_{k, R}^{*})$ for the latter. However,

{\underline{G}}_{k, R} {\underline{Φ}}_{n k} {\underline{G}}_{k, R}^{*} = {\underline{β}}_{k} {\underline{G}}_{k, R} {\underline{G}}_{k, R}^{*} = {\underline{β}}_{k} I_{{\underline{d}}_{k}}

(14)

because the rows of G_k,R are chosen to be an orthonormal basis for the left null space of A_k in (6). Applying (14), we see the noise covariance for both the conventional single-user MIMO system and the uplink equivalent single-user MIMO system is just at scalar times an identity matrix. Summarized in Table 3 are the covariance matrices for the conventional and equivalent single-user MIMO systems.

3.2. Example designs

The main conclusion of the previous section and Tables 2 and 3 is that for the ATP or IAP constraints, the downlink and uplink equivalent single-user MIMO systems can be treated as conventional single-user MIMO systems. Thus, all example designs in this subsection are (a) given using the notation of the conventional single-user MIMO system (F, G, H, etc.) and (b) are applicable to both downlink and uplink equivalent single-user MIMO systems.

Presented here are practical minimum mean square error (PMMSE), practical maximum mutual information (PMMI), practical minimum symbol error rate (PMBER), and practical SVD (PSVD) designs for the precoder and decoder. Each design is subject to either the ATP or the IAP constraint. The word ‘practical’ is used as a reminder that these designs are for the proposed practical decentralized downlink and uplink frameworks which have been discussed in Section 2 and summarized in Table 1. Based on Figure 3 and Tables 2 and 3, the four design approaches are outlined in Table 4.

Table 4 Four designs of precoder F and decoder G subject to either ATP or IAP constraint

Full size table

The first three approaches (PMMSE, PMMI, or PMBER) in Table 4 are formulated as optimization problems. The cost functions of PMMSE and PMMI are mean square error (to be minimized) and mutual information (to be maximized), respectively. The PMBER approach maximizes for each pair a lower bound for the minimum distance between symbol hypotheses and is an approximate alphabet-independent minimum bit error rate (BER) design. For the PMMSE design subject to ATP or IAP constraint (denoted as PMMSE-ATP or PMMSE-IAP, respectively), the solution can be readily obtained by applying Lemma 1 or 3 of [15]. Similarly, Lemmas 2 and 4 of [15] can be applied for the PMMI-ATP and PMMI-IAP problems, respectively, and Lemmas 5 and 6 of [15] for the PMBER-ATP and PMBER-IAP problems, respectively. The closed-form solutions of these three approaches are provided in [15] and summarized in Appendix for the convenience of the readers.

Not only these closed-form solutions are optimum for their respective problems in Table 4 (see proofs in [15]), they also fit perfectly into the proposed frameworks. The only requirement which is not met is that the transmitter (i.e., the cell site for the downlink scenario or the user for the uplink scenario) needs an estimate of the noise covariance of the receiver, which increases the network load. As such, some systems may prefer to use other closed-form solutions instead. One such option is to adopt the PSVD approach (the last design listed in Table 4) where the transmitter does not need to know the noise covariance of the receiver. In PSVD, the precoder and decoder can be derived directly by performing SVD of the channel (as shown in Table 4):

\begin{array}{l} F = α [t_{1} \dots t_{m}], G = W^{*} \\ with α^{2} = P / m σ_{s}^{2} (for ATP); α^{2} = L σ_{s}^{2} (for IAP) \end{array}

(15)

In (15), vectors t₁ … t_m and matrix W are given in Table 4. The PSVD-ATP and PSVD-IAP only differ in the α s they use. If L = P/m, the two α s are the same and thus the PSVD-ATP and PSVD-IAP are the same. Alternatively, instead of using the SVD decoder in (15), the MMSE decoder in (29) can also be employed for the PSVD approach. Note that the decoder is designed at the receiver. Thus, in this case, the transmitter still does not need to estimate the noise covariance, unlike in the other three approaches in the Appendix.

4. Properties of proposed approach

4.1. Optimality

In the following, the optimality of the PMMSE solution for the downlink and uplink frameworks will be proven. The optimality of PMMI and PMBER results can be established following the same procedure and are therefore omitted. Consider the downlink equivalent single-user MIMO system first. The MMSE cost function and the power constraint are (from Tables 2,3,4)

\begin{array}{l} min_{G_{k}, F_{k, R}} tr & \{σ_{s k}^{2} (G_{k} (H_{kk} F_{k, L}) F_{k, R} - I_{m_{k}}) {(G_{k} (H_{kk} F_{k, L}) F_{k, R} - I_{m_{k}})}^{*} \\ + β_{k} G_{k} G_{k}^{*}\} \end{array}

(16)

\begin{array}{l} P_{k} = tr \{F_{k, R} F_{k, R}^{*}\} σ_{s k}^{2} for ATP; or L_{k} \\ = λ_{max} \{F_{k, R} F_{k, R}^{*}\} σ_{s k}^{2} for IAP . \end{array}

(17)

Since F_k = F_k,LF_k,R and $I_{d_{k}} = F_{k, L}^{*} F_{k, L},$ (16) and (17) become

min_{G_{k}, F_{k, R}} tr {\{σ_{s k}^{2} (G_{k} H_{kk} F_{k} - I_{m_{k}}) {(G_{k} H_{kk} F_{k} - I_{m_{k}})}^{*} + β_{k} G_{k} G_{k}^{*}\}}_{F_{k} = F_{k, L} F_{k, R}}

(18)

\begin{array}{l} P_{k} = tr \{F_{k} F_{k}^{*}\} σ_{s k}^{2} for ATP; or L_{k} \\ = λ_{max} \{F_{k} F_{k}^{*}\} σ_{s k}^{2} for IAP . \end{array}

(19)

Note that (18) and (19) define the PMMSE problems for the k^th pair data transmission in the downlink framework. Since F_k,L has been determined previously for inter-pair interference cancellation and is known, the optimal solution {G_k, F_k,R} for the downlink equivalent problem in (16) and (17) (see [15]) can be used to construct the optimal decoder and precoder ${\{G_{k}, F_{k}\}}_{F_{k} = F_{k, L} F_{k, R}}$ of the k^th pair in the downlink framework. Next, consider the uplink equivalent single-user MIMO system. The cost function and the power constraint are

\begin{array}{l} min_{{\underline{G}}_{k, L}, {\underline{F}}_{k}} tr \{σ_{s k}^{2} & ({\underline{G}}_{k, L} ({\underline{G}}_{k, R} {\underline{H}}_{kk}) {\underline{F}}_{k} - I_{{\underline{m}}_{k}}) \\ {({\underline{G}}_{k, L} ({\underline{G}}_{k, R} {\underline{H}}_{kk}) {\underline{F}}_{k} - I_{{\underline{m}}_{k}})}^{*} \\ + {\underline{β}}_{k} {\underline{G}}_{k, L} {\underline{G}}_{k, L}^{*}\} \end{array}

(20)

\begin{array}{l} {\underline{P}}_{k} = tr \{{\underline{F}}_{k} {\underline{F}}_{k}^{*}\} {\underline{σ}}_{s k}^{2} for ATP; or {\underline{L}}_{k} \\ = λ_{max} \{{\underline{F}}_{k} {\underline{F}}_{k}^{*}\} {\underline{σ}}_{s k}^{2} for IAP . \end{array}

(21)

Since ${\underline{G}}_{k, L} {\underline{G}}_{k, R} = {\underline{G}}_{k}$ and ${\underline{G}}_{k, R} {\underline{G}}_{k, R}^{*} = I_{{\underline{d}}_{k}}$ , (20) is the same as

\begin{array}{l} min_{G_{k}, F_{k, R}} tr & \{{\underline{σ}}_{s k}^{2}, ({\underline{G}}_{k} {\underline{H}}_{kk} {\underline{F}}_{k} - I_{{\underline{m}}_{k}}), {({\underline{G}}_{k} {\underline{H}}_{kk} {\underline{F}}_{k} - I_{{\underline{m}}_{k}})}^{*} \\ {+ {\underline{β}}_{k} {\underline{G}}_{k} {\underline{G}}_{k}^{*}\}}_{{\underline{G}}_{k} = {\underline{G}}_{k, L} {\underline{G}}_{k, R}} \end{array}

(22)

Note that (22) and (21) define the PMMSE problems for the k^th pair data transmission in the uplink framework. Since G_k,R has been determined previously for inter-pair interference cancellation and is known, the optimal solution {G_k,L, F_k} for the uplink equivalent problem in (20) and (21) (see [15]) can be used to construct the optimal decoder and precoder {G_k = G_k,LG_k,R, F_k} of the k^th pair in the uplink framework.

4.2. Feasibility conditions

Regardless of which scenario, the k^th pair’s data transmission will only be feasible if its equivalent channel (H_kkF_k,L in downlink and G_k,RH_kk in uplink) has sufficient rank. The goal of this subsection is thus to derive necessary and sufficient conditions for

rank (H_{kk} F_{k} {, L}_{}) \geq m_{k},

(23a)

rank (G_{k} {, R}_{} H_{kk}) \geq m_{k} .

(23b)

Data transmission for the k^th pair is feasible in the downlink framework if and only if it is feasible in the uplink framework. The reason is twofold. First, since A_k = A_k^′, F_k,L^′ is a valid choice for G_k,R, and G_k,R^′ is a valid choice for F_k,L. Second, with m_k = m_k and G_k,R = F_k,L^′, (23a) holds if and only if (23b) holds. As such, the following will only focus on the downlink.

Without loss of generality, let 0 < m_k ≤ u_k and, let A_k and H_kkF_k,L be full rank. By observing that (23a) holds if and only if d_k ≥ m_k and by applying the rank nullity theorem, the necessary and sufficient condition for (23a),

b_{k} - m_{k} \geq \sum_{l = 1, l \neq k}^{K} u_{l},

(24a)

is obtained. Interestingly, the feasibility of data transmission depends solely on the number of antennas and data streams - not on the particular F_k,L chosen or the channel realization (assuming A_k and H_kkF_k,L be full rank). Similarly, the necessary and sufficient condition for (23b) is

{\underline{b}}_{k} - {\underline{m}}_{k} \geq \sum_{l = 1, l \neq k}^{K} {\underline{u}}_{l},

(24b)

4.3. Equivalencies between downlink and uplink frameworks

Let m_k = m_k, $Φ_{s k} = {\underline{Φ}}_{s k} = σ_{s k}^{2} I_{m_{k}}$ , $Φ_{n k} = β_{k} I_{u_{k}}$ , ${\underline{Φ}}_{n k} = β_{k} I_{{\underline{b}}_{k}}$ , and let (24a) and (24b) hold. Also, let both downlink and uplink be under the same ATP or IAP. Then, there are actually equivalencies between the performances of the downlink and uplink frameworks for the k^th pair:

a)
Let ${\underline{G}}_{k, R} = F_{k, L}^{'}$ and one of the optimum closed-form solutions in (29) to (34) is employed. If the k ^th pair uses the same solution for both downlink and uplink, its downlink and uplink mean square error (MSE) per stream, signal to interference and noise (SINR) per stream, and mutual information are the same.
b)
For a given power constraint (ATP or IAP), the lowest achievable sum MSE (derived from PMMSE) and highest achievable mutual information (derived from PMMI) for the k ^th pair are the same for the downlink and uplink frameworks.

Here is a rough sketch of the proof. When the optimum closed-form solutions in (29) to (34) are used, the MSE per stream, SINR per stream, and mutual information are essentially functions of only the eigenvalues of the matrix Ξ in (28). With appropriate variable mappings, this matrix is

Ξ_{k} = β_{k}^{- 1} F_{k, L}^{*} H_{kk}^{*} H_{kk} F_{k, L},

(25a)

\begin{array}{l} {\underline{Ξ}}_{k} = β_{k}^{- 1} {\underline{H}}_{kk}^{*} {\underline{G}}_{k, R}^{*} {\underline{G}}_{k, R} {\underline{H}}_{kk} \\ = β_{k}^{- 1} {\bar{H}}_{kk} {\bar{F}}_{k, L} F_{k, L}^{'} H_{kk}^{'}, \end{array}

(25b)

for the downlink and uplink frameworks, respectively. In (25b), ${\underline{G}}_{k, R} = F_{k, L}^{'}$ and ${\underline{H}}_{kk} = H_{kk}^{'}$ have been employed. As Ξ_k and ${\underline{Ξ}}_{k}$ have the same non-zero eigenvalues, ‘point (a)’ follows because the MSE per stream, SINR per stream, and mutual information are functions of these non-zero eigenvalues. ‘Point (b)’ can be proved from point (a) and the fact that solutions (29) to (34) are optimum in their respective sense (see Table 4 and the proofs in [15]).

4.4. Equivalencies among some optimal solutions

Consider the downlink scenario (the uplink scenario is analogous and is omitted). It has been shown in (A4) that PMMSE-IAP and PMMI-IAP are equivalent (i.e., they have the same precoder and decoder). It has also been shown in that if the power constraint parameters in (9a) and (11a) are related by L = P/m, the PSVD-ATP and PSVD-IAP are equivalent. Interestingly, if the MMSE decoder is employed for PSVD approach and $Φ_{n k} = β_{k} I_{u_{k}}$ , the PMMSE-IAP, PMMI-IAP, and PSVD-IAP are equivalent for any L_k. Thus, if L_k = P_k/m_k, $Φ_{n k} = β_{k} I_{u_{k}}$ and the MMSE decoder is employed for PSVD approach, the PMMSE-IAP, PMMI-IAP, PSVD-IAP, and PSVD-ATP are equivalent. When there is only one data stream per pair, PMMSE-ATP, PMMI-ATP, and PMBER-ATP are exactly the same (since Λ in (28) is just a scalar in this case, (30), (32), and (34) will all yield the same Ω in (29)). When in addition, $Φ_{n k} = β_{k} I_{u_{k}}$ and the MMSE decoder is employed for PSVD approach, PSVD-ATP is also MMSE and max information rate. These equivalencies (summarized in Table 5) can be derived, with some work, using (28) and the closed-form solutions for the PMMSE-IAP and PMMI-IAP problem.

Table 5 Equivalencies among various optimal solutions

Full size table

4.5. Relationship to interference alignment

Firstly, we will show that some of our example designs satisfy the IA conditions. In the downlink scenario (the uplink scenario is analogous and is omitted), a set of precoders {F_k} and decoders {G_k} achieve IA [16] when

rank (G_{k} H_{kk} F_{k}) = m_{k},

(26a)

G_{k} H_{kl} F_{l} = 0, \forall l \neq k, \forall k .

(26b)

Let (24a) hold for every pair. Looking at the downlink framework, one can easily see that all of its implementations satisfy (26b); the left precoders result in A_kF_k = 0, ∀k, or equivalently H_klF_l = 0, ∀l ≠ k, ∀k. In addition, one can easily see that some of its implementations (e.g., PSVD-ATP in Section 3.2) satisfy (26a). As a result, constructive proofs are obtained for the feasibility of IA in the downlink scenario when (26a) holds for every pair. Thus, the PSVD-ATP is an IA-equivalent implementation.

Secondly, we will show that an IA solution satisfies the nulling constraint of our schemes when each pair’s number of data streams is equal to its user’s number of antennas, i.e., m_k = u_k. With m_k = u_k, any {F_k,G_k} which achieves IA must therefore satisfy (a) (26a) ∀k; (b) G_k⁻¹ exists, ∀k; and finally (c) the nulling constraint H_klF_l = 0, ∀l ≠ k, ∀k (equivalently A_kF_k = 0, ∀k).

Finally, the example transceiver designs in Appendix are optimal for their metrics under their power constraints and nulling constraints A_kF_k = 0, ∀k. However, the IA-equivalent implementation (such as PSVD) is not designed under those criteria and conditions and, therefore, may not be optimal. Thus, the performance of an example transceiver design in Appendix will be at least as good as that of the IA-equivalent PSVD design for its given metric and power constraint, e.g., the PMMSE-ATP will obtain a MSE at least a small as the MSE of the IA-equivalent PSVD design (see [5, 16]). On the other hand, the example designs in Appendix may not be able to achieve easily what general IA designs can [17, 18].

5. Numerical results

To demonstrate the performance of the proposed frameworks, this section presents simulation results for typical CBF K- pair systems under the downlink scenario. The results for the uplink scenario are not presented due to the equivalencies (Section 4.3) and the similarities in the results. Three configurations are considered. In configuration A, K = 2, b_k = 4, u_k = 2, ∀k; in configuration B, K = 2, b_k = 8, u_k = 4, ∀k; and in configuration C, K = 4, b_k = 8, u_k = 2, ∀k. In configuration A, two cases are considered. In case A-1, m_k = 1, ∀k (partially loaded), and in case A-2, m_k = 2, ∀k (fully loaded). In configurations B and C, only the fully loaded case is presented. Therefore, m_k = 4, ∀k, in configuration B and m_k = 2, ∀k, in configuration C. As (24a) is satisfied for all cases, data transmission using the proposed framework is always feasible in all of them. No matter for which case, the source covariance matrices are identity matrices. Each data stream consists of uncoded BPSK modulated symbols. The noises are independently identically distributed CN (0,ϵ) random variables and $Φ_{n k} = β_{k} I_{u_{k}} = ϵ I_{u_{k}}$ , ∀k. The channel elements are independently identically distributed CN (0,1) random variables in each case.

For each case, five designs under the ATP condition are considered: the GIA-ATP, PMMSE-ATP, PMMI-ATP, PMBER-ATP, and PSVD-ATP. For comparison, the PMMSE design under the IAP condition (i.e., PMMSE-IAP) is also included. The MMSE decoder is employed for all designs. The GIA-ATP (see [5]) is a centralized MMSE design and is included as a performance benchmark. Note that the designs considered here are but a subset of the possible implementations. One can derive others using results from [19, 20]. The various equivalence relations among different designs under special conditions are discussed in Section 4.4 and summarized in Table 5.

Because PMMSE-IAP’s $F_{k, R} = \sqrt{L_{k}} V_{k}$ , ∀k, its tr{F_kF_k^*} = m_kL_k, ∀k. In addition, its L_k’s are chosen such that

\frac{P}{b_{k}} = max_{i, s_{k}} \{{|{[F_{k} s_{k}]}_{i}|}^{2}\} = max_{i, s_{k}} \{L_{k} {|{[F_{k, L} V_{k} s_{k}]}_{i}|}^{2}\}, \forall k,

(27)

i.e., so that the maximum instantaneous antenna power of each cell site is equal to P, the total average power for a cell site under the ATP constraint divided by its number of antennas. Note that the average power under the IAP constraint will thus be upper bounded by P. For the sake of comparison, perfect CSI is used in the GIA-ATP. In addition, no errors are incurred by the channel soundings and the equivalent channel soundings in the proposed designs.

The sum MSEs, system BERs, and sum capacities for case A-1 versus signal-to-noise ratio (SNR) ≜ 10log₁₀(P/ϵ) are plotted in Figure 4a,b,c, respectively. They are obtained by averaging over 15 channel realizations. First, let us look at the sum MSEs of the six designs in Figure 4a. The GIA-ATP and the PMMSE-IAP provide the best and worst performances, respectively. The other four designs result in exactly the same performance (because there is only one data stream per pair and these four results are equivalent; see Table 5). Furthermore, the sum MSEs of all designs is merging together as the SNR increases. The better performance of the GIA-ATP is expected; it is a centralized MMSE design and its precoders do not necessarily need to null out the interfering channels. The PMMSE-IAP’s performance is behind the others because its average total power per cell site is less than P.

Next, let us look at the system BER results. All of the BERs are very good and the performance order of the designs is the same as with the sum MSEs. For the sum capacity results, the designs are still in the same performance order. In addition, all of the curves have approximately the same slope. Though the GIA-ATP is a MMSE design, it has the highest sum capacity. This can be attributed to it being a centralized design while the others are distributed. Moreover, the PMMI-ATP cannot do any waterfilling between data streams because there is only one data stream.

The sum MSEs, system BERs and sum capacities for case A-2 versus SNR are plotted in Figure 5a,b,c, respectively. Since each pair has more than one data stream, the equivalencies between PMMSE-ATP, PMMI-ATP, PMBER-ATP, and PSVD-ATP no longer hold. Note that the performance of each of the three example designs (PMMSE-ATP, PMMI-ATP, and PMBER-ATP) in Appendix is better than that of PSVD-ATP (the equivalent centralized IA design) under its metric and power constraint. Granted, the PMBER-ATP design is using an approximate minimum BER metric. Consequently, it is observed that the PSVD-ATP does slightly outperform it in low SNRs.

First, let us look at the sum MSEs of the six designs. The performance order is, from best to worst, the GIA-ATP, PMMSE-ATP, PSVD-ATP, PMMI-ATP, PMBER-ATP, and PMMSE-IAP. The sum MSEs of all designs is merging together as the SNR increases. Because m_k = 2 and Φ_{n
k} = ϵ I₂, ∀k, the PSVD-ATP is MMSE subject to IAP in (12) with L_k = P/2, ∀k. There are two interesting remarks: (a) the optimum performances under ATP in (9b) and IAP in (12) are similar when the average total power of the two are the same and (b) the difference in the performances of the PMMSE-IAP and PSVD-ATP is due to the value of L_k used in (12).

For the system BER results, the performance order is, from best to worst, the GIA-ATP, PMBER-ATP and PMMSE-ATP, PSVD-ATP, PMMI-ATP, and PMMSE-IAP. Interestingly, the PMBER-ATP, using an approximate minimum BER design, provides excellent results. In addition, the PMMSE-ATP, though designed for MSE, provides essentially the same BER results as the PMBER-ATP.

For the sum capacity results, the performance order is dependent on the SNR. Among the five decentralized practical designs, the PMMI-ATP has the largest sum capacity (for all SNRs) because it is designed to maximize the mutual information. The PMMSE-IAP has the smallest sum capacity because its transmitted power is less than that used by other designs under the ATP condition. PMBER-ATP has the second smallest sum capacity because it is designed to maximize only the minimum eigenvalue of the matrix shown in Table 4. Remarkably, the GIA-ATP, though it is a centralized design, does not always have the highest sum capacity. Moreover, two other much simpler decentralized designs, PSVD-ATP and PMMSE-ATP, have achieved similar sum capacities as the centralized GIA-ATP. In fact, PSVD-ATP has a slightly larger sum capacity than GIA-ATP and PMMSE-ATP for high SNRs. This is because PSVD-ATP is equivalent to PSVD-IAP (if L_k = P/2, ∀k) and, furthermore, PSVD-IAP is equivalent to PMMI-IAP (since m_k = 2 and Φ_{n
k} = ϵ I₂, ∀k). Thus, the PSVD-ATP is max information rate subject to IAP in (12), i.e., PMMI-IAP, with L_k = P/2, ∀k.

Note that Figure 4a,b,c presents the single data stream results and Figure 5a,b,c presents the two data stream results. Comparing Figure 4a with Figure 5a, the MSEs in Figure 4a are smaller than half of the sum MSEs (i.e., the corresponding average MSEs over the two data streams) in Figure 5a. Comparing Figure 4b with Figure 5b, the BERs in Figure 4b are smaller than the corresponding average BERs over the two data streams in Figure 5b. Comparing Figure 4c with Figure 5c, the capacities in Figure 4c are larger than half of the sum capacities (i.e., the corresponding average capacities over the two data streams) in Figure 5c. All of the above observations are due to the fact that each of the two communication pairs in configuration A is an equivalent 2 by two single-user MIMO system, and the two eigenchannel gains of the equivalent 2 by two single-user MIMO system are usually very different. Thus, one of the two data streams in case A-2 (results presented in Figure 5a,b,c) must go through the eigenchannel with the smaller channel gain. But the single data stream in case A-1 can always use the eigenchannel with the larger channel gain (results presented in Figure 4a,b,c). Thus, the per-stream performances in Figure 4a,b,c are generally better than those in Figure 5a,b,c.

To demonstrate the usefulness of the proposed schemes, we also present the numerical results of the two larger systems. In configuration B, the number of antennas at each user is twice of that in configuration A; in configuration C, the number of users is twice of that in configuration A. Obviously, the number of antennas at the cell site in configuration B or C needs to be twice of that in configuration A as well. The MSEs, system BERs and sum capacities for configuration B are plotted in Figure 6a,b,c, respectively. In addition, the MSEs, system BERs, and sum capacities for configuration C are plotted in Figure 7a,b,c, respectively. Although the systems are larger, the observations made for configuration A can also be made for configurations B and C. Moreover, comparing Figures 6a or 7a with Figure 5a, the MSEs in Figures 6a or 7a are around twice of the MSEs in Figure 5a. Comparing Figures 6b or 7b with Figure 5b, the BERs in Figures 6b or 7b are slightly larger than the BERs in Figure 5b. Comparing Figures 6c or 7c with Figure 5c, the capacities in Figures 6c or 7c are twice of the capacities in Figure 5c. All of the above observations are due to the fact that the system represented by Figure 6a,b,c or by Figure 7a,b,c is twice in size (in terms of the number of the antennas) of the system represented by Figure 5a,b,c.

6. Conclusions

Two frameworks (and various example designs) are proposed for the practical transceiver and signaling design of a K-pair system desiring to employ CBF. Though one is for the downlink scenario and the other for the uplink scenario, they are very similar. Firstly, both of them use the same mechanisms (e.g., channel soundings from the users, equivalent channel soundings from the cell sites, decoupling the system into K single-user MIMO systems) and have the same feasibility conditions. Secondly, there exist equivalencies between their performances. Thirdly, both have implementations which are constructive proofs for IA. For example, one of the example designs, the PSVD, is shown to be equivalent to a centralized IA design. Unlike [21], there is no difficulty dealing with more than one data stream per pair. Fourthly, optimum closed-form solutions are able to be given for both of them. Fifthly, the performances of these optimum closed-form solutions in their corresponding design metrics are at least as good as those of the centralized IA-equivalent PSVD design. For example, the information rates of the max information rate closed-form solutions (PMMI) are at least as high as those of the centralized IA-equivalent PSVD design. Sixthly, the numerical results show that they both have implementations which obtain higher sum capacities than the GIA (a centralized MMSE approach). Clearly, they are both frameworks for practical low-information exchange CBF designs which are able to deliver the long-awaited performance gain.

Over the years, there has been much debate over whether to use TDD or FDD. In the light of CBF and this paper’s proposed designs, it becomes clear that the ability of TDD to support channel soundings in the reverse direction is a great underused advantage. We envision that this ability will be a key for implementing and fully harnessing the benefits of other MIMO techniques as well. As both LTE and wireless interoperability for microwave access (WiMAX) utilize TDD, the newly proposed C-RAN network [22–24] also utilizes TDD. It may not be long before reverse channel sounding enabled MIMO techniques, such as this paper’s proposed designs, are employed.

The usefulness of CBF is not limited to mitigating the inter-site interference between cell sites of a cellular system. It can be used whenever multiple transmissions are using the same frequency at the same time. Due to the practical decentralized nature of this paper’s proposed designs, it seems possible that CBF will be used to mitigate the interference that macrocells and femtocells cause to each other [25]. In addition, it seems possible that it will be used in non-cellular systems as well (e.g., ad hoc networks, mesh networks).

Although only the analysis of a K-pair system is presented in this paper, the five phases of the proposed frameworks in Table 1 can be extended to deal with a multiuser scenario where each cell site needs to talk to multiple users in its cell simultaneously. The closed-form optimal solutions (e.g., the ones in Appendix) for the K-pair system are no longer available for the K-multiuser system. Many multiuser precoder-decoder designs are available for the K-multiuser system. But, they may require additional signaling load. Investigation will be needed to determine how to trade off between performance and signaling load for the K-multiuser system. The number of users which can be served simultaneously is limited by the number of antennas of the cell sites. If many users exist in the same cell, some kind of user scheduling or selection scheme [26, 27] is required. One possibility is frequency multiplexing, a natural solution in orthogonal frequency-division multiplexing (OFDM) systems like LTE.

Currently, the practicality of the proposed approach is somewhat limited by the fact that the antenna setup is restricted and, therefore, only a small number of cell or user antennas can be supported. However, the proposed approach is very promising for a future large-scale network, because the current research trend is massive MIMO [28] where huge cell site antenna arrays are employed.

There is a concern about the effectiveness of zero forcing used for block diagonalization in a cellular system since the users may not be of the same distances from the cell site. The problem can be mitigated to a certain extent by power control where the received powers of all users are controlled to be at the same level at the cell site. Smart scheduling, like frequency multiplexing, can also be employed to group users with similar SNRs together. In addition, it is shown in [28] that the channel matrix for a massive MIMO system tends to be well conditioned and, therefore, the zero-forcing technique may becomes more appealing as the number of antennas increases. There is also a concern about the effects due to the channel estimation error. It is shown in [28] that as the number of antennas increases, the thermal noise can be averaged out so that the system is predominantly limited by interference from other transmitters. In summary, all these practical limiting factors are reduced as the number of antennas increases. We conclude that the more massive MIMO and TDD technologies advance, the more practical and promising our proposed approach will become.

Endnote

^aA part of this work has been presented in IEEE Sarnoff Symposium 2011 (see [12]).

Appendix

Closed-form solutions from Scaglione et al

For the PMMSE design subject to ATP or IAP constraint (denoted as PMMSE-ATP or PMMSE-IAP, respectively), the solution can be readily obtained by applying Lemma 1 or 3 of [15]. Similarly, Lemmas 2 and 4 of [15] can be applied for the PMMI-ATP and PMMI-IAP problems, respectively, and Lemmas 5 and 6 of [15] can be applied for the PMBER-ATP and PMBER-IAP problems, respectively. These closed-form solutions are summarized below for the convenience of the readers.

Define an eigendecomposition of

with the eigenvalues arranged in descending order, Λ = diag(λ₁…λ_m), and V = $[\begin{array}{c} q_{1} & \dots & q_{m} \end{array}]$ . By the Lemmas 1 to 6 of [15], an optimum closed-form solution of F and G to each of the six problems is

\begin{array}{l} F = VΩ, Ω = diag (\begin{array}{c} ω_{1} & ω_{2} & \dots & ω_{m} \end{array}) \\ G = σ_{s}^{2} F^{*} H^{*} {(σ_{s}^{2} HF F^{*} H^{*} + β I_{d})}^{- 1} \end{array}

(29)

where the entries of the diagonal matrix Ω depends on the particular problem. For the PMMSE-ATP problem,

ω_{i} = \sqrt{{(\frac{P + \sum_{j = 1}^{M} {(λ_{j})}^{- 1}}{σ_{s}^{2} \sum_{j = 1}^{M} {(λ_{j})}^{- 1 / 2}} {(λ_{i})}^{- 1 / 2} - \frac{1}{σ_{s}^{2} λ_{i}})}^{+}}, \forall i;

(30)

where M ≤ m is chosen so that ω_i >0 when i ≤ M and ω_i =0 when M < i ≤ m. For the PMMSE-IAP and PMMI-IAP problems,

ω_{i} = \sqrt{\frac{L}{σ_{s}^{2}}}, \forall i .

(31)

Hence, the optimum closed-form solutions provided for the PMMSE-IAP and PMMI-IAP problems are the same. For the PMBER-ATP problem,

ω_{i} = \sqrt{P} / \sqrt{σ_{s}^{2} λ_{i} tr (Λ^{- 1})}, \forall i .

(32)

For the PMBER-IAP problem,

ω_{i} = \sqrt{\frac{L λ_{m}}{σ_{s}^{2} λ_{i}}}, \forall i .

(33)

For the PMMI-ATP problem,

ω_{i} = \sqrt{{(\frac{P + \sum_{j = 1}^{J} {(λ_{j})}^{- 1}}{σ_{s}^{2} J} - \frac{1}{σ_{s}^{2} λ_{i}})}^{+}}, \forall i;

(34)

where J ≤ m is chosen so that ω_i >0 when i ≤ J and ω_i = 0 when J < i ≤ m.

References

3GPP mobile broadband innovation path to 4G: release 9, release 10 and beyond: HSPA+, SAE/LTE and LTE-Advanced. 2010. . Accessed 27 January 2011 http://www.4gamericas.org/documents/3GPP_Rel-9_Beyond%20Feb%202010.pdf
3GPP Releases 9–12. Accessed 27 January 2011 http://www.3gpp.org/ftp/Information/WORK_PLAN/Description_Releases/
Sawahashi M, Kishiyama Y, Morimoto A, Nishikawa D, Tanno M: Coordinated multipoint transmission/reception techniques for LTE-Advanced. IEEE Wirel. Commun. Mag. 2010, 17(3):26-34.
Article Google Scholar
Ziera A: Advanced network topologies and spectrally efficient air interface solutions for LTE-Advanced and beyond. Amsterdam, Netherlands: LTE World Summit 2010; 2010.
Google Scholar
Lu E, Li J, Lu IT: Comparison of coordinated beamforming and non-coordinated multipoint using MMSE transceiver designs. Princeton, NJ: Proceedings of the 33rd IEEE Sarnoff Symposium; 2010.
Google Scholar
Peters SW, Heath RW Jr: Interference alignment via alternating minimization. Taipei: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP); 2009.
Book Google Scholar
Jorswieck E, Larsson E: The MISO interference channel from a game-theoretic perspective: a combination of selfishness and altruism achieves pareto optimality. Las Vegas: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2008.
Google Scholar
Gomadam KS, Cadambe VR, Jafar SA: Approaching the capacity of wireless networks through distributed interference alignment. 2008. Mar. 26, 2008 Accessed 27 January 2011 http://arxiv.org/abs/0803.3816v1
Book Google Scholar
Tölli A, Pennanen H, Petri K: Distributed coordinated multi-cell transmission based on dual decomposition. Honolulu: Proceedings of the IEEE Global Telecom Conference (GLOBECOM); 2009.
Book Google Scholar
Zakhour R, Ho Z, Gesbert D: Distributed beamforming coordination in multicell MIMO channels. Barcelona: Proceedings of the IEEE 69th Vehicular Technology Conference; 2009.
Book Google Scholar
Sung H, Park S-H, Lee K-J, Lee I: Linear precoder designs for K -user interference channels. IEEE Trans Wireless Comm 2010, 9(1):291-301.
Article Google Scholar
Lu E, Lu I-T: Practical decentralized high-performance coordinated beamforming. Princeton: Proceedings of IEEE Sarnoff Symposium; 2011.
Book Google Scholar
Spencer QH, Swindlehurst AL, Haardt M: Zero-forcing methods for downlink spatial multiplexing in multiuser MIMO channels. IEEE Transac Signal Proc 2004, 52(2):461-471. 10.1109/TSP.2003.821107
Article MathSciNet Google Scholar
Park J, Sung Y, Poor HV: On beamformer design for multiuser MIMO interference channels. 2010. , 2010, Accessed 27 January 2011 http://arxiv.org/abs/1011.6121
Google Scholar
Scaglione A, Stoica P, Barbarossa S, Giannakis GB, Sampath H: Optimal designs for space-time linear precoders and decoders. IEEE Trans Signal Proc 2002, 50(5):1051-1064. 10.1109/78.995062
Article Google Scholar
Lu E, Lu I-T, Li J: Interference alignment: a building block of coordinated beamforming transceiver designs. J Comm 2011, 6(5):409-419.
Article Google Scholar
Jafar S: Linear interference alignment. In Interference Alignment: A New Look at Signal Dimensions in a Communication Network. Foundations and Trends in Communications and Information Theory, 7, 1; 2013. , Accessed 10 January 2013 https://sites.google.com/site/interferencealignment/linear-interference-alignment
Google Scholar
Jafar SA: New challenges and solutions. In Interference Alignment: A New Look at Signal Dimensions in a Communication Network. Foundations and Trends in Communications and Information Theory, 7, 1; 2013. , Accessed 10 January 2013 https://sites.google.com/site/interferencealignment/4-new-challenges-and-solutions/4-4-channel
Google Scholar
Palomar DP, Cioffi JM, Lagunas MA: Joint Tx-Rx beamforming design for multicarrier MIMO channels: a unified framework for convex optimization. IEEE Trans Signal Proc 2003, 51(9):2381-2401. 10.1109/TSP.2003.815393
Article Google Scholar
Palomar DP: Unified framework for linear MIMO transceivers with shaping constraints. IEEE Comm Lett 2004, 8(12):697-699. 10.1109/LCOMM.2004.837647
Article Google Scholar
Yetis CM, Gou T, Jafar SA, Kayran AH: On feasibility of interference alignment in MIMO interference networks. IEEE Trans. Signal Proc 2010, 58(9):4771-4782.
Article MathSciNet Google Scholar
C-RAN, C-RAN: The road towards Green Radio Access Network, (C-RAN 2013). 2013. Accessed 10 January 2013 http://labs.chinamobile.com/cran/
Google Scholar
TelecomEngine: How C-RAN architecture can reduce costs for mobile backhaul deployments. (Telecommuncations Online & Horizon House Publications 2013). 2013. , Accessed 10 January 2013 http://www.telecomengine.com/article/how-c-ran-architecture-can-reduce-costs-mobile-backhaul-deployments
Google Scholar
C-RAN. 2013. , Accessed 10 January 2013 http://www.c-ran.com/portal.php
Giese J, Amin MA: Performance upper bounds for coordinated beam selection in LTE-Advanced. Proc 2010 Int ITG Workshop Smart Antenna (WSA) 2010.
Google Scholar
Yoo T, Goldsmith A: On the optimality of multiantenna broadcast scheduling using zero-forcing beamforming. IEEE J Sel Areas Commun 2006, 24: 528-541.
Article Google Scholar
Li J, Li Y, Lu I: UE centric coordinated beamforming in multi-cell MU-MIMO systems. IEEE Sarnoff 2011.
Google Scholar
Rusek F, Persson D, Buon Kiong L, Larsson EG, Marzetta TL, Edfors O, Tufvesson F: Scaling Up MIMO: opportunities and challenges with very large arrays. Signal Proc Mag, IEEE 2013, 30(1):40-60.
Article Google Scholar

Download references

Acknowledgement

We would like to thank Professor Peter Voltz for his gracious help.

Author information

Authors and Affiliations

Department of ECE, Polytechnic Institute of NYU, 6 Metrotech Center, Brooklyn, NY, 11201, USA
Enoch Lu & I-Tai Lu

Authors

Enoch Lu
View author publications
You can also search for this author in PubMed Google Scholar
I-Tai Lu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Enoch Lu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Authors’ original file for figure 13

Authors’ original file for figure 14

Authors’ original file for figure 15

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Lu, E., Lu, IT. Practical decentralized high-performance coordinated beamforming for both downlink and uplink in time-division duplex systems. J Wireless Com Network 2013, 251 (2013). https://doi.org/10.1186/1687-1499-2013-251

Download citation

Received: 25 February 2013
Accepted: 09 October 2013
Published: 29 October 2013
DOI: https://doi.org/10.1186/1687-1499-2013-251

Practical decentralized high-performance coordinated beamforming for both downlink and uplink in time-division duplex systems

Abstract

1. Introduction

2. Proposed frameworks

2.1. System model

2.2. Proposed framework for the downlink scenario

2.3. Proposed framework for the uplink scenario

3. Example precoder-decoder designs

3.1. Conventional and equivalent single-user MIMO systems

3.2. Example designs

4. Properties of proposed approach

4.1. Optimality

4.2. Feasibility conditions

4.3. Equivalencies between downlink and uplink frameworks

4.4. Equivalencies among some optimal solutions

4.5. Relationship to interference alignment

5. Numerical results

6. Conclusions

Endnote

Appendix

Closed-form solutions from Scaglione et al

References

Acknowledgement

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords