Distributed resource allocation for MISO downlink systems via the alternating direction method of multipliers

Joshi, Satya Krishna; Codreanu, Marian; Latva-aho, Matti

doi:10.1186/1687-1499-2014-1

Research
Open access
Published: 04 January 2014

Distributed resource allocation for MISO downlink systems via the alternating direction method of multipliers

Satya Krishna Joshi¹,
Marian Codreanu¹ &
Matti Latva-aho¹

EURASIP Journal on Wireless Communications and Networking volume 2014, Article number: 1 (2014) Cite this article

2827 Accesses
72 Citations
Metrics details

Abstract

We provide distributed algorithms for the radio resource allocation problem in multicell downlink multi-input single-output systems. Specifically, the problems of (1) minimizing total transmit power subject to signal-to-interference-plus-noise ratio (SINR) constraints of each user and (2) SINR balancing subject to total transmit power constraints are considered. We propose a consensus-based distributed algorithm and the fast solution method via alternating the direction method of multipliers. First, we derive a distributed algorithm for minimization of total transmit power. Then, in conjunction with the bracketing method, a distributed algorithm for SINR balancing is derived. Numerical results show that the proposed distributed algorithms achieve the optimal centralized solution.

1 Introduction

We provide distributed algorithms for the problem of resource allocation for multicell downlink systems with linear precoding. The base stations (BSs) are assumed to have multiple antennas while all the users are equipped with single antenna. Full channel state information is assumed to be available at both the BSs and the users, and all the users share the same frequency bandwidth. Under this setting, we consider the following two optimization problems: P1 - minimization of the total transmission power subject to minimum signal-to-interference-plus-noise ratio (SINR) constraints of each user, and P2 - SINR balancing subject to total transmit power constraint of BSs.

Several centralized algorithms for problems P1 and P2 have been proposed in the literature. See, e.g.,[1–6] for problem P1 and[7–10] for problem P2. Unfortunately, the centralized method is not practical for the resource allocation due to high overhead required for collecting all channel state information at the central processing unit. Therefore, to share the workload of the central controller and to overcome impelling backhaul overhead, the distributed algorithm is desirable in practice.

Distributed methods for problem P1 for multiple-input and single-output (MISO) multicell wireless systems have been proposed in[11–15]. The algorithm in[11] is based on uplink-downlink duality, where the minimum power downlink beamformers designing problem is solved using a dual uplink problem. The algorithm in[11] is a multicell generalization of the algorithm proposed in[16] for the single-cell case. In[12] dual decomposition method is adopted, and the algorithm in[13] is based on primal decomposition. Both in primal and dual decomposition methods[17], the master problem is solved using an iterative method such as the subgradient method[18]. Even though problem P1 is a convex problem and can be easily solved via primal or dual decomposition (see,[12, 13]), the convergence speed of those algorithms are slow and highly sensitive on the subgradient step size[18]. A game theoretic approach is considered in[14].

Problem P2 is a quasiconvex problem[16]. Thus, the centralized method based on bisection search[19] is commonly used, e.g.,[10, 16]. Combining the bisection search and the uplink-downlink SINR duality, a distributed algorithm is proposed in[20]. The algorithm in[20] is a hierarchical iterative algorithm which consists of outer and inner iterations, where the bisection search is carried out in the outer iteration and uplink-downlink SINR duality is used for the inner iteration.

The alternating direction method of multipliers (ADMM) is a simple but powerful algorithm that is well suited to distributed convex optimization. ADMM combines the benefits of dual decomposition and augmented Lagrangian methods[21]. Hence, ADMM is numerically stable than dual decomposition and hence suitable for many practical optimization problem[21]. Due to superior stability properties and decomposability, ADMM has been applied to a wide rage of applications, such as compressed sensing[22], image restoration[23], signal processing[15, 24, 25], etc.; see[21] for the recent survey. In many applications, the ADMM method has been observed to converge fast[21, 26, 27]; and when the objective function is strongly convex and Lipschitz continuous, ADMM can even guarantee the liner rate of convergence[28, 29].

The main contribution of the paper is to propose consensus-based distributed algorithms for problems P1 and P2, and the fast solution method via ADMM[21]. The ADMM turns the original problem into a series of iterative steps, namely, local variable update, global variable update, and dual variable update[21]. The local variable and dual variable updates are carried out independently in parallel by all BSs, while the global variable update is carried out by BS coordination. In this paper, we first derive distributed algorithm for problem P1. Then, we extend the formulation of problem P1 to derive the distributed algorithm for problem P2. In particular, we recast the problem into a more tractable form and combine bracketing method (e.g., golden ratio search)[30, 31] with ADMM to derive the distributed algorithm for problem P2.

Recently, for problem P1, by considering the uncertainty in the channel measurements, an algorithm based on ADMM has been proposed in[15]. In our paper, we consider perfect channel state information (CSI) and use the consensus technique to solve the problem. Then, we apply ADMM to derive the distributed algorithm. The consensus technique can be easily decomposed into a set of subproblems suitable for distributed implementation[21, 24]. Hence, the algorithm formulation in this paper is more intuitive than that provided in[15]. It is worth noting that this paper extends our recent work[32] to SINR balancing problem (i.e., problem P2). In addition, for problem P1, we show that the proposed distributed algorithm converges to the optimal centralized solution. Moreover, for problem P1, we also provide a method to find the ADMM penalty parameter that leads faster convergence of the algorithm.

Note that problem P2 is a quasiconvex problem. To the best of our knowledge there is no convergence theory to the ADMM method for a quasiconvex problem. However, if each step of the ADMM iteration is tractable, the ADMM algorithm can still be used to derive (possibly suboptimal) distributed methods for problem P2[21], Chapter 9. Though these methods are not provably optimal, our numerical results show that the proposed algorithm converged to the optimal solution in all simulated cases.

The remainder of this paper is organized as follows. The considered MISO system model and problem formulation are described in Section 2. The distributed algorithm for sum power minimization (P1) is derived in Section 3. Next, in Section 4, we derive the distributed algorithm for SINR balancing problem (P2). The numerical results are presented in Section 5, and Section 6 concludes our paper.

Notations: All boldface lower case and upper case letters represent vectors and matrices, respectively, and calligraphy letters represent sets. The notation $C^{T}$ denotes the set of complex T-vectors, |x| denotes the absolute value of the scalar x, ∥x∥₂ denotes the Euclidean norm of the vector x, I denotes the identity matrix, and $CN (m, C)$ denotes the complex circular symmetric Gaussian vector distribution with mean m and covariance matrix C. The superscripts (·)^H and (·)^⋆ are used to denote a Hermitian transpose of a matrix and a solution of an optimization problem, respectively.

2 System model and problem formulation

A multicell MISO downlink system, with N BSs each equipped with T transmit antennas, is considered. The set of all BSs is denoted by, and we label them with the integer values n = 1, …, N. The transmission region of each BS is modeled as a disc with radius R_BS centered at the location of the BS. Single data stream is transmitted for each user. We denote the set of all data streams in the system by, and we label them with the integer values l = 1, …, L. The transmitter node (i.e., the BS) of l th data stream is denoted by tran(l), and the receiver node of l th data stream is denoted by rec(l). We have $L = \cup_{n \in N} L (n)$ , where $L (n)$ denotes the set of data streams transmitted by n th BS. Note that the intended users of the data streams transmitted by each BS are necessarily located inside the transmission region of the BS (see Figure1).

The antenna signal vector transmitted by n th BS is given by

x_{n} = \sum_{l \in L (n)} d_{l} m_{l},

(1)

where $d_{l} \in C$ and $m_{l} \in C^{T}$ represent the information symbol and the transmit beamformer associated to l th data stream, respectively. We assume that d_l is normalized such that E|d_l|² = 1. Moreover, we assume that the data streams are independent, i.e., $E {d_{l} d_{j}^{*}} = 0$ for all l ≠ j, where $l, j \in L$ .

The signal received at rec(l) can be expressed as

\begin{array}{l} y_{l} = d_{l} h_{ll}^{H} m_{l} & + \sum_{j \in L (tran (l)), j \neq l} d_{j} h_{jl}^{H} m_{j} \\ (intra-cell interference) \\ + \sum_{n \in N ∖ {tran (l)}} \sum_{j \in L (n)} d_{j} h_{jl}^{H} m_{j} + n_{l}, \\ (out-of-cell interference) \end{array}

(2)

where $h_{jl}^{H} \in C^{1 \times T}$ is the channel matrix between tran(j) and rec(l), and n_l is circular symmetric complex gaussian noise with variance $σ_{l}^{2}$ . Note that the second right hand term in (2) represents the intra-cell interference and the third right hand term represents the out-of-cell interference. The received SINR of l th data stream is given by

Γ_{l} = \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (tran (l)), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{n \in N ∖ {tran (l)}} z_{nl}^{2}},

(3)

where $z_{nl}^{2} = \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}$ represents the power of the out-of-cell interference from n th BS to rec(l).

The out-of-cell interference term in (3) (i.e., $\sum_{n \in N ∖ {tran (l)}} z_{nl}^{2}$ ) prevents resource allocation on an intra-cell basis and demands BS cooperation/coordination. To avoid unnecessary coordination between far apart-located BSs, we make the following assumption: transmission from n th BS interfere the l th data stream (transmitted by BS b ≠ n) only if the distance between n th BS and rec(l) is smaller than a threshold R_int^a. The disc with radius R_int centered at the location of any BS is referred to as the interference region of the BS (see Figure1). Thus, if n th BS located at a distance larger than R_int to rec(l), the associated z_nl components are set to zero^b. Based on the assumption above, we can express Γ_l as

Γ_{l} = \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (tran (l)), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{n \in N_{int} (l)} z_{nl}^{2}},

where $N_{int} (l) \subseteq N ∖ {tran (l)}$ is the set of out-of-cell interfering BSs that are located at a distance less than R_int to rec(l). For example, in Figure1a, we have $N_{int} (2) = {2}$ , $N_{int} (8) = {1}$ , and $N_{int} (l) = \emptyset$ for all l ∈ {1, 3, 4, 5, 6, 7}. Moreover, it is useful to define the set $L_{int}$ of all data streams that are subject to the out-of-cell interference, i.e., $L_{int} = {l | l \in L, N_{int} (l) \neq \emptyset}$ . For example, in Figure1a, we have $L_{int} = {2, 8}$ .

The total transmitted power of the multicell downlink system can be expressed as

P = \sum_{n \in N} \sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2} .

Assuming that the SINR Γ_l is subject to the constraint Γ_l ≥ γ_l for each user $l \in L$ , the problem of minimizing the total transmitted power (i.e., P1) can be expressed as

\begin{array}{l} P 1 : minimize & \sum_{n \in N} \sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2} \\ subject to & \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (tran (l)), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{n \in N_{int} (l)} z_{nl}^{2}} \geq γ_{l}, \\ l \in L \\ z_{nl}^{2} = \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, l \in L_{int}, n \in N_{int} (l), \end{array}

(4)

with variables ${m_{l}}_{l \in L}$ and ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ ^c.

Providing fairness among the users with per BS power constraint (i.e., $\sum_{j \in L (n)} {∥ m_{l} ∥}_{2}^{2} \leq p_{n}^{max}$ ) is another important resource allocation problem. One way^d of providing fairness among the users is by maximizing the minimum SINR (i.e., P2)[16], which can be formulated as

\begin{array}{l} P 2 : maximize & min_{l \in L} (\frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (tran (l)), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{n \in N_{int} (l)} z_{nl}^{2}}) \\ subject to & z_{nl}^{2} = \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, l \in L_{int}, n \in N_{int} (l) \\ \sum_{j \in L (n)} {∥ m_{l} ∥}_{2}^{2} \leq p_{n}^{max}, n \in N, \end{array}

(5)

where the variables are ${m_{l}}_{l \in L}$ and ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ . Finally, to improve the readability of the paper, we summarize a list of sets used in this paper in Table1.

Table 1 Summary of a list of sets

Full size table

3 Sum power minimization

In this section, we derive a distributed algorithm for problem (4), i.e., P1. First, we equivalently reformulate the original problem (4) in a form of global consensus problem. Then, we derive the proposed distributed algorithm based on ADMM[21].

3.1 An equivalent reformulation: sum power minimization

We start by reformulating sum power minimization problem (4) as

\begin{array}{l} minimize & \sum_{n \in N} \sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2} \\ subject to & \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (tran (l)), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{n \in N_{int} (l)} z_{nl}^{2}} \geq γ_{l}, \\ l \in L \\ z_{nl}^{2} \geq \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, l \in L_{int}, n \in N_{int} (l), \end{array}

(6)

where the variables are ${m_{l}}_{l \in L}$ and ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ . Problems (4) and (6) are equivalent as it can be easily shown (e.g., by contradiction) that the second inequality holds with equality at the optimal point.

Recall that $z_{nl}^{2}$ in problem (6) represents power of the out-of-cell interference caused by n th BS at rec( l ), and hence, variable z_nl couples exactly two BSs (i.e., BS n and BS tran(l)). We use consensus technique to distribute problem (6) over the BSs. The method consist of introducing local copies of the coupling variables z_nl for all $l \in L_{int}$ , $n \in N_{int} (l)$ at each BS (see Figure2).

Let us define x_k,nl as the local copy of z_nl at BS k. Thus for each z_nl, we make two local copies, i.e., x_n,nl at BS n and x_tran(l),nl at BS tran(l). Then, problem (6) can be written equivalently in a global consensus form as

\begin{array}{l} minimize & \sum_{n \in N} \sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2} \\ subject to & \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (tran (l)), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{b \in N_{int} (l)} x_{n, bl}^{2}} \geq γ_{l}, \\ n \in N, l \in L (n) \\ x_{n, nl}^{2} \geq \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, l \in L_{int}, n \in N_{int} (l) \\ x_{k, nl} = z_{nl}, k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l), \end{array}

(7)

with variables ${m_{l}}_{l \in L}$ , ${x_{k, nl}}_{k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l)}$ , and ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ . Note that in the SINR constraints of problem (7), we have replaced z_bl by the local copy x_n,bl and used $L = \cup_{n \in N} L (n)$ . In the second inequality constraints of (7), we have replaced z_nl by the local copy x_n,nl. The last set of equality constraints of (7) are called consistency constraints, and they enforce the local copies {x_k,nl}_{k ∈ {n, tran(l)}} to be equal to the corresponding global variable z_nl.

Problem (7) is not a convex problem. However, by following the approach of[16], Section IV-B, problem (7) can be equivalently cast in the form of convex problem. To do this, let us define the matrix $M_{n} = {[m_{l}]}_{l \in L (n)}$ obtained by concatenating the column vectors m_l. Then, by following the approach of[16], Section IV-B, problem (7) can be equivalently reformulated in the form of convex problem as

\begin{array}{l} minimize & \sum_{n \in N} \sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2} \\ subject to & [\begin{array}{c} \sqrt{1 + \frac{1}{γ_{l}}} h_{ll}^{H} m_{l} \\ M_{n}^{H} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{l} \end{array}] ≽_{SOC} 0, n \in N, l \in L (n) \\ [\begin{array}{c} x_{n, nl} \\ M_{n}^{H} h_{jl} \end{array}] ≽_{SOC} 0, l \in L_{int}, n \in N_{int} (l) \\ x_{k, nl} = z_{nl}, k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l), \end{array}

(8)

with variables ${M_{n}}_{n \in N}$ , ${x_{k, nl}}_{k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l)}$ , and ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ , where vector ${\tilde{x}}_{l} = {x_{n, bl}}_{b \in N_{int} (l)}$ , the matrix h_jl in the second set of constraints denotes the channel from BS n to link l (i.e., the index j in the second set of constraints denotes an arbitrary link in $L (n)$ ), and the notation ≽_SOC denotes the generalized inequalities with respect to the second-order cone[16, 19].

In problem (8), the objective function and the first set of inequality constraints are separable in $n \in N$ (one for each BS). Also, it can be easily shown that the second set of inequality constraints of (8) are separable in $n \in N$ . To do this, let us denote $I_{int} (n)$ the set of links for which BS n acts as an out-of-cell interferer, i.e., $I_{int} (n) = {l | l \in L_{int}, n \in N_{int} (l)}$ . Then, by noting that the sets ${(n, l) | l \in L_{int}, n \in N_{int} (l)}$ , and ${(n, l) | n \in N, l \in I_{int} (n)}$ are identical, the second set of inequality constraints of (8) can be written as

[\begin{array}{l} x_{n, nl} \\ M_{n}^{^{H}} h_{jl} \end{array}] ≽_{SOC} 0, n \in N, l \in I_{int} (n),

(9)

which is separable in $n \in N$ . Observe that without the consistency constraints, problem (8) can now be easily decoupled into N subproblems, one for each BS.

We next express problem (8) more compactly. To do this, we first express the consistency constraints of problem (8) more compactly by using vector notations, which denote a collection of the local and global variables associated with BS n. By using the equivalence between the sets ${(n, l) | l \in L_{int}, n \in N_{int} (l)}$ and ${(n, l) | n \in N, l \in I_{int} (n)}$ , let us express the consistency constraints of problem (8) as

\begin{array}{l} x_{n, nl} = z_{nl}, & n \in N, l \in I_{int} (n) \\ x_{tran (l), nl} = z_{nl}, & l \in L_{int}, n \in N_{int} (l) . \end{array}

(10)

In the first set of equalities of (10), ${x_{n, nl}}_{l \in I_{int} (n)}$ is a set of local variables that are associated with BS n. Similarly, to find a set of local variables that are associated with BS n in the second set of equalities of (10), let us define $L_{int} (n)$ the set of links in BS n that are affected by the out-of-cell interference, i.e., $L_{int} (n) = {l | l \in L_{int} \cap L (n)}$ . Then, by noting that the set $L_{int} = \cup_{n \in N} L_{int} (n)$ , we can rewrite (10) as

\begin{array}{l} x_{n, nl} = z_{nl}, & n \in N, l \in I_{int} (n) \\ x_{tran (l), bl} = z_{bl}, & n \in N, l \in L_{int} (n), b \in N_{int} (l) . \end{array}

(11)

Clearly, in the second set of equalities of (11)^e, ${x_{tran (l), bl}}_{l \in L_{int} (n), b \in N_{int} (l)}$ is a set of local variables that are associated with BS n.

We now denote (11) compactly using vector notation. Let us define vectors x_n and z_n as^f

\begin{array}{l} x_{n} = {{x_{n, nl}}_{l \in I_{int} (n)}, {x_{tran (l), bl}}_{l \in L_{int} (n), b \in N_{int} (l)}} \\ z_{n} = {{z_{nl}}_{l \in I_{int} (n)}, {z_{bl}}_{l \in L_{int} (n), b \in N_{int} (l)}} . \end{array}

(12)

Then, (11) can be compactly written as

x_{n} = z_{n}, n \in N.

(13)

Note that x_n is a collection of the local variables that are associated with BS n, and z_n is a collection of the global variables that are associate with the components of variable x_n.

Furthermore, for the sake of brevity, let us define the following set

\begin{array}{l} M_{n} = \{M_{n}, x_{n} |\begin{array}{c} [\begin{array}{c} \sqrt{1 + \frac{1}{γ_{l}}} h_{ll}^{H} m_{l} \\ M_{n}^{H} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{l} \end{array}] ≽_{SOC} 0, l \in L (n) \\ [\begin{array}{c} x_{n, nl} \\ M_{n}^{H} h_{jl} \end{array}] ≽_{SOC} 0, l \in I_{int} (n), \end{array}\}, \end{array}

(14)

and the following function

f_{n} (M_{n}, x_{n}) = \{\begin{array}{l} \sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2} (M_{n}, x_{n}) \in M_{n} \\ \infty otherwise \end{array} .

(15)

Then by using notations (13), (14), and (15), consensus problem (8) can be written compactly as

\begin{array}{l} minimize & \sum_{n \in N} f_{n} (M_{n}, x_{n}) \\ subject to & x_{n} = z_{n}, n \in N, \end{array}

(16)

where the variables are M_n, x_n, and z_n for all $n \in N$ .

3.2 Distributed algorithm via ADMM: sum power minimization

In this section, we derive distributed algorithm for problem (16). The proposed algorithm is based on ADMM[21]. We start by writing the augmented Lagrangian[33] for problem (16) as

\begin{array}{l} L_{ρ} ({M_{n}, x_{n}}_{n \in N}, {z_{n}}_{n \in N}, {u_{n}}_{n \in N}) \\ = \sum_{n \in N} (f_{n} (M_{n}, x_{n}) + u_{n}^{T} (x_{n} - z_{n}) + \frac{ρ}{2} {∥ x_{n} - z_{n} ∥}_{2}^{2}), \end{array}

(17)

where ${u_{n}}_{n \in N}$ are the dual variables^g associated with the equality constraints of (16), and ρ>0 is a penalty parameter that adds the quadratic penalty to the standard Lagrangian L₀ for the violation of the equality constraints of problem (16).

Each iteration of ADMM algorithm consists of the following three steps[21]

\begin{align} M_{n}^{i + 1}, x_{n}^{i + 1} & = \underset{M_{n}, x_{n}}{argmin} L_{ρ} (M_{n}, x_{n}, z_{n}^{i}, u_{n}^{i}), n \in N \end{align}

(18)

\begin{align} {z_{n}^{i + 1}}_{n \in N} & = \underset{{z_{n}}_{n \in N}}{argmin} L_{ρ} ({M_{n}^{i + 1}, x_{n}^{i + 1}}_{n \in N}, {z_{n}}_{n \in N}, {u_{n}^{i}}_{n \in N}) \end{align}

(19)

\begin{align} u_{n}^{i + 1} & = u_{n}^{i} + ρ (x_{n}^{i + 1} - z_{n}^{i + 1}), n \in N, \end{align}

(20)

where superscript i is the iteration counter. Steps (18) and (20) are completely decentralized, and hence, they can be carried out independently in parallel in each BS. Note that each component of z_n couples two local variables that are associated with the adjacent BSs (see, consistency constraint of problem (8))^h. Thus, step (19) requires to gather the updated local variables $(M_{n}^{i + 1}, x_{n}^{i + 1})$ and the dual variables $u_{n}^{i}$ from all N BSs. In the sequel, we first explain in detail to solve the ADMM steps in (18) and (19). Then, we summarize the proposed ADMM based distributed algorithm.

The local variable update $(M_{n}^{i + 1}, x_{n}^{i + 1})$ in (18) is a solution of the following optimization problem

\begin{array}{l} minimize & f_{n} (M_{n}, x_{n}) + u_{n}^{iT} (x_{n} - z_{n}^{i}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} ∥}_{2}^{2} \end{array}

(21)

with variables M_n and x_n. Here, we write $u_{n}^{iT}$ instead of ${(u_{n}^{i})}^{T}$ to lighten the notation. Let v_n = (1 / ρ)u_n (i.e., v_n is a scaled dual variable). Then by using notations (14) and (15), problem (21) can be equivalently expressed as

\begin{array}{l} minimize & \sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2} + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2} \\ subject to & [\begin{array}{c} \sqrt{1 + \frac{1}{γ_{l}}} h_{ll}^{H} m_{l} \\ M_{n}^{H} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{l} \end{array}] ≽_{SOC} 0, l \in L (n) \\ [\begin{array}{c} x_{n, nl} \\ M_{n}^{H} h_{jl} \end{array}] ≽_{SOC} 0, l \in I_{int} (n) \end{array}

(22)

with variables $M_{n} = {[m_{l}]}_{l \in L (n)}$ and x_n, where ${\tilde{x}}_{l} = {x_{n, bl}}_{b \in N_{int} (l)}$ is a subset of x_n (see (12)), the matrix h_jl in the second set of constraints denotes the channel from BS n to link l (i.e., the index j in the third set of constraints denotes an arbitrary link in $L (n)$ ), and the notation ≽_SOC denotes the generalized inequalities with respect to the second-order cone[16, 19]. Note that in the objective function of (22)ⁱ, we have dropped a constant term $\frac{ρ}{2} {∥ v_{n}^{i} ∥}_{2}^{2}$ since it does not effect the solution of the problem.

Moreover, by writing problem (22) in the epigraph form, and then following the approach of[16], Section IV-B, problem (22) can be equivalently reformulated in the form of second-order cone program (SOCP) as

\begin{array}{l} minimize & t \\ subject to & [\begin{array}{c} t \\ vec (M_{n}) \\ \sqrt{(ρ / 2)} (x_{n} - z_{n}^{i} + v_{n}^{i}) \end{array}] ≽_{SOC} 0 \\ [\begin{array}{c} \sqrt{1 + \frac{1}{γ_{l}}} h_{ll}^{H} m_{l} \\ M_{n}^{H} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{l} \end{array}] ≽_{SOC} 0, l \in L (n) \\ [\begin{array}{c} x_{n, nl} \\ M_{n}^{H} h_{jl} \end{array}] ≽_{SOC} 0, l \in I_{int} (n), \end{array}

(23)

with variables t, M_n, and x_n. Let us denote t^⋆, $M_{n}^{⋆}$ , and $x_{n}^{⋆}$ the solutions of problem (23), then the update $M_{n}^{i + 1} = M_{n}^{⋆}$ and $x_{n}^{i + 1} = x_{n}^{⋆}$ .

Now, we turn to the second step of ADMM algorithm and provide a solution for the global variable update (19). The update ${z}_{n}^{i + 1}}_{n \in N}$ is a solution of the following optimization problem

\begin{array}{l} minimize \sum_{n \in N} (u_{n}^{iT} (x_{n}^{i + 1} - z_{n}) + \frac{ρ}{2} {∥ x_{n}^{i + 1} - z_{n} ∥}_{2}^{2}), \end{array}

(24)

with variable ${z_{n}}_{n \in N}$ . By using the notations in (12), and further noting equalities (13) and the equality constraints of problem (8) are equivalent, problem (24) in the components of x_n, z_n, and u_n can be expressed as

\begin{array}{l} minimize & \sum_{l \in L_{int}} \sum_{n \in N_{int} (l)} \sum_{k \in {n, tran (l)}} \\ \times (u_{k, nl}^{i} (x_{k, nl}^{i + 1} - z_{nl}) + \frac{ρ}{2} {(x_{k, nl}^{i + 1} - z_{nl})}^{2}), \end{array}

(25)

with variable ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ , where ${u_{k, nl}}_{k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l)}$ are the dual variables associated with the equality constraints of problem (7)^j.

Problem (25) decouples across z_nl, since the objective function is separable in z_nl for all $l \in L_{int}, n \in N_{int} (l)$ . Moreover, the objective function of problem (25) is quadratic in z_nl. Hence, by setting the gradient of (25) with respect to z_nl equal to zero, we can get the solution $z_{nl}^{⋆}$ which can be expressed as

z_{nl}^{⋆} = (x_{n, nl}^{i + 1} + x_{tran (l), nl}^{i + 1} + \frac{1}{ρ} (u_{n, nl}^{i} + u_{tran (l), nl}^{i}))/ 2,

(26)

for all $l \in L_{int}, n \in N_{int} (l)$ . Therefore, the update $z_{nl}^{i + 1} = z_{nl}^{⋆}$ for all $l \in L_{int}, n \in N_{int} (l)$ . Moreover, by substituting $z_{nl}^{i + 1}$ in (20)^k, we can show that the sum of the dual variables $u_{n, nl}^{i} + u_{tran (l), nl}^{i}$ is equal to zero. Thus, the update $z_{nl}^{i + 1}$ further simplifies to

z_{nl}^{i + 1} = (x_{n, nl}^{i + 1} + x_{tran (l), nl}^{i + 1})/ 2,

(27)

for all $l \in L_{int}, n \in N_{int} (l)$ . Hence, the global variable update $z_{nl}^{i + 1}$ is simply the average of its local copies $x_{n, nl}^{i + 1}$ and $x_{tran (l), nl}^{i + 1}$ .

Finally, we summarize the proposed ADMM-based distributed algorithm for sum power minimization problem (8) in Algorithm 1.

Algorithm 1 Proposed ADMM-based distributed algorithm for sum power minimization

The first step initializes the algorithm. Step 2 updates the local variables of each BS by solving problem (23). Step 2 is completely decentralized. In step 3, the neighboring BSs that are coupled by variable z_nl, i.e., BS n and BS tran(l), exchange their local copies $x_{n, nl}^{i + 1}$ and $x_{tran (l), nl}^{i + 1}$ . In step 4, each BS update the global variable $z_{n}^{i + 1}$ . Note that the global variable update $z_{n}^{i + 1}$ in its component is simply the average of the local copies (27). In step 5, the dual variables are updated by each BS, by solving (20). Step 6 checks the stopping criteria^l, and the algorithm stops if the stopping criteria is satisfied. Otherwise, the algorithm continues in an iterative manner. A method to find the feasible solution at each iteration of Algorithm 1 is provided in Section 3.3. Note that in deriving Algorithm 1, we have considered perfect CSI in all relevant channels between BSs and receivers. The impact of imperfect CSI in the derivation of the algorithm can be found in[15].

3.3 Finding feasible solution at each iteration of Algorithm 1

In many practical applications, we have to stop the distributed algorithm after a finite number of iterations before converging the algorithm. On the other hand, the intermediate solutions provided by Algorithm 1 do not necessarily result feasible solution. In particular, the SINR constraints of problem (4) may not hold since the local copies x_n,nl and x_tran(l),nl that correspond to the global variable z_nl for all $l \in L_{int}, n \in N_{int} (l)$ may not be equal. Thus, we can get SINR Γ_l ≤ γ_l as a solution of step 2 of Algorithm 1 for some $l \in L$ .

At the cost of solving one additional subproblem by each BS in each iteration, we can find a set of feasible beamformers M_n for all $n \in N$ . For this, in order to make the local copies x_n,nl and x_tran(l),nl equal, we fix them to the consensus value $z_{nl}^{i}$ (i.e., $x_{n, nl} = z_{nl}^{i}$ and $x_{tran (l), nl} = z_{nl}^{i}$ ) for all $l \in L_{int}, n \in N_{int} (l)$ . Then, solve problem (23) in variables t and M_n by each BS, which can be expressed as

\begin{array}{l} minimize & t \\ subject to & [\begin{array}{c} t \\ vec (M_{n}) \end{array}] ≽_{SOC} 0 \\ [\begin{array}{c} \sqrt{1 + \frac{1}{γ_{l}}} h_{ll}^{^{H}} m_{l} \\ M_{n}^{^{H}} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{l} \end{array}] ≽_{SOC} 0, l \in L (n) \\ [\begin{array}{c} x_{n, nl}^{i} \\ M_{n}^{^{H}} h_{jl} \end{array}] ≽_{SOC} 0, l \in I_{int} (n), \end{array}

(28)

where ${\tilde{x}}_{l} = {x_{n, bl}}_{b \in N_{int} (l)}$ . Note that at iteration i the set of beamformer ${M_{n}}_{n \in N}$ is feasible for the original problem (4), if problem (28) is feasible for all BSs.

3.4 Convergence of Algorithm 1 to the global optimum of problem P1

The convergence of Algorithm 1 to the global optimal solution of problem P1 (i.e., problem (4)) can be established by using proposition[34], Proposition 4.2.

First, by applying proposition[34], Proposition 4.2 to problem (16), we can show that the ADMM Algorithm 1 converges to the global optimal solution of problem (16) (note that problem (16) is compact representation of problem (8)). Next, we note that the phase of the optimization variable ${m_{l}}_{l \in L}$ in problems (8) and (4) do not change the objective and the constraints of both problems. Thus, the optimal solution obtained by Algorithm 1 for problem (8) is also optimal for problem (4) (i.e., problem P1).

4 SINR balancing

In this section, we derive a distributed algorithm for problem (5), i.e., P2. As before in the sum power minimization problem, we begin by reformulating problem (5) in the global consensus form. Then, we apply ADMM[21] to derive the distributed algorithm.

4.1 An equivalent reformulation: SINR balancing

We start by equivalently reformulating SINR balancing problem (5) in the epigraph form[19] as

\begin{align} minimize & - γ \\ subject to & \frac{{| h_{ll}^{^{H}} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (tran (l)), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{n \in N_{int} (l)} z_{nl}^{2}} \geq γ, \\ l \in L \\ z_{nl}^{2} \geq \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, l \in L_{int}, n \in N_{int} (l) \\ \sum_{j \in L (n)} {∥ m_{l} ∥}_{2}^{2} \leq p_{n}^{max}, n \in N, \end{align}

(29)

with variables γ, ${m_{l}}_{l \in L}$ , and ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ .

We now follow a similar approach as in the Section 3.1 to express problem (29) in a global consensus form (i.e., we introduce the local copies of the coupling variables γ and z_nl for each BS). Since the SINR variable γ couples all BSs via a SINR constraints, we introduce local copies α_n for each BS such that α_n = γ for all $n \in N$ . For the out-of-cell interference variable z_nl, we introduce local copies x_k,nl and x_tran(l),nl, respectively, for BS n and BS tran(l) as in problem (7). Then, problem (29) in the global consensus form can be expressed equivalently as

\begin{align} minimize & - γ \\ subject to & \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (n), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{b \in N_{int} (l)} x_{n, bl}^{2}} \geq α_{n}, \\ n \in N, l \in L (n) \\ x_{n, nl}^{2} \geq \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, n \in N, l \in I_{int} (n) \\ \sum_{j \in L (n)} {∥ m_{l} ∥}_{2}^{2} \leq p_{n}^{max}, n \in N \\ x_{k, nl} = z_{nl}, k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l) \\ α_{n} = γ, n \in N, \end{align}

(30)

with variables γ, ${m_{l}}_{l \in L}$ , ${α_{n}}_{n \in N}$ , ${x_{k, nl}}_{k \in {n, tran (l)}, n \in N, l \in I_{int} (n)}$ , and ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ . Note that in the second set of inequality constraints, we use the equivalence between the sets ${(n, l) | l \in L_{int}, n \in N_{int} (l)}$ and ${(n, l) | n \in N, l \in I_{int} (n)}$ (see (10)).

Now, we express problem (30) more compactly. Note that except the third inequality constraints and the last equality constraints of problem (30), the constraint set of problem (30) is identical to that of problem (7). Hence, we can use variables M_n, x_n, and z_n to define the set C_n as given in (31)

C_{n} = \{M_{n}, x_{n}, α_{n} |\begin{array}{l} \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (n), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{b \in N_{int} (l)} x_{n, bl}^{2}} \geq α_{n}, l \in L (n) \\ x_{n, nl}^{2} \geq \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, l \in I_{int} (n) \\ \sum_{j \in L (n)} {∥ m_{l} ∥}_{2}^{2} \leq p_{n}^{max} \end{array}\},

(31)

and the following indicator function I_n(M_n, x_n, α_n)

I_{n} (M_{n}, x_{n}, α_{n}) = \{\begin{array}{l} 0 & (M_{n}, x_{n}, α_{n}) \in C_{n} \\ \infty & otherwise . \end{array}

(32)

Then, by using notations (31) and (32), consensus problem (30) can be rewritten compactly as

\begin{array}{l} minimize - γ + \sum_{n \in N} I_{n} (M_{n}, x_{n}, α_{n}) \\ subject to x_{n} = z_{n}, n \in N \\ α_{n} = γ, n \in N, \end{array}

(33)

with variables γ and ${M_{n}, x_{n}, z_{n}, α_{n}}_{n \in N}$ . Furthermore, by noting that $\sum_{n \in N} α_{n} = Nγ$ (from the second equality constraints of (33)), problem (33) can be expressed equivalently as

\begin{array}{l} minimize \sum_{n \in N} (- \frac{α_{n}}{N} + I_{n} (M_{n}, x_{n}, α_{n})) \\ subject to x_{n} = z_{n}, n \in N \\ α_{n} = γ, n \in N, \end{array}

(34)

with variables γ and ${M_{n}, x_{n}, z_{n}, α_{n}}_{n \in N}$ .

4.2 Distributed algorithm via ADMM: SINR balancing

To derive the ADMM algorithm, we first form the augmented Lagrangian[33] of problem (34). Let u_n and v_n be the dual variables associated with the first and second consensus constraints of problem (34), respectively. Then, the augmented Lagrangian can be written as

\begin{array}{l} L_{ρ} ({M_{n}, x_{n}, α_{n}, u_{n}, v_{n}, z_{n}}_{n \in N}, γ) \\ = \sum_{n \in N} (- \frac{α_{n}}{N} + I_{n} (M_{n}, x_{n}, α_{n}) + u_{n}^{T} (x_{n} - z_{n}) \\ + v_{n} (α_{n} - γ) + \frac{ρ}{2} {∥ x_{n} - z_{n} ∥}_{2}^{2} + \frac{ρ}{2} {(α_{n} - γ)}^{2}), \end{array}

(35)

where ρ > 0 is the penalty parameter. Each iteration of ADMM consists of the following steps[21]

\begin{align} M_{n}^{i + 1}, x_{n}^{i + 1}, α_{n}^{i + 1} & = \underset{M_{n}, x_{n}, α_{n}}{argmin} L_{ρ} (M_{n}, x_{n}, α_{n}, u_{n}^{i}, v_{n}^{i}, z_{n}^{i}, γ^{i}), \\ n \in N \end{align}

(36)

\begin{align} {z_{n}^{i + 1}}_{n \in N}, γ^{i + 1} & = \underset{{z_{n}}_{n \in N}, γ}{argmin} L_{ρ} (\{M_{n}^{i + 1}, x_{n}^{i + 1}, α_{n}^{i + 1}, u_{n}^{i}, \\ {v_{n}^{i}, z_{n}\}}_{n \in N}, γ) \end{align}

(37)

\begin{align} u_{n}^{i + 1} = u_{n}^{i} + ρ (x_{n}^{i + 1} - z_{n}^{i + 1}), n \in N \end{align}

(38)

\begin{align} v_{n}^{i + 1} = v_{n}^{i} + ρ (α_{n}^{i + 1} - γ^{i + 1}), n \in N. \end{align}

(39)

Note that the first step is completely decentralized. Each BS $n \in N$ updates the local variables $(M_{n}^{i + 1}, x_{n}^{i + 1}, α_{n}^{i + 1})$ by solving the following optimization problem

\begin{array}{l} minimize & - \frac{α_{n}}{N} + I_{n} (M_{n}, x_{n}, α_{n}) + u_{n}^{iT} (x_{n} - z_{n}^{i}) \\ + v_{n}^{i} (α_{n} - γ^{i}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} ∥}_{2}^{2} + \frac{ρ}{2} {(α_{n} - γ^{i})}^{2}, \end{array}

(40)

with variables α_n, M_n, and x_n. Let v_n = (1 / ρ)u_n and λ_n = (1 / ρ)v_n, then by combining the linear and quadratic terms of the objective function^m, problem (40) can be written as

\begin{array}{l} minimize & - \frac{α_{n}}{N} + I_{n} (M_{n}, x_{n}, α_{n}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2} \\ + \frac{ρ}{2} {(α_{n} - γ^{i} + λ_{n}^{i})}^{2}, \end{array}

(41)

with variables α_n, M_n, and x_n. Note that in the objective function of (41), constant terms $\frac{ρ}{2} {∥ v_{n}^{i} ∥}_{2}^{2}$ and $\frac{ρ}{2} {(λ_{n}^{i})}^{2}$ are dropped, since they do not affect the solution of the optimization problem.

Problem (41) is not a convex problem, due to the indicator function I_n(M_n,x_n,α_n) is a function of nonconvex set $C_{n}$ (see (31)). However, for fixed α_n, set $C_{n}$ is a convex set, and hence, problem (41) can be solved easily. Therefore, to solve problem (41), we first find the optimal $α_{n}^{⋆}$ and then find $M_{n}^{⋆}$ and $x_{n}^{⋆}$ .

For fixed α_n, let us denote the optimal value function of problem (41) as

\begin{align} p (α_{n}) & = inf_{M_{n}, x_{n}} (- \frac{α_{n}}{N} + I_{n} (M_{n}, x_{n}, α_{n}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2} \\ + \frac{ρ}{2} {(α_{n} - γ^{i} + λ_{n}^{i})}^{2}) \end{align}

(42)

\begin{align} = inf_{M_{n}, x_{n}} (I_{n} (M_{n}, x_{n}, α_{n}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2}) \\ - \frac{α_{n}}{N} + \frac{ρ}{2} {(α_{n} - γ^{i} + λ_{n}^{i})}^{2}, \end{align}

(43)

where (43) follows by noting that α_n / N and $\frac{ρ}{2} {(α_{n} - γ^{i} + λ_{n}^{i})}^{2}$ are independent of the optimization variables M_n and x_n. Then, the optimal value of problem (41) is given by

p^{⋆} = inf_{α_{n}} p (α_{n}) .

(44)

For ease of presentation, let us express the optimal value function p(α_n) in (43) as

p (α_{n}) = \tilde{p} (α_{n}) - \frac{α_{n}}{N} + \frac{ρ}{2} {(α_{n} - γ^{i} + λ_{n}^{i})}^{2},

(45)

where $\tilde{p} (α_{n})$ is the optimal value of the following optimization problem

\begin{align} minimize & \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2} \\ subject to & \frac{{| h_{ll}^{H} m_{l} |}^{2}}{σ_{l}^{2} + \sum_{j \in L (n), j \neq l} {| h_{jl}^{H} m_{j} |}^{2} + \sum_{b \in N_{int} (l)} x_{n, bl}^{2}} \geq α_{n}, \\ l \in L (n) \\ x_{n, nl}^{2} \geq \sum_{j \in L (n)} {| h_{jl}^{H} m_{j} |}^{2}, l \in I_{int} (n) \\ \sum_{j \in L (n)} {∥ m_{l} ∥}_{2}^{2} \leq p_{n}^{max}, \end{align}

(46)

with variables x_n and ${m_{l}}_{l \in L (n)}$ . Note that to write (46), we have used the notations defined in (31) and (32).

Let the interval $[0, α_{n}^{\max}]$ denote the range of feasible α_n for problem (46). Note that the optimal value $\tilde{p} (α_{n})$ is a nondecreasing function of $α_{n} \in [0, α_{n}^{\max}]$ (see Appendix 1). Based on this observation, in Appendix 1, we have provided the condition for which p(α_n) is a unimodal function and propose the bracketing method[30, 31] to solve problem (44). In Algorithm 2, we summarize the bracketing method (golden ratio search)[30], Section 8.1 to find the optimal $α_{n}^{⋆}$ for problem (44).

Algorithm 2 Bracketing method to find optimal $α_{n}^{⋆}$ for problem (44)

Next, we find $x_{n}^{⋆}$ and $M_{n}^{⋆} = {m_{l}^{⋆}}_{l \in L (n)}$ that are associated with $α_{n}^{⋆}$ by solving problem (46). By writing problem (46) in the epigraph form, and then following the approach of[16], Section IV-B, problem (46) can be formulated equivalently in the form of SOCP as

\begin{array}{l} minimize & t \\ subject to & [\begin{array}{c} t \\ \sqrt{ρ / 2} (x_{n} - z_{n}^{i} + v_{n}^{i}) \end{array}] ≽_{SOC} 0 \\ [\begin{array}{c} \sqrt{1 + \frac{1}{α_{n}}} h_{ll}^{H} m_{l} \\ M_{n}^{H} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{n} \end{array}] ≽_{SOC} 0, l \in L (n) \\ [\begin{array}{c} x_{n, nl} \\ M_{n}^{H} h_{jl} \end{array}] ≽_{SOC} 0, l \in I_{int} (n) \\ [\begin{array}{c} \sqrt{p_{n}^{max}} \\ vec (M_{n}) \end{array}] ≽_{SOC} 0, \end{array}

(47)

with variables t, x_n, and M_n, where ${\tilde{x}}_{l} = {x_{n, bl}}_{b \in N_{int} (l)}$ is a subset of x_n (see (12)), the matrix h_jl in the third set of constraints denotes the channel from BS n to link l (i.e., the index j in the third set of constraints denotes an arbitrary link in $L (n)$ ). Note that to write problem (46) in the SOCP form (47), we first took the square root of the objective function of (46). Hence, the optimal value of problem (46) is given by t^⋆2 (i.e., $\tilde{p} (α_{n}^{⋆}) = t^{⋆ 2}$ ), where t^⋆ is the solution of problem (47).

We now turn to the second step of ADMM in (37), where the global variables ${z_{n}}_{n \in N}^{i + 1}$ and γⁱ⁺¹ are updated. By dropping the constant terms which do not affect the solution, problem (37) can be written as

\begin{array}{l} minimize & \sum_{n \in N} (u_{n}^{iT} (x_{n}^{i + 1} - z_{n}) + v_{n}^{i} (α_{n}^{i + 1} - γ) \\ + \frac{ρ}{2} {∥ x_{n}^{i + 1} - z_{n} ∥}_{2}^{2} + \frac{ρ}{2} {(α_{n}^{i + 1} - γ)}^{2}), \end{array}

(48)

with variables ${z_{n}}_{n \in N}$ and γ.

Problem (48) is separable in variables ${z_{n}}_{n \in N}$ and γ. Note that minimization of problem (48) with respect to ${z_{n}}_{n \in N}$ yields problem (24), and hence, the solution ${z_{n}^{⋆}}_{n \in N}$ is given by (27). Here, we provide the solution for γ. Minimization of problem (48) with respect to γ yields the following optimization problem

minimize \sum_{n \in N} (v_{n}^{i} (α_{n}^{i + 1} - γ) + \frac{ρ}{2} {(α_{n}^{i + 1} - γ)}^{2}) .

(49)

Problem (49) is an unconstrained quadratic optimization problem in γ. Therefore, by setting the gradient of problem (49) with respect to γ equal to zero, we can get

γ^{⋆} = \frac{\sum_{n \in N} v_{n}^{i} + ρ α_{n}^{i + 1}}{ρN} .

(50)

Hence, the update γⁱ⁺¹ = γ^⋆. Moreover, by substituting γⁱ⁺¹ in (39), we can show that the sum of the dual variables $\sum_{n \in N} v_{n}^{i}$ is equal to zero. Thus, the update γⁱ⁺¹ (i.e., (50)) further simplifies to

γ^{⋆} = \frac{\sum_{n \in N} α_{n}^{i + 1}}{N} .

(51)

We now summarize the proposed ADMM-based distributed algorithm for SINR balancing problem in Algorithm 3.

Algorithm 3 Proposed ADMM-based distributed algorithm for SINR balancing

The computational steps of Algorithm 3 is similar to that of Algorithm 1. As in Algorithm 1, step 1 initializes the algorithm. Step 2 updates the local variables. In step 3, BSs exchange their local copies to update the global variables. Local copies $x_{n, nl}^{i + 1}$ and $x_{tran (l), nl}^{i + 1}$ are exchanged between the adjacent BS n and BS tran(l), while local copy α_n is broadcasted to all other BSs. Steps 4 and 5 updates the global and dual variables, respectively. Note that steps 2, 4, and 5 are completely decentralized. Step 6 checks the stopping criteriaⁿ. A method to find the feasible solution at each iteration of Algorithm 3 is provided in next section.

4.3 Finding feasible solution at each iteration of Algorithm 3

Note that at each step of Algorithm 3, the locally obtained SINR α_n for all $n \in N$ are not necessarily balanced (i.e., α_n for all $n \in N$ are not necessarily equal). So, we can take the global variable γⁱ, which is the average of α_n for all $n \in N$ , as the intermediate solution of Algorithm 3. However, due to the difference in the local copies $x_{n, nl}^{i + 1}$ at BS n and $x_{tran (l), nl}^{i + 1}$ at BS tran(l), and the maximum transmit power constraint of the BSs, the intermediate solution γⁱ may not be feasible for all BSs.

Therefore, it is necessary to check the feasibility of γⁱ to use it as the intermediate solution at each step of Algorithm 3. The SINR γⁱ is feasible for BS n, if their exist a feasible solution of problem (47) for α_n = γⁱ and given out-of-cell interference value x_n. Thus, we set α_n = γⁱ and $x_{n} = z_{n}^{i}$ for all $n \in N$ (i.e., α_n and x_n are set equal to the consensus value). Then, check the feasibility of problem (47) by each BS in between steps 4 and 5 of Algorithm 3, which is equivalent to the following SOCP feasibility problem

\begin{array}{l} find & {m_{l}}_{l \in L (n)} \\ subject to & [\begin{array}{c} \sqrt{1 + \frac{1}{α_{n}}} h_{ll}^{^{H}} m_{l} \\ M_{n}^{^{H}} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{l} \end{array}] ≽_{SOC} 0, l \in L (n) \\ [\begin{array}{c} x_{n, nl} \\ M_{n}^{^{H}} h_{jl} \end{array}] ≽_{SOC} 0, l \in I_{int} (n) \\ [\begin{array}{c} \sqrt{p_{n}^{max}} \\ vec (M_{n}) \end{array}] ≽_{SOC} 0, l \in I_{int} (n) \end{array}

(52)

with variable $M_{n} = {[m_{l}]}_{l \in L (n)}$ , where ${\tilde{x}}_{l} = {x_{n, bl}}_{b \in N_{int} (l)}$ is a subset of x_n (see (12)), the matrix h_jl in the third set of constraints denotes the channel from BS n to link l (i.e., the index j in the third set of constraints denotes an arbitrary link in $L (n)$ ). Note that γⁱ is feasible for the original problem (5) only if problem (52) is feasible for all BSs. Thus, in Algorithm 3, we can update the feasible SINR $γ_{feas}^{i}$ as

γ_{feas}^{i} = \{\begin{array}{l} γ^{i} & if problem (52) is feasiblem for all n \in N \\ γ_{feas}^{i - 1} & otherwise, \end{array}

(53)

where $γ_{feas}^{0} = 0$ .

5 Numerical example

In this section, we numerically evaluate the performance of proposed Algorithms 1 and 3. In our simulations, two multicell wireless networks as shown in Figure1 are considered. In the case of first network (i.e., Figure1a), there are N = 2 BSs with T = 4 antennas at each one. The distance between the BSs is denoted by D_BS. In the case of second network (i.e., Figure1b), there are N = 7 BSs with T = 6 antennas at each one. The BSs are located such that they form the hexagon, and the distance between the BSs is denoted by D_BS. We assume that BSs have circular transmission and interference regions, where the radius of the transmission region of each BS is denoted by R_BS, and the radius of the interference region of each BS is denoted by R_int. For simplicity, we assume 4 users per cell in the first network, and three users per cell in the second network. The location of users associated with BSs is arbitrarily chosen as shown in Figure1.

We assume an exponential path loss model, where the channel matrix between BSs and users is modeled as

h_{jl} = {(\frac{d_{jl}}{d_{0}})}^{- η / 2} c_{jl},

where d_jl is the distance from the transmitter of data stream j (i.e., BS tran(j)) to the receiver of data stream l (i.e., user rec(l)), d₀ is the far field reference distance[35], η is the path loss exponent, and $c_{jl} \in C^{T}$ is arbitrarily chosen from the distribution $C N (0, I)$ (i.e., frequency-flat fading channel with uncorrelated antennas). Here, we refer an arbitrarily generated set of fading coefficients $C = {c_{jl} | j, l \in L}$ as a single fading realization.

We assume that the maximum power constraint is same for each BS, i.e., $p_{n}^{max} = p_{0}^{max}$ for all $n \in N$ , and σ_l = σ for all $l \in L$ . We define the signal-to-noise ratio (SNR) operating point at a distance r as

SNR (r) = {(\frac{r}{d_{0}})}^{- η} \frac{p_{0}^{max}}{σ^{2}} .

(54)

In our simulations, we set d₀ = 1, η = 4, σ² = 1, p 0max / σ² = 45 dB, SNR(R_int) = 0 dB, SNR(R_BS) = 5 dB, and D_BS = 1.5 × R_BS.

To illustrate the convergence behavior of Algorithm 1, we consider a single fading realization and run the algorithm for both networks shown in Figure1. For a comparison, we consider a dual decomposition-based distributed algorithm (DDA) proposed in[12]. For DDA[12], we consider fixed step size α to solve the master problem (see in[12]), which is based on the subgradient method[18].

Figure3 shows the normalized power accuracy |pⁱ - p^⋆| / p^⋆, where pⁱ is the objective value at i th iteration, and p^⋆ is the optimal objective value obtained by using centralized algorithm[16], Section IV. SINR target is set to γ_l = 5 dB for all $l \in L$ . DDA[12] plots are drawn for the subgradient step size α = 10,50,100. For Algorithm 1, the penalty parameter is set to ρ = 0.5β, β, 2β, where β depends on the problem parameters (detailed in Appendix 2) and it is defined as

β = max_{n \in N} \{\sum_{l \in L (n)} (1 0^{0.1 \times γ_{l}}) / {∥ h_{ll} ∥}_{2}^{2}\} .

(55)

Results show that the proposed Algorithm 1 converges much faster than DDA[12]. For example, in both multicell networks, Algorithm 1 can achieve normalized power accuracy 10^-2 in less than 10 iterations. However, in order to gain the same accuracy (i.e., normalized power accuracy 10^-2), DDA[12] requires more than 200 iterations for all simulated cases in both networks. Results also show that Algorithm 1 performs very well for a wide range of values of ρ. Hence, Algorithm 1 is less sensitive to the variation of values of ρ, while the results show that the convergence speed of DDA[12] is quite sensitive to the variation of the subgradient step size α.

In order to see the average behavior of the proposed Algorithm 1, we next consider fading case. Here, we run Algorithm 1 for 500 fading realizations with the algorithm parameter ρ = 2β for both networks shown in Figure1. We first present the feasibility rate of the proposed algorithm, and then, we provide the average performance of the algorithm.

Figure4 shows the feasibility rate of Algorithm 1 versus iteration for SINR target γ_l = 5 dB and 15 dB for all $l \in L$ . For a comparison, we consider DDA[12] with the subgradient step size α = 50. Plots are drawn for the first 50 iterations. Results show that the proposed algorithm can achieves the feasible solution for all channel realizations (for multicell network 1a, the proposed algorithm achieves the feasible solution for all simulated cases; and for multicell network 1b, the feasibility rate improves with the iteration). However, for DDA[12] feasibility rate depends on the network size and the SINR target. For example, in the case of small network and low SINR target (i.e., multicell network 1a and SINR target γ_l = 5 dB), DDA[12] can achieve the feasible solution for all simulated cases. But, with increase in the SINR target and the network size, the feasibility rate of DDA[12] drops significantly. For example, in multicell network 1b for SINR target γ_l = 15 dB, DDA[12] is not able to find a feasible solution for any of the fading realization.

Figure5 shows the average sum power versus iteration for multicell network 1a. The SINR target γ_l is set to 15 dB for all $l \in L$ . For a comparison, we consider centralized algorithm[16], Section IV and DDA[12]. DDA[12] plots are drawn for the subgradient step size α = 10, 50. For a fair comparison of Algorithm 1, DDA[12], and the centralized algorithm[16], Section IV, the plots are drawn for the fading realizations that are feasible for all considered algorithms. Results show that the convergence speed of proposed Algorithm 1 compared with DDA[12] is fast and can achieves the centralized solution in less than ten iterations.

Figure6 shows the average sum power versus SINR target for multicell network 1b. For a comparison, we consider centralized algorithm[16], Section IV. To note a fair progress of the proposed algorithm for a wide SINR target values, each curve is averaged for the fading realizations that are feasible for all the SINR values. Plots are drawn for the average sum power at iteration number 10 and 50. Results show that the proposed Algorithm 1 can achieve the centralized solution over the wide rage of SINR target values.

We next evaluate the performance of Algorithm 3 for SINR balancing problem (P2). We, first, consider single fading realization and run the algorithm for both networks shown in Figure1. As a benchmark, we consider centralized optimal algorithm proposed in[16], Section V. In the simulation, we set SNR = 5 dB, and for Algorithm 2, we set ε = 0.1, and $α_{n}^{max} = 2 \times 1 0^{0.1 \times SNR}$ for all $n \in N$ . Plots are drawn for ρ = 0.5,1,2.

Figure7 shows the progress of the global variable γ by iteration. Note that the global variable γ is the average of SINR values ${α_{n}}_{n \in N}$ that is obtained independently by all N BSs (see (51)). Results show that for all considered values of ρ, Algorithm 3 can obtain SINR γ that converges to the optimal centralized solution. Since γ is the average of the SINR values obtained independently in all N BSs, the intermediate values of γ may not be feasible for all BSs before the algorithm converges. For example, the value of γ for ρ = 0.5 is clearly infeasible at the iteration step i = {4, 5, 6, 7, 8} in Figure7a. Therefore, to illustrate the convergence of feasible γ, we define the following metric

γ_{best}^{i} = max_{t = 1, \dots, i} {γ_{feas}^{t}},

(56)

where $γ_{best}^{i}$ is the best feasible SINR value at i th iteration, and $γ_{feas}^{t}$ is the feasible SINR at t th iteration (53). Figure8 shows the behavior of $γ_{best}^{i}$ by iteration. Results show that Algorithm 3 can obtain the feasible values of γ that converges to the centralized solution. For example, with ρ = 0.5, the algorithm converges to the centralized solution in just tenth iterations in Figure8a.

Figure9 shows the SINR $γ_{best}^{i}$ for different SNR values^o. Each curve is averaged over 300 fading realizations. In the simulation, penalty parameter ρ is set to 0.5. Plots are drawn for the SINR obtained at iteration number 20, 30, and 50 of Algorithm 3. Results show that the proposed Algorithm 3 can achieve close to the centralized solution over the wide range of SNR values without any tuning of ρ.

6 Conclusion

We have provided distributed algorithms for the radio resource allocation problems in multicell downlink multi-input single-output systems. Specifically, we have considered two optimization problems: P1 - minimization of the total transmission power subject to signal-to-interference-plus-noise ratio (SINR) constraints of each user, and P2 - SINR balancing subject to total transmit power constraint of BSs. We have proposed consensus-based distributed algorithms, and the fast solution method via alternating direction method of multipliers. First, we have derived a distributed algorithm for problem P1. Then, in conjunction with the bracketing method, the algorithm is extended for problem P2. Numerical results show that the proposed distributed algorithms converge very fast to the optimal centralized solution.

Endnotes

^a Similar assumptions are made, e.g., in[36] in the context of arbitrary wireless networks.

^b The value of R_int is chosen such that the power of the interference term is below the noise level, and this commonly used approximation is made to avoid unnecessary coordinations between distant BSs. The appropriate value of R_int can be chosen to trade off between the required backhaul signaling and the optimality of the solution. The effect of nonzero z_nl terms can be accurately modeled by changing the statistical characteristics of noise n_l at rec(l). However, those issues are extraneous to the main focus of the paper.

^c In problem (4) and (5), the set ${z_{nl}}_{l \in L_{int}, n \in N_{int} (l)}$ is a collection of z_nl for which the l th user is inside the interference region of BS n. Thus, the constrained for unconsidered out-of-cell interference term (i.e., $z_{nl}^{2} = 0$ ) for l th user that is outside the interference region of BS n is dropped in problem (4) and (5).

^d A more general SINR balancing problem which can set priority of users (keeping the SINR values of data stream to a fixed ratios)[6], Section IV-C can be formulated. To simplify the presentation, we consider maximization of the minimum SINR. Note that the proposed decentralized method can be easily generalized to the more general problem considered in[6], Section IV-C.

^e Note that $L_{int} (n) \subseteq L (n)$ . Hence, tran(l) = n for all $l \in L_{int} (n)$ .

^f To simplify the presentation, here we have used a slight abuse of notation, i.e., we have considered that the sets in (12) are ordered.

^g Let ${u_{k, nl}}_{k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l)}$ be the dual variables associated with the equality constraints of problem (8), then by following steps (10) to (12), one can easily express $u_{n} = {{u_{n, nl}}_{l \in I_{int} (n)}, {u_{n, bl}}_{l \in L_{int} (n), b \in N_{int} (l)}}, n \in N$ .

^h Variable z_nl (component of z_n) couples two local variables x_n,nl (component of x_n) and x_tran(l),nl (component of x_tran(l)). Hence, in step (19) to update z_nl coordination between BS n and BS tran(l) is required.

ⁱ For convenience, we can combine the linear and quadratic terms of problem (21) as $u_{n}^{iT} (x_{n} - z_{n}^{i}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} ∥}_{2}^{2} = \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2} - \frac{ρ}{2} {∥ v_{n}^{i} ∥}_{2}^{2}$ .

^j Note that ${u_{n}}_{n \in N}$ are the dual variables associate with the consistency constraints of problem (16). By following steps (10) to (12), we can easily show $u_{n} = \{{u_{n, nl}}_{l \in I_{int} (n)}, {u_{n, bl}}_{l \in L_{int} (n), b \in N_{int} (l)}\}, n \in N$ .

^k Note that (20) in the components of u_n, x_n, and z_n can be expressed as $u_{k, nl}^{i + 1} = u_{k, nl}^{i} + ρ (x_{k, nl}^{i + 1} - z_{nl}^{i + 1})$ for all $k \in {n, tran (l)}, l \in L_{int}, n \in N_{int} (l)$ .

^l In ADMM algorithm, standard stopping criteria is to check primal and dual residuals[21]. However, it is often the case that ADMM can produce acceptable results of practical use within a few tens of iteration[21]. As, finite number of iteration is more favorable for practical implementation, we adopt fixed number of iteration to stop the algorithm.

^m For convenience we can combine the terms in problem (40) as a) $u_{n}^{iT} (x_{n} - z_{n}^{i}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} ∥}_{2}^{2} = \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2} - \frac{ρ}{2} {∥ v_{n}^{i} ∥}_{2}^{2}$ and b) $v_{n}^{i} (α_{n} - γ^{i}) + \frac{ρ}{2} {(α_{n} - γ^{i})}^{2} = \frac{ρ}{2} {(α_{n} - γ^{i} + λ_{n}^{i})}^{2} - \frac{ρ}{2} {(λ_{n}^{i})}^{2}$ .

ⁿ In ADMM algorithm, standard stopping criteria is to check primal and dual residuals[21]. However, it is often the case that ADMM can produce acceptable results of practical use within a few tens of iteration[21]. As, finite number of iteration is favorable for practical implementation, we adopt fixed number of iteration to stop the algorithm.

^o For fixed radius R_BS in Figure1, different SNRs (i.e., different SNR(R_BS)) are obtained by changing $\frac{p_{0}^{max}}{σ^{2}}$ in (54).

^pThe interval $[0, α_{n}^{\max}]$ denotes the range of feasible α_n for problem (46).

Appendices

Appendix 1

In this appendix, we propose the bracketing method[30, 31] to solve problem (44). Let us start by combining the second (linear) and third (quadratic) terms of (45) as

\begin{array}{l} p (α_{n}) & = \tilde{p} (α_{n}) + \frac{ρ}{2} {(α_{n} - γ^{i} + λ_{n}^{i} - \frac{1}{ρN})}^{2} \\ - \frac{1}{N} (γ^{i} - λ_{n}^{i} + \frac{1}{2}) . \end{array}

(57)

Without loss of generality, let us drop the constant term of (57) and simplify it as

p (α_{n}) = \tilde{p} (α_{n}) + \frac{ρ}{2} {(α_{n} - θ)}^{2},

(58)

where $θ = γ^{i} - λ_{n}^{i} + \frac{1}{ρN}$ .

Note that the optimal value $\tilde{p} (α_{n})$ is nondecreasing function of $α_{n} \in [0, α_{n}^{\max}]$ ^p. To see that, let $P_{i}$ and $P_{j}$ be the feasible set of problem (46) for $α_{n} = α_{n}^{i}$ and $α_{n} = α_{n}^{j}$ , respectively. If $α_{n}^{j} \geq α_{n}^{i}$ , then it is easy to see that $P_{j} \subseteq P_{i}$ . Hence, the optimal value $\tilde{p} (α_{n}^{j}) \geq \tilde{p} (α_{n}^{i})$ for all $α_{n}^{j} \geq α_{n}^{i}$ and $α_{n}^{i}, α_{n}^{j} \in [0, α_{n}^{\max}]$ . Furthermore, there exists a partition of $[0, α_{n}^{\max}]$ as $[0, ϕ] \cup [ϕ, α_{n}^{\max}]$ such that

\tilde{p} (α_{n}) = c, α_{n} \in [0, ϕ],

(59)

where c is the optimal solution of problem (46) for α_n = 0.

Next, we propose to use bracketing method[30, 31] to find the infimum of function p(α_n) on the interval $α_{n} \in [0, α_{n}^{\max}]$ . First, in Lemma 1, we show that the function p(α_n) is a unimodal function on the interval $α_{n} \in [0, α_{n}^{\max}]$ for the condition: C) θ ≤ ϕ.

Lemma 1. The function p(α_n),

\begin{array}{lcr} p (α_{n}) = \tilde{p} (α_{n}) + \frac{ρ}{2} {(α_{n} - θ)}^{2}, \end{array}

(60)

is a unimodal function on the interval $α_{n} \in [0, α_{n}^{\max}]$ for the condition C.

Proof:

1.
For the case θ ≤ 0, the proof is trivial, since p(α _n) is a sum of two increasing functions on the interval $α_{n} \in [0, α_{n}^{\max}]$ .
2.
For the case θ > 0, let us partition $[0, α_{n}^{\max}]$ as $[0, θ] \cup [θ, α_{n}^{\max}]$ . On the interval α _n∈ [0, θ], the function $\tilde{p} (α_{n})$ takes a constant value c. On the interval $α_{n} \in [θ, α_{n}^{\max}]$ , the function $\tilde{p} (α_{n})$ is a nondecreasing function. Hence, the function p(α _n) is a sum of affine and convex functions on the interval [0,θ], and a sum of nondecreasing and increasing functions on the interval $[θ, α_{n}^{\max}]$ . Thus, the function p(α _n) is a unimodal function. □

Lemma 1 implies that for the condition C (i.e., θ ≤ ϕ), the infimum of the function p(α_n) can be obtained optimally by using bracketing method[30, 31].

For the case condition C is not satisfied (i.e., ϕ ≤ θ), let us partition $[0, α_{n}^{\max}]$ as $[0, ϕ] \cup [ϕ, θ] \cup [θ, α_{n}^{\max}]$ . On the interval α_n∈ [0,ϕ], the function p(α_n) is a decreasing function (since $\tilde{p} (α_{n})$ takes a constant value c, and (α_n - θ)² is a decreasing function). On the interval $α_{n} \in [θ, α_{n}^{\max}]$ , the function p(α_n) is an increasing function (since $\tilde{p} (α_{n})$ is nondecreasing function and (α_n - θ)² is increasing function). On the interval α_n ∈ [ϕ, θ], analytically expressing the curvature of p(α_n) is difficult, since the curvature of function $\tilde{p} (α_{n})$ depends on the numerical parameters. This implies that for the case ϕ ≤ θ, the infimum of the function p(α_n) lies on the interval [ϕ, θ], i.e.,

\begin{array}{l} \underset{α_{n} \in [0, α_{n}^{\max}]}{arg min} p (α_{n}) \in [ϕ, θ] . \end{array}

(61)

Thus in the case ϕ ≤ θ (i.e., if condition C is not satisfied), the solution of problem (61) obtained by using bracketing method[30, 31] lies at most (θ - ϕ) away from the optimal solution. However, in all of our numerical simulations, we have always noted that the function p(α_n) is a unimodal function. In that case, problem (61) is solved optimally by bracketing method[30, 31]. Moreover, the convergence of the proposed Algorithm 3 (see numerical example, Section 5) to the centralized solution shows that bracketing method can be used to solve problem (44).

Appendix 2

The ADMM method is guaranteed to converge for all values of its penalty parameter ρ[21]. However, the rate of convergence of ADMM algorithm is sensitive to the choice of the penalty parameter ρ. In practice, the ADMM penalty parameter ρ is either tuned empirically for each specific application, or set equal to 1 by normalizing the problem data set[21], Chapter 11. Note that in Algorithm 1, to solve the local variable update (22), we can normalize the problem data (i.e., sum power $(\sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2}))$ by normalizing factor β_n > 0 and set ρ = 1, which is equivalent to set ρ = β_n in Algorithm 1, if the problem data (i.e., sum power $(\sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2}))$ is not normalized. To elaborate further, let us express equivalently the local variable update (22) as

\begin{array}{l} minimize \frac{1}{β_{n}} (\sum_{l \in L (n)} {∥ m_{l} ∥}_{2}^{2}) + \frac{ρ}{2} {∥ x_{n} - z_{n}^{i} + v_{n}^{i} ∥}_{2}^{2} \\ subject to [\begin{array}{c} \sqrt{1 + \frac{1}{γ_{l}}} h_{ll}^{^{H}} m_{l} \\ M_{n}^{^{H}} h_{ll} \\ {\tilde{x}}_{l} \\ σ_{l} \end{array}] ≽_{SOC} 0, l \in L (n) \\ [\begin{array}{c} x_{n, nl} \\ M_{n}^{^{H}} h_{jl} \end{array}] ≽_{SOC} 0, l \in I_{int} (n) \end{array}

(62)

with variables $M_{n} = {[m_{l}]}_{l \in L (n)}$ and x_n, where β_n > 0 is the normalizing factor, ${\tilde{x}}_{l} = {x_{n, bl}}_{b \in N_{int} (l)}$ is a subset of x_n (see (12)), the matrix h_jl in the second set of constraints denotes the channel from BS n to link l (i.e., the index j in the third set of constraints denotes an arbitrary link in $L (n)$ ), and the notation ≽_SOC denotes the generalized inequalities with respect to the second-order cone[16, 19].

For problem (62), the optimal choice of β_n is $\sum_{l \in L (n)} {∥ m_{l}^{⋆} ∥}_{2}^{2}$ . However, before the convergence of Algorithm 1, we do not have optimal beamformers (i.e., ${m_{l}^{⋆}}_{l \in L (n)}$ ). Thus, in our simulation, to estimate β_n, we ignore the interference and noise terms and find beamforming vector ${\tilde{m}}_{l}$ that achieves the required SINR threshold γ_l, which can be expressed as

{\tilde{m}}_{l} = (1 0^{0.1 \times γ_{l}}) / {∥ h_{ll} ∥}_{2}^{2}, l \in L (n) .

Hence, the normalizing factor β_n for problem (62) can be written as

\begin{array}{l} β_{n} & = \sum_{l \in L (n)} {\tilde{m}}_{l} \\ = \sum_{l \in L (n)} (1 0^{0.1 \times γ_{l}}) / {∥ h_{ll} ∥}_{2}^{2} . \end{array}

Furthermore, we find $β = max_{n \in N} {β_{n}}$ , and set ρ = β for Algorithm 1.

References

Bengtsson M, Ottersten B: Optimal Downlink Beamforming Using Semidefinite Optimization. In Proceedings of the Annual Allerton Conference on Communications, Control, and Computing. Urbana-Champaign IL; 1999:987-996.
Google Scholar
Rashid-Farrokhi F, Liu KJR, Tassiulas L: Transmit beamforming and power control for cellular wireless systems. IEEE J. Select. Areas Commun 1998, 16(8):1437-1450. 10.1109/49.730452
Article Google Scholar
Visotsky E, Madhow U: Optimum Beamforming Using Transmit Antenna Arrays. In Proceedings of the IEEE Vehicular Technology Conference. Houston, TX; 1999:851-856.
Google Scholar
Shi Q, Razaviyayn M, Hong M, Luo ZQ: SINR constrained beamforming for a MIMO multi-user downlink system. In Proceedings of the Annual Asilomar Conference on Signals, Systems and Computers. Pacific Grove, CA; 2012:1991-1995.
Google Scholar
Wong KK, Zheng G, Ng TS: Convergence analysis of downlink MIMO antenna systems using second-order cone programming. In Proceedings of the IEEE Vehicular Technology Conference. Dallas; 2005:492-496.
Google Scholar
Codreanu M, Tölli A, Juntti M, Latva-aho M: Joint design of Tx-Rx beamformers in MIMO Downlink channel. IEEE Trans. Signal Process 2007, 55(9):4639-4655.
Article MathSciNet Google Scholar
Schubert M, Boche H: Solution of the Multiuser downlink beamforming problem with individual SINR constraints. IEEE Trans. Vehic. Technol 2004, 53: 18-28. 10.1109/TVT.2003.819629
Article Google Scholar
Boche H, Schubert M: Optimal multi-user interference balancing using transmit beamforming. Wireless Pers. Commun., Kluwer Acad. Publishers 2003, 26(4):305-324. 10.1023/A:1025684624987
Article Google Scholar
Schubert M, Boche H: Comparison of ℓ_∞-norm and ℓ₁-norm optimization criteria for SIR-balanced multi-user beamforming. Elsevier Signal Process 2004, 84(2):367-378. 10.1016/j.sigpro.2003.10.022
Article Google Scholar
Tolli A, Codreanu M, Juntti M: Minimum SINR maximization for multiuser MIMO downlink with per BS power constraints. In Proceedings of the IEEE Global Telecommunication Conference. Kowloon; 2007:1144-1149.
Google Scholar
Yu HDW: Coordinated beamforming for the multicell multi-antenna wireless system. IEEE Trans. Wireless Commun 2010, 9(5):1748-1759.
Article Google Scholar
Tölli A, Pennanen H, Komulainen P: Decentralized minimum power multi-cell beamforming with limited backhaul signaling. IEEE Trans. Wireless Commun 2011, 10(2):570-580.
Article Google Scholar
Pennanen H, Tolli A, Latva-ahom A: Decentralized coordinated downlink beamforming via primal decomposition. IEEE Signal Process. Lett 2011, 18(11):647-650.
Article Google Scholar
Nguyen D, Le-Ngoc T: Multiuser downlink beamforming in multicell wireless systems: a game theoretical approach. IEEE Trans. Signal Process 2011, 59(7):3326-3338.
Article MathSciNet Google Scholar
Shen C, Chang T, Wang K, Qiu Z, Chi C: Distributed robust multicell coordinated beamforming with imperfect CSI: an ADMM approach. IEEE Trans. Signal Process 2012, 60(6):2988-3003.
Article MathSciNet Google Scholar
Wiesel A, Eldar YC, Shamai S: Linear precoding via conic optimization for fixed MIMO receivers. IEEE Trans. Signal Proc 2006, 54: 161-176.
Article Google Scholar
Boyd S, Xiao L, Mutapcic A, Mattingley J: Notes on decomposition methods: course reader for convex optimization II,. Stanford 2007. [[Online]. Available:. Accessed 01 March 2012 http://www.stanford.edu/class/ee364b/notes/decomposition_notes.pdf]
Boyd S: Subgradient methods 2007. [[Online]. Available:. Accessed 01 March 2012 http://www.stanford.edu/class/ee364b/lectures/subgrad_method_slides.pdf]
Boyd S, Vandenberghe L: Convex optimization. Cambridge: Cambridge University Press; 2004.
Book Google Scholar
Huang Y, Zheng G, Bengtsson M, Wong KK, Yang L, Ottersten B: Distributed multicell beamforming with limited intercell coordination. IEEE Trans. Signal Process 2011, 59(2):728-738.
Article MathSciNet Google Scholar
Boyd S, Parikh N, Chu E, Peleato B, Eckstein J: Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations Trends Mach. Learn 2010, 3: 1-122. 10.1561/2200000016
Article Google Scholar
Yang J, Zhang Y: Alternating direction algorithms for ℓ₁-problems in compressive sensing. SIAM J. Sci. Comput 2011, 33: 250-278. 10.1137/090777761
Article MathSciNet Google Scholar
Figueiredo M, Bioucas-Dias J: Restoration of poissonian images using alternating direction optimization. IEEE Trans. Image Process 2010, 19(12):3133-3145.
Article MathSciNet Google Scholar
Schizas I, Ribeiro A, Giannakis G: Consensus in ad hoc WSNs with noisy links-part I: distributed estimation of deterministic signals. IEEE Trans. Signal Process 2008, 56: 350-364.
Article MathSciNet Google Scholar
Leinonen M, Codreanu M, Juntti M: Distributed consensus based joint resource and routing optimization in wireless sensor networks. In Proceedings of the Annual Asilomar Conference on Signals, Systems and Computers. Pacific Grove, CA; 2012.
Google Scholar
He B, Yuan X: On the O (1/ n ) convergence rate of the Douglas-Rachford alternating direction method. SIAM J. Numerical Anal 2012, 50(2):700-709. 10.1137/110836936
Article MathSciNet Google Scholar
Mota J, Xavier J, Aguiar P, Puschel M: Distributed ADMM for model predictive control and congestion control. In Proceedings of the IEEE International Conference on Decision and Control. Maui, HI; 2012:5110-5115.
Google Scholar
Deng W, Yin W: On the global and linear convergence of the generalized alternating direction method of multipliers. Rice CAAM tech report TR12-14 2012.
Google Scholar
Hong M, Luo ZQ: On the linear convergence of the alternating direction method of multipliers. arXiv preprint arXiv:1208.3922 2012.
Google Scholar
Mathews JH, Fink KK: Numerical Methods Using Matlab. Upper Saddle River: Prentice-Hall Inc.; 2004.
Google Scholar
Cheney W, Kincaid D: Numerical Mathematics and Computing. Stamford: International Thomson Publishing; 1998.
Google Scholar
Joshi S, Codreanu M, Latva-Aho M: Distributed resource allocation for MISO downlink systems via the alternating direction method of multipliers. In Proceedings of the Annual Asilomar Conference on Signals, Systems and Computers. Pacific Grove, CA; 2012:488-493.
Google Scholar
Bertsekas DP: Constrained Optimization and Lagrange Multiplier Methods. Waltham: Academic Press; 1982.
Google Scholar
Bertsekas D, Tsitsiklis J: Parallel and Distributed Computation: Numerical Methods. Belmont: Athena Scientific; 1997.
Google Scholar
Kumar A, Manjunath D, Kuri J: Wireless Networking. Burlington: Elsevier Inc.; 2008.
Google Scholar
Gupta P, Kumar PR: The capacity of wireless networks. IEEE Trans. Inform. Theory 2000, 46(2):388-404. 10.1109/18.825799
Article MathSciNet Google Scholar

Download references

Acknowledgements

This research was supported by the Finnish Funding Agency for Technology and Innovation (Tekes), Nokia Solutions and Networks, Anite Telecoms, Renesas Mobile Europe, and Elektrobit.

Author information

Authors and Affiliations

Centre for Wireless Communications, University of Oulu, P.O. Box 4500, Oulu, FI-90014, Finland
Satya Krishna Joshi, Marian Codreanu & Matti Latva-aho

Authors

Satya Krishna Joshi
View author publications
You can also search for this author in PubMed Google Scholar
Marian Codreanu
View author publications
You can also search for this author in PubMed Google Scholar
Matti Latva-aho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Satya Krishna Joshi.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Joshi, S.K., Codreanu, M. & Latva-aho, M. Distributed resource allocation for MISO downlink systems via the alternating direction method of multipliers. J Wireless Com Network 2014, 1 (2014). https://doi.org/10.1186/1687-1499-2014-1

Download citation

Received: 14 April 2013
Accepted: 16 December 2013
Published: 04 January 2014
DOI: https://doi.org/10.1186/1687-1499-2014-1

Distributed resource allocation for MISO downlink systems via the alternating direction method of multipliers

Abstract

1 Introduction

2 System model and problem formulation

3 Sum power minimization

3.1 An equivalent reformulation: sum power minimization

3.2 Distributed algorithm via ADMM: sum power minimization

3.3 Finding feasible solution at each iteration of Algorithm 1

3.4 Convergence of Algorithm 1 to the global optimum of problem P1

4 SINR balancing

4.1 An equivalent reformulation: SINR balancing

4.2 Distributed algorithm via ADMM: SINR balancing

4.3 Finding feasible solution at each iteration of Algorithm 3

5 Numerical example

6 Conclusion

Endnotes

Appendices

Appendix 1

Appendix 2

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords