The random access NUM with multiclass traffic

Vo, Phuong L; Lee, Sungwon; Hong, Choong Seon

doi:10.1186/1687-1499-2012-242

Research
Open access
Published: 06 August 2012

The random access NUM with multiclass traffic

Phuong L Vo¹,
Sungwon Lee¹ &
Choong Seon Hong¹

EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 242 (2012) Cite this article

2061 Accesses
5 Citations
Metrics details

Abstract

In this article, we consider the network utility maximization (NUM) problem for the random access network with multiclass traffic. The utilities associated with the users are not only concave, but also nonconcave functions. Consequently, the random access NUM problem becomes more difficult to solve. Based on the successive approximation method, we propose an algorithm that jointly controls the rate and the persistent probability of the users. The proposed algorithm converges to a suboptimal solution to the original problem which also satisfies the Karush–Kuhn–Tucker conditions. We also generalize the framework so that a broader choice of utility functions can be applied.

Introduction

The network utility maximization (NUM) for the random access wireless networks is thoroughly studied in the literature, e.g.,[1–3]. The assumption of strictly concave utilities in conventional works makes the NUM to merely address the elastic traffic which is from nonreal-time applications. In current Internet, there are many kinds of traffic, both elastic and inelastic. The inelastic traffic from the real-time applications does not have the strictly concave form anymore. They are usually modeled by sigmoidal utilities, which are convex at the lower region and concave at the higher region as depicted in Figure1[4]. As a result, the analysis frameworks in[1–3] cannot be applied in the case of multiclass traffic and it is very difficult to address the nonconvexity of the problem.

The early studies that deal with the inelastic traffic in the basic NUM problem for wired networks are[5, 6]. The authors utilize the standard dual-based algorithm to allocate the rate. Certainly, this algorithm does not result to an optimal solution because of the nonconvexity of the primal problem. The duality gap is not always zero and the result is suboptimal or even infeasible. Therefore, the authors of[5] offer a ‘self-regulate’ mechanism for the users to access the network without fluctuation. On the other hand, the authors of[6] find the conditions for which the dual-based algorithm converges to a global optimum. It turns out that the link capacity must be higher than a critical value. Then, they propose the link ‘capacity provisioning’ to satisfy those conditions. Another method to solve the basic NUM is using the sum-of-square method in[7]. The nonconvex NUM is relaxed and solved by semidefinite programming. However, this method requires a centralized and offline computation. Its framework is also difficult to integrate into the cross-layer optimization problem in which the dual decomposition approach has shown its efficiency[8]. Extending the work in[6] to the random access WLANs, the authors of[9] design a dual-based algorithm to jointly allocate the rate and the persistent probability of elastic and inelastic traffic. Consequently, their algorithm only converges just in the case where the link capacities are higher than critical values. Otherwise, only the lower bound and upper bound are specified.

In this article, we address the random access NUM for multiclass traffic using the successive approximation method. The solutions to the convex approximation problems converge to a suboptimal solution which also satisfies the Karush–Kuhn–Tucker (KKT) conditions of the original problem. The successive approximation method is first introduced in[10]. It is usually applied to geometric programming in the power control problems such as[11–13]. Similar to our previous work[14] which jointly controls the rate and power in a multi-hop wireless network with multiclass traffic, the nonconcave objective of the problem is approximated to a concave function. After solving a series of approximation problems, the algorithm converges. Moreover, we generalize our analysis framework and show that a broader choice of utilities can be obtained.

The rest of the article is organized as follows. Section ‘Design of the successive approximation algorithm’ introduces the network model and propose the successive approximation algorithm. Section ‘More general utility functions and analysis’ generalizes the framework analysis and finds the conditions on the utility. The numerical results and some discussions are presented in Section ‘Numerical results and discussions’. Finally, conclusions are given in Section ‘Conclusions’.

Notations: In this article, we use italic characters to denote variables and bold characters to denote vectors. For example, $x = [x_{1}, \dots, x_{| N |}]$ , $p = [p_{1}, \dots, p_{| N |}]$ , and $c = [c_{1}, \dots, c_{| N |}]$ are $| N |$ -dimensional vectors which elements are x_i, p_i, and c_i, respectively. The words ‘user’ and ‘node’ are sometimes used interchangeably.

Design of the successive approximation algorithm

Network model

We consider a wireless LAN with the set of users N. We assume that every user is one-hop neighbor to another. Each user generates saturated traffic, i.e., it always has packets to transmit. If each user i attempts to access the medium with probability p_i, then the probability of successful transmission of user i will be $p_{i} \prod_{j \neq i} (1 - p_{j})$ . As a result, the long-term transmission rate of user i is $c_{i} p_{i} \prod_{j \neq i} (1 - p_{j})$ , where c_i is the wireless link capacity of user i. The random access NUM is stated as follows[1, 9]

\begin{align} (P 1) : max. & \sum_{i \in N} U_{i} (x_{i}) \\ s.t. & x_{i} \leq c_{i} p_{i} \prod_{j \neq i} (1 - p_{j}), \forall i \in N, \\ x^{min} ≼ x ≼ x^{max}, \\ 0 ≼ p ≼ 1, \\ variables : x, p, \end{align}

where U_i is the utility function of user i. In this article, we assume that $x^{min}$ is strictly greater than 0 to avoid dividing by zero in the mathematical analysis.

Each user is associated with a utility function. We will mention a broader choice of utility functions that can be applied to our framework later in Section ‘More general utility functions and analysis’. In this section, we consider two groups of utility functions:

1.
The concave utilities for elastic traffic
$U (x) = \{\begin{matrix} \ln (x + 1), if α = 1, \\ \frac{{(x + 1)}^{(1 - α)} - 1}{1 - α}, if α > 0 and α \neq 1; \end{matrix}$
(1)
2.
The sigmoidal utilities for inelastic traffic
$U (x) = \frac{x^{a}}{k + x^{a}}, \forall a > 1, k > 0 .$
(2)

The sigmoidal function (2) has an inflection point at $x^{in} = {(\frac{k (a - 1)}{a + 1})}^{1 / a}$ . It is convex in $(x^{min}, x^{in})$ and concave in $(x^{in}, x^{max})$ . In the literature, sigmoidal function is usually used for the real-time utility because it is small when the rate is below xⁱⁿ and increases quickly when the rate exceeds xⁱⁿ. As a result, xⁱⁿis also considered the demand of a real-time connection (see Figure1).

Similar to the articles on utility optimality of multiclass traffic, e.g.,[5, 7, 9, 15], the concave utilities usually cannot take the conventional form of α-fair utility which is ln(x) if α=1 and $\frac{x^{1 - α}}{1 - α}$ if α>0 and α≠1[16]. It is shifted 1 unit on the x-axis. With the present of sigmoidal utilities which are usually the same as (2) or $\frac{1}{1 + e^{- a (x - b)}}$ a b>0 in the literature, the utilities of the users are normalized or at least have close values at $x^{max}$ in order to be comparable. Otherwise, the inelastic flows always take the advantage over the elastic flows because of the conventional α-fair utility is negative as α>1. So the concave utilities usually have the form as (1) in these articles.

Approximation problem

Since the utilities (1) and (2) are always positive as x>0, we maximize the logarithm of the aggregate utility instead of itself and replace (P1) by an equivalent problem as follows

\begin{align} (P 2) : max. & \ln (\sum_{i \in N} U_{i} (x_{i})) \\ s.t. & x_{i} \leq c_{i} p_{i} \prod_{j \neq i} (1 - p_{j}), \forall i \in N, \\ x^{min} ≼ x ≼ x^{max}, \\ 0 ≼ p ≼ 1, \\ variables : x, p . \end{align}

(3)

The Lagrangian of (P2) is given by

\begin{align} L_{2} (x, p, ν) = ln (\sum_{i \in N} U_{i} (x_{i})) - \sum_{i \in N} ν_{i} (x_{i} - c_{i} p_{i} \prod_{j \neq i} (1 - p_{j})), \end{align}

where ν_i is the multiplier associated with the constraint $x_{i} \leq c_{i} p_{i} \prod_{j \neq i} (1 - p_{j})$ for all $i \in N$ . We have the following result

Lemma 1

(P1) and (P2) share the same optimal/suboptimal solutions. Moreover, if (x^∗,p^∗,ν^∗) is a KKT point of (P2), which means that the following conditions are satisfied

\begin{align} \nabla_{x} L_{2} (x^{*}, p^{*}, ν^{*}) = 0 and \nabla_{p} L_{2} (x^{*}, p^{*}, ν^{*}) = 0; \end{align}

(4)

\begin{align} ν_{i}^{*} (x_{i}^{*} - c_{i} p_{i}^{*} \prod_{j \neq i} (1 - p_{j}^{*})) = 0, \forall i \in N; \end{align}

(5)

\begin{align} x_{i}^{*} \leq c_{i} p_{i}^{*} \prod_{j \neq i} (1 - p_{j}^{*}), \forall i \in N; \end{align}

(6)

\begin{align} ν^{*} ≽ 0, \end{align}

(7)

then $(x^{*}, p^{*}, (\sum_{i \in N} U_{i} (x_{i}^{*})) ν^{*})$ is a KKT point of (P1).

Proof

Since logarithm is a monotonically increasing function, the first statement is obvious. We now verify the second statement. The Lagrangian of (P1) is given by

\begin{align} L_{1} (x, p, μ) = \sum_{i \in N} U_{i} (x_{i}) - \sum_{i \in N} μ_{i} (x_{i} - c_{i} p_{i} \prod_{j \neq i} (1 - p_{j})) . \end{align}

(8)

We can easily verify that (4)–(7) are equivalent to the KKT conditions of (P1), which are

\begin{align} \nabla_{x} L_{1} (x^{*}, p^{*}, μ^{*}) = 0 and \nabla_{p} L_{1} (x^{*}, p^{*}, μ^{*}) = 0; \end{align}

(9)

\begin{align} μ_{i}^{*} (x_{i}^{*} - c_{i} p_{i}^{*} \prod_{j \neq i} (1 - p_{j}^{*})) = 0, \forall i \in N; \end{align}

(10)

\begin{align} x_{i}^{*} \leq c_{i} p_{i}^{*} \prod_{j \neq i} (1 - p_{j}^{*}), \forall i \in N; \end{align}

(11)

\begin{align} μ^{*} ≽ 0, \end{align}

(12)

when $μ^{*} = (\sum_{i \in N} U_{i} (x_{i}^{*})) ν^{*}$ , for all $i \in N$ . □

We now derive an inequality to approximate (P2) to a new problem which can equivalently be transformed to a convex one. From the arithmetic-geometric mean inequality, we have $\sum_{i \in N} θ_{i} u_{i} \geq \prod_{i \in N} {(u_{i})}^{θ_{i}}$ for all u≽0, θ≻0, and 1^Tθ=1. Replacing u_i with $\frac{U_{i} (x_{i})}{θ_{i}}$ and taking the logarithm of both sides of the inequality yields

\ln (\sum_{i \in N} U_{i} (x_{i})) \geq \sum_{i \in N} θ_{i} \ln (\frac{U_{i} (x_{i})}{θ_{i}})

(13)

The equality of (13) holds if and only if

θ_{i} = \frac{U_{i} (x_{i})}{\sum_{k \in N} U_{k} (x_{k})}, \forall i \in N.

(14)

Now we consider the approximation problem as follows

\begin{align} (P 3^{τ}) : max. & \sum_{i \in N} θ_{i}^{(τ)} \ln (\frac{U_{i} (x_{i})}{θ_{i}^{(τ)}}) \\ s.t. & x_{i} \leq c_{i} p_{i} \prod_{j \neq i} (1 - p_{j}), \forall i \in N, \\ x^{min} ≼ x ≼ x^{max}, \\ 0 ≼ p ≼ 1, \\ variables : x, p . \end{align}

As we have mentioned earlier, there is a sequence of approximations. The superscript τ is used here to indicate that this is the τ th approximation problem, θ^(τ) is a fixed value in τ th approximation problem. It will be proved that, by updating θ and solving the approximation problem many times, the solution to the approximation problem converges. At the stationary point, the approximation becomes exact.

Changing the variables ${\tilde{x}}_{i} ≜ \ln (x_{i})$ as in[1, 9] to separate the product form of the constraints, the following problem is obtained

\begin{align} (P 4^{τ}) : max. & \sum_{i \in N} Ũ_{i} ({\tilde{x}}_{i}; θ_{i}^{(τ)}) \\ s.t. & {\tilde{x}}_{i} \leq {\tilde{c}}_{i} + \ln (p_{i}) + \sum_{j \neq i} \ln (1 - p_{j}), \forall i \in N, \\ {\tilde{x}}^{min} ≼ \tilde{x} ≼ {\tilde{x}}^{max}, \\ 0 ≼ p ≼ 1, \\ variables : \tilde{x}, p, \end{align}

where $Ũ_{i} ({\tilde{x}}_{i}; θ_{i}^{(τ)}) ≜ θ_{i}^{(τ)} \ln (\frac{U_{i} (e^{{\tilde{x}}_{i}})}{θ_{i}^{(τ)}})$ is a function of ${\tilde{x}}_{i}$ parameterized by θ_i, and ${\tilde{c}}_{i} ≜ \ln (c_{i})$ .

Lemma 2

The function $Ũ_{i} ({\tilde{x}}_{i}; θ_{i})$ is strictly concave for both concave and sigmoidal utilities (1) and (2).

Proof

See the Appendix for the proof. □

From Lemma 2, (P4^τ) is a convex problem; therefore, it can be solved efficiently for an optimal solution. In the next section, we will solve (P4^τ) using the dual-based decomposition approach.

Solution to the approximation problem and the algorithm

We apply the dual decomposition method to solve (P4^τ). Its Lagrangian is given by

\begin{align} L_{4} (\tilde{x}, p, λ; θ^{(τ)}) = & \sum_{i \in N} Ũ_{i} ({\tilde{x}}_{i}; θ_{i}^{(τ)}) - \sum_{i \in N} λ_{i} \\ \times ({\tilde{x}}_{i} - {\tilde{c}}_{i} - \ln (p_{i}) - \sum_{j \neq i} \ln (1 - p_{j})) . \end{align}

Hence, the dual function is

\begin{align} D (λ; θ^{(τ)}) = & max_{\begin{matrix} {\tilde{x}}^{min} ≼ \tilde{x} ≼ {\tilde{x}}^{max} \\ 0 ≼ p ≼ 1 \end{matrix}} L_{4} (\tilde{x}, p, λ; θ^{(τ)}) \\ = & \sum_{i \in N} max_{{\tilde{x}}_{i}^{min} \leq {\tilde{x}}_{i} \leq {\tilde{x}}_{i}^{max}} (Ũ_{i} ({\tilde{x}}_{i}; θ_{i}^{(τ)}) - λ_{i} {\tilde{x}}_{i}) (15) \\ + max_{0 ≼ p ≼ 1} \sum_{i \in N} λ_{i} (\ln (p_{i}) + \sum_{j \neq i} \ln (1 - p_{j})) \\ + \sum_{i \in N} λ_{i} {\tilde{c}}_{i}, (16) \end{align}

and the dual problem is $min_{λ ≽ 0} D (λ; θ^{(τ)})$ .

Since both subproblems (15) and (16) are convex problems, the first-order conditions are sufficient to establish their optimal solutions. The solution to the first subproblem (15) at time instant t is given by

{\tilde{x}}_{i}^{(τ)} (t) = {[Ũ_{i}^{″ - 1} (λ_{i}^{(τ)} (t); θ_{i}^{(τ)})]}_{{\tilde{x}}_{i}^{min}}^{{\tilde{x}}_{i}^{max}}, \forall i \in N.

(17)

where ${[z]}_{z^{min}}^{z^{max}} = min (max (z, z^{min}), z^{max})$ , the projection of z on $[z^{min}, z^{max}]$ . Solving the second subproblem (16) yields the persistent probability[1]

p_{i}^{(τ)} (t) = \frac{λ_{i}^{(τ)} (t)}{\sum_{j \in N} λ_{j}^{(τ)} (t)}, \forall i \in N.

(18)

We now apply the subgradient algorithm to solve the dual problem. $({\tilde{x}}_{i} - {\tilde{c}}_{i} - \ln (p_{i}) - \sum_{j \neq i} \ln (1 - p_{j}))$ is a subgradient of D(λ;θ^(τ)) where ${\tilde{x}}_{i}$ and p_i are specified by (17) and (18), respectively. Hence, the subgradient update is as follows[17]

\begin{align} λ_{i}^{(τ)} (t + 1) = & [λ_{i}^{(τ)} (t) - γ (t) ({\tilde{c}}_{i} + \ln (p_{i}^{(τ)} (t)) \\ {+ \sum_{j \neq i} \ln (1 - p_{j}^{(τ)} (t)) - {\tilde{x}}_{i}^{(τ)} (t))]}^{+}, \forall i \in N, \end{align}

(19)

where γ(t) is the step-size sequence, ${\tilde{x}}_{i}^{(τ)} (t)$ and $p_{i}^{(τ)} (t)$ are calculated according to (17) and (18), respectively, at time instant t. ${[a]}^{+} = max (a, 0)$ . Once again, we use the superscript τ in (17)–(19) to indicate that they are the values in solving the τ th approximation problem. From the above analysis, we develop the successive approximation algorithm for the multiclass traffic in the one-hop random access wireless network as described in Algorithm 1.

Algorithm 1 Successive approximation algorithm for multiclass traffic

1.
Initialize from θ⁽⁰⁾and any feasible point;
2.
τ:=0;
3.
loop
4.
τ:=τ + 1;
5.
t:=0;
6.
repeat
7.
t:=t + 1;
8.
Set rate, persistent probability, and multipliers according to (17), (18), and (19) respectively;
9.
until stationary;
10.
$θ_{i}^{(τ + 1)} : = \frac{U_{i} (x_{i}^{(τ) *})}{\sum_{k \in N} U_{k} (x_{k}^{(τ) *})}$ ;
11.
$x_{i}^{(τ + 1)} (0) : = x_{i}^{(τ) *}$ ;
12.
end loop

In Algorithm 1, $x_{s}^{(τ) *}$ is the stationary value of the τ th (outer-)iteration. At step 10, the new value θ is calculated by the stationary rate of previous outer-iterations. Moreover, the initial value of a new outer-iteration is the stationary value of the previous outer-iteration at step 11.

Theorem 1

If the step size satisfies γ(t)>0, $lim_{t \to \infty} γ (t) = 0$ , and $\sum_{t = 1}^{\infty} γ (t) = \infty$ , then Algorithm1 monotonically increases the aggregate utility in each outer-iteration and converges to a stationary point satisfying the KKT conditions of (P1).

Proof

See the Appendix for the proof. □

We have some discussions on the distributed implementation and the message passing mechanism of the proposed algorithm. There are two kinds of updates in Algorithm 1, the inner-updates (17)–(19) and the outer-updates (14). In each inner-iteration, a user uses the information $\sum_{j \in N} λ_{j} (t)$ to update its persistent probability according to (18). The persistent probabilities of all the nodes are also needed to update the user’s multiplier according to (19). Hence, after each inner-iteration, each user broadcasts its information (p_i and λ_i) to all the other users in the network. At the outer-iteration, each user needs the information of total utility of all the users to update its θ-value according to (14). Therefore, each user also broadcasts its current utility value to all the other users in each outer-iteration. Note that, the users update their θ-values as recognizing the stationary of the inner-iterations. The following technique can be used for the users to recognize the stationary. The users broadcast their utility periodically after each T time-slots. So, each user can always keep track of the aggregate utility value of the system. It only updates its θ-value as recognizing the stationary of this value.

Finally, there are some mechanisms to reduce the amount of message passing in the network:

1.
Each node piggybacks its information p_i, λ_i, and θ_iby inserting them into their data packets. Since all nodes are one-hop neighbors to each other, the other odes can overhear these information and update their values based on the received information.
2.
The multiplier update (19) can be a local update as follows. We rewrite the update (19) by $λ_{i}^{(τ)} (t + 1) = {[λ_{i}^{(τ)} (t) - γ (t) ({\tilde{c}}_{i} + \ln (p_{i}^{succ} (t)) - {\tilde{x}}_{i}^{(τ)} (t))]}^{+}$ , where $p_{i}^{succ} (t) = p_{i}^{(τ)} (t) \prod_{j \neq i} (1 - p_{j}^{(τ)} (t))$ is the successful transmission probability of node i. The value p^succcan be estimated locally. For example, (1) $p_{i}^{succ} \approx \frac{number of successful transmissions of i}{number of transmissions of i}$ , or (2) we can estimate the probability that the channel is idle $p^{idle} \approx \frac{number of timeslots the channel is idle}{number of timeslots}$ and the successful transmission probability will be $p_{i}^{succ} \approx p^{idle} \frac{p_{i}}{1 - p_{i}}$ due to $p^{idle} = \prod_{i \in N} (1 - p_{i})$ . By estimating this parameter locally, the multipliers can be implicitly updated. Therefore, the amount of message passing in the network is reduced significantly.

More general utility functions and analysis

In the first part of this section, we focus on the conditions of utility functions that the above analysis can still be applied. It is easy to see that the first criteria are

twice continuously differentiable and monotonically increasing function;
bounded function: U_i(x_i)>0, ∀x_i>0 and U_i(x_i) is bounded as x_iis bounded.

The important condition is that the function $Ũ_{i} ({\tilde{x}}_{i}) = θ_{i} \ln (\frac{U_{i} (e^{{\tilde{x}}_{i}})}{θ_{i}})$ must be strictly concave. Equivalently, we must have $\frac{d^{2} Ũ_{i} ({\tilde{x}}_{i})}{d {\tilde{x}}_{i}^{2}} = \frac{x_{i} U_{i} - x_{i}^{2} U_{i}^{″}}{U_{i}^{2}} U_{i}^{″} + \frac{x_{i}^{2} U_{i}^{″″}}{U_{i}} < 0$ . With the assumption U_i(x_i)>0,∀x_i>0, the condition is equivalent to

U_{i}^{″} + x_{i} U_{i}^{″″} < \frac{1}{U_{i}} x_{i} U_{i}^{″ 2}, \forall i \in N

(20)

where U_i=U_i(x_i), $U_{i}^{″} = \frac{d U_{i} (x_{i})}{d x_{i}}$ , and $U_{i}^{″″} = \frac{d^{2} U_{i} (x_{i})}{d x_{i}^{2}}$ .

We next consider the logarithm transformation from (P1) to (P2). Indeed, the log-transformation ln(u) transforms u into a ‘more’ concave function, for example, x + 1 is linear but ln(x + 1) is strictly concave; $\frac{1}{1 + e^{- a (x - b)}}, a, b > 0$ is nonconcave but $\ln (\frac{1}{1 + e^{- a (x - b)}})$ is concave. We generalize the analysis by using a general concave function f(u) which is monotonically increasing. Instead of using the approximation inequality (13) from arithmetic-geometric mean inequality, we use Jensen’s inequality

f (\sum_{i \in N} U_{i} (x_{i})) \geq \sum_{i \in N} θ_{if} (\frac{U_{i} (x_{i})}{θ_{i}}),

(21)

for all vector θ, such that θ≻0 and 1^Tθ=1. In this case, the condition on the utility function in order to perform the analysis is that $θ_{i} f (\frac{U_{i} (e^{{\tilde{x}}_{i}})}{θ_{i}})$ must be concave, or its second derivative in terms of ${\tilde{x}}_{i}$ must be negative equivalently. Hence,

U_{i}^{″} + x_{i} U_{i}^{″″} < (- \frac{f^{″″} (U_{i} / θ_{i})}{θ_{i} f^{″} (U_{i} / θ_{i})}) x_{i} U_{i}^{″ 2}, \forall i \in N.

(22)

We note at the factor $- \frac{f^{″″} (U_{i} / θ_{i})}{θ_{i} f^{″} (U_{i} / θ_{i})}$ in (22). It is always positive because f is a monotonically increasing and concave function. The higher the factor, the quicker the slope of f changes, and the more relaxed the condition of utility.

Particularly, if f(.) has the form of well-used α-fair family,

f (u) = \{\begin{matrix} \ln (u), if β = 1, \\ \frac{u^{1 - β}}{1 - β}, if β > 0, β \neq 1, \end{matrix}

(β is used here to distinguish from α parameter in (1)), we can see that the analysis in Section ‘Design of the successive approximation algorithm’ is a special case as β=1, and the condition (22) becomes exactly (20) in this case. In case of β>0 and β≠1, $- \frac{f^{″″} (U_{i} / θ_{i})}{θ_{i} f^{″} (U_{i} / θ_{i})} = \frac{β}{U_{i}}$ . So, the higher the value of β, the more relaxed the condition (22). We consider some following examples:

1.
α -fair utility $U_{i} (x_{i}) = \frac{x_{i}^{1 - α}}{1 - α}$ with 0<α<1: although this function is a canonical α-fair concave function, it cannot be applied to [1]. Lemma 1 therein requires a ‘sufficiently’ concave utility function, i.e., α>1 for the α-fair family. However, with the transform function f(u)=−1/u(which corresponding to β=2) and the new approximation (21) instead of (13), our framework can be applied.
2.
Linear/convex utility function $U_{i} (x_{i}) = x_{i}^{M}$ : if β=1, $Ũ_{i} ({\tilde{x}}_{i}; θ_{i})$ is a linear function, the analysis in Section ‘Design of the successive approximation algorithm’ cannot be applied. With the use of f(u)=−1/u which corresponds to β=2, $Ũ_{i} ({\tilde{x}}_{i}; θ_{i}) = - \frac{θ_{i}^{2}}{e^{M {\tilde{x}}_{i}}}$ is a strictly concave function. Note that this utility function certainly leads to the nonconvergence of the standard dual-based algorithm in [1, 9] because it is not a concave function.
3.
Exponential utility $U_{i} (x_{i}) = e^{x_{i}}$ : it is clear that we cannot use the standard dual-based algorithm in [1] because of the same reason as the above examples. The inequality (22) becomes $β > 1 + \frac{1}{x_{i}}$ . Therefore, if we choose β such that $β > 1 + \frac{1}{min_{i} x_{i}^{min}}$ , then the exponential utility can still be applied.

Numerical results and discussions

In this section, we use $\frac{x_{i}}{x_{i} + 1}$ as elastic utility with α=2 and $\frac{x_{i}^{4}}{x_{i}^{4} + 400}$ as inelastic utility with k=400 and a=4 (see Figure1). The rate unit for calculating utilities is Mbps. The inner-iteration is considered stationary if $|\frac{x (t) - x (t - 1)}{x (t - 1)}| ≺ 10^{- 4}$ . $x^{min} = 0.01$ Mbps and $x^{max} = c$ Mbps. The diminishing step size 0.001/t is used for Algorithm 1. λ⁰(0) is 0.1.

Convergence of the algorithm

In the first experiment, we want to examine the convergence of Algorithm 1 in case of scarce resource. We consider a network with two inelastic users. The link capacities are all 6 Mbps. With the use of standard dual-based algorithm presented in[9, Alg. 1], although the persistent probabilities of two flows converge, we cannot find any step size for the convergence of the rates. With Algorithm 1, however, both rates and persistent probabilities converge to a stationary point as shown in Figure2a,b.

We can see that although two users are symmetric, i.e., the same utilities as well as link capacities, one of them accesses the channel most of the time whereas the other one is mostly abandoned. This result shows the major difference from the resource allocation of elastic flows in which all elastic flows are fairly allocated the resource. Therefore, by using the sigmoidal utilities, the admission control is implicitly integrated as we solve the NUM. This is an advantage of using the sigmoidal utility. Also we have a remark that we rarely have fairness among inelastic users. Intuitively, when there is not enough resource for both flows, it is better to drop one flow and keep the other one than to maintain both inelastic flows with bad quality. This unfairness is also similar to the real-time system with the explicit admission control scheme. Some real-time connections can be dropped to guarantee the system performance because of the lack of the resource.

To mitigate the unfairness among the users as well as to avoid the starvation of some users in the network, we can guarantee a minimum persistent probability for each user. The constraint $0 ≼ p ≼ 1$ is replaced by the new one $p^{min} ≼ p ≼ 1$ where $max_{i} p_{i}^{min} \leq 1 / | N |$ to avoid the infeasibility. As a result, the persistent probability update (18) for each user in the τ th outer-iteration becomes $p_{i}^{(τ)} (t) = max (p_{i}^{min}, \frac{λ_{i}^{(τ)} (t)}{\sum_{j \in N} λ_{j}^{(τ)} (t)})$ for all $i \in N$ . With the new lower bound $p^{min}$ , all the users have a minimum chance to access the channel.

A heuristic implementation

We implement a heuristic algorithm in this experiment by limiting the number of inner-iterations in each τ-step to a fixed value T. As we have seen, Algorithm 1 has two levels of convergence. The outer-iterations update θ and the inner-iterations solve the convex approximation problem. Theoretically, the number of inner-iterations must be large enough for the convergence in every outer-step. In the heuristic algorithm, we limit the number of inner-iterations to a fixed value T. Moreover, we also apply a constant step size to the subgradient update (19) since it usually has a faster convergence than the diminishing step-size. It is known that with the dual-based subgradient algorithm using constant step size, the primal function sequence calculated from the running average primal values ${{\hat{x}}^{(τ)} (t) = \frac{1}{t} \sum_{k = 1}^{t} x^{(τ)} (k), t = 1, 2, \dots}$ converges to an optimal value (of P 3^τ) within an error ([18], Sec. 1.2). The feasible violation of the running average primal sequence also converges to zero. So, in the heuristic algorithm, θ_i corresponding to user i is updated according to $θ_{i}^{(τ)} = \frac{U_{i} ({\hat{x}}_{i}^{(τ - 1)} (T))}{\sum_{k \in N} U_{k} ({\hat{x}}_{k}^{(τ - 1)} (T))}$ , the running average value of the previous outer-iteration. The heuristic algorithm converges to the same solution as Algorithm 1 does in most of our experiments. However, we have a note that its convergence cannot be guaranteed theoretically. The reason is that with the dual-based subgradient update solving the approximation problem, the primal value ${\hat{x}}_{k}^{(τ - 1)} (T)$ can be infeasible. Therefore, the inequality (27) is no longer valid, i.e., we cannot guarantee a feasible improvement of the objective in every outer-iterations.

We repeat the experiment in Subsection ‘Convergence of the algorithm’ with T=5. Figure2c,d shows the evolution of rate and persistent probability with the heuristic algorithm. The convergence is much faster than the ones with stationary inner-iterations as shown in Figure2a,b. We consider another example in which there are four users, two elastic and two inelastic. The link capacities are c=[36 24 6 48] Mbps. Figure3c,d also shows the convergence of heuristic algorithm which is also much faster than that of Algorithm 1 in Figure3a,b.

Varying the initial point

Given θ, (P^4τ) as well as (P^3τ) have a unique optimal solution due to the strict convexity of (P4^τ). So, we can see that the result of Algorithm 1 only depends on choosing the initial θ⁽⁰⁾. In this experiment, we evaluate the stationary point according to different initial θ⁽⁰⁾. Let consider again the network with four users in Section ‘A heuristic implementation’. We uniformly generate 100 random initial vectors θ⁽⁰⁾and run Algorithm 1 with these 100 initial points. Figure4 shows the results of 100 experiments starting from these initial points. We can see that 72% of the experiments reach the globally optimal point x^∗=[4.20 3.36 0.01 9.03] Mbps, p^∗=[0.28 0.32 0.01 0.39], and Usum^∗=2.52.

Compare to the standard dual-based algorithm

We compare the aggregate utility archived by Algorithm 1 to the lower and upper bounds calculated from the standard dual-based algorithm in[9] as the number of users in the network increases gradually. In[9], after log-transforming the rate variables of the original NUM, the standard dual-based algorithm (Algorithm 1 therein) can achieve the stationary value of the multipliers, i.e., λ^∗, due to the convexity of the dual problem. Therefore, the lower bound is calculated by $\sum_{i \in N} U_{i} (x_{i}^{*})$ , where $p_{i}^{*} = \frac{λ_{i}^{*}}{\sum_{j \in N} λ_{j}^{*}}$ and $x_{i}^{*} = c_{i} p_{i}^{*} \prod_{j \neq i} (1 - p_{j}^{*})$ . The upper bound is the value of the dual function at the point λ^∗. Notice that this upper bound is absolutely not a feasible solution in case of nonzero duality gap.

We fix the link capacities at 12 Mbps and increase the number of users gradually. Half of the users have the elastic utilities and the other ones have the inelastic utilities. Figure5 shows that when the number of users increases, the aggregate utility also increases. It is always higher than the lower bound specified by the standard dual-based algorithm in[9].

Compare to binary exponential backoff MAC protocol

In this experiment, we want to compare our proposed algorithm to the MAC protocol running binary exponential backoff (BEB) rule such as IEEE 802.11 DCF. It is known that the window-based BEB MAC protocol implicitly maximizes it own utility function in a noncooperative game model[19]. Its equilibrium persistent probability depends on the maximum and minimum contention windows (CW). In this experiment, the minimum CW for BEB MAC is 7 time-slots and the maximum CW is 1,023 time-slots. All the links are fixed at 12 Mbps. We vary the number of users from 4 to 50. Half of the users are elastic and the other ones are inelastic. The collision probability is the probability when there are more than one user access the channel at the same time. The system throughput is calculated according to[20] with the setting parameters are listed in Table1.

Table 1 Setting parameters for subsection “Compare to binary exponential backoff MAC protocol”

Full size table

Figure6 shows the system throughput and collision probability of the proposed algorithm and BEB MAC. When the number of nodes is small, the collision of our proposed protocol is a little bit higher than that of BEB MAC and the system throughput of our proposed protocol is slightly lower than BEB MAC. However, when the number of nodes in the network increases, the collision of the BEB MAC also increases since the users use the incomplete information of the network condition in their distributed operation. With our proposed algorithm, many users tend to decrease their access probability (extend their contention window equivalently) to decrease the number of collisions for each user (see Figure6a). As a result, the system throughput of BEB MAC decreases much faster than that of our proposed protocol as we increase the number of nodes in the network (see Figure6b).

Conclusions

Based on successive approximation method, we have proposed an algorithm that converges to a KKT solution to the nonconvex NUM problem of a random access WLAN serving multiclass traffic. The equivalent problem of the original one is approximated to a new convex problem, which is solved efficiently by the dual-based decomposition approach. The algorithm converges after a sequence of approximations. We specify the necessary condition on the utilities to be used in the framework and we also generalize the analysis framework. The simulations show that our algorithm can achieve the global optimum starting from many initial points.

Appendix

Proof of Lemma 2

We prove Lemma 2 by verifying the second derivative of $Ũ_{i} ({\tilde{x}}_{i}; θ_{i})$ in terms of ${\tilde{x}}_{i}$ . For clearly presentation, we transform back to the x space and omit the superscript τ.

In case of concave utilities,

1.
if α=1, then
$\begin{align} \frac{d^{2} \tilde{U_{i}} ({\tilde{x}}_{i}; θ_{i})}{{d {\tilde{x}}_{i}}^{2}} = & \frac{d^{2}}{{d {\tilde{x}}_{i}}^{2}} (θ_{i} \ln (\frac{\ln (e^{{\tilde{x}}_{i}} + 1)}{θ_{i}})) \\ = & - \frac{θ_{i} x_{i}}{{(x_{i} + 1)}^{2} \ln^{2} (x_{i} + 1)} \\ \times (x_{i} - \ln (x_{i} + 1)) < 0 \end{align}$
(23)

because $e^{x_{i}}$ >x_i + 1 for all x_i>0.

2.
If α>0 and α≠1, then
$\begin{align} \frac{d^{2} Ũ_{i} ({\tilde{x}}_{i}; θ_{i})}{{d {\tilde{x}}_{i}}^{2}} = & \frac{d^{2}}{{d {\tilde{x}}_{i}}^{2}} (θ_{i} \ln (\frac{{(e^{{\tilde{x}}_{i}} + 1)}^{1 - α} - 1}{θ_{i} (1 - α)})) \\ = & (1 - α) ({(x_{i} + 1)}^{1 - α} - (1 - α) x_{i} - 1) \\ \times \frac{θ_{i} x_{i} {(x_{i} + 1)}^{- 1 - α}}{{({(x_{i} + 1)}^{1 - α} - 1)}^{2}} . \end{align}$
(24)

From Bernoulli’s inequality, ${(x_{i} + 1)}^{1 - α} < 1 + (1 - α) x_{i}$ if x_i>0 and 0<α<1, and ${(x_{i} + 1)}^{1 - α} > 1 + (1 - α) x_{i}$ if x_i>0 and α>1, we have (24) is negative for all x_i>0, α>0 and α≠1.

In case of sigmoidal utilities,

\begin{align} \frac{d^{2} Ũ_{i} ({\tilde{x}}_{i})}{{d {\tilde{x}}_{i}}^{2}} = & \frac{d^{2}}{{d {\tilde{x}}_{i}}^{2}} (θ_{i} \ln (\frac{e^{a {\tilde{x}}_{i}}}{θ_{i} (k + e^{a {\tilde{x}}_{i}})})) \\ = & - \frac{θ_{i} k a^{2} x_{i}^{a}}{{(k + x_{i}^{a})}^{2}} < 0, \end{align}

(25)

for all k,θ_i,a,x_i>0.

Proof of Theorem 1

Define x^(τ)(0) to be the initial point of step τ, and x(τ)^∗ to be the stationary point of step τ. First of all, we show that x(τ)^∗ is obtainable in each outer-iteration. Give θ, it is known that problem (P3^τ) has a unique optimal solution because it is a strictly convex problem with a strictly concave objective. With the assumptions on the step size γ(t)>0, $lim_{t \to \infty} γ (t) = 0$ , and $\sum_{t = 1}^{\infty} γ (t) = \infty$ , the dual-based subgradient algorithm converges to the optimal point given θ^(τ) in each τ-step according to ([17], Prop.8.2.5).

We now prove the convergence of the algorithm. Denote $G (x) ≜ \ln (\sum_{i \in N} U_{i} (x_{i}^{(τ - 1) *})$ , the objective of (P2). The solution of (P4^τ) indeed increases monotonically G(x) in each outer step:

\begin{align} G (x^{(τ - 1) *}) & = \sum_{i \in N} Ũ_{i} ({\tilde{x}}_{i}^{(τ)} (0); θ_{i}^{(τ)}) (26) \\ \leq \sum_{i \in N} Ũ_{i} ({\tilde{x}}_{i}^{(τ) *}; θ_{i}^{(τ)}) (27) \\ \leq G (x^{(τ) *}) . (28) \end{align}

Equation (26) is obtained via the replacement of $θ_{i}^{(τ)} = \frac{U_{i} (x_{i}^{(τ - 1) *})}{\sum_{j \in N} U_{j} (x_{j}^{(τ - 1) *})}$ and ${\tilde{x}}^{(τ)} (0) = {\tilde{x}}^{(τ - 1) *}$ into the right-hand size. The inequality (27) is satisfied because x(τ)^∗is an optimal point of (P4^τ) as well as (P3^τ) given θ^(τ). The inequality (28) is from (13). On the other hand, G(x) is a continuous function, so, G(x) is bounded as x is bounded. Moreover, the sequence {G(x(τ)^∗),τ=1,2,…} monotonically increases, therefore, it converges ([17], Prop.A.3). Hence, the sequence ${\sum_{i \in N} U_{i} (x_{i}^{(τ) *}), τ = 1, 2, \dots}$ also converges.

We next prove that the stationary point of Algorithm 1 is also the KKT point of (P2). The Lagrangian of P3^τis given by

\begin{align} L_{3} (x, p, ξ; θ) = & \sum_{i \in N} θ_{i} \ln (\frac{U_{i} (x_{i})}{θ_{i}}) \\ - \sum_{i \in N} ξ_{i} (x_{i} - c_{i} p_{i} \prod_{j \neq i} (1 - p_{j})) . \end{align}

(29)

If $({\tilde{x}}^{*}, p^{*})$ is an optimal solution of (P4^τ), then (x^∗p^∗), where $x^{*} = e^{{\tilde{x}}^{*}}$ is an optimal solution, hence, a KKT point of (P3^τ)[17]. Let the vector ξ^∗ be the multiplier vector corresponding with (x^∗p^∗) of (P3^τ). We note that ξ^∗ is definitely not the multiplier vector corresponding with $({\tilde{x}}^{*}, p^{*})$ of (P4^τ). The KKT conditions of (P3^τ) are

\begin{align} \nabla_{x} L_{3} (x^{*}, p^{*}, ξ^{*}; θ^{*}) = 0 and \nabla_{p} L_{3} (x^{*}, p^{*}, ξ^{*}; θ^{*}) = 0; \end{align}

(30)

\begin{align} ξ_{i}^{*} (x_{i}^{*} - c_{i} p_{i}^{*} \prod_{j \neq i} (1 - p_{j}^{*})) = 0, \forall i \in N; \end{align}

(31)

\begin{align} x_{i}^{*} \leq c_{i} p_{i}^{*} \prod_{j \neq i} (1 - p_{j}^{*}), \forall i \in N; \end{align}

(32)

\begin{align} ξ^{*} ≽ 0 . \end{align}

(33)

We can easily verify that the point $({\tilde{x}}^{*}, p^{*}, ξ^{*})$ also satisfies (4)–(7) which are the KKT conditions of (P2) if we replace $θ_{i}^{*} = \frac{U_{i} (x_{i}^{*})}{\sum_{k \in N} U_{k} (x_{k}^{*})}$ and ξ^∗=ν^∗. Hence, the theorem is proved.

References

Lee JW, Chiang M, Calderbank A: Utility-optimal random-access control. IEEE Trans. Wirel. Commun 2007, 6(7):2741-2751.
Article Google Scholar
Mohsenian-Rad A, Huang J, Chiang M, Wong V: Utility-optimal random access without message passing. IEEE Trans. Wirel. Commun 2009, 8(3):1073-1079.
Article Google Scholar
Mohsenian-Rad A, Huang J, Chiang M, Wong V: Utility-optimal random access: reduced complexity, fast convergence, and robust performance. IEEE Trans. Wirel. Commun 2009, 8(2):898-911.
Article Google Scholar
Shenker S: Fundamental design issues for the future internet. IEEE J. Sel. Areas Commun 1995, 13(7):1176-1188.
Article Google Scholar
Lee JW, Mazumdar R, Shroff N: Non-convex optimization and rate control for multi-class services in the Internet. IEEE/ACM Trans. Netw 2005, 13(4):827-840.
Article Google Scholar
Hande P, Shengyu Z, Chiang M: Distributed rate allocation for inelastic flows. IEEE/ACM Trans. Netw 2007, 15(6):1240-1253.
Article Google Scholar
Fazel M, Chiang M: Network utility maximization with nonconcave utilities using sum-of-squares method. IEEE Conference on Decision and Control 2005, 1867-1874.
Google Scholar
Chiang M, Low S, Calderbank A, Doyle J: Layering as optimization decomposition: a mathematical theory of network architectures. Proc. IEEE 2007, 95(1):255-312.
Article Google Scholar
Cheung MH, Mohsenian-Rad A, Wong V, Schober R: Random access for elastic and inelastic traffic in wlans. IEEE Trans. Wirel. Commun 2010, 9(6):1861-1866.
Article Google Scholar
Marks BR, Wright GP: A general inner approximation algorithm for nonconvex mathematical programs. Oper. Res 1978, 26(4):681-683. 10.1287/opre.26.4.681
Article MathSciNet MATH Google Scholar
Chiang M, Tan CW, Palomar D, O’Neill D, Julian D: Power control by geometric programming. IEEE Trans. Wirel. Commun 2007, 6(7):2640-2651.
Article Google Scholar
Papandriopoulos J, Dey S, Evans J: Optimal and distributed protocols for cross-layer design of physical and transport layers in manets. IEEE/ACM Trans. Netw 2008, 16(6):1392-1405.
Article Google Scholar
Tran N, Hong CS: Joint rate and power control in wireless network: a novel successive approximations method. IEEE Commun. Lett 2010, 14(9):872-874.
Article Google Scholar
Vo PL, Tran NH, Hong CS: Joint rate and power control for elastic and inelastic traffic in multihop wireless networks. IEEE Globecom 2011, 1-5.
Google Scholar
Wang WH, Palaniswami M, Low SH: Application-oriented flow control: fundamentals, algorithms and fairness. IEEE/ACM Trans. Netw 2006, 14(6):1282-1291.
Article Google Scholar
Mo J, Walrand J: Fair end-to-end window-based congestion control. IEEE/ACM Trans. Netw 2000, 8(5):556-567. 10.1109/90.879343
Article Google Scholar
Bertsekas DP, Nedić A, Ozdaglar AE: Convex Analysis and Optimization. (Athena Scientific, Belmont, MA , 2003)
Nedić A, Ozdaglar A: Cooperative distributed multi-agent optimization. Convex Optimization in Signal Processing and Communications 2010. (Cambridge University Press, Cambridge, MA)
Google Scholar
Lee JW, Ao T, Jianwei H, Chiang M, Robert A: Reverse-engineering mac: a non-cooperative game model. IEEE J. Sel. Areas Commun 2007, 25(6):1135-1147.
Article Google Scholar
Bianchi G: Performance analysis of the ieee 802.11 distributed coordination function. IEEE J. Sel. Areas Commun 2000, 18(3):535-547.
Article Google Scholar

Download references

Acknowledgements

This research was supported by the KCC (Korea Communications Commission), Korea, under the R&D program supervised by the KCA (Korea Communications Agency) (KCA-2012-08-911-05-002).

Author information

Authors and Affiliations

Department of Computer Engineering, Kyung Hee University, Seoul, Korea
Phuong L Vo, Sungwon Lee & Choong Seon Hong

Authors

Phuong L Vo
View author publications
You can also search for this author in PubMed Google Scholar
Sungwon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Choong Seon Hong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Choong Seon Hong.

Additional information

Competing interests

Both authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Vo, P.L., Lee, S. & Hong, C.S. The random access NUM with multiclass traffic. J Wireless Com Network 2012, 242 (2012). https://doi.org/10.1186/1687-1499-2012-242

Download citation

Received: 14 February 2012
Accepted: 18 July 2012
Published: 06 August 2012
DOI: https://doi.org/10.1186/1687-1499-2012-242

The random access NUM with multiclass traffic

Abstract

Introduction

Design of the successive approximation algorithm

Network model

Approximation problem

Lemma 1

Proof

Lemma 2

Proof

Solution to the approximation problem and the algorithm

Algorithm 1 Successive approximation algorithm for multiclass traffic

Theorem 1

Proof

More general utility functions and analysis

Numerical results and discussions

Convergence of the algorithm

A heuristic implementation

Varying the initial point

Compare to the standard dual-based algorithm

Compare to binary exponential backoff MAC protocol

Conclusions

Appendix

Proof of Lemma 2

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords