 Research
 Open Access
 Published:
An OFDMA resource allocation algorithm based on coalitional games
EURASIP Journal on Wireless Communications and Networking volume 2011, Article number: 46 (2011)
Abstract
This work investigates a fair adaptive resource management criterion (in terms of transmit powers and subchannel assignment) for the uplink of an orthogonal frequencydivision multiple access network, populated by mobile users with constraints in terms of target data rates. The inherent optimization problem is tackled with the analytical tools of coalitional game theory, and a practical algorithm based on Markov modeling is introduced. The proposed scheme allows the mobile devices to fulfill their rate demands exactly with a minimum utilization of network resources. Simulation results show that the average number of operations of the proposed iterative algorithm are much lower than K · N, where N and K are the number of allocated subcarriers and of mobile terminals.
1. Introduction
The advent of highdefinition entertainment services justifies the need for wideband, highcapacity wireless communication technologies that use the available bandwidth efficiently and provide data rates close to channel capacity [1]. Multicarrier channel access techniques such as orthogonal frequencydivision multiple access (OFDMA) can be exploited to increase data rates, by dividing a frequencyselective broadband channel into a multitude of orthogonal narrowband flatfading subchannels. An intelligent and scalable joint power and bandwidth allocation mechanism is crucial to ensure the quality of service (QoS) to the consumer at a reasonable cost [2].
The problem of subcarrier and power assignment in OFDMA has been extensively considered in the literature during the last few years. The proposed solutions mainly fall into two different categories: marginadaptive and rateadaptive methods. The goal of marginadaptive schemes (such as [3]) is to minimize the total transmit power expenditure to achieve the (minimum) QoS requirements. Algorithms based on the rateadaptive criterion (such as [4]) aim on the contrary at achieving the maximum data rate subject to different QoS constraints.
Most algorithms focus on the downlink scenario, with constraints on the total power transmitted by the radio base station. In the uplink scenario, the restrictions apply on an individual basis to each user terminal, and the simplest solution to maximize channel capacity of mobile devices under a power constraint is the water filling (WF) criterion [5]. In this case, channel capacity is increased when every subcarrier is assigned to the user with the best path gain, and the power is distributed according to the WF criterion. However, the WF solution is highly unfair, since only users with the best channel gains receive an acceptable channel capacity, while users with bad channel conditions achieve very low data rates. To derive fair resource allocation schemes, we resort to other techniques, described in the following.
Generally, a resource allocation algorithm can be either centralized or distributed. In centralized schemes like [6, 7], the algorithm is executed by a central unit (like the radio base station) that is aware of the channel conditions and the demands of all mobile terminals. In a distributed model (such as [8]), each mobile terminal tries to accomplish its own (minimum) QoS autonomously. In general, centralized techniques show better performance at the expense of a higher signaling between terminals and central unit, and lower scalability. In the context of distributed algorithms, several crosslayer approaches were developed (e.g., [9, 10]) to reduce the total power consumption and to support different services and traffic classes in the downlink channel of an OFDMA system. Maximizing the power efficiency in uplink OFDMA has also been tackled in [7, 11, 12] using different formulations for the joint resource allocation problem.
Recently, coalitional game theory[13, 14] has been used to address the problem of fair resource allocation for OFDMA systems using either centralized or distributed algorithms. Roughly speaking, coalitional game theory studies the actions of a group of individual agents (such as mobile devices) that compete for a common resource (such as the wireless medium) by possibly finding synergies and forming coalitions among each others. Han et al. in [6] introduce a distributed algorithm for the OFDMA uplink based on the Nash bargaining solution (NBS) [13] and the Hungarian method [15] to maximize the overall system rate under individual power and rate constraints. The NBS guarantees each user to achieve its own demand, thus providing fairness to the resource allocation. The proposed algorithm shows a complexity , without considering the expensive computational load to solve the (convex) equations of the NBS. In [16], Chee et al. propose a centralized algorithm for the OFDMA downlink scenario based on NBS and RaiffaKalaiSmorodinsky bargaining solution (RBS) [17]. NBS guarantees the minimum rate, while RBS bounds the maximal rate achieved by each user, respectively. The results show a good performance only when the gap between the maximum and the minimum rate is large. The complexity of this algorithm is , again without considering the solution of the RBS. In [18], Noh proposes a distributed and iterative auctionbased algorithm in the OFDMA uplink scenario with incomplete information. The experimental complexity of the algorithm is . However, the simulation parameters are not realistic (three users and subcarriers), and it is thus hard to estimate the computational complexity when using realworld network parameters.
All the mentioned schemes, which represent, to the authors' knowledge, the most relevant algorithms for OFDMA resource allocation with coalitional game theory, exhibit a good tradeoff between overall system rate and fairness. Unfortunately, they also present a number of common problems: (i) most algorithms are based on nonlinear programming, which is computationally expensive and hardly scalable when considering thousands of subcarriers and tens of users. Thus, they are not suitable for implementation by network designers; (ii) although the resource apportionment results to be fair from the users' point of view, the achieved QoS may be much larger than demanded. This implies a waste of network resources from the service provider perspective, which has not been considered by previous works; and (iii) to reduce the computational burden, each subcarrier is allocated to mobile terminal in an exclusive manner, although this may limit the number of simultaneous connections in the uplink channel.
In this work, we aim at fulfilling each user's QoS requirement in terms of target transmit rates exactly with the best utilization of the network resources, so as to satisfy both the users and the service provider. We also aim at designing a lowcomplexity algorithm that allows a centralized solution for the joint power and bandwidth allocation for OFDMA uplink channels to be achieved in a few steps using typical network parameters. In our approach, we allow every subcarrier to be possibly shared among more than one user, and we add a constraint on the maximum number of used subcarriers per terminal. This is achieved by dividing the available bandwidth into a number of disjoint blocks of consecutive subcarriers and forcing each terminal to use at most one subcarrier per block. The motivation of this is twofold: we wish to (i) increase the signaltointerferenceplusnoise ratio (SINR) on the used subcarriers, which also simplifies channel estimation; and (ii) exploit the frequency diversity to increase the performance of forward error correction techniques.
The remainder of the paper is structured as follows. Section 2 introduces the basics of coalitional game theory. In Section 3, we formulate the resource allocation problem in the uplink OFDMA scenario as a coalitional game, whereas in Section 4 we introduce a solution algorithm based on Markov modeling. Section 5 presents our the experiment results, and some conclusions are drawn in Section 6.
Notation: For the reader's convenience, Section 7 reports the list of symbols used throughout the paper.
2. Brief review of coalitional game theory
A coalitional game is a game where groups of players (the coalitions), instead of single players, interact and compete [13, 14]. It is denoted as , where denotes the set of players and ν the coalition function. We also denote with x_{ m } the payoff of player m in , . If is a coalition (subset) of formed in , then its members get an overall payoff , with when . In a cooperative game with transferable utility (TU), the payoff of a coalition can be expressed by a real value.
A relevant issue in coalitional games is how the players make mutual binding agreements to form the coalition that provides them with the highest payoff. When the players are better off when staying together, they tend to form the grand coalition (i.e., the coalition of all the agents) [14]. The grand coalition is formed only if the game is superadditive:
Definition 1: A TU game is superadditive if
■
An important issue in a coalitional TU game is how to distribute the payoff of the grand coalition among agents. The fundamental solution is the core solution, defined as follows:
Definition 2: Let be the set of M players of the superadditive TU game , and let ν be the payoff of the game. The core of is the set
In other words, x ∈ ℝ ^{M} is a core of if and only if no payoff distribution can improve upon . ■
In other words, the core of a coalitional game is the set of all payoff vectors (i.e., all those vectors whose entries add up to a same amount equal to the utility of the grand coalition) such that the sum of all payoffs of the players in any existing coalition is no smaller than the utility of the coalition.
For a nonsuperadditive coalitional game, the network formation process does not lead the players to form a grand coalition. In this case, Definition 2 does not apply. Let us redefine the core set in a general (not necessarily superadditive) coalitional formation TU game. Let denote a partition of the set wherein for i ≠ j, and for i = 1, ... m, and let Ψ denote the set of all possible partitions ψ. Let us also define , such that and for i = 1, ... m, as a family of (nondisjoint) coalitions.
Definition 3: A core apportionment x ∈ ℝ ^{M} is a payoff distribution with the following property:
Note that, if is superadditive, . ■
The core allocation set can be found through linear programming and can also be an empty set. We can study the nonemptiness of the core without explicitly solving the core equation, using the following lemma:
Lemma 1[13]: A necessary and sufficient condition for the core of a TU game to be nonempty is the TU game to be balanced.
Definition 4: A superadditive TU game for a family of coalitions is balanced if, for any , the inequality
holds, where is a collection of numbers in [0, 1] (balanced weights) such that
with denoting the characteristic vector whose elements are
■
Definition 5: A nonsuperadditive TU game for a family of coalitions is balanced if, for every balanced collection of weights , and for any ,
■
3. Problem formulation
Let us consider the uplink of a singlecell infrastructure OFDMA system with total bandwidth B, subdivided in N subcarriers with frequency spacing Δf = B/N. The cell is populated by K mobile terminals, each terminal experiencing a complexvalued channel gain H_{ kn } on the n th subcarrier to the base station and having a data rate requirement R_{ k } (in bit/s). We assume that fulfilling such constraints simultaneously by all terminals is feasible.
To ensure fairness among users, the set of available subcarriers is grouped in D blocks of N/D contiguous subcarriers , with 1 ≥ d ≥ D, as shown in Figure 1. Each terminal is allowed to take at most one subcarrier per each subblock. This is done to avoid assignments of contiguous blocks of subcarriers to users that may be in a deepfading frequency range.
Our resource allocation strategy consists in finding a vector of transmit powers P _{k}, where P_{ k } = [p_{k 1}, ..., p_{ kN } ], with p_{ kn } representing the power allocated by terminal k over the n th subcarrier, that allows the QoS constraint R_{ k } to be satisfied. We decouple the problem into the cascade of subchannel assignment and (subsequent) power allocation.
A. Subchannel assignment
We describe here two different options to perform this function:
1) Bestcarrier assignment: For every subblock , every terminal is assigned its best subcarrier . The probability of assigning the same subcarrier to multiple mobile terminals is nonnull.
2) Vacantcarrier assignment: In a sequential manner, for every subblock , every terminal is assigned its best subcarrier . But, if k ≤ N/D, we would like to ensure exclusive use of each subcarrier to better exploit the available bandwidth B (i.e., to reduce the multiple access interference). So, if has been already assigned to some other terminal ℓ < k, then terminal k is assigned the best vacant (unassigned) subcarrier to within the channel coherence bandwidth. Clearly, this is not considered if k > N/D, so that terminal k is assigned its best subcarrier in the subblock anyway. Note that the ordering of has a negligible impact on system performance when N is, as usual, sufficiently high.
Both assignment strategies can be modified to address the case in which each terminal is allowed to have a different number of assigned subcarriers (different D_{ k } for each mobile terminal), based on its own data rate requirement R_{ k } . This can be done, for instance, by assigning the subcarriers on a terminal basis rather than on a subblock basis. This modification to the algorithm might lead to a bad performance given particular configurations of the network, whereas the average performance in the long run proves to be experimentally equivalent to the case of equal number of blocks D across all users. However, for the sake of simplicity, we consider the same D for all terminals from now on.
B. Power allocation
To derive a stable solution to the power allocation subproblem, we consider it as a coalitional game, in which each subchannel is identified as a player in the game. To model the coalitional game, we build K coalitions , to be assigned to the K terminals. Each coalition , , contains the D players . Note that (i) the members of each coalition are fixed, since one player cannot move from one coalition to another; and (ii) since a subcarrier can be shared among multiple users, there exist virtual copies of it belonging to different coalitions. For the sake of notation, we will identify with a generic any of the subcarriers assigned to terminal k. The strategy of each player is represented by the optimal power expenditure , where is the maximum power expenditure over subcarrier n by terminal k. Note that (i) if , p_{ kn } = 0; and (ii) if , we can also have p_{ kn } = 0, which means that the k th terminal does not transmit on the n th subcarrier, and it thus bears an actual number of active subcarriers .
The system under investigation aims at fulfilling the QoS requirement of every terminal k in terms of target rate R_{ k } . For simplicity, we estimate the achieved data rate as the Shannon capacity C_{ k } of terminal k that can be approached by using suitable channel coding techniques [19]:
where C_{ k } is the Shannon capacity achieved by terminal k on its subcarrier :
Clearly, C_{ kn } = 0 if , since p_{ kn } = 0. If , C_{ kn } depends on the received SINR γ_{ kn } at the base station on subcarrier n, which is a function of the strategy (i.e., the transmit power) chosen by player n (i.e., one of the D subcarriers assigned to the k th terminal), of the transmit power of other terminals on the same subcarrier (if , p_{ jn } = 0), of the corresponding channel gains, and of the power of the additive white Gaussian noise (AWGN) . Note that, in an OFDMA system, there is no interference between adjacent subcarriers. Hence, C_{ kn } considers only intrasubcarrier noise that occurs when the same subcarrier is shared by more terminals. Each player causes interference only to its virtual copies, i.e., to the players of other coalitions such that , with j ≠ k and for any d', 1 ≤ d' ≤ D.
The mobile terminals and the service provider are most satisfied when each mobile terminal k achieves its own data rate requirement exactly: C_{ k } = R_{ k } . In view of this goal, we can force all players in each coalition to select their strategies (i.e., the power allocation for terminal k over the available bandwidth B) so as to maximize a utility function for the k th coalition , defined as
where u(·) is the step function, with u(y) = 1 if y ≥ 0 and u(y) = 0 otherwise (see Figure 2). If C_{ k } = R_{ k } , , earns the highest possible payoff . If C_{ k } > R_{ k } , gets a positive payoff, whereas it obtains a negative payoff if C_{ k } < R_{ k } . The factor α is a finite positive constant (much) greater than one (i.e., 1 ≪ α < +∞) that ensures to be negative when C_{ k } < R_{ k } . This is expedient to let the players distinguish a capacity C_{ k } that is lower/upper than R_{ k } only by knowing their own coalition's payoff. Note that, in practice, +∞ can be represented by the largest countable number available (e.g., 2^{64}  1) in a given simulation platform.
The payoff of each coalition is a real number and, in our formulation, the most important parameter is the gain of each coalition, whereas the outcome of each player does not matter at all. For instance, we can equally divide the payoff of the coalition among all players. Therefore, this game is a TU one [13, 14]. The specific shape of our utility function (10) is actually immaterial and was chosen to ensure fast convergence of the iterative algorithm that will be introduced later on. We could have considered any utility function that increases as the difference C_{ k }  R_{ k } moves from ±∞ to 0, just to make sure that, for any C_{ k } ≠ R_{ k } , each coalition has an incentive to move toward C_{ k } = R_{ k } .
To provide further insight into the problem, we investigate now some properties of the proposed game . As a first step, we note that the players in with the utility function (10) do not tend to form the grand coalition. This is because every player cannot leave its coalition : the members of every coalition are fixed and do not change during the game. This may appear inappropriate to the notion of a coalitional game. However, our assumption is fairly common in economic problems like the study of a bargaining game between two corporations when each corporation has its own business branches. In this case, the members (branches) of each coalition (corporation) are fixed [20].
A relevant result for our game is the following:
Theorem 1: The core of the game with utility function (10) is not empty.
Proof: The number of coalitions and the number of players in each coalition are both fixed. Since each player belongs just to one coalition, the unique balanced collection of weights is . To conclude the proof, we must verify that . Since the target rates of all terminals are assumed to be feasible, then every coalition expects C_{ k } to approach R_{ k } . Therefore, every coalition is allowed to earn the highest possible payoff.■
In the following section, we will show how the fundamental properties of our game lead to a practical allocation algorithm.
4. The bestresponse algorithm
We are interested in answering questions like: How do the players set their proper transmit power amounts? Dynamic learning models provide a framework for analyzing the way the players may set their proper strategies. A player adopts a certain power amount if and only if this matches its coalition's interests, and this goal can be achieved through a bestresponse iterative algorithm [21] based on Markov modeling [22]. Each player takes its own decisions individually, myopically, and concurrently with the others, so as to lead its own coalition's payoff toward +∞(C_{ k } = R_{ k } ). At each (discrete) time step of the algorithm, the autonomous players simultaneously adjust their transmit powers based on a model to increase the payoff of their own coalitions. Although this leads to interference when virtual copies of the same subcarriers simultaneously change their powers, we show that this dynamic myopic procedure guarantees the maximum payoff to each coalition.
The process starts up at time step t = 0 with an arbitrary assignment of the transmit powers to all K · D players in the game (that are grouped in K coalitions with players with , 1 ≤ d ≤ D). At the generic time step t, our system is in the state ω^{t} = (ψ^{t} , ν^{t}) where ψ^{t} is the set , and contains the payoffs of the coalitions in ψ^{t} . The evolution of the Markov chain is then dictated by the strategy of the game. The strategy of each player is to find the best power amount that leads to an increase in the payoff of its own coalition . In practice, player decides whether to change its power allocation, making its coalition better off, or to keep transmitting at the same power level (e.g., when its coalition's payoff is infinite). The following snippet pseudocode shows how each player takes its decision during time step t.
In this algorithm, is the "trial" value of the current payoff of the coalition when the tentative power is adopted: it is computed with for all and for any j ≠ k, and . At each step of the update process, the power step is the particular outcome (value) of a random variable uniformly distributed between 0 and , with . As better detailed in Section 5, optimal values for can be found in order to minimize the algorithm computational load, based on experimental results. If , then C_{ k } < R_{ k } , and the best strategy for player is to increase its current transmit power so as to increase its coalition's payoff. As a result of the random power stepping, the tentative power is a random number in the interval . Player accepts this value if and only if the coalition payoff increases, otherwise it ends up transmitting at its previous value. If , player best strategy is on the contrary to decrease , and thus the tentative (random) transmit power belongs to the interval . At the end of each time step t, the base station computes the payoff with updated power amounts. A uniformly distributed random power stepping is adopted to increase the probability of picking the (unknown) best adjustment value, and thus both to reduce the convergence time of the algorithm and to possibly minimize the overall power consumption. As is apparent, the convergence speed of the algorithms depends not only on the parameters of the network but also on the choice of the maximum update step .
As already stated, two copies and (the virtual copies of the same subcarrier n) may happen to wish to adjust their transmit powers in a conflicting (and thus incompatible) way. If we assume that each player just follows the decision rules listed in the pseudocode above, then the probability of conflicting decisions will be high. To reduce the occurrence of this event, we modify our algorithm by requesting each player not to update its transmit power at every step of the game with a probability λ ∈ [0, 1]. At each time step t, every player selects a random number uniformly distributed in [0, 1]. If , then the player applies the algorithm and (possibly) update , otherwise (i.e., during time step t, it skips the update process, and the value of is maintained). If λ is close to 1, then the probability of conflicting decisions tends to 0, but the algorithm will have a large convergence time, since the probability of updates is low. In addition to the conflicts described above, another potentially disruptive condition may arise between different subcarriers belonging to the same coalition: if both (myopic) players simultaneously increase their powers and , it may occur that C_{ k } >R_{ k } . To optimize the update mechanism and to cope with both negative kinds of events, we could consider a variable and adaptive threshold for each virtual copy of the same subcarrier (each player). However, to reduce the complexity of the algorithm, we assume for all the players (i.e., virtual copies of the subcarriers). As better detailed in Section 5, the optimal value of λ must be selected as a suited tradeoff. Note that the value of λ is common knowledge among the players at every step of the algorithm. Nevertheless, interference between concurrent, conflicting decisions may prevent the coalitions from achieving the expected payoff. If all coalitions earn less than the previous time step, all players assign the previous power amount for the next time step. There may exist network configurations in which the iterative algorithm is not guaranteed to converge. To account for these situations, we place a maximum number of operations Θ, beyond which the algorithm is stopped, and the sum of the users' demands is supposed to be unfeasible.
We show now that our proposed algorithm reaches a stable state, which corresponds to the core apportionment of the game. We model the evolution of the algorithm as the output of a finitestate Markov chain with state space Ω = {ω = (ψ, ν)ψ ∈ Ψ, ν∈ ℝ ^{K} }. For all time steps t, ψ^{t} = ψ belongs to the subset of all possible disjoint coalitions Ψ with exactly D members, and remains fixed for the whole duration of the algorithm. The time evolution of the algorithm as a Markov chain is due to time variability of ν^{t}, which depends on the power levels chosen by the players in the coalitions collected by ψ^{t} . We the use this notation for the sake of convenience, to emphasize that ν^{t}is directly connected to ψ^{t} .
The Markov process asymptotically tends toward a stable coalition structure state, where no player has any incentive to change its power. In other words, all coalitions get their maximum payoffs. Our algorithm guarantees that when t → ∞, this Markov chain tends toward a singleton steady state with probability 1.
Definition 6[22]: A set Φ ⊂ Ω is an ergodic set if, for any ω ∈ Φ and ω' ∈ Φ, the probability of reaching the state ω' starting from ω is zero. Once the Markov chain falls into a state belonging to an ergodic set, it never leaves that set, and it wavers between the states in that ergodic set from then on. The probability of reaching any state in the ergodic set is strictly positive. ■
Lemma 2[22]: In any finite Markov chain, no matter which state the process starts from, the probability of ending up into an ergodic set tends to 1 as time tends to infinity.
Definition 7[22]: Singleton ergodic sets are called absorbing states. ■
If Φ is an absorbing state and ω ∈ Φ, the probability of ending up into state ω when beginning from ω is one. In fact, absorbing states individually represent points of equilibrium.
Lemma 3: The state ω = (ψ, ν) is an absorbing state of the bestresponse process if and only if
Proof: This condition ensures that no player has any incentive to change its power amount. If this condition is met, then no coalition can get a higher payoff by deviating from state ω = (ψ, ν). Since all the target rates are feasible, this condition is also necessary.
Theorem 2: The bestresponse process has at least one absorbing state.
Proof: Since the bestresponse algorithm is a Markov process, Lemma 2 ensures that the bestresponse process reaches an ergodic set Φ. To conclude the proof, it is enough to show that Φ is singleton. Suppose that the number of states in the ergodic set is Φ > 1. Then, all players revise their strategies without conflicting decisions with a nonnull probability. As a consequence, the Markov process moves to a new state, in which all coalitions' payoff are higher than those achieved in the previous state. This means that the probability of going back to the previous state is null, which contradicts the notion of an ergodic set. ■
Note that Theorem 2 does not ensure the uniqueness of the ergodic set in the bestresponse process. There may exist some different combinations of the power allocation for the players to reach to a steady state. It means that the game possesses multiple equilibria. The major finding of Theorem 2 is that according to the way the players adjust their strategies, the bestresponse process leads to one of the steady states, in which no player has any incentive to revise its power allocation.
Theorem 3: The set of payoffs associated with an absorbing state of the bestresponse process coincides with the set of core allocation:

i.
if ω = (ψ, ν) is an absorbing state, then ν is a core allocation.

ii.
if ν is a core allocation, then all ω = (ψ, ν) are absorbing states.
Proof:
Part (i) Suppose ω = (ψ, ν) is an absorbing state but ν is not a core allocation. In this case, there exist some coalitions that can obtain a higher payoff. This is contradictory, since the game reaches an absorbing state when every coalition gets the maximum payoff.
Part (ii) If ν is a core allocation, then no coalition can earn by letting its member change their powers. This implies that the state will not move to a new state, and thus the current state is absorbing. ■
Coalitional games aim at identifying the best coalitions of the agents and a fair distribution of the payoff among the agents. Interestingly, in this game the absorbing state coincides with one of the Nash equilibria [13] of the game. Suppose there are K = 2 mobiles connected to a base station with N = 1 subcarrier only. In this case, the M = K · N = 2 copies of the subcarrier, each constituting a coalition, are engaged in a 2 × 2 game. Every player has two strategies: either p_{ k } = 0 or . It is straightforward to verify that, in this game, a mixed (versus pure) Nash equilibrium exists which satisfies the stability of the static game. With due attention to the notation, we can extend this result to a general case.
Theorem 4: The set of absorbing states in the bestresponse process and the set of Nash equilibria of the static game are asymptotically (in the long run) equivalent.
Proof: Let us consider the coalitions in the bestresponse process as players in a static game. Lemma 2 ensures that this process reaches an ergodic set in the long run. According to Theorem 2, this set is singleton, and thus its member is an absorbing state. Hence, no coalition (i.e., no player in the static game) has any incentive to revise its strategy. In static games, this is the definition of a Nash equilibrium. ■
We can now conclude that the absorbing state is an extension of the Nash equilibrium, since the coalitions bind agreements with each other as economic agents and earn a vector value rather than a real number. Once the coalitions reach the absorbing state, their payoff is the highest possible (+∞), and no coalition is willing to revise its current strategy. In general, as follows from Theorem 4, the Nash equilibrium of the game is Paretooptimal (efficient), since no other strategy can achieve a payoff greater than +∞.
5. Numerical results
In this section, we evaluate the performance of the bestresponse algorithm presented in Section 4. We consider some cases with different numbers of mobile terminals, target data rates, and subcarriers, showing that our suggested scheme reaches a steady state after a few steps only. To increase the convergence speed of the algorithm, we introduce a tolerance parameter ε in our utility function, such that if C_{ k } /R_{ k }  1 < ε, then we assume that the payoff is +∞. We can possibly set an asymmetric range [ε_{1}, ε_{2}] such that ε_{1} ≤ (C_{ k } /R_{ k }  1) ≤ ε_{2}, so as to favor solutions with C_{ k } > R_{ k } .
We consider the following parameters for our simulations: the maximum power of each terminal k on each subcarrier n is ; the power of the ambient AWGN noise on each subcarrier is , and the constant number in (10) is α = 5000. We also set Θ = 10K · N as the stopping criterion of the iterative algorithm, where K and N depend on the network parameters of the simulation. The path coefficients H_{ kn } , corresponding to the frequency response of the multipath wireless channel at the carrier frequency n Δf, are computed using the 24tap ITU modified vehicularB channel model adopted by the IEEE 802.16m standard [23]. To account for the largescale path loss, we assumed the terminals to be uniformly distributed between 3 and 100m. Based on numerical optimizations, the parameter λ that reduces the probability of conflicting decisions among members of different coalitions for different number of terminals, subcarriers, and signal bandwidth is λ = 0.97.
The initial power allocation is and . This experimentally provides the minimal power consumption at the steady state, and in most cases the minimum number of steps of the algorithm.
Figure 3 reports the behavior of the achievable rate C_{ k } as a function of the time step t in a network with K = 10 terminals, N = 1024 subcarriers, and bandwidth B = 10 MHz using the vacantcarrier assignment scheme. The target rates, reported in Figure 3 with solid markers on the right axis, are assigned randomly to each terminal using a uniform distribution in the range [100, 250]kb/s. Further parameters are as follows: tolerance ε_{1} = 0, ε_{2} = 0.01 power update step , and number of subblocks D = 32. Numerical results show the convergence of C_{ k } to the respective target rates R_{ k } after 31 steps of the bestresponse algorithm.
In the remainder of this section, we will evaluate the average performance of our proposed algorithm in terms of power expenditure and computational burden using realistic system parameters and extensive simulation campaigns. Note that we are not able to implement the joint resource allocation techniques available in the literature and reviewed in Section 1, mainly due to the unfeasible algorithmic complexity when using tens of terminals, hundreds of subcarriers, and high data rates (on the order of Mb/s). As a consequence, in the following we will compare our measured results with the theoretical performance provided by the literature. The complexity figures given in Section 1 will be used as a reference to compare the performance of our proposed scheme in terms of computational demand.
Figures 4 and 5 report the simulation results obtained after 500 random realizations of a network with , N = 1024, B = 10 MHz, and ε_{1} = 0, ε_{2} = 0.04 again with the vacantcarrier assignment strategy. Solid lines represent the case , whereas dashed lines depict the case . Circles, squares, upper triangles, and lower triangles correspond to D = {8, 16, 32, 64}, respectively. Figure 4 shows the average normalized power expenditure ζ_{ k } at the steady state as a function of K, computed by averaging over all terminals. This serves as a measure for the average total power consumption normalized to the maximum power expenditure available to each terminal. As can be noticed, ζ_{ k } increases for K ≥ N/D, since the number of shared subcarriers increases and the terminals must spend more power to overcome the intrasubcarrier noise. Interestingly, the power expenditure of the proposed centralized algorithm shows higher efficiency than the distributed and crosslayer schemes available in the literature (e.g., see [7, 10, 12]). For instance, when considering 500 random realizations of a system with bandwidth B = 10 MHz and N = 1024 subcarriers, and using the vacantcarrier assignment model, we find that, in the case of a total sumrate demand of 20 Mb/s (i.e., with a spectral efficiency of 2 b/s/Hz) and R_{ k } = R 200 kb/s (i.e., K = 100 terminals), the maximum power consumption per user is 31 μ W and the average power consumption of the system is 0.53 mW. In the multicell scenario of [7], the average power expenditure for each cell is 8 mW when the achievable data rate is 40 Mb/s. When considering the crosslayer algorithm proposed in [10], the average power expenditure per mobile terminal is 0.4 W with maximal spectral efficiency of 2 b/s/Hz, whereas the average power expenditure per mobile terminal required by the energyefficient techniques proposed in [12] is 0.4 and 1.2 W when the achieved data rate is equal to 40 and 140 kb/s, respectively.
Figure 5 shows the computational burden of our algorithm expressed in terms of the average number of operations per terminal required to reach the steady state as a function of the number of terminals K, with the vacantcarrier assignment model. The number of operations is measured experimentally by counting the number of steps required by the subchannel assignment plus the total number of trials required to update the transmit power according to the bestresponse algorithm. As can be seen, the number of operations increases as D increases. This can be justified since increasing D increases the number of players K · D, which yields an increase in the number of conflicting decisions. Note that the proposed algorithm is able to provide a spectral efficiency higher than 1 b/s/Hz, which occurs, for instance, when we assume more than K = 50 users with rates R_{ k } = 200 kb/s over a bandwidth B = 10 MHz in the proposed scenario, with a linear computational burden at the base station using appropriate values for the parameters. In this particular example, a good tradeoff between performance and complexity is D = {8, 16} and . Using these values, the number of operations of the proposed algorithm is experimentally lower than the product K · N, and so considerably lower than the number of operations required by the schemes available in the literature (e.g., see [6, 16, 18]). Our experiments with different data rate demands show that a smaller data rate reduces also the number of operations significantly. To further reduce the number of operations, we can also increase the tolerance parameters (e.g., with ε_{2} = 0.1, we experience a reduction in the number of operations on the order of 2030%). Note also that the spectral efficiency achieved by the proposed fair resource allocation method, while showing a linear computational burden, is comparable with that provided by sumrate maximizing algorithms (e.g., see [24]). In practice, a reasonable value for the maximum spectral efficiency achieved by the network in the region of linear computational load in all simulated scenarios (not reported here for the sake of brevity) is slightly lower than 2 b/s/Hz. For higher spectral efficiencies, no parameter selections can achieve the optimal resource allocation with linear complexity, and the number of operations appears to increase exponentially with the number of mobile terminals. However, note that the solutions can be found in most cases.
Figures 6 and 7 depict the simulation results of a network with , N = 1024, B = 10 MHz, and ε_{1} = 0, ε_{2} = 0.04 using the bestcarrier assignment model. Solid lines represent the case whereas dashed lines depict the case . Squares, upper triangles, and lower triangles correspond to D = {16, 32, 64}, respectively. Figure 6 shows the average normalized power expenditure ζ_{ k } at the steady state as a function of K. As can be seen, the average power expenditure using the bestcarrier assignment model is lower than with the vacantcarrier assignment, since the terminals having better channel conditions can spend less power.
A drawback of the bestcarrier assignment is an increased number of operations required by the algorithm. Figure 7 shows the average number of operations per terminal required to reach the steady state as a function of the number of terminals K. As can be seen, the bestcarrier assignment model has a computational burden higher than vacantcarrier assignment model, since the number of shared subcarriers in the bestcarrier assignment model is larger than in the vacantcarrier assignment, which increases the probability of interference between simultaneous decisions in the bestreply algorithm. Note that, using the bestcarrier assignment model, the case D = 16 appears to be computationally expensive.
Figure 8 shows the average number of operations per terminal in the case of a network with parameters , N = 512, B = 10 MHz, and ε_{1} = 0, ε_{2} = 0.04 using vacantcarrier assignment model. Solid and dashed lines represents the cases and , respectively, whereas circles, squares, upper triangles, and lower triangles depict D = {8, 16, 32, 64}, respectively. Even in this case, with more severe requirements in terms of target data rates, the number of operations is shown to be lower than the product K · N, again using spectral efficiencies higher than 1 b/s/Hz.
Finally, Figure 9 shows the average number of operations per terminal in the case of a network with parameters B = 20 MHz, N = 2048, R_{ k } = 2 Mb/s, ε_{1} = 0, and ε_{2} = 0.04 with vacantcarrier assignment model. Solid and dashed lines represent the cases and , respectively, whereas circles, squares, and upper triangles depict D = {64, 128, 256}, respectively. The number of operations is again lower than K · N even in the case of high data rate demands.
As can be seen in Figures 5, 7, 8, and 9, due to the random behavior of the proposed algorithm, there is a strict relation between the average number of operations, the network parameters, and the algorithm parameters (including the channel assignment model). Depending on the parameter selection, we see different shapes (linear or exponential behavior) for the average number of operations. Thus, estimating the analytical complexity function for the bestresponse algorithm is hard to do. However, for all tested scenarios (not reported here for the sake of brevity), there exist properly tuned values (such as D, ) that provide an average number of operations for the proposed algorithm that are lower than the product K · N, even with high data rate demands like in the cases of Figures 8 and 9. The parameter that most impacts on the number of operations is D. Our experiments show that, for the optimal parameter selection (i.e., when the number of operations scales linearly with N and K), the average number of active subcarriers per terminal (i.e., those which bear p_{ kn } > 0) is approximately D/2 when the vacantcarrier model is adopted. This rule of thumb can be used as a design criterion for the proposed algorithm. Let us consider Figure 10, which reports the average number of active subcarriers to each mobile terminal as a function of the achieved rate R, in the linear computational load regime and using . Dashed and solid lines depict the cases B = {10, 20}MHz, respectively, whereas circles, squares, and upper triangles represent N = {512, 1024, 2048}, respectively. For instance, when B = 20 MHz, N = 512, and R = 500 kb/s, the average number of active subcarriers is 4. If we look back at Figure 8, we can verify that the linear number of operations can be achieved using D = 8. Note that the number of active subcarriers in the case of B = 10 MHz is higher than in the case B = 20 MHz, since the subcarrier spacing is halved.
6. Conclusion
This paper described a computationally inexpensive centralized algorithm based on coalitional game theory to address the issue of fair optimal resource allocation (in terms of subcarrier assignment and power control) for the uplink of an infrastructure OFDMA wireless network. The scheme derived here is designed to meet the required data rates exactly, thus ensuring a fair performance apportionment to both users and service providers, with the best utilization of the network resources (minimum power expenditure and good spectral efficiency). The proposed algorithm can be analyzed as a Markov model that converges to an absorbing state with unitary probability in the long run. Our criterion also allows us to tradeoff system performance and computational burden of the algorithm, based on the number of subblocks used to apportion the available bandwidth and the data rate requirements of the terminals. Simulations show that the target rates are achieved with a lowcomplexity procedure, even in the case of populated networks and stringent QoS requirements. The (greedy) bestcarrier assignment rule results into a higher number of operations but a lower power expenditure compared to the case with full use of the available subcarriers. The presented coalitionbased strategy appears to be a good tradeoff between computational load and power efficiency in comparison with the schemes available in the literature and achieves a spectral efficiency larger than 1 b/s/Hz.
7. List of symbols
characteristic vector of coalition
B OFDM signal bandwidth
C_{ k } Shannon capacity achieved by terminal k
C_{ kn } Shannon capacity achieved by terminal k on the carrier n
d generic index for carrier subblock
D number of carrier subblocks
number of subcarriers assigned to terminal k
family of coalitions
coalitional game
H_{ kn } channel response of the channel between terminal k and the base station over carrier n
k generic index for a terminal
K number of terminals
set of terminals
m generic index for a player
M number of players
set of players
n generic index for a subcarrier
N number of carriers
set of carriers
set of carriers of the d th subblock
p_{ kn } transmit power of terminal k over carrier n
transmit power of terminal k over carrier n at time step t
tentative transmit power of terminal k over carrier n
previous tentative transmit power of terminal k over carrier n
maximum transmit power of terminal k over carrier n
maximum tentative transmit power of terminal k over carrier n
R_{ k } target data rate of terminal k
coalition (subset) of players
coalition at time step t
t generic time step
generic coalition of players
x_{ m } payoff of player m
x payoff distribution across players
α generic positive constant
γ_{ kn } received signaltointerferenceplusnoise ratio of terminal k over carrier n
Δf carrier spacing
power step to update the tentative transmit power of terminal k over carrier n
maximum power step to update the tentative transmit power of terminal k over carrier n
ε tolerance parameter
Θ stopping criterion of the iterative algorithm
λ probability of transmit power update
balanced weight of coalition
ν coalition utility function
ν set of coalition utilities
ν^{t}set of coalition utilities at time step t
uniformly distributed random variable
AWGN power
Φ ergodic set
ψ set of disjoint coalitions
ψ^{t} set of disjoint coalitions at time step t
Ψ set of all possible ψ
ω state of the Markov chain
ω^{t} state of the Markov chain at time step t
Ω state space of the Markov chain
ζ_{ k } normalized average power expenditure of terminal k
Abbreviations
 AWGN:

additive white Gaussian noise
 NBS:

Nash bargaining solution
 OFDMA:

orthogonal frequencydivision multiple access
 QoS:

quality of service
 RBS:

RaiffaKalaiSmorodinsky bargaining solution
 SINR:

signaltointerferenceplusnoise ratio
 TU:

transferable utility
 WF:

waterfilling.
References
Goldsmith AJ: Wireless Communications. Cambridge University Press, Cambridge; 2005.
Bahai AR, Saltzberg BR, Ergen M: MultiCarrier Digital Communications Theory and Applications of OFDM. 2nd edition. Springer, Boston; 2004.
Hassan N, Assaad M: Low complexity margin adaptive resource allocation in downlink MIMOOFDMA system. IEEE Trans Wireless Commun 2009, 8(7):33653371.
Hui J, Zhou Y: Enhanced rate adaptive resource allocation scheme in downlink OFDMA system. Proceedings of the IEEE Vehicular Technology Conference (VTC), Melbourne, Australia 2006, 24642468.
Peng T, Wang W, Lu Q, Wang W: Subcarrier allocation based on waterfilling level in OFDMAbased cognitive radio networks. Proceedings of the International Conference on Wireless Communications, Networking and Mobile Computing, Shanghai, China 2007, 196199.
Han Z, Liu KJR: Fair multiuser channel allocation for OFDMA networks using Nash bargaining solutions and coalitions. IEEE Trans Commun 2005, 53(8):13661376. 10.1109/TCOMM.2005.852826
Han Z, Ji Z, Liu KJR: Noncooperative resource competition game by virtual referee in multicell OFDMA networks. IEEE J Select Areas Commun 2007, 25(6):10791090.
Yu D, Wu D, Cai Y, Zhong W: Power allocation based on power efficiency in uplink OFDMA systems: a game theoretic approach. Proceedings of the IEEE International Conference on Communication Systems (ICCS), Singapore, Singapore 2008, 9297.
Tralli V, PerezNeira A, Henarejos P: A crosslayer scheduling strategy for the downlink of a MIMOOFDMA system with heterogeneous traffic. Proceedings of the Newcom++/Cost2100 Workshop, Vienna, Austria 2009.
Dainelli G, Moretti M, Zalonis A, Dimitriou N: Distributed subcarrier allocation schemes for OFDMA multiple access systems. Proceedings of the Newcom++/Cost2100 Workshop, Vienna, Austria 2009.
Zhang YJ, Letaief KB: An efficient resourceallocation scheme for spatial multiuser access in MIMO/OFDM systems. IEEE Trans Commun 2005, 53(1):107116. 10.1109/TCOMM.2004.840666
Yang L, He Z, Wang L, Wu W: A new utility based subcarrier and power joint allocation scheme in uplink OFDMA systems. Proceedings of the IEEE Vehicular Technology Conference (VTC), Dublin, Ireland 2007, 27562760.
Osborne MJ, Rubinstein A: A Course in Game Theory. MIT Press, Cambridge; 1994.
Peleg B, Sudhölter P: Introduction to the Theory of Cooperative Games. 2nd edition. Springer, Berlin; 2007.
Burkard R, Dell'Amico M, Martello S: Assignment Problems (Society for Industrial and Applied Mathematics (SIAM), Philadelphia. 2009.
Chee TK, Lim CC, Choi J: A cooperative game theoretic framework for resource allocation in OFDMA systems. Proceedings of the IEEE International Conference on Communication Systems (ICCS), Singapore, Singapore 2006.
Kalai E, Smorodinsky M: Other solutions to Nash's bargaining problem. Econometrica 1975, 4: 513518.
Noh W: A distributed resource control for fairness in OFDMA systems: Englishauction game with imperfect information. Proceedings of the IEEE Global Communications Conference (GLOBECOM), New Orleans, LA 2008, 16.
Biglieri E, Proakis J, Shitz SS: Fading channels: informationtheoretic and communications aspects. IEEE Trans Inf Theory 1998, 44(6):26192692. 10.1109/18.720551
Osborne MJ, Rubinstein A: Bargaining and Markets. Academic Press, San Diego; 1990.
Agastya M: Perturbed adaptive dynamics in coalition form games. J Econ Theory 1999, 89(2):207233. 10.1006/jeth.1999.2574
Kemeny JG, Snell JL: Finite Markov Chains. 2nd edition. Springer, Berlin; 1976.
IEEE 802.16 Broadband Wireless Access Working Group, IEEE 802.16m Evaluation Methodology Document (EMD), Technical Report IEEE 802.16m08/004r5 2009.
Wang T, Vandendorpe L: Resource allocation for maximizing weighted sum minrate in downlink cellular OFDMA systems. Proceedings of the IEEE International Conference Communications, Cape Town, South Africa 2010.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Shams, F., Bacci, G. & Luise, M. An OFDMA resource allocation algorithm based on coalitional games. J Wireless Com Network 2011, 46 (2011). https://doi.org/10.1186/16871499201146
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/16871499201146
Keywords
 Payoff
 Power Allocation
 Mobile Terminal
 Grand Coalition
 Coalitional Game