 Research
 Open Access
 Published:
Crosslayer distributed power control: a repeated game formulation to improve the sum energy efficiency
EURASIP Journal on Wireless Communications and Networking volume 2015, Article number: 257 (2015)
Abstract
The main objective of this work is to improve the energy efficiency (EE) of a multiple access channel (MAC) system, through power control, in a distributed manner. In contrast with many existing works on energyefficient power control, which ignore the possible presence of a queue at the transmitter, we consider a new generalized crosslayer EE metric. This approach is relevant when the transmitters have a nonzero energy cost even when the radiated power is zero and takes into account the presence of a finite packet buffer and packet arrival at the transmitter. As the Nash equilibrium (NE) is an energyinefficient solution, the present work aims at overcoming this deficit by improving the global energy efficiency. Indeed, as the considered system has multiple agencies each with their own interest, the performance metric reflecting the individual interest of each decisionmaker is the global energy efficiency defined then as the sum over individual energy efficiencies. Repeated games (RG) are investigated through the study of two dynamic games (finite RG and discounted RG), whose equilibrium is defined when introducing a new operating point (OP), Paretodominating the NE and relying only on individual channel state information (CSI). Accordingly, closedform expressions of the minimum number of stages of the game for finite RG (FRG) and the maximum discount factor of the discounted RG (DRG) were established. Our contributions consist of improving the system performances in terms of powers and utilities when using the new OP compared to the NE and the Nash bargaining (NB) solution. Moreover, the crosslayer model in the RG formulation leads to achieving a shorter minimum number of stages in the FRG even for higher number of users. In addition, the social welfare (sum of utilities) in the DRG decreases slightly with the crosslayer model when the number of users increases while it is reduced considerably with the Goodman model. Finally, we show that in real systems with random packet arrivals, the crosslayer power control algorithm outperforms the Goodman algorithm.
Introduction
Motivation
The design and management of green wireless networks [1–3] has become increasingly important for modern wireless networks, in particular, to manage operating costs. Futuristic (beyond 5G) cellular networks face the dual challenges of being able to respond to the explosion of data rates and also to manage network energy consumption. Due to the limited spectrum and large number of active users in modern networks, energyefficient distributed power control is an important issue. Sensor networks, which have multiple sensors sending information to a common receiver with a limited energy capacity have also recently surged in popularity. Energy minimization in sensor networks has been analyzed in many recent works [4–6].
Several of the above described systems have some common features:

1.
Multiple transmitters connected to a common receiver

2.
Lack of centralization or coordination, i.e., a distributed and decentralized network

3.
Relevance of minimizing energy consumption or maximizing energy efficiency (EE)

4.
Transmitters that have arbitrary data transmission
These features are present in many modern systems like a sensor network which has multiple sensors with limited energy connected in a distributed manner to a common receiver. These sensors do not always have information to transmit, resulting in sporadic data transmission. Another example would be several mobile devices connected to a hotspot (via wifi or even Bluetooth). Due to these features of the network, intertransmitter communication is not possible and the transmitters are independent decisionmakers. Therefore, implementing frequency or time division multiple access becomes harder, and a MAC protocol (with single carrier) is often the preferred or natural method of channel access.
Novelty
In many existing works, both networkcentric and usercentric approaches have been studied. In a networkcentric approach, the global energy efficiency (GEE) is defined as the ratio between the system benefit (sumthroughput or sumrate) and the total cost in terms of consumed power [7, 8]. However, when targeting an efficient solution in an usercentric problem, the GEE becomes not ideal as it has no significance to any of the decisionmakers. In this case, other metrics are required to reflect the individual interest of each decisionmaker. Therefore, we redefine the GEE to be the sum over individual energy efficiencies as a suitable metric of interest [9].
The major novelty of this work is in improving the sum of energy efficiencies for a communication system with all the listed features above. In such a decentralized and distributed network, as each transmitter operates independently, implementing a frequency division or a time division multiple access is not trivial. Therefore, we are interested in looking at a MAC system where all transmitters operate on the same band. Additionally, EE will be our preferred metric due to its relevance. This metric has been defined in [10] as the ratio between the average net data rate and the transmitted power. In [11, 12], the total power consumed by the transmitter was taken into account in the EE expression to design distributed power control which is one of the most wellknown techniques for improving EE. However, many of the works available on energyefficient power control consider the EE defined in [10] where the possible presence of a queue at the transmitter is ignored. In contrast with the existing works, we consider a new generalized EE based on a crosslayer approach developed recently in [13, 14]. This approach is important since it takes into account: (1) a fixed cost in terms of power namely, a cost which does not depend on the radiated power and (2) the presence of a finite packet buffer and sporadic packet arrival at the transmitter (which corresponds to including the fourth feature mentioned above). Although providing a more general model, the distributed system in [14] may operate at a point which is energyinefficient. Indeed, the point at which the system operates is a Nash equilibrium (NE) of a certain noncooperative static game. The present work aims at filling this gap by not only considering a crosslayer approach of energyefficient power control but also improving the system performance in terms of sum of energy efficiencies.
State of the art
Nash bargaining (NB) solution in a cooperative game can provide a possible efficient solution concept for the problem of interest as it is Paretoefficient. However, it generally requires global channel state information (CSI) [15]. Therefore, we are interested in improving the average performance of the system by considering longterm utilities. We focus then on repeated games (RG) where repetition allows efficient equilibrium points to be implemented. Unlike static games which are played in one shot, RG are a special case of dynamic games which consider a cooperation plan and consist in repeating at each step the same static game and the utilities result from averaging the static game utilities over time [16]. There are two relevant dynamic RG models: finite (FRG) and discounted (DRG). The FRG is defined when the number of stages during which the players interact is finite. For the DRG model, the discount factor is seen as the stopping probability at each stage [17]. The power control problem using the classic EE developed by Goodman et al in [10] has been solved with RG only in [18] where authors developed an operating point (OP) relying on individual CSI and showed that RG lead to efficient distributed solution. Here, we investigate the power control problem of a MAC system by referring to RG (finite and discounted) where the utility function is based on a crosslayer approach. Accordingly, we contribute to:

1.
Determine the closedform expressions of the minimum number of stages for the FRG and the maximum discount factor for the DRG. These two parameters identify the two considered RG.

2.
Determine a distributed solution Paretodominating the NE and improving the system performances in terms of powers and utilities compared not only to the NE but also to the NB solution even for high number of users.

3.
Show that the RG formulation when using the new EE and the new OP leads to significant gains in terms of social welfare (sum of utilities of all the users) compared to the NE

4.
Show that the following aspects of the crosslayer model improve considerably the system performances when comparing to the Goodman model even for large number of users:

The minimum number of stages in the crosslayer EE model can always be shorter than the minimum number of stages in the Goodman EE formulation.

The social welfare for the DRG in the crosslayer model decreases slightly when the number of users increases while it decreases considerably in the Goodman model.


5.
Show that in real systems with random packet arrivals, the crosslayer power control algorithm outperforms the Goodman algorithm and then the new OP with the crosslayer approach is more efficient.
Structure
This paper is structured as follows. In section 2, we define the system model under study, introduce the generalized EE metric, and define the noncooperative static game. This is followed (section 3) by the study of the NB solution. In section 4, we introduce the new OP, give the formulation of both RG models (FRG and DRG), and determine the closedform expressions of the minimum number of stages and the maximum discount factor as well. Numerical results are presented in section 5, and finally, we draw several concluding remarks.
Problem statement
System model
We consider a MAC system composed of N small transmitters communicating with a receiver. The ith transmitter transmits a signal x _{ i } with a power \(p_{i} \in \left [0, P_{i}^{\max }\right ]\) where \(P_{i}^{\max }\) is the maximum transmit power assumed identical for all users \(\left (P_{i}^{\max }=P^{\max }\right)\). The additive noise, which is the same for all users, is an additive white Gaussian noise denoted as n with zero mean and variance σ ^{2}. We assume that the users transmit their data over block fading channels. The channel gain between user i and the receiver is given by g _{ i }. Thus, the baseband signal received at the receiver is written as:
Therefore, the resulting signaltointerferenceplusnoise ratio (SINR) γ _{ i } corresponding to the ith transmitter is given by [18, 19]:
where p=(p _{1},p _{2},…,p _{ N }) defines the power vector of all users and can be written as p=(p _{ i },p _{−i }) with p _{−i }=(p _{1},…,p _{ i−1},p _{ i+1},…,p _{ N }).
The purpose of this work is to determine how each user is going to control its power in an optimum way. Game theory, as a powerful mathematical tool, helps to solve such an optimization problem where the utility function is the EE which is a function of the users powers. Since the system under study has multiple agencies each with individual interest, the sum over individual energy efficiencies will be considered as the performance metric reflecting the individual interest of each decisionmaker.
Energy efficiency metric
The EE is defined in [10] as a ratio of the net data rate to the transmit power level and is given by:
where R is the transmission rate (in bit/s) while f :[0,+∞)→[ 0,1] denotes the efficiency function which is sigmoidal and corresponds to the packet success rate verifying f(0)=0 and \(\lim \limits _{x \rightarrow +\infty } f(x)=1\). Authors of [11] were the first to consider a total transmission cost of the type radiated power (p _{ i }) + consumed power (b) to design distributed power control strategies for multiple access channels [13, 14] as follows:
In [13, 14], a more generalized EE metric has been developed by considering a packet arrival process following a Bernoulli process with a constant probability q and a finite memory buffer of size K. The new EE expression is given by:
where the function Φ identifies the packet loss due to both bad channel conditions and the finiteness of the packet buffer and is expressed as follows:
where Π _{ K }(γ _{ i }) is the stationary probability that the buffer is full and is given by:
with:
It is important to highlight that this new generalized EE given by (5) includes the conventional case of (4) when making q→1.
Static crosslayer power control game
The static crosslayer power control game is a noncooperative game which can be defined as a strategic form game [17].
Definition 1.
The game is defined by the ordered triplet \(\mathcal {G} = \left (\mathcal {N}, (\mathcal {S}_{i})_{i \in \mathcal {N}}, \allowbreak (u_{i})_{i \in \mathcal {N}} \right)\) where \(\mathcal {N}\)is the set of players (the N transmitters), \(\mathcal {S}_{1},\ldots,\mathcal {S}_{N}\) are the corresponding sets of strategies with \(\mathcal {S}_{i}=\left [0, P_{i}^{\max }\right ]\) and u _{1},…,u _{ N } are the utility functions given by:
where χ _{ i }(p) is given by Eq. (5).
In a noncooperative game, each user (player) seeks to maximize selfishly its individual utility function. The optimum solution results then by setting ∂ u _{ i }/∂ p _{ i } to zero as follows:
where \(\gamma _{i}^{\prime } = {\frac {\mathrm {d} \gamma _{i}}{\mathrm {d} p_{i}}} = {\frac {\gamma _{i}}{p_{i}}}, f^{\prime } = {\frac {\mathrm {d} f}{\mathrm {d} \gamma _{i}} }\), and \(\Phi ^{\prime } = {\frac {\mathrm {d} \Phi }{\mathrm {d} \gamma _{i}}}\).
Authors in [13, 14] proved that such equation has a unique best response. In the game \(\mathcal {G}\), this best response defines the NE and is denoted as \(\mathbf {p}^{\ast }=\left (p_{1}^{\ast },p_{2}^{\ast },\ldots,p_{N}^{\ast }\right)\). However, the NE solution is not always Paretoefficient for many scenarios. We highlight in Fig. 1 that the NE is not on the Pareto frontier (the outer boundary of the achievable utilities region). Therefore, we are motivated to design a more efficient solution than the NE. For this, as a first step we investigate the NB solution.
Nash bargaining solution
Due to the inefficiency of the NE, a Paretoefficient solution can be achieved by introducing the cooperation between the players. The resulting solution is called NB solution whose determination requires two elements [20]:

The region of achievable utilities formed by the set of the feasible utilities of all the players should be compact and convex [21]

The threat point is defined by the NE of the oneshot game [22]
Compactness and convexity of the achievable utilities region
We denote \(\mathcal {R}\) the achievable utilities region defined as follows:
As the strategies sets \(\mathcal {S}_{1},\ldots,\mathcal {S}_{N}\) are compact since \(\mathcal {S}_{i} = \left [0,P_{i}^{\max }\right ]\) and the utility function u _{ i } is continuous, the region \(\mathcal {R}\) is compact for a given channel configuration [22]. Since it is generally not convex, timesharing has been a solution to convexify it. In order to illustrate the main idea of this technique applied to our problem, let us consider a system of two users [22]. During a time fraction τ, the users use the powers (p _{1},p _{2}) to have utilities (u _{1},u _{2}). During a time fraction (1−τ), they use another combination of powers \((p_{1}^{\prime },p_{2}^{\prime })\) to have \((u_{1}^{\prime },u_{2}^{\prime })\) [15, 22]. Thus, the new achievable utilities region (for the twouser system) is:
We define \(\bar {\mathcal {R}}^{\ast }\) the Pareto boundary (the outer frontier) of the convex hull of \(\bar {\mathcal {R}}\). Figure 1 shows the convexified achievable utilities region with the NE point, the NB solution, and the Nash curve (both will be defined next).
Existence and uniqueness of the NB solution
Let \(\mathcal {R}^{NB}\) define the improvement region of utilities versus the NE, and it is given by:
The NB solution belongs to the region \(\mathcal {R}^{NB}\). Here, in the power control game \(\mathcal {G}\), there exists a unique NB solution denoted as \(\mathbf {u}^{NB}=\left (u_{1}^{NB},u_{2}^{NB},\ldots,u_{N}^{NB}\right)\) and is given by [21]:
Since the NE can always be reached and the achievable utility region is a compact convex set, the NB solution exists. It is unique since it verifies certain axioms: individual rationality and feasibility, independence of irrelevant alternatives, symmetry, Pareto optimality (efficiency), and independence of linear transformations [21]. The NB solution results from the intersection of the Pareto boundary (\(\bar {\mathcal {R}}^{\ast }\)) with the Nash curve whose form is \(m=\prod _{i=1}^{N}\left (u_{i}u_{i}^{NE}\right)\) where m is a constant chosen such that there is precisely one intersection point [22] (see Fig. 1). Although the NB solution is Paretoefficient, it generally requires global CSI at the transmitters due to the Nash product (m) introducing all the users utilities [15]. For this reason, we are looking for another efficient solution through the study of the dynamic RG.
Repeated game formulation
RG consist in their standard formulation, in repeating the same static game at every time instance and the players seek to maximize their utility averaged over the whole game duration [16]. Repetition allows efficient equilibrium points to be implemented and which can be predicted from the oneshot static game according to the Folk theorem, which provides the set of possible Nash equilibria of the repeated game [18, 23]. In a repeated game, certain agreements between players on a common cooperation plan and a punishment policy can be implemented to punish the deviators [16]. In what follows, we introduce the new OP and characterize the two RG models.
New OP
The new OP consists in setting p _{ i }g _{ i }^{2} to a constant α which is unique when maximizing the expected sum utility over all the channel states. It is given by [19]:
The power of the ith player is then deduced as follows:
The new OP Paretodominates the NE and relies on individual CSI at the transmitter. In order to implement a cooperation plan between the players, we assume, in addition to the individual CSI assumption, that every player is able to know the power of the received signal at each game stage, which is denoted by [18]:
When assuming that p _{ i }g _{ i }^{2} is set to the constant α, the received signal power can be written as:
Accordingly, each transmitter needs only its individual SINR and the constant α (depending only on p _{ i } and g _{ i }^{2}) to establish the received signal power P _{ y }. We assume that the data transmission is over block fading channels and that channel gains g _{ i }^{2} lie in a compact set \(\left [\nu _{i}^{\min }, \nu _{i}^{\max }\right ]\) [18]. Thus, the interval to which the received signal power belongs is \({\Delta = \left [\sigma ^{2}, \sigma ^{2} + \sum _{\substack {i=1}}^{N} p_{i}\nu _{i}^{\max }\right ]}\). Since the players detect a variation of the received signal power, a deviation from the cooperation plan has occurred. Indeed, when playing at the new OP, the received signal power is constant and equal to \({\frac {\sigma ^{2}(\tilde {\gamma }\,+\,1)}{1\,\,(N\,\,1)\tilde {\gamma }}}\). Consequently, when any player deviates from the new OP, the latter quantity changes and the deviation is then detected [18].
Repeated game characterization
A RG is a longterm interaction game where players react to past experience by taking into account what happened in all previous stages and make decisions about their future choices [24, 25]. The resulting payoff is an average over all the stage payoffs. We denote by t, the game stage which corresponds to the instant in which all players choose their actions. Accordingly, a profile of actions can be defined for all players as p(t)=(p _{1}(t),p _{2}(t),…,p _{ N }(t)). A history h(t) of player i at time t is the pair of vectors (P _{ y,t },p _{ i,t })=(P _{ y }(1),P _{ y }(2),…,P _{ y }(t−1),p _{ i }(1),p _{ i }(2),…,p _{ i }(t−1)) and which lies in the set \(\mathcal {H}_{t} = \left (\Delta ^{t1},\mathcal {P}_{i}^{t1}\right)\) with \(\mathcal {P}_{i} = \left [0,P_{i}^{\max }\right ] = \left [0,P^{\max }\right ]\) (as all the users have the same maximum power) [18]. Histories are fundamental in RG as they allow players to coordinate their behavior at each stage so that previous histories are known by all the players [25]. We denote δ _{ i,t } the pure strategy of the ith player. It defines the action to select after each history [18, 25]:
In RG literature, there are two important models [17]:

The finite RG where the number of stages of the game (denoted as T≥1) during which the players interact is finite

The discounted RG where the discount factor (denoted as λ∈] 0,1[) is seen as the stopping probability at each stage
The utility function of each player results from averaging over the instantaneous utilities over all the game stages in the FRG while it is a geometric average of the instantaneous utilities during the game stages in the DRG [18, 25, 26]. We denote δ=(δ _{1},δ _{2},…,δ _{ N }) the joint strategy of all players.
Definition 2.
A joint strategy δ satisfies the equilibrium condition for the repeated game defined by \(\left (\mathcal {N}, (\mathcal {S}_{i})_{i \in \mathcal {N}}, (v_{i})_{i \in \mathcal {N}} \right)\) if \(\forall i \in \mathcal {N}, \forall \delta ^{\prime }_{i}, v_{i}(\boldsymbol {\delta }) \geq v_{i}(\delta ^{\prime }_{i}, \boldsymbol {\delta }_{i})\) with \(v_{i} = {v_{i}^{T}}\) for the FRG or \(v_{i} = v_{i}^{\lambda }\) for the DRG such that:
In RG with complete information and full monitoring, the Folk theorem characterizes the set of possible equilibrium utilities. It ensures that the set of NE in a RG is precisely the set of feasible and individually rational outcomes of the oneshot game [24, 25]. A cooperation/punishment plan is established between the players before playing [18]. The players cooperate by always transmitting at the new OP with powers \(\tilde {p}_{i}\). When the power of the received signal changes, a deviation is then detected and the players punish the deviator by transmitting with their maximum transmit power \(P_{i}^{\max }\) in the FRG and by playing at the oneshot game in the DRG. In what follows, we give the equilibrium solution of each repeated game model and mention the corresponding algorithm [27–29]. It is important to note that in contrast with iterative algorithms (e.g., iterative waterfilling type algorithms), there is no convergence problem in repeated games (FRG and DRG). Indeed, the transmitters implement an equilibrium strategy (referred to as the operating point) at every stage of the repeated game.
Finite RG
The FRG is characterized by the minimum number of stages (T _{min}). If the number of stages in the game T verifies T>T _{min}, a more efficient equilibrium point can be reached. However, if it is less than T _{min}, the NE is then played. Assuming that channel gains g _{ i }^{2} lie in a compact set \(\left [\nu _{i}^{\min }, \nu _{i}^{\max }\right ]\) [18], we have the following proposition [19]:
Proposition 1 (FRG equilibrium).
: When supposing the following condition is met: T≥T _{min} with:
such that:
Then, the NE corresponding to the Tstage FRG is given by the following action plan for any (T,T _{min}) and ∀t≥1:
The quantities A, B, C, D, E, F, G, and H are defined in Appendix and \(\gamma _{i}^{\ast }\) is the SINR at the NE while \(\bar {\gamma }_{i}\) and \(\widehat {\gamma _{i}}\) are the SINRs related to the maximal utility and the utility minmax respectively (the proof of this proposition is detailed in [19]). The corresponding algorithm is as follows.
Discounted RG
In the DRG, the probability that the game stops at stage t is λ(1−λ)^{t−1} with λ∈] 0,1[ defines the discount factor [17]. Accordingly, we can express the analytic form of the maximum discount factor in a DRG when assuming that channel gains g _{ i }^{2} lie in a compact set \(\left [\nu _{i}^{\min }, \nu _{i}^{\max }\right ]\) [18].
Proposition 2 (DRG equilibrium).
: When assuming the following condition is met:
with:
Then, the NE corresponding to the DRG is given by the following action plan ∀t≥1:
For the proof, see Appendix. The corresponding algorithm is as follows.
Numerical results
In this section, we consider the efficiency function f(x)=e ^{−c/x} with \(c=2^{\frac {R}{R_{0}}}1\). It has been proven in [30, 31] that such a function is sigmoidal as it is convex on the open interval (0,c/2] and concave on (c/2,+∞). The throughput R and the used bandwidth R _{0} are equal to 1 Mbps and 1 MHz, respectively. The maximum power P ^{max} is set to 0.1 Watt while the noise variance is set to 10^{−3} W. The buffer size K, the packet arrival rate q and the consumed power b are fixed to 10, 0.5 and 5×10^{−3} W, respectively. We consider Rayleigh fading channels and a spreading factor L introducing an interference processing (1/L) in the interference term of the SINR.
In Fig. 2, we present the achievable utility region, the new OP, the NE, and the NB solution. We stress that the new OP and the NB solution dominate both the NE in the sense of Pareto. The region between the Pareto frontier and the minmax level is the possible set of equilibrium utilities of the RG according to the Folk theorem.
In order to study the efficiency of the new OP versus the NB solution and the NE, we are interested in comparing powers and utilities of the three equilibria by averaging over channel gains for different scenarios (different number of users N in the system). In Fig. 3, we plot the power and the utility that a user (in a system of N users) can reach for each equilibrium. Thus, we highlight that the new OP and the NB solution have better performances than the NE as they Paretodominate it. When N=2, we notice that the new OP and the NB solution are more efficient than the NE. It is clear that the NB solution requires less power and provides higher utility compared to the new OP, but it is important to stress that values, in terms of powers and utilities, are slightly different for both equilibria (new OP and NB solution). When N>2, we highlight that lower powers are provided with the new OP which leads also to higher values of the utilities. Thus, we notice that the new OP gives better performances than the NE and the NB solution. Therefore, the new OP contributes not only to improve the system performances better than the NE for any given scenario but also enables important gains in terms of powers and utilities when compared to the NB solution for a system with a large number of users (N>2).
We are interested in studying the performances of the social welfare \(\left (\sum _{i} u_{i}\right)\) according to the FRG versus the NE in a multiusers system. The corresponding expression is given by:
In Fig. 4, we present the ratio of the social welfare corresponding to the FRG (ω _{ FRG }) versus the NE social welfare (ω _{ NE }). We proceed by averaging over channel gains lying in a compact set such that 10 log10(ν ^{max}/ν ^{min})=20. We highlight that the social welfare of the FRG reaches higher values than the NE (ω _{ FRG }>ω _{ NE }). In addition, we notice that the social welfare ratio increases with the number of users for both models (Goodman and crosslayer). The minimum number of stages T _{min} according to the crosslayer model is much lower compared to the one related to the Goodman model. To illustrate this, when N=3,T _{min} for the Goodman model is equal to 4600 while it is 3700 for the crosslayer model. This difference becomes considerable with the increase of the number of users. Indeed, when N=4, the minimum number of stages for the Goodman EE is 14,300 while it is equal to 10,900 for the crosslayer approach.
We are interested in plotting the minimum number of stages as a function of the consumed power b and the packet arrival rate q according to both EE models. Results, obtained by averaging over channel realizations, are drawn in Figs. 5 and 6. According to Fig. 5, we stress that T _{min} increases with the number of users while it decreases with the spreading factor. It is clear that for any values of N and L, it exists a consumed power b≠0 for which T _{min} is less than T _{min} when b=0. Thus, a good choice of the fixed consumed power leads to a lower minimum number of stages for the crosslayer model compared to the Goodman model.
In Fig. 6, we highlight that the minimum number of stages is an increasing function of the packet arrival rate q according to the crosslayer model while it is a constant function for the Goodman model since the latter does not take into account the packet arrival process. One can confirm that the minimum number of stages is an increase function of the number of users as deduced previously. Simulations show that it exists a packet arrival rate q _{0} before which T _{min} of the crosslayer model is much lower than T _{min} of the Goodman model for different number of users. Simulations show that q _{0}≈0.6 and for q≥q _{0},T _{min} of the crosslayer model converges to T _{min} corresponding to the Goodman model. It is important to highlight that when N=3 and q≥q _{0},T _{min} of the crosslayer model takes higher values than T _{min} corresponding to the Goodman model but values are quite similar. With the increase of the number of users, the difference between the minimum number of stages for both models becomes noticeable. According to Figs. 5 and 6, one can conclude that the crosslayer model can be exploited for short games.
For the DRG model, we plot in the first step the improvement of the social welfare (ω _{ DRG }) versus the oneshot game (ω _{ NE }) for Goodman and crosslayer models (b=0 and b=5×10^{−3}, respectively) as a function of the spectral efficiency η=N/L. We simulated our algorithm by averaging over channel gains for different number of users. Results are given in Fig. 7. It is important to highlight that the DRG social welfare reaches higher values than the NE social welfare (ω _{ DRG }>ω _{ NE }). For low values of the spectral efficiency, the social welfare ratio is quite similar for both models while the difference becomes noticeable when the spectral efficiency takes higher values. The social welfare ratio increases with the number of users for both EE models. For each model, when N takes high values, the social welfare ratios become closer (for the crosslayer model, the curves corresponding to N=3 and N=4 are closer than with the curve of N=2).
For this reason, we studied the variation of λ _{max} as a function of η and q for both EE models and for different number of users. Results are given in Figs. 8 and 9. According to Fig. 8, we deduce how λ _{max} decreases with the number of users for both EE models. In addition, we stress that the values reached by λ _{max} becomes closer when N takes higher values. This can explain Fig. 7.
The study of the variation of λ _{max} versus the packet arrival rate q (in Fig. 9) shows that the maximum discount factor λ _{max} decreases with the number of users and with the packet arrival rate q as well. Simulations show that it exists a packet arrival rate q _{1} before which the λ _{max} corresponding to the crosslayer model takes higher values than the maximum discount factor of the Goodman model for different numbers of users. We notice that starting from q _{1}, the maximum discount factor of the crosslayer model converges to λ _{max} corresponding to the Goodman model.
In a second step, we plotted in Fig. 10 the variation of the DRG social welfare as a function of λ≤λ _{max}. We notice that ω _{ DRG } is an increase function of λ. Thus, when λ=λ _{max},ω _{ DRG } reaches highest value. However, we stress that ω _{ DRG } decreases with the number of users especially for the Goodman model while it is quite similar for the crosslayer model. This confirms that the proposed new OP is still quite efficient and can be utilized for games with high number of users.
Finally, we plot for both RG models (FRG and DRG) in Figs. 11 and 12 the social welfare when using the crosslayer approach against the constant power b for two different values of the packet arrival rate q (0.5 and 0.7). The considered system is composed of two users and the spreading factor L is fixed to 4. The idea consists in studying the efficiency of the crosslayer approach regarding the Goodman power control algorithm. Accordingly, for each packet arrival rate, we plot the social welfare with the crosslayer approach (powers at the equilibrium are determined normally according to q) and the social welfare with the crosslayer power control but when powers at the equilibrium are determined by the Goodman algorithm (p[q→1]). Indeed, the packet arrival rate is assumed constant in the Goodman model and equal to 1 (packets arrive with probability q=1). For both RG models, we stress that the crosslayer power control approach outperforms the Goodman algorithm for both values of the packet arrival rate q. Important (relative) gains are reached. To illustrate this, for q=0.5 and b=0.045 W the relative gain is higher than 50 % in the FRG and the DRG as well. Therefore, we conclude that the OP with the crosslayer approach provides better performances and is more efficient than the OP with the Goodman power control approach.
Conclusions
In this paper, we have investigated RG for distributed power control in a MAC system. As the NE is not always energyefficient, the NB solution might be a possible efficient solution since it is Paretoefficient. However, the latter, in general, requires global CSI at each transmitter node. Thus, we were motivated to investigate using the repeated game formulation and develop a new OP that simultaneously is both more efficient than the NE and achievable with only individual CSI being required at the transmitter. Also, we consider a new EE metric taking into account the presence of a queue at the transmitter with an arbitrary packet arrivals.
Cooperation plans are proposed where the new OP is considered and closedform expressions of the minimum number of stages for the FRG and the maximum discount factor for the DRG have been established. The study of the social welfare (sum of utilities of all the users) shows that considerable gains are reached compared to the NE (for the FRG and DRG). Moreover, our model proves that even with a high number of users, the FRG can always be played with a minimum number of stages shorter than when using the Goodman model. In addition, the social welfare in the DRG decreases slightly with the number of users with the crosslayer approach while it decreases considerably with the Goodman model. Finally, the comparison of the crosslayer algorithm versus the Goodman algorithm shows that in real systems with random packet arrivals, the crosslayer power control algorithm outperforms the Goodman algorithm. Thus, the new OP with the crosslayer approach is more efficient. An interesting extension to this work would be to consider the interference channel instead of the MAC channel and generalize the framework applied here. Another possible extension would be to consider the multicarrier case and the resulting repeated g ame.
Appendix
Proof of λ _{ max }
Determination of the maximal utility
Let us determine the maximal utility that a player can get and which is denoted as follows:
We denote \(\dot {p}_{i}\) the power maximizing the utility function u _{ i } and which is the solution of the following equation:
with \({\gamma _{i}^{\prime } = {\frac {\mathrm {d} \gamma _{i}}{\mathrm {d} p_{i}}} = {\frac {\gamma _{i}}{p_{i}}}}, {f^{\prime } = {\frac {\mathrm {d} f}{\mathrm {d} \gamma _{i}} }}\) and \({\Phi ^{\prime } = {\frac {\mathrm {d} \Phi }{\mathrm {d} \gamma _{i}}}}\). Therefore, the expression of the maximum utility function writes as:
with:
We have to study then the behavior of \(\dot {u}_{i}({\dot {p}_{i},\mathbf {p}_{i}})\) regarding p _{ j } for j≠i, and then we determine the sign of \({\frac {\partial \dot {u}_{i}({\dot {p}_{i},\mathbf {p}_{i}})}{\partial p_{j}}}\) which is given by:
We are interested to study the sign of the numerator:
with:
The next step would be to determine the sign of the expression \({\frac {b\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}} \allowbreak + \dot {p}_{i} q \left (\frac {(1\ \ \phi (\dot {\gamma }_{i}))}{f(\dot {\gamma }_{i})}\right)^{2}\frac {\partial f(\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}}\). It is obvious that \({\dot {p}_{i} q \left (\frac {(1\ \ \phi (\dot {\gamma }_{i}))}{f(\dot {\gamma }_{i})}\right)^{2}\frac {\partial f(\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}>0}\) since f is an increasing function of the SINR. Therefore, we need to determine the sign of \({\frac {\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}}\). We have:
The sign of the first term is negative while the sign of the second term is the same as ∂ Π(γ _{ i })/∂ γ _{ i } since (1−f(γ _{ i }))>0 and we have:
However \(\rho (\gamma _{i})={\frac {q(1\ \ f(\gamma _{i}))}{(1\ \ q)f(\gamma _{i})}}\) and then:
As shown in [13], we have:
The latter quantity can be expressed as:
Consequently, we have:
Therefore, \({\frac {\partial \Pi (\gamma _{i})}{\partial \gamma _{i}}<0}\) and hence \({\frac {\partial \phi (\gamma _{i})}{\partial \gamma _{i}}}<0\). In particular, we have \({\frac {\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}}<0\). Thus, we have \({\left (\frac {b\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}+ \dot {p}_{i} q \left (\frac {(1\ \ \phi (\dot {\gamma }_{i}))}{f(\dot {\gamma }_{i})}\right)^{2}\frac {\partial f(\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}\right)}>0\) and finally \({\frac {\partial \dot {u}_{i}({\dot {p}_{i},\mathbf {p}_{i}})}{\partial p_{j}}}<0\). We deduce then that \(\dot {u}_{i}\) is a decreasing function of p _{ j }. It reaches its maximum when p _{ j }=0 and it is minimum when \(p_{j}=p_{j}^{\max }\) (for all j≠i). When substituting p _{ j }=0 in the SINR expression, this allows the determination of the optimal power:
with: \({\gamma _{i} = \frac {p_{i}g_{i}^{2}}{\sigma ^{2}}}\).
The latter equation is a function of the SINR. We determine then the solution in terms of SINR which we denote \(\bar {\gamma }_{i}\) and for which the optimal power is \({\bar {p}_{i}=\frac {\bar {\gamma }_{i}\sigma ^{2}}{g_{i}^{2}}}\). This SINR exists due to the quasiconcavity of u _{ i } in (p _{ i },p _{−i }) [13, 14]. Then, we have:
Determination of λ _{max}
The SINR \(\tilde {\gamma }_{i}\) refers to the SINR when playing the new OP while \(\gamma _{i}^{\ast }, \bar {\gamma }_{i}\), and \(\widehat {\gamma _{i}}\) are the SINRs at the NE, at the maximal utility and at the utility minmax respectively. In order to simplify expressions, we define the following notations:
At a stage t, the equilibrium condition is [18]:
Knowing that \(\sum _{s\geq t+1}{(1\lambda)^{st}} = (1\lambda)/\lambda \), we have:
Let Ψ and Γ define the following quantities:
Thus:
References
 1
D Lister, in Proc. IEEE Internat. Conf. on Comm. Workshops (ICC Workshops 2009). An Operators View on Green Radio (1st Int. Workshop on Green Comm., 2009).
 2
J Palicot, C Roland, On the Use of Cognitive Radio for Decreasing the Electromagnetic Radiations, URSI 05, XXVIII General Assembly (2005).
 3
GreenTouch, Communications Turns Totally Green (Press Release, 2010).
 4
Y Yu, VK Prasanna, B Krishnamachari, Energy minimization for realtime data gathering in wireless sensor networks. IEEE Trans. Wirel. Commun. 5(11), 3087–3096 (2006).
 5
S Bandyopadhyay, EJ Coyle, in TwentySecond Annual Joint Conference of the IEEE Computer and Communications, 3. An Energy Efficient Hierarchical Clustering Algorithm for Wireless Sensor Networks, INFOCOM 2003 (IEEE Societies, 2003), pp. 1713–1723.
 6
M Cardei, MT Thai, Y Li, W Wu, in 24th Annual Joint Conference of the IEEE Computer and Communications Societies, 3. EnergyEfficient Target Coverage in Wireless Sensor Networks, INFOCOM 2005 (Proceedings IEEE, 2005), pp. 1976–1984.
 7
A Zappone, E Jorswieck, Energy Efficiency in Wireless Networks via Fractional Programming Theory. Foundations and Trends in Commun Inf Theory. 11(3–4), 185–396 (2015).
 8
C Isheden, Z Chong, E Jorswieck, G Fettweis, Framework for LinkLevel Energy Efficiency Optimization with Informed Transmitter. IEEE Trans. Wirel. Commun. 11(8), 2946–2957 (2012).
 9
S Lasaulce, Y Hayel, RE Azouzi, M Debbah, Introducing hierarchy in energy games. IEEE Trans. Wirel. Commun. 8(7), 3833–3843 (2009).
 10
DJ Goodman, NB Mandayam, Power control for wireless data. IEEE Pers. Commun. 7(2), 48–54 (2000).
 11
SM Betz, HV Poor, Energy efficient communications in CDMA networks: a game theoretic analysis considering operating costs. IEEE Trans. Signal Proc. 56(10), 5181–5190 (2008).
 12
A Zappone, Z Chong, E Jorswieck, S Buzzi, Energyaware competitive power control in relayassisted interference wireless networks. IEEE Trans. Wirel. Commun. 12(4), 1860–1871 (2013).
 13
VS Varma, S Lasaulce, Y Hayel, SE Elayoubi, M Debbah, Crosslayer design for green power control. IEEE Int. Conf. Commun. (ICC), 4021–4026 (2012).
 14
VS Varma, S Lasaulce, Y Hayel, SE Elayoubi, A CrossLayer Approach for Distributed EnergyEfficient Power Control in Interference Networks. IEEE Trans. Veh. Technol (2014).
 15
M Mhiri, K Cheikhrouhou, A Samet, F Mériaux, S Lasaulce, in 6th International Conference on Network Games, Control and Optimization (NETGCOOP). EnergyEfficient Spectrum Sharing in RelayAssisted Cognitive Radio Systems, (2012), pp. 86–91.
 16
S Lasaulce, M Debbah, E Altman, Methodologies for analyzing equilibria in wireless games: a look at pure, mixed, and correlated equilibria. IEEE Signal Process. Mag. 26(5), 41–52 (2009).
 17
S Lasaulce, H Tembine, Game Theory and Learning for Wireless Networks: Fundamentals and Applications (Academic Press, Elsevier, 2011).
 18
M Le Treust, S Lasaulce, A repeated game formulation of energyefficient decentralized power control. IEEE Trans. Wirel. Commun. 9(9), 2860–2869 (2010).
 19
M Mhiri, VS Varma, M Le Treust, S Lasaulce, A Samet, in First International Black Sea Conference on Communications and Networking (BlackSeaCom). On the Benefits of Repeated Game Models for Green CrossLayer Power Control in Small Cells, (2013), pp. 137–141.
 20
M Abidi, VT Vakili, in IEEE 18th International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC). A Game Theoretic Approach for SINRConstrained Power Control in 3G Cellular CDMA Communication Systems, (2007), pp. 1–5.
 21
E Hossain, D Niyato, Z Han, Dynamic Spectrum Access and Management in Cognitive Radio Networks (United States of America by Cambridge University Press, New York, 2009).
 22
EG Larsson, EA Jorswieck, Competition versus cooperation on the MISO interference channel. IEEE J. Selected Areas in Commun. 26(7), 1059–1069 (2008).
 23
J Friedman, A noncooperative equilibrium for supergames. Rev. Econ. Stud. 38(1), 1–12 (1971).
 24
S Hart, Robert Aumann’s game and economic theory. Scand. J. Econ. 108(2), 185–211 (2006).
 25
S Sorin, in Handbook of game theory, 1, ed. by RJ Aumann, S Hart. Repeated games with complete information (Elsevier Science Publishers, 1992), pp. 72–107.
 26
RJ Aumann, LS Shapley, LongTerm CompetitionA GameTheoretic Analysis. preprint, 1–26 (1976).
 27
Y Xu, J Wang, Q Wu, A Anpalagan, YD Yao, Opportunistic spectrum access in unknown dynamic environment: A gametheoretic stochastic learning solution. IEEE Trans. Wirel. Commun. 11(4), 1380–1391 (2012).
 28
H Li, Multiagent Qlearning for Alohalike spectrum access in cognitive radio systems. EURASIP J. Wirel. Commun. Netw. 2010:, 1–15 (2010).
 29
Y Song, SHY Wong, KW Lee, in MobiCom’11 Proceedings of the 17th Annual International Conference on Mobile Computing and Networking. Optimal Gateway Selection in Multidomain Wireless Networks: A Potential Game Perspective, (2011), pp. 325–336.
 30
EV Belmega, S Lasaulce, in VALUETOOLS’09 Proceedings of the 4th International ICST Conference on Performance Evaluation Methodologies and Tools. An InformationTheoretic Look at MIMO EnergyEfficient Communications, (2009).
 31
EV Belmega, S Lasaulce, Energyefficient precoding for multipleantenna terminals. IEEE Trans. Signal Proc. 59(1), 329–340 (2011).
Author information
Additional information
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Mhiri, M., Varma, V.S., Cheikhrouhou, K. et al. Crosslayer distributed power control: a repeated game formulation to improve the sum energy efficiency. J Wireless Com Network 2015, 257 (2015) doi:10.1186/s136380150486z
Received:
Accepted:
Published:
Keywords
 Distributed power control
 Crosslayer energy efficiency
 Repeated games
 Channel state information