Skip to main content

Cross-layer distributed power control: a repeated game formulation to improve the sum energy efficiency

Abstract

The main objective of this work is to improve the energy efficiency (EE) of a multiple access channel (MAC) system, through power control, in a distributed manner. In contrast with many existing works on energy-efficient power control, which ignore the possible presence of a queue at the transmitter, we consider a new generalized cross-layer EE metric. This approach is relevant when the transmitters have a non-zero energy cost even when the radiated power is zero and takes into account the presence of a finite packet buffer and packet arrival at the transmitter. As the Nash equilibrium (NE) is an energy-inefficient solution, the present work aims at overcoming this deficit by improving the global energy efficiency. Indeed, as the considered system has multiple agencies each with their own interest, the performance metric reflecting the individual interest of each decision-maker is the global energy efficiency defined then as the sum over individual energy efficiencies. Repeated games (RG) are investigated through the study of two dynamic games (finite RG and discounted RG), whose equilibrium is defined when introducing a new operating point (OP), Pareto-dominating the NE and relying only on individual channel state information (CSI). Accordingly, closed-form expressions of the minimum number of stages of the game for finite RG (FRG) and the maximum discount factor of the discounted RG (DRG) were established. Our contributions consist of improving the system performances in terms of powers and utilities when using the new OP compared to the NE and the Nash bargaining (NB) solution. Moreover, the cross-layer model in the RG formulation leads to achieving a shorter minimum number of stages in the FRG even for higher number of users. In addition, the social welfare (sum of utilities) in the DRG decreases slightly with the cross-layer model when the number of users increases while it is reduced considerably with the Goodman model. Finally, we show that in real systems with random packet arrivals, the cross-layer power control algorithm outperforms the Goodman algorithm.

1 Introduction

1.1 Motivation

The design and management of green wireless networks [13] has become increasingly important for modern wireless networks, in particular, to manage operating costs. Futuristic (beyond 5G) cellular networks face the dual challenges of being able to respond to the explosion of data rates and also to manage network energy consumption. Due to the limited spectrum and large number of active users in modern networks, energy-efficient distributed power control is an important issue. Sensor networks, which have multiple sensors sending information to a common receiver with a limited energy capacity have also recently surged in popularity. Energy minimization in sensor networks has been analyzed in many recent works [46].

Several of the above described systems have some common features:

  1. 1.

    Multiple transmitters connected to a common receiver

  2. 2.

    Lack of centralization or coordination, i.e., a distributed and decentralized network

  3. 3.

    Relevance of minimizing energy consumption or maximizing energy efficiency (EE)

  4. 4.

    Transmitters that have arbitrary data transmission

These features are present in many modern systems like a sensor network which has multiple sensors with limited energy connected in a distributed manner to a common receiver. These sensors do not always have information to transmit, resulting in sporadic data transmission. Another example would be several mobile devices connected to a hotspot (via wifi or even Bluetooth). Due to these features of the network, inter-transmitter communication is not possible and the transmitters are independent decision-makers. Therefore, implementing frequency or time division multiple access becomes harder, and a MAC protocol (with single carrier) is often the preferred or natural method of channel access.

1.2 Novelty

In many existing works, both network-centric and user-centric approaches have been studied. In a network-centric approach, the global energy efficiency (GEE) is defined as the ratio between the system benefit (sum-throughput or sum-rate) and the total cost in terms of consumed power [7, 8]. However, when targeting an efficient solution in an user-centric problem, the GEE becomes not ideal as it has no significance to any of the decision-makers. In this case, other metrics are required to reflect the individual interest of each decision-maker. Therefore, we redefine the GEE to be the sum over individual energy efficiencies as a suitable metric of interest [9].

The major novelty of this work is in improving the sum of energy efficiencies for a communication system with all the listed features above. In such a decentralized and distributed network, as each transmitter operates independently, implementing a frequency division or a time division multiple access is not trivial. Therefore, we are interested in looking at a MAC system where all transmitters operate on the same band. Additionally, EE will be our preferred metric due to its relevance. This metric has been defined in [10] as the ratio between the average net data rate and the transmitted power. In [11, 12], the total power consumed by the transmitter was taken into account in the EE expression to design distributed power control which is one of the most well-known techniques for improving EE. However, many of the works available on energy-efficient power control consider the EE defined in [10] where the possible presence of a queue at the transmitter is ignored. In contrast with the existing works, we consider a new generalized EE based on a cross-layer approach developed recently in [13, 14]. This approach is important since it takes into account: (1) a fixed cost in terms of power namely, a cost which does not depend on the radiated power and (2) the presence of a finite packet buffer and sporadic packet arrival at the transmitter (which corresponds to including the fourth feature mentioned above). Although providing a more general model, the distributed system in [14] may operate at a point which is energy-inefficient. Indeed, the point at which the system operates is a Nash equilibrium (NE) of a certain non-cooperative static game. The present work aims at filling this gap by not only considering a cross-layer approach of energy-efficient power control but also improving the system performance in terms of sum of energy efficiencies.

1.3 State of the art

Nash bargaining (NB) solution in a cooperative game can provide a possible efficient solution concept for the problem of interest as it is Pareto-efficient. However, it generally requires global channel state information (CSI) [15]. Therefore, we are interested in improving the average performance of the system by considering long-term utilities. We focus then on repeated games (RG) where repetition allows efficient equilibrium points to be implemented. Unlike static games which are played in one shot, RG are a special case of dynamic games which consider a cooperation plan and consist in repeating at each step the same static game and the utilities result from averaging the static game utilities over time [16]. There are two relevant dynamic RG models: finite (FRG) and discounted (DRG). The FRG is defined when the number of stages during which the players interact is finite. For the DRG model, the discount factor is seen as the stopping probability at each stage [17]. The power control problem using the classic EE developed by Goodman et al in [10] has been solved with RG only in [18] where authors developed an operating point (OP) relying on individual CSI and showed that RG lead to efficient distributed solution. Here, we investigate the power control problem of a MAC system by referring to RG (finite and discounted) where the utility function is based on a cross-layer approach. Accordingly, we contribute to:

  1. 1.

    Determine the closed-form expressions of the minimum number of stages for the FRG and the maximum discount factor for the DRG. These two parameters identify the two considered RG.

  2. 2.

    Determine a distributed solution Pareto-dominating the NE and improving the system performances in terms of powers and utilities compared not only to the NE but also to the NB solution even for high number of users.

  3. 3.

    Show that the RG formulation when using the new EE and the new OP leads to significant gains in terms of social welfare (sum of utilities of all the users) compared to the NE

  4. 4.

    Show that the following aspects of the cross-layer model improve considerably the system performances when comparing to the Goodman model even for large number of users:

    • The minimum number of stages in the cross-layer EE model can always be shorter than the minimum number of stages in the Goodman EE formulation.

    • The social welfare for the DRG in the cross-layer model decreases slightly when the number of users increases while it decreases considerably in the Goodman model.

  5. 5.

    Show that in real systems with random packet arrivals, the cross-layer power control algorithm outperforms the Goodman algorithm and then the new OP with the cross-layer approach is more efficient.

1.4 Structure

This paper is structured as follows. In section 2, we define the system model under study, introduce the generalized EE metric, and define the non-cooperative static game. This is followed (section 3) by the study of the NB solution. In section 4, we introduce the new OP, give the formulation of both RG models (FRG and DRG), and determine the closed-form expressions of the minimum number of stages and the maximum discount factor as well. Numerical results are presented in section 5, and finally, we draw several concluding remarks.

2 Problem statement

2.1 System model

We consider a MAC system composed of N small transmitters communicating with a receiver. The ith transmitter transmits a signal x i with a power \(p_{i} \in \left [0, P_{i}^{\max }\right ]\) where \(P_{i}^{\max }\) is the maximum transmit power assumed identical for all users \(\left (P_{i}^{\max }=P^{\max }\right)\). The additive noise, which is the same for all users, is an additive white Gaussian noise denoted as n with zero mean and variance σ 2. We assume that the users transmit their data over block fading channels. The channel gain between user i and the receiver is given by g i . Thus, the baseband signal received at the receiver is written as:

$$ y = \sum_{\substack{i=1}}^{N} x_{i}|g_{i}|^{2} + n. $$
((1))

Therefore, the resulting signal-to-interference-plus-noise ratio (SINR) γ i corresponding to the ith transmitter is given by [18, 19]:

$$ \gamma_{i}(\mathbf{p}) = {\frac{p_{i}|g_{i}|^{2}}{\sigma^{2}+\sum_{j \neq i} p_{j}|g_{j}|^{2} }}, $$
((2))

where p=(p 1,p 2,…,p N ) defines the power vector of all users and can be written as p=(p i ,p i ) with p i =(p 1,…,p i−1,p i+1,…,p N ).

The purpose of this work is to determine how each user is going to control its power in an optimum way. Game theory, as a powerful mathematical tool, helps to solve such an optimization problem where the utility function is the EE which is a function of the users powers. Since the system under study has multiple agencies each with individual interest, the sum over individual energy efficiencies will be considered as the performance metric reflecting the individual interest of each decision-maker.

2.2 Energy efficiency metric

The EE is defined in [10] as a ratio of the net data rate to the transmit power level and is given by:

$$ \chi_{i}(\mathbf{p})=\frac{Rf(\gamma_{i}(\mathbf{p}))}{p_{i}}, $$
((3))

where R is the transmission rate (in bit/s) while f :[0,+)→[ 0,1] denotes the efficiency function which is sigmoidal and corresponds to the packet success rate verifying f(0)=0 and \(\lim \limits _{x \rightarrow +\infty } f(x)=1\). Authors of [11] were the first to consider a total transmission cost of the type radiated power (p i ) + consumed power (b) to design distributed power control strategies for multiple access channels [13, 14] as follows:

$$ \chi_{i}(\mathbf{p}) = \frac{Rf(\gamma_{i}(\mathbf{p}))}{b+p_{i}}. $$
((4))

In [13, 14], a more generalized EE metric has been developed by considering a packet arrival process following a Bernoulli process with a constant probability q and a finite memory buffer of size K. The new EE expression is given by:

$$ \chi_{i}(\mathbf{p}) = {\frac{Rq(1-\Phi(\gamma_{i}(\mathbf{p})))}{b+{\frac{qp_{i}(1\,-\,\Phi(\gamma_{i}(\mathbf{p})))}{f(\gamma_{i}(\mathbf{p}))}}}}, $$
((5))

where the function Φ identifies the packet loss due to both bad channel conditions and the finiteness of the packet buffer and is expressed as follows:

$$ \Phi(\gamma_{i})=(1-f(\gamma_{i}))\Pi_{K}(\gamma_{i}), $$
((6))

where Π K (γ i ) is the stationary probability that the buffer is full and is given by:

$$ \Pi_{K}(\gamma_{i})={\frac{\rho^{K}(\gamma_{i})}{1+\rho(\gamma_{i})+\ldots+\rho^{K}(\gamma_{i})}}, $$
((7))

with:

$$ \rho(\gamma_{i})={\frac{q(1-f(\gamma_{i}))}{(1-q)f(\gamma_{i})}}. $$
((8))

It is important to highlight that this new generalized EE given by (5) includes the conventional case of (4) when making q→1.

2.3 Static cross-layer power control game

The static cross-layer power control game is a non-cooperative game which can be defined as a strategic form game [17].

Definition 1.

The game is defined by the ordered triplet \(\mathcal {G} = \left (\mathcal {N}, (\mathcal {S}_{i})_{i \in \mathcal {N}}, \allowbreak (u_{i})_{i \in \mathcal {N}} \right)\) where \(\mathcal {N}\)is the set of players (the N transmitters), \(\mathcal {S}_{1},\ldots,\mathcal {S}_{N}\) are the corresponding sets of strategies with \(\mathcal {S}_{i}=\left [0, P_{i}^{\max }\right ]\) and u 1,…,u N are the utility functions given by:

$$ u_{i}(\mathbf{p}) = \chi_{i}(\mathbf{p}), $$
((9))

where χ i (p) is given by Eq. (5).

In a non-cooperative game, each user (player) seeks to maximize selfishly its individual utility function. The optimum solution results then by setting u i / p i to zero as follows:

$$ b\gamma_{i}^{\prime}\Phi^{\prime}(\gamma_{i})+q\left(\frac{1-\Phi(\gamma_{i})}{f(\gamma_{i})}\right)^{2}\left[f(\gamma_{i})-p_{i}\gamma_{i}^{\prime} f^{\prime}(\gamma_{i})\right]=0, $$
((10))

where \(\gamma _{i}^{\prime } = {\frac {\mathrm {d} \gamma _{i}}{\mathrm {d} p_{i}}} = {\frac {\gamma _{i}}{p_{i}}}, f^{\prime } = {\frac {\mathrm {d} f}{\mathrm {d} \gamma _{i}} }\), and \(\Phi ^{\prime } = {\frac {\mathrm {d} \Phi }{\mathrm {d} \gamma _{i}}}\).

Authors in [13, 14] proved that such equation has a unique best response. In the game \(\mathcal {G}\), this best response defines the NE and is denoted as \(\mathbf {p}^{\ast }=\left (p_{1}^{\ast },p_{2}^{\ast },\ldots,p_{N}^{\ast }\right)\). However, the NE solution is not always Pareto-efficient for many scenarios. We highlight in Fig. 1 that the NE is not on the Pareto frontier (the outer boundary of the achievable utilities region). Therefore, we are motivated to design a more efficient solution than the NE. For this, as a first step we investigate the NB solution.

Fig. 1
figure 1

Pareto efficiency of the NB solution versus the NE

3 Nash bargaining solution

Due to the inefficiency of the NE, a Pareto-efficient solution can be achieved by introducing the cooperation between the players. The resulting solution is called NB solution whose determination requires two elements [20]:

  • The region of achievable utilities formed by the set of the feasible utilities of all the players should be compact and convex [21]

  • The threat point is defined by the NE of the one-shot game [22]

3.1 Compactness and convexity of the achievable utilities region

We denote \(\mathcal {R}\) the achievable utilities region defined as follows:

$$ {}\mathcal{R} = \left\{\left(u_{1},u_{2},\ldots,u_{N} \right)|\left(p_{1},p_{2},\ldots,p_{N} \right)\in \left[0,P_{i}^{\max}\right]^{N}\right\}. $$
((11))

As the strategies sets \(\mathcal {S}_{1},\ldots,\mathcal {S}_{N}\) are compact since \(\mathcal {S}_{i} = \left [0,P_{i}^{\max }\right ]\) and the utility function u i is continuous, the region \(\mathcal {R}\) is compact for a given channel configuration [22]. Since it is generally not convex, time-sharing has been a solution to convexify it. In order to illustrate the main idea of this technique applied to our problem, let us consider a system of two users [22]. During a time fraction τ, the users use the powers (p 1,p 2) to have utilities (u 1,u 2). During a time fraction (1−τ), they use another combination of powers \((p_{1}^{\prime },p_{2}^{\prime })\) to have \((u_{1}^{\prime },u_{2}^{\prime })\) [15, 22]. Thus, the new achievable utilities region (for the two-user system) is:

$$ \begin{aligned} \bar{\mathcal{R}} = & \left\{\left(\tau u_{1}+(1-\tau)u_{1}^{\prime},\tau u_{2}+(1-\tau)u_{2}^{\prime}\right)\right.\\ &\left.|0 \leq \tau \leq 1,\; \left(u_{1},u_{2}\right)\in \mathcal{R},\; \left(u_{1}^{\prime},u_{2}^{\prime}\right)\in \mathcal{R}\right\}. \end{aligned} $$
((12))

We define \(\bar {\mathcal {R}}^{\ast }\) the Pareto boundary (the outer frontier) of the convex hull of \(\bar {\mathcal {R}}\). Figure 1 shows the convexified achievable utilities region with the NE point, the NB solution, and the Nash curve (both will be defined next).

3.2 Existence and uniqueness of the NB solution

Let \(\mathcal {R}^{NB}\) define the improvement region of utilities versus the NE, and it is given by:

$$ \mathcal{R}^{NB} = \left\{u_{i}\geq u_{i}^{NE} | i \in\ [\!1,\ldots,N]\right\}. $$
((13))

The NB solution belongs to the region \(\mathcal {R}^{NB}\). Here, in the power control game \(\mathcal {G}\), there exists a unique NB solution denoted as \(\mathbf {u}^{NB}=\left (u_{1}^{NB},u_{2}^{NB},\ldots,u_{N}^{NB}\right)\) and is given by [21]:

$$ \mathbf{u}^{NB}=\max_{\substack{u_{i} \in \mathcal{R}^{NB} \\ i \in [1,\ldots,N]}} \prod_{i=1}^{N}{\left(u_{i}-u_{i}^{NE}\right)}, $$
((14))

Since the NE can always be reached and the achievable utility region is a compact convex set, the NB solution exists. It is unique since it verifies certain axioms: individual rationality and feasibility, independence of irrelevant alternatives, symmetry, Pareto optimality (efficiency), and independence of linear transformations [21]. The NB solution results from the intersection of the Pareto boundary (\(\bar {\mathcal {R}}^{\ast }\)) with the Nash curve whose form is \(m=\prod _{i=1}^{N}\left (u_{i}-u_{i}^{NE}\right)\) where m is a constant chosen such that there is precisely one intersection point [22] (see Fig. 1). Although the NB solution is Pareto-efficient, it generally requires global CSI at the transmitters due to the Nash product (m) introducing all the users utilities [15]. For this reason, we are looking for another efficient solution through the study of the dynamic RG.

4 Repeated game formulation

RG consist in their standard formulation, in repeating the same static game at every time instance and the players seek to maximize their utility averaged over the whole game duration [16]. Repetition allows efficient equilibrium points to be implemented and which can be predicted from the one-shot static game according to the Folk theorem, which provides the set of possible Nash equilibria of the repeated game [18, 23]. In a repeated game, certain agreements between players on a common cooperation plan and a punishment policy can be implemented to punish the deviators [16]. In what follows, we introduce the new OP and characterize the two RG models.

4.1 New OP

The new OP consists in setting p i |g i |2 to a constant α which is unique when maximizing the expected sum utility over all the channel states. It is given by [19]:

$$ \tilde{\alpha} = \arg\max_{\alpha} \mathbb{E}_{g} \left[\sum_{i=1}^{N}{u_{i}(\mathbf{p})}\right]. $$
((15))

The power of the ith player is then deduced as follows:

$$ \tilde{p}_{i} = {\frac{\tilde{\alpha}}{|g_{i}|^{2}}}. $$
((16))

The new OP Pareto-dominates the NE and relies on individual CSI at the transmitter. In order to implement a cooperation plan between the players, we assume, in addition to the individual CSI assumption, that every player is able to know the power of the received signal at each game stage, which is denoted by [18]:

$$ P_{y} = \sigma^{2} + \sum_{\substack{i=1}}^{N} p_{i}|g_{i}|^{2}. $$
((17))

When assuming that p i |g i |2 is set to the constant α, the received signal power can be written as:

$$ P_{y} = \alpha\frac{\gamma_{i} + 1}{\gamma_{i}}. $$
((18))

Accordingly, each transmitter needs only its individual SINR and the constant α (depending only on p i and |g i |2) to establish the received signal power P y . We assume that the data transmission is over block fading channels and that channel gains |g i |2 lie in a compact set \(\left [\nu _{i}^{\min }, \nu _{i}^{\max }\right ]\) [18]. Thus, the interval to which the received signal power belongs is \({\Delta = \left [\sigma ^{2}, \sigma ^{2} + \sum _{\substack {i=1}}^{N} p_{i}\nu _{i}^{\max }\right ]}\). Since the players detect a variation of the received signal power, a deviation from the cooperation plan has occurred. Indeed, when playing at the new OP, the received signal power is constant and equal to \({\frac {\sigma ^{2}(\tilde {\gamma }\,+\,1)}{1\,-\,(N\,-\,1)\tilde {\gamma }}}\). Consequently, when any player deviates from the new OP, the latter quantity changes and the deviation is then detected [18].

4.2 Repeated game characterization

A RG is a long-term interaction game where players react to past experience by taking into account what happened in all previous stages and make decisions about their future choices [24, 25]. The resulting payoff is an average over all the stage payoffs. We denote by t, the game stage which corresponds to the instant in which all players choose their actions. Accordingly, a profile of actions can be defined for all players as p(t)=(p 1(t),p 2(t),…,p N (t)). A history h(t) of player i at time t is the pair of vectors (P y,t ,p i,t )=(P y (1),P y (2),…,P y (t−1),p i (1),p i (2),…,p i (t−1)) and which lies in the set \(\mathcal {H}_{t} = \left (\Delta ^{t-1},\mathcal {P}_{i}^{t-1}\right)\) with \(\mathcal {P}_{i} = \left [0,P_{i}^{\max }\right ] = \left [0,P^{\max }\right ]\) (as all the users have the same maximum power) [18]. Histories are fundamental in RG as they allow players to coordinate their behavior at each stage so that previous histories are known by all the players [25]. We denote δ i,t the pure strategy of the ith player. It defines the action to select after each history [18, 25]:

$$ \delta_{\text{\textit{i,t}}} = \left | \begin{array}{rcl} \mathcal{H}_{t} & \rightarrow & \left[0, P_{i}^{\max}\right]\\ \mathbf{h}(t) & \mapsto & p_{i}(t) \end{array} \right. $$
((19))

In RG literature, there are two important models [17]:

  • The finite RG where the number of stages of the game (denoted as T≥1) during which the players interact is finite

  • The discounted RG where the discount factor (denoted as λ] 0,1[) is seen as the stopping probability at each stage

The utility function of each player results from averaging over the instantaneous utilities over all the game stages in the FRG while it is a geometric average of the instantaneous utilities during the game stages in the DRG [18, 25, 26]. We denote δ=(δ 1,δ 2,…,δ N ) the joint strategy of all players.

Definition 2.

A joint strategy δ satisfies the equilibrium condition for the repeated game defined by \(\left (\mathcal {N}, (\mathcal {S}_{i})_{i \in \mathcal {N}}, (v_{i})_{i \in \mathcal {N}} \right)\) if \(\forall i \in \mathcal {N}, \forall \delta ^{\prime }_{i}, v_{i}(\boldsymbol {\delta }) \geq v_{i}(\delta ^{\prime }_{i}, \boldsymbol {\delta }_{-i})\) with \(v_{i} = {v_{i}^{T}}\) for the FRG or \(v_{i} = v_{i}^{\lambda }\) for the DRG such that:

$$\begin{array}{*{20}l} {v_{i}^{T}}(\boldsymbol{\delta})= & \frac{1}{T}\sum_{t=1}^{T}u_{i}(\mathbf{p}(t)) & \mathrm{for~the~FRG} \end{array} $$
((20))
$$\begin{array}{*{20}l} v_{i}^{\lambda}(\boldsymbol{\delta})= &\sum_{t=1}^{+\infty}\lambda(1-\lambda)^{t-1}u_{i}(\mathbf{p}(t)) & \mathrm{for~the~DRG} \end{array} $$
((21))

In RG with complete information and full monitoring, the Folk theorem characterizes the set of possible equilibrium utilities. It ensures that the set of NE in a RG is precisely the set of feasible and individually rational outcomes of the one-shot game [24, 25]. A cooperation/punishment plan is established between the players before playing [18]. The players cooperate by always transmitting at the new OP with powers \(\tilde {p}_{i}\). When the power of the received signal changes, a deviation is then detected and the players punish the deviator by transmitting with their maximum transmit power \(P_{i}^{\max }\) in the FRG and by playing at the one-shot game in the DRG. In what follows, we give the equilibrium solution of each repeated game model and mention the corresponding algorithm [2729]. It is important to note that in contrast with iterative algorithms (e.g., iterative water-filling type algorithms), there is no convergence problem in repeated games (FRG and DRG). Indeed, the transmitters implement an equilibrium strategy (referred to as the operating point) at every stage of the repeated game.

4.2.1 Finite RG

The FRG is characterized by the minimum number of stages (T min). If the number of stages in the game T verifies T>T min, a more efficient equilibrium point can be reached. However, if it is less than T min, the NE is then played. Assuming that channel gains |g i |2 lie in a compact set \(\left [\nu _{i}^{\min }, \nu _{i}^{\max }\right ]\) [18], we have the following proposition [19]:

Proposition 1 (FRG equilibrium).

: When supposing the following condition is met: TT min with:

$$ T_{\min} = \left\lceil \frac{\Theta}{\Lambda-\Omega}\right\rceil, $$
((22))

such that:

$$\begin{array}{lcl} \Theta & = & {\frac{A\nu_{i}^{\max}}{{b\nu_{i}^{\min}\ +\ \bar{\gamma}_{i}\sigma^{2}B}}- \frac{G\nu_{i}^{\max}}{{b\nu_{i}^{\min}\ +\ \tilde{\alpha}H}}}\\ \Lambda & = & {\frac{E\nu_{i}^{\min}}{{b\nu_{i}^{\max}\ +\ \gamma_{i}^{\ast}\left(\sigma^{2}\ +\ \sum_{j \neq i}{p_{j}^{\ast} \nu_{i}^{\max}}\right)F}}}\\ \Omega & = & {\frac{C\nu_{i}^{\min}}{{b\nu_{i}^{\max}\ +\ \widehat{\gamma}_{i}\left(\sigma^{2}\ +\ \sum_{j \neq i}{p_{j}^{\max} \nu_{i}^{\max}}\right)D}}}\\ \end{array} $$

Then, the NE corresponding to the T-stage FRG is given by the following action plan for any (T,T min) and t≥1:

$$ \delta_{\text{\textit{i,t}}} : \left | \begin{array}{ll} \tilde{p}_{i} & \text{for }t \in \{1,2,\ldots,T-T_{\min}\}\\ p_{i}^{\ast} & \text{for }t \in \{T-T_{\min}+1,\ldots,T\}\\ P_{i}^{\max} & \text{for any deviation detection} \end{array} \right. $$
((23))

The quantities A, B, C, D, E, F, G, and H are defined in Appendix and \(\gamma _{i}^{\ast }\) is the SINR at the NE while \(\bar {\gamma }_{i}\) and \(\widehat {\gamma _{i}}\) are the SINRs related to the maximal utility and the utility min-max respectively (the proof of this proposition is detailed in [19]). The corresponding algorithm is as follows.

4.2.2 Discounted RG

In the DRG, the probability that the game stops at stage t is λ(1−λ)t−1 with λ] 0,1[ defines the discount factor [17]. Accordingly, we can express the analytic form of the maximum discount factor in a DRG when assuming that channel gains |g i |2 lie in a compact set \(\left [\nu _{i}^{\min }, \nu _{i}^{\max }\right ]\) [18].

Proposition 2 (DRG equilibrium).

: When assuming the following condition is met:

$$ {\lambda \leq \frac{\Psi}{\Gamma+\Psi}}, $$
((24))

with:

$$\begin{array}{lcl} \Gamma & = & {\frac{A\nu_{i}^{\max}}{{b\nu_{i}^{\min}\ +\ \bar{\gamma}_{i}\sigma^{2}B}} - \frac{G\nu_{i}^{\max}}{{b\nu_{i}^{\min}\ +\ \tilde{\alpha}H}}}\\ \Psi & = & {\frac{G\nu_{i}^{\min}}{{b\nu_{i}^{\max}\ +\ \tilde{\alpha}H}}-\frac{E\nu_{i}^{\min}}{{b\nu_{i}^{\max}\ +\ \gamma_{i}^{\ast}\left(\sigma^{2}\ +\ \sum_{j \neq i}{p_{j}^{\ast} \nu_{i}^{\max}}\right)F}}} \end{array} $$

Then, the NE corresponding to the DRG is given by the following action plan t≥1:

$$ \delta_{\text{\textit{i,t}}} = \left | \begin{array}{ll} \tilde{p}_{i} & \text{when all other players play }\tilde{\mathbf{p}}_{-i}\\ p_{i}^{\ast} & \text{else } \end{array} \right. $$
((25))

For the proof, see Appendix. The corresponding algorithm is as follows.

5 Numerical results

In this section, we consider the efficiency function f(x)=e c/x with \(c=2^{\frac {R}{R_{0}}}-1\). It has been proven in [30, 31] that such a function is sigmoidal as it is convex on the open interval (0,c/2] and concave on (c/2,+). The throughput R and the used bandwidth R 0 are equal to 1 Mbps and 1 MHz, respectively. The maximum power P max is set to 0.1 Watt while the noise variance is set to 10−3 W. The buffer size K, the packet arrival rate q and the consumed power b are fixed to 10, 0.5 and 5×10−3 W, respectively. We consider Rayleigh fading channels and a spreading factor L introducing an interference processing (1/L) in the interference term of the SINR.

In Fig. 2, we present the achievable utility region, the new OP, the NE, and the NB solution. We stress that the new OP and the NB solution dominate both the NE in the sense of Pareto. The region between the Pareto frontier and the min-max level is the possible set of equilibrium utilities of the RG according to the Folk theorem.

Fig. 2
figure 2

Pareto-dominance of the new OP and the NB solution versus the NE (L=2)

In order to study the efficiency of the new OP versus the NB solution and the NE, we are interested in comparing powers and utilities of the three equilibria by averaging over channel gains for different scenarios (different number of users N in the system). In Fig. 3, we plot the power and the utility that a user (in a system of N users) can reach for each equilibrium. Thus, we highlight that the new OP and the NB solution have better performances than the NE as they Pareto-dominate it. When N=2, we notice that the new OP and the NB solution are more efficient than the NE. It is clear that the NB solution requires less power and provides higher utility compared to the new OP, but it is important to stress that values, in terms of powers and utilities, are slightly different for both equilibria (new OP and NB solution). When N>2, we highlight that lower powers are provided with the new OP which leads also to higher values of the utilities. Thus, we notice that the new OP gives better performances than the NE and the NB solution. Therefore, the new OP contributes not only to improve the system performances better than the NE for any given scenario but also enables important gains in terms of powers and utilities when compared to the NB solution for a system with a large number of users (N>2).

Fig. 3
figure 3

Better performances in terms of power and utility with the new OP for different number of users N

We are interested in studying the performances of the social welfare \(\left (\sum _{i} u_{i}\right)\) according to the FRG versus the NE in a multi-users system. The corresponding expression is given by:

$$ {}{\fontsize{9pt}{9.6pt}\selectfont{\begin{aligned} \frac{w_{FRG}}{w_{NE}}\,=\, \frac{\sum_{i=1}^{N}\left(\sum_{t=1}^{T-T_{\min}}\tilde{u}_{i}{(\mathbf{p}(t))} +\sum_{t=T-T_{\min}+1}^{T}u^{\ast}_{i}{(\mathbf{p}(t))}\!\right)}{\sum_{i=1}^{N}\sum_{t=1}^{T}u^{\ast}_{i}{(\mathbf{p}(t))}}. \end{aligned}}} $$
((26))

In Fig. 4, we present the ratio of the social welfare corresponding to the FRG (ω FRG ) versus the NE social welfare (ω NE ). We proceed by averaging over channel gains lying in a compact set such that 10 log10(ν max/ν min)=20. We highlight that the social welfare of the FRG reaches higher values than the NE (ω FRG >ω NE ). In addition, we notice that the social welfare ratio increases with the number of users for both models (Goodman and cross-layer). The minimum number of stages T min according to the cross-layer model is much lower compared to the one related to the Goodman model. To illustrate this, when N=3,T min for the Goodman model is equal to 4600 while it is 3700 for the cross-layer model. This difference becomes considerable with the increase of the number of users. Indeed, when N=4, the minimum number of stages for the Goodman EE is 14,300 while it is equal to 10,900 for the cross-layer approach.

Fig. 4
figure 4

Improvement of the social welfare in FRG versus the NE as a function of the number of stages of the game T (L=5)

We are interested in plotting the minimum number of stages as a function of the consumed power b and the packet arrival rate q according to both EE models. Results, obtained by averaging over channel realizations, are drawn in Figs. 5 and 6. According to Fig. 5, we stress that T min increases with the number of users while it decreases with the spreading factor. It is clear that for any values of N and L, it exists a consumed power b≠0 for which T min is less than T min when b=0. Thus, a good choice of the fixed consumed power leads to a lower minimum number of stages for the cross-layer model compared to the Goodman model.

Fig. 5
figure 5

Existence of T min for the cross-layer model (b≠0) lower than T min of Goodman model (b=0)

Fig. 6
figure 6

Lower values of T min of the cross-layer model when comparing to Goodman model (L=5)

In Fig. 6, we highlight that the minimum number of stages is an increasing function of the packet arrival rate q according to the cross-layer model while it is a constant function for the Goodman model since the latter does not take into account the packet arrival process. One can confirm that the minimum number of stages is an increase function of the number of users as deduced previously. Simulations show that it exists a packet arrival rate q 0 before which T min of the cross-layer model is much lower than T min of the Goodman model for different number of users. Simulations show that q 0≈0.6 and for qq 0,T min of the cross-layer model converges to T min corresponding to the Goodman model. It is important to highlight that when N=3 and qq 0,T min of the cross-layer model takes higher values than T min corresponding to the Goodman model but values are quite similar. With the increase of the number of users, the difference between the minimum number of stages for both models becomes noticeable. According to Figs. 5 and 6, one can conclude that the cross-layer model can be exploited for short games.

For the DRG model, we plot in the first step the improvement of the social welfare (ω DRG ) versus the one-shot game (ω NE ) for Goodman and cross-layer models (b=0 and b=5×10−3, respectively) as a function of the spectral efficiency η=N/L. We simulated our algorithm by averaging over channel gains for different number of users. Results are given in Fig. 7. It is important to highlight that the DRG social welfare reaches higher values than the NE social welfare (ω DRG >ω NE ). For low values of the spectral efficiency, the social welfare ratio is quite similar for both models while the difference becomes noticeable when the spectral efficiency takes higher values. The social welfare ratio increases with the number of users for both EE models. For each model, when N takes high values, the social welfare ratios become closer (for the cross-layer model, the curves corresponding to N=3 and N=4 are closer than with the curve of N=2).

Fig. 7
figure 7

Improvement of the social welfare in DRG versus the NE for Goodman and cross-layer models as a function of the spectral efficiency η for different number of users N

For this reason, we studied the variation of λ max as a function of η and q for both EE models and for different number of users. Results are given in Figs. 8 and 9. According to Fig. 8, we deduce how λ max decreases with the number of users for both EE models. In addition, we stress that the values reached by λ max becomes closer when N takes higher values. This can explain Fig. 7.

Fig. 8
figure 8

Variation of λ max for Goodman and cross-layer models as a function of the spectral efficiency η with different number of users N

Fig. 9
figure 9

Variation of λ max as a function of the packet arrival rate q (L=2)

The study of the variation of λ max versus the packet arrival rate q (in Fig. 9) shows that the maximum discount factor λ max decreases with the number of users and with the packet arrival rate q as well. Simulations show that it exists a packet arrival rate q 1 before which the λ max corresponding to the cross-layer model takes higher values than the maximum discount factor of the Goodman model for different numbers of users. We notice that starting from q 1, the maximum discount factor of the cross-layer model converges to λ max corresponding to the Goodman model.

In a second step, we plotted in Fig. 10 the variation of the DRG social welfare as a function of λλ max. We notice that ω DRG is an increase function of λ. Thus, when λ=λ max,ω DRG reaches highest value. However, we stress that ω DRG decreases with the number of users especially for the Goodman model while it is quite similar for the cross-layer model. This confirms that the proposed new OP is still quite efficient and can be utilized for games with high number of users.

Fig. 10
figure 10

Increase of ω DRG as a function of λ (L=2)

Finally, we plot for both RG models (FRG and DRG) in Figs. 11 and 12 the social welfare when using the cross-layer approach against the constant power b for two different values of the packet arrival rate q (0.5 and 0.7). The considered system is composed of two users and the spreading factor L is fixed to 4. The idea consists in studying the efficiency of the cross-layer approach regarding the Goodman power control algorithm. Accordingly, for each packet arrival rate, we plot the social welfare with the cross-layer approach (powers at the equilibrium are determined normally according to q) and the social welfare with the cross-layer power control but when powers at the equilibrium are determined by the Goodman algorithm (p[q→1]). Indeed, the packet arrival rate is assumed constant in the Goodman model and equal to 1 (packets arrive with probability q=1). For both RG models, we stress that the cross-layer power control approach outperforms the Goodman algorithm for both values of the packet arrival rate q. Important (relative) gains are reached. To illustrate this, for q=0.5 and b=0.045 W the relative gain is higher than 50 % in the FRG and the DRG as well. Therefore, we conclude that the OP with the cross-layer approach provides better performances and is more efficient than the OP with the Goodman power control approach.

Fig. 11
figure 11

Variations of the FRG social welfare against b for q=0.5 and q=0.7: the cross-layer power control approach outperforms the Goodman algorithm

Fig. 12
figure 12

Plotting the DRG social welfare against b for q=0.5 and q=0.7: the cross-layer approach improves the power control when compared to the Goodman algorithm

6 Conclusions

In this paper, we have investigated RG for distributed power control in a MAC system. As the NE is not always energy-efficient, the NB solution might be a possible efficient solution since it is Pareto-efficient. However, the latter, in general, requires global CSI at each transmitter node. Thus, we were motivated to investigate using the repeated game formulation and develop a new OP that simultaneously is both more efficient than the NE and achievable with only individual CSI being required at the transmitter. Also, we consider a new EE metric taking into account the presence of a queue at the transmitter with an arbitrary packet arrivals.

Cooperation plans are proposed where the new OP is considered and closed-form expressions of the minimum number of stages for the FRG and the maximum discount factor for the DRG have been established. The study of the social welfare (sum of utilities of all the users) shows that considerable gains are reached compared to the NE (for the FRG and DRG). Moreover, our model proves that even with a high number of users, the FRG can always be played with a minimum number of stages shorter than when using the Goodman model. In addition, the social welfare in the DRG decreases slightly with the number of users with the cross-layer approach while it decreases considerably with the Goodman model. Finally, the comparison of the cross-layer algorithm versus the Goodman algorithm shows that in real systems with random packet arrivals, the cross-layer power control algorithm outperforms the Goodman algorithm. Thus, the new OP with the cross-layer approach is more efficient. An interesting extension to this work would be to consider the interference channel instead of the MAC channel and generalize the framework applied here. Another possible extension would be to consider the multi-carrier case and the resulting repeated g ame.

7 Appendix

7.1 Proof of λ max

7.1.1 Determination of the maximal utility

Let us determine the maximal utility that a player can get and which is denoted as follows:

$$ \begin{array}{lcl} {\bar{u}_{i} }&= &{\max_{\mathbf{p}_{-i}}\max_{p_{i}}u_{i}({p_{i},\mathbf{p}_{-i}})}.\\ \end{array} $$
((27))

We denote \(\dot {p}_{i}\) the power maximizing the utility function u i and which is the solution of the following equation:

$$ {b\frac{\gamma_{i}}{p_{i}}\Phi^{\prime}(\gamma_{i})+q\left(\frac{1-\Phi(\gamma_{i})}{f(\gamma_{i})}\right)^{2} \left[\,f(\gamma_{i})-\gamma_{i} f^{\prime}(\gamma_{i})\right]=0}, $$
((28))

with \({\gamma _{i}^{\prime } = {\frac {\mathrm {d} \gamma _{i}}{\mathrm {d} p_{i}}} = {\frac {\gamma _{i}}{p_{i}}}}, {f^{\prime } = {\frac {\mathrm {d} f}{\mathrm {d} \gamma _{i}} }}\) and \({\Phi ^{\prime } = {\frac {\mathrm {d} \Phi }{\mathrm {d} \gamma _{i}}}}\). Therefore, the expression of the maximum utility function writes as:

$$ \dot{u}_{i}({\dot{p}_{i},\mathbf{p}_{-i}}) = {\frac{Rq(1-\phi(\dot{\gamma}_{i}))}{{b+{\frac{\dot{p}_{i}q(1\ -\ \phi(\dot{\gamma}_{i}))}{f(\dot{\gamma}_{i})}}}}}, $$
((29))

with:

$$ {\dot{\gamma}_{i} = \frac{\dot{p}_{i}|g_{i}|^{2}}{\sigma^{2}+\sum_{j \neq i}{p_{j} |g_{j}|^{2}}}}. $$
((30))

We have to study then the behavior of \(\dot {u}_{i}({\dot {p}_{i},\mathbf {p}_{-i}})\) regarding p j for ji, and then we determine the sign of \({\frac {\partial \dot {u}_{i}({\dot {p}_{i},\mathbf {p}_{-i}})}{\partial p_{j}}}\) which is given by:

$$ {\frac{\partial \dot{u}_{i}({\dot{p}_{i},\mathbf{p}_{-i}})}{\partial p_{j}}} = {Rq\frac{{\frac{-b\partial \phi(\dot{\gamma}_{i})}{\partial p_{j}}+ \dot{p}_{i} q \left(\frac{(1\ -\ \phi(\dot{\gamma}_{i}))}{f(\dot{\gamma}_{i})}\right)^{2}\frac{\partial f(\dot{\gamma}_{i})}{{\partial p_{j}}}}}{{\left(b+\frac{\dot{p}_{i} q(1\ -\ \phi(\dot{\gamma}_{i}))}{f(\dot{\gamma}_{i})}\right)^{2}}}}. $$
((31))

We are interested to study the sign of the numerator:

$$ \begin{aligned} \frac{-b\partial \phi(\dot{\gamma}_{i})}{\partial p_{j}}+ \dot{p}_{i} q \left(\frac{(1-\phi(\dot{\gamma}_{i}))}{f(\dot{\gamma}_{i})}\right)^{2}\frac{\partial f(\dot{\gamma}_{i})}{\partial p_{j}} = \\ \left(\frac{-b\partial \phi(\dot{\gamma}_{i})}{\partial \dot{\gamma}_{i}}+ \dot{p}_{i} q \left(\frac{(1-\phi(\dot{\gamma}_{i}))}{f(\dot{\gamma}_{i})}\right)^{2}\frac{\partial f(\dot{\gamma}_{i})}{\partial \dot{\gamma}_{i}}\right)\frac{\partial \dot{\gamma}_{i}}{\partial p_{j}}, \end{aligned} $$
((32))

with:

$$ \frac{\partial \dot{\gamma}_{i}}{\partial p_{j}} = \frac{-\dot{p}_{i}|g_{i}|^{2}|g_{j}|^{2}}{\left(\sigma^{2} +\sum_{j \neq i}{p_{j}} |g_{j}|^{2}\right)^{2}}<0. $$
((33))

The next step would be to determine the sign of the expression \({\frac {-b\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}} \allowbreak + \dot {p}_{i} q \left (\frac {(1\ -\ \phi (\dot {\gamma }_{i}))}{f(\dot {\gamma }_{i})}\right)^{2}\frac {\partial f(\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}}\). It is obvious that \({\dot {p}_{i} q \left (\frac {(1\ -\ \phi (\dot {\gamma }_{i}))}{f(\dot {\gamma }_{i})}\right)^{2}\frac {\partial f(\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}>0}\) since f is an increasing function of the SINR. Therefore, we need to determine the sign of \({\frac {\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}}\). We have:

$$ \begin{array}{lcl} {\frac{\partial \phi(\gamma_{i})}{\partial \gamma_{i}}} & = & {\frac{\partial ((1\ -\ f(\gamma_{i}))\Pi(\gamma_{i}))}{\partial \gamma_{i}}}\\ & = & {-\frac{\partial f(\gamma_{i})}{\partial \gamma_{i}}\Pi(\gamma_{i})+(1-f(\gamma_{i}))\frac{\partial \Pi(\gamma_{i})}{\partial \gamma_{i}}}. \end{array} $$
((34))

The sign of the first term is negative while the sign of the second term is the same as Π(γ i )/ γ i since (1−f(γ i ))>0 and we have:

$$ {\frac{\partial \Pi(\gamma_{i})}{\partial \gamma_{i}}} = {\frac{\partial \rho(\gamma_{i})}{\partial \gamma_{i}}\frac{\partial \Pi(\gamma_{i}))}{\partial \rho}}. $$
((35))

However \(\rho (\gamma _{i})={\frac {q(1\ -\ f(\gamma _{i}))}{(1\ -\ q)f(\gamma _{i})}}\) and then:

$$ {\frac{\partial \rho(\gamma_{i})}{\partial \gamma_{i}}} = {\frac{-q}{(1-q)f^{2}(\gamma_{i})}\frac{\partial f(\gamma_{i})}{\partial \gamma_{i}}<0}. $$
((36))

As shown in [13], we have:

$$ \Pi(\gamma_{i}) = \frac{\rho^{K}}{1+\rho+\rho^{2}+\ldots+\rho^{K}}. $$
((37))

The latter quantity can be expressed as:

$$ \frac{1}{\Pi(\gamma_{i})} = 1+\frac{1}{\rho}+\frac{1}{\rho^{2}}+\ldots+\frac{1}{\rho^{K}}. $$
((38))

Consequently, we have:

$$ \frac{\partial \Pi(\gamma_{i})}{\partial \rho} = \Pi^{2}(\gamma_{i})\left[\frac{1}{\rho^{2}}+\frac{2}{\rho^{3}}+\ldots+\frac{K}{\rho^{K+1}}\right]>0. $$
((39))

Therefore, \({\frac {\partial \Pi (\gamma _{i})}{\partial \gamma _{i}}<0}\) and hence \({\frac {\partial \phi (\gamma _{i})}{\partial \gamma _{i}}}<0\). In particular, we have \({\frac {\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}}<0\). Thus, we have \({\left (\frac {-b\partial \phi (\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}+ \dot {p}_{i} q \left (\frac {(1\ -\ \phi (\dot {\gamma }_{i}))}{f(\dot {\gamma }_{i})}\right)^{2}\frac {\partial f(\dot {\gamma }_{i})}{\partial \dot {\gamma }_{i}}\right)}>0\) and finally \({\frac {\partial \dot {u}_{i}({\dot {p}_{i},\mathbf {p}_{-i}})}{\partial p_{j}}}<0\). We deduce then that \(\dot {u}_{i}\) is a decreasing function of p j . It reaches its maximum when p j =0 and it is minimum when \(p_{j}=p_{j}^{\max }\) (for all ji). When substituting p j =0 in the SINR expression, this allows the determination of the optimal power:

$$ \begin{aligned} b\frac{|g_{i}|^{2}}{\sigma^{2}}\Phi^{\prime}(\gamma_{i}(p_{i}))&+ q\left(\frac{(1-\Phi(\gamma_{i}(p_{i})))}{f(\gamma_{i}(p_{i}))}\right)^{2} \\ &\left[\,f(\gamma_{i}(p_{i}))\right.-\gamma_{i} f^{\prime}\left.(\gamma_{i}(p_{i}))\right]=0, \end{aligned} $$
((40))

with: \({\gamma _{i} = \frac {p_{i}|g_{i}|^{2}}{\sigma ^{2}}}\).

The latter equation is a function of the SINR. We determine then the solution in terms of SINR which we denote \(\bar {\gamma }_{i}\) and for which the optimal power is \({\bar {p}_{i}=\frac {\bar {\gamma }_{i}\sigma ^{2}}{|g_{i}|^{2}}}\). This SINR exists due to the quasi-concavity of u i in (p i ,p i ) [13, 14]. Then, we have:

$$ \bar{u}_{i} = \max_{\mathbf{p}}u_{i}(\mathbf{p}) = {\frac{Rq(1-\phi(\bar{\gamma}_{i}))}{{b+\frac{\bar{\gamma}_{i}\sigma^{2}}{|g_{i}|^{2}} \frac{q(1\ -\ \phi(\bar{\gamma}_{i}))}{f(\bar{\gamma}_{i})}}}}. $$
((41))

7.1.2 Determination of λ max

The SINR \(\tilde {\gamma }_{i}\) refers to the SINR when playing the new OP while \(\gamma _{i}^{\ast }, \bar {\gamma }_{i}\), and \(\widehat {\gamma _{i}}\) are the SINRs at the NE, at the maximal utility and at the utility min-max respectively. In order to simplify expressions, we define the following notations:

$$\begin{array}{lcl} A & = & Rq(1-\phi(\bar{\gamma}_{i}))\\ B & = & {\frac{q(1\ -\ \phi(\bar{\gamma}_{i}))}{f(\bar{\gamma}_{i})}}\\ C & = & Rq(1-\phi(\widehat{\gamma}_{i}))\\ D & = & {\frac{q(1\ -\ \phi(\widehat{\gamma}_{i}))}{f(\widehat{\gamma}_{i})}}\\ E & = & Rq(1-\phi(\gamma_{i}^{\ast}))\\ F & = & {\frac{q(1\ -\ \phi(\gamma_{i}^{\ast}))}{f(\gamma_{i}^{\ast})}}\\ G & = & Rq(1-\phi(\tilde{\gamma}_{i}))\\ H & = & {\frac{q(1\ -\ \phi(\tilde{\gamma}_{i}))}{f(\tilde{\gamma}_{i})}} \end{array} $$

At a stage t, the equilibrium condition is [18]:

$$ \begin{array}{c} \lambda \bar{u}_{i}(\mathbf{p}(t)) + \sum_{s\geq t+1}{\lambda (1-\lambda)^{s-t}\mathbb{E}_{g}\left[u_{i}^{\ast}(\mathbf{p}(s))\right]} \\ \leq \lambda \tilde{u}_{i}(\mathbf{p}(t)) + \sum_{s\geq t+1}{\lambda (1-\lambda)^{s-t}\mathbb{E}_{g}\left[\tilde{u}_{i}(\mathbf{p}(s))\right]} \end{array} $$
((42))

Knowing that \(\sum _{s\geq t+1}{(1-\lambda)^{s-t}} = (1-\lambda)/\lambda \), we have:

$$ \lambda \bar{u}_{i} + (1-\lambda)\mathbb{E}_{g}\!\left[u_{i}^{\ast}\right] \leq \lambda \tilde{u}_{i}+(1-\lambda)\mathbb{E}_{g}\!\left[\tilde{u}_{i}\right] $$
((43))
$$ \begin{array}{c} {}\Longleftrightarrow \lambda {\frac{A|g_{i}|^{2}}{{b|g_{i}|^{2}+\bar{\gamma}_{i}\sigma^{2}B}}} + (1-\lambda)\mathbb{E}_{g}\left[{\frac{E|g_{i}|^{2}}{{b|g_{i}|^{2}+\gamma_{i}^{\ast}\left(\sigma^{2}+\sum_{j \neq i}{p_{j}^{\ast} |g_{j}|^{2}}\right)F}}}\right] \\ \leq \lambda {\frac{G|g_{i}|^{2}}{{b|g_{i}|^{2}+\tilde{\alpha}H}}}+(1-\lambda)\mathbb{E}_{g}\left[ {\frac{G|g_{i}|^{2}}{{b|g_{i}|^{2}+\tilde{\alpha}H}}}\right] \end{array} $$
((44))
$$ \begin{array}{c} \Rightarrow \lambda \left[{\frac{A\nu_{i}^{\max}}{{b\nu_{i}^{\min}+\bar{\gamma}_{i}\sigma^{2}B}} - \frac{G\nu_{i}^{\max}}{{b\nu_{i}^{\min}+\tilde{\alpha}H}}} \right] \\ \leq (1-\lambda)\left[{\frac{G\nu_{i}^{\min}}{{b\nu_{i}^{\max}+\tilde{\alpha}H}} - \frac{E\nu_{i}^{\min}}{{b\nu_{i}^{\max}+\gamma_{i}^{\ast}\left(\sigma^{2}+\sum_{j \neq i}{p_{j}^{\ast} \nu_{i}^{\max}}\right)F}}}\right]. \end{array} $$
((45))

Let Ψ and Γ define the following quantities:

$$\begin{array}{lcl} \Gamma & = & {\frac{A\nu_{i}^{\max}}{{b\nu_{i}^{\min}\ +\ \bar{\gamma}_{i}\sigma^{2}B}} - \frac{G\nu_{i}^{\max}}{{b\nu_{i}^{\min}\ +\ \tilde{\alpha}H}}}\\ \Psi & = & {\frac{G\nu_{i}^{\min}}{{b\nu_{i}^{\max}\ +\ \tilde{\alpha}H}}-\frac{E\nu_{i}^{\min}}{{b\nu_{i}^{\max}\ +\ \gamma_{i}^{\ast}\left(\sigma^{2}\ +\ \sum_{j \neq i}{p_{j}^{\ast} \nu_{i}^{\max}}\right)F}}} \end{array} $$

Thus:

$$ {{\lambda_{\max} = \frac{\Psi}{\Gamma+\Psi}}}. $$
((46))

References

  1. D Lister, in Proc. IEEE Internat. Conf. on Comm. Workshops (ICC Workshops 2009). An Operators View on Green Radio (1st Int. Workshop on Green Comm., 2009).

  2. J Palicot, C Roland, On the Use of Cognitive Radio for Decreasing the Electromagnetic Radiations, URSI 05, XXVIII General Assembly (2005).

  3. GreenTouch, Communications Turns Totally Green (Press Release, 2010).

  4. Y Yu, VK Prasanna, B Krishnamachari, Energy minimization for real-time data gathering in wireless sensor networks. IEEE Trans. Wirel. Commun. 5(11), 3087–3096 (2006).

    Article  Google Scholar 

  5. S Bandyopadhyay, EJ Coyle, in Twenty-Second Annual Joint Conference of the IEEE Computer and Communications, 3. An Energy Efficient Hierarchical Clustering Algorithm for Wireless Sensor Networks, INFOCOM 2003 (IEEE Societies, 2003), pp. 1713–1723.

  6. M Cardei, MT Thai, Y Li, W Wu, in 24th Annual Joint Conference of the IEEE Computer and Communications Societies, 3. Energy-Efficient Target Coverage in Wireless Sensor Networks, INFOCOM 2005 (Proceedings IEEE, 2005), pp. 1976–1984.

  7. A Zappone, E Jorswieck, Energy Efficiency in Wireless Networks via Fractional Programming Theory. Foundations and Trends in Commun Inf Theory. 11(3–4), 185–396 (2015).

    Article  Google Scholar 

  8. C Isheden, Z Chong, E Jorswieck, G Fettweis, Framework for Link-Level Energy Efficiency Optimization with Informed Transmitter. IEEE Trans. Wirel. Commun. 11(8), 2946–2957 (2012).

    Google Scholar 

  9. S Lasaulce, Y Hayel, RE Azouzi, M Debbah, Introducing hierarchy in energy games. IEEE Trans. Wirel. Commun. 8(7), 3833–3843 (2009).

    Article  Google Scholar 

  10. DJ Goodman, NB Mandayam, Power control for wireless data. IEEE Pers. Commun. 7(2), 48–54 (2000).

    Article  Google Scholar 

  11. SM Betz, HV Poor, Energy efficient communications in CDMA networks: a game theoretic analysis considering operating costs. IEEE Trans. Signal Proc. 56(10), 5181–5190 (2008).

    Article  MathSciNet  Google Scholar 

  12. A Zappone, Z Chong, E Jorswieck, S Buzzi, Energy-aware competitive power control in relay-assisted interference wireless networks. IEEE Trans. Wirel. Commun. 12(4), 1860–1871 (2013).

    Article  Google Scholar 

  13. VS Varma, S Lasaulce, Y Hayel, SE Elayoubi, M Debbah, Cross-layer design for green power control. IEEE Int. Conf. Commun. (ICC), 4021–4026 (2012).

  14. VS Varma, S Lasaulce, Y Hayel, SE Elayoubi, A Cross-Layer Approach for Distributed Energy-Efficient Power Control in Interference Networks. IEEE Trans. Veh. Technol (2014).

  15. M Mhiri, K Cheikhrouhou, A Samet, F Mériaux, S Lasaulce, in 6th International Conference on Network Games, Control and Optimization (NETGCOOP). Energy-Efficient Spectrum Sharing in Relay-Assisted Cognitive Radio Systems, (2012), pp. 86–91.

  16. S Lasaulce, M Debbah, E Altman, Methodologies for analyzing equilibria in wireless games: a look at pure, mixed, and correlated equilibria. IEEE Signal Process. Mag. 26(5), 41–52 (2009).

    Article  Google Scholar 

  17. S Lasaulce, H Tembine, Game Theory and Learning for Wireless Networks: Fundamentals and Applications (Academic Press, Elsevier, 2011).

  18. M Le Treust, S Lasaulce, A repeated game formulation of energy-efficient decentralized power control. IEEE Trans. Wirel. Commun. 9(9), 2860–2869 (2010).

    Article  Google Scholar 

  19. M Mhiri, VS Varma, M Le Treust, S Lasaulce, A Samet, in First International Black Sea Conference on Communications and Networking (BlackSeaCom). On the Benefits of Repeated Game Models for Green Cross-Layer Power Control in Small Cells, (2013), pp. 137–141.

  20. M Abidi, VT Vakili, in IEEE 18th International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC). A Game Theoretic Approach for SINR-Constrained Power Control in 3G Cellular CDMA Communication Systems, (2007), pp. 1–5.

  21. E Hossain, D Niyato, Z Han, Dynamic Spectrum Access and Management in Cognitive Radio Networks (United States of America by Cambridge University Press, New York, 2009).

    Book  Google Scholar 

  22. EG Larsson, EA Jorswieck, Competition versus cooperation on the MISO interference channel. IEEE J. Selected Areas in Commun. 26(7), 1059–1069 (2008).

    Article  Google Scholar 

  23. J Friedman, A non-cooperative equilibrium for supergames. Rev. Econ. Stud. 38(1), 1–12 (1971).

    Article  MATH  Google Scholar 

  24. S Hart, Robert Aumann’s game and economic theory. Scand. J. Econ. 108(2), 185–211 (2006).

    Article  Google Scholar 

  25. S Sorin, in Handbook of game theory, 1, ed. by RJ Aumann, S Hart. Repeated games with complete information (Elsevier Science Publishers, 1992), pp. 72–107.

  26. RJ Aumann, LS Shapley, Long-Term Competition-A Game-Theoretic Analysis. preprint, 1–26 (1976).

  27. Y Xu, J Wang, Q Wu, A Anpalagan, YD Yao, Opportunistic spectrum access in unknown dynamic environment: A game-theoretic stochastic learning solution. IEEE Trans. Wirel. Commun. 11(4), 1380–1391 (2012).

    Article  Google Scholar 

  28. H Li, Multi-agent Q-learning for Aloha-like spectrum access in cognitive radio systems. EURASIP J. Wirel. Commun. Netw. 2010:, 1–15 (2010).

    Article  Google Scholar 

  29. Y Song, SHY Wong, KW Lee, in MobiCom’11 Proceedings of the 17th Annual International Conference on Mobile Computing and Networking. Optimal Gateway Selection in Multidomain Wireless Networks: A Potential Game Perspective, (2011), pp. 325–336.

  30. EV Belmega, S Lasaulce, in VALUETOOLS’09 Proceedings of the 4th International ICST Conference on Performance Evaluation Methodologies and Tools. An Information-Theoretic Look at MIMO Energy-Efficient Communications, (2009).

  31. EV Belmega, S Lasaulce, Energy-efficient precoding for multiple-antenna terminals. IEEE Trans. Signal Proc. 59(1), 329–340 (2011).

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mariem Mhiri.

Additional information

Competing interests

The authors declare that they have no competing interests.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mhiri, M., Varma, V.S., Cheikhrouhou, K. et al. Cross-layer distributed power control: a repeated game formulation to improve the sum energy efficiency. J Wireless Com Network 2015, 257 (2015). https://doi.org/10.1186/s13638-015-0486-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13638-015-0486-z

Keywords