Adaptive access and rate control of CSMA for energy, rate, and delay optimization

Khodaian, Mahdi; Pérez, Jesús; Khalaj, Babak H; Crespo, Pedro M

doi:10.1186/1687-1499-2012-27

Research
Open access
Published: 30 January 2012

Adaptive access and rate control of CSMA for energy, rate, and delay optimization

Mahdi Khodaian¹,
Jesús Pérez²,
Babak H Khalaj¹ &
…
Pedro M Crespo³

EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 27 (2012) Cite this article

2564 Accesses
Metrics details

Abstract

In this article, we present a cross-layer adaptive algorithm that dynamically maximizes the average utility function. A per stage utility function is defined for each link of a carrier sense multiple access-based wireless network as a weighted concave function of energy consumption, smoothed rate, and smoothed queue size. Hence, by selecting weights we can control the trade-off among them. Using dynamic programming, the utility function is maximized by dynamically adapting channel access, modulation, and coding according to the queue size and quality of the time-varying channel. We show that the optimal transmission policy has a threshold structure versus the channel state where the optimal decision is to transmit when the wireless channel state is better than a threshold. We also provide a queue management scheme where arrival rate is controlled based on the link state. Numerical results show characteristics of the proposed adaptation scheme and highlight the trade-off among energy consumption, smoothed data rate, and link delay.

1. Introduction

In wireless networks, mobile devices are usually battery powered with a limited amount of energy. Therefore, minimization of energy consumption while maintaining the quality of service in the network is crucial. This must be accomplished by adapting the transmission parameters to the system dynamics and to the time-varying channel of the links. In this article, we present a cross-layer adaptive algorithm that dynamically maximizes the average utility function of a carrier sense multiple access (CSMA)-based wireless link.

Benefits of such adaptation schemes are shown in some prior works in terms of energy efficiency [1–8]. In such works various control algorithms have been proposed that trade-off among different goals such as energy consumption, average delay, packet dropping probability and bit error rate, and dynamically adapt the transmission parameters to the channel and system state. The aforementioned works assume point-to-point links with dedicated channels. However, in data transmission networks, where data are generated at random time instances, random access schemes are used to efficiently exploit channel resources. In such systems, there are more users than available channels, and at any given time only a subset of users can access the channels. Therefore, the optimality of channel access decision is crucial in random access networks. Random access is widely used in ad hoc networks as it can be implemented in a distributed manner. Wireless local area networks (WLAN) and practical personal or sensor networks usually use random access control in their ad hoc operation mode [9, 10]. On the other hand, it is shown recently that CSMA protocols can achieve maximum stable throughput [11] while keeping bounded queuing delay [12], and it can achieve a collision free WLAN [13].

Optimization of random access networks was first proposed in order to achieve single hop proportional fairness for slotted ALOHA networks [14]. Different types of fairness are also considered and random access control is modeled as a utility maximization problem in [15]. In addition, the cross-layer optimization problem of random access control and transmission control protocol is solved as a network utility maximization problem [16]. Newton-like algorithms are also provided for energy and throughput optimization with end-to-end delay constraint in multi hop random access network [17]. However, in the aforementioned articles static transmission probability was used and opportunity of time varying and adaptive control was ignored.

On the other hand, queue-based random access algorithms were studied in [18], where access probabilities are assumed to be adapted based on queue sizes. Stability of the proposed algorithms was verified and their delay performance was shown to surpass fixed optimization algorithms. Also a heuristic differential queue-based scheduling algorithm is proposed in [19] which shows superior performance compared to 802.11 through experimental results. However, such queue-based algorithms are inappropriate for fading channels and prioritize links with low channel quality, which results in low energy efficiency [20].

In this article, we propose cross-layer adaptive algorithms; derived from dynamic programming, for distributed optimization of the links in CSMA-based wireless networks operating in mobile environments. As a performance metric, we define the per stage utility of the link as a weighted concave function of energy consumption, smoothed data rate, and smoothed queue size in the link, where the weights are assigned based on the desired tradeoff among them. The algorithms maximize the average utility by dynamically adapting the channel access decision and transmit data rate (by selecting different modulation and coding schemes) according to the queue size of the link and the availability and quality of the time-varying channel (channel state is assumed to be known at the transmitter). Both, finite-time horizon (FTH) and infinite-time horizon (ITH) problems are considered. In the first case, the utility sum is maximized for a finite time period, whereas in the second case, the long-term average utility is maximized.

We consider a mobile environment with frequency-flat time-varying channel response. This requires suitable models of the wireless channel dynamics. Here, we use finite-state Markov chains (FSMC) to model channel dynamics, such that channel time-correlation at network links is partially exploited by the proposed algorithms. Although the physical wireless channel is inherently non-Markovian, it has been shown that stationary Markov chains can capture the essence of the channel dynamics [21]. Many transmission adaptation algorithms are based on first-order Markov channel models [1, 2]. Here, we consider first- and second-order Markov chains to model characteristics of network links.

The numerical simulations show the benefits of the proposed adaptation algorithms in terms of energy efficiency, and highlight the trade-off among energy consumption, smoothed data rate, and delay in links of a CSMA network. They also show that the use of suitable Markov model for the wireless channel improves performance of the adaptation algorithm, mainly for slow fading channels. Algorithms based on uncorrelated, first- and second-order Markov models are considered and their performance is compared through simulations.

The rest of the article is organized as follows. Section 2 presents the system model and in particular it describes the model of the network links as well as wireless channel models. In Section 3, per stage utility of the links is defined. Consequently, the utility sum maximization for a finite time period is formulated as an optimal finite-horizon control problem. Similarly, the long-term average utility maximization is formulated as an optimal infinite-horizon control problem. Section 4 uses dynamic programming to compute the optimal adaptation policies for the problems formulated in Section 3. We have investigated structural properties of the optimal solution in Section 5. Numerical results and comparisons are described in Section 6. Finally, Section 7 concludes the article.

2. System model

In this section, we describe the model of the random access links as well as wireless channel models.

2.1. Link model

We consider an ad hoc network where links use CSMA protocol similar to the one provided in [22] which prevents collision among links and also resolves hidden and exposed node problems which exist in wireless networks [23]. As shown in Figure 1, we assume a slotted transmission model where each timeslot, of duration T_s, contains both a data slot and a number of control mini slots. When the link has a packet to transmit, it should wait for a random value of W control mini-slots, and if no other link has reserved the channel earlier, it will send a short request to send packet to reserve the channel. Then, the potential receiver which also perceives that the channel is idle will response with a clear to send (CTS) packet that allows the transmitter to transmit and informs possible interfering nodes that the channel will be used. Once the transmitter receives the CTS, it sends its packet in the data slot.

Timeslot k is defined as the time interval [(k - 1)T_s, kT_s). We use I_k to denote the channel access, where I_k = 1 indicates that the link has decided to access the channel at the k th timeslot. The control policy adapts I_k in each slot based on the system and channel state. Also B_k = 1 indicates that the link should delay its transmission because the channel is already occupied by another link. We model B_k as a Bernoulli process where P_B Pr{B_k = 1} is the channel occupancy probability. The Bernoulli distribution is widely used to model the statistics of B_k in CSMA networks [24].

The link has a queue of maximum size L. Let q_k denote the number of packets in the queue at the k th timeslot, which is assumed to be known at the transmitter. Obviously, I_k = 0 when q_k = 0. r_k denotes the controlled number of packets that arrive the queue in slot k, which we will call arrival rate hereafter. The value of r_k should be chosen both to provide suitable rate for source data and to prevent delay due to backlog through adapting source rate to the link state [25]. To avoid buffer overflow the arrival rate is constrained by r_k ≤ (L - q_k ). The queue update equation is

q_{k + 1} = q_{k} - \min {q_{k}, C_{k}} I_{k} (1 - B_{k}) + r_{k}

(1)

Where C_k indicates the maximum number of packets that can be transmitted during the k th data slot. C_k depends on the channel state, and it is assumed to be known at the transmitter at the beginning of each timeslot. We call the data that the physical layer transmits in one time slot a frame and the link consumes a constant energy e for transmission of frame in the data slot. Thus, the energy consumed in the k th timeslot will be E_k = eI_k (1 - B_k ).

We also consider the exponentially weighted moving average (EWMA) of the queue occupancy ${\bar{q}}_{k}$ and of the arrival rate ${\bar{r}}_{k}$ as the link state variables which are defined as follows

{\bar{q}}_{k + 1} = θ_{q} {\bar{q}}_{k} + (1 - θ_{q}) q_{k + 1}; θ_{q} \in (0, 1)

(2)

{\bar{r}}_{k + 1} = θ_{r} {\bar{r}}_{k} + (1 - θ_{r}) r_{k}; θ_{r} \in (0, 1)

(3)

Note that ${\bar{q}}_{k}$ and ${\bar{r}}_{k}$ can be viewed as "smoothed" measures of the delay and data rate in the link. The parameters θ_q and θ_r determines the time scale over which the smoothing is performed. The smaller the value of θ_r or θ_q , the shorter the time period of moving average (smoothing). Values of θ_r and θ_q are determined based on the tolerance of the applications to the delay and data rate variations in the link. Random early detection protocol has used the EWMA of the delay ( ${\bar{q}}_{k}$ ) as a criterion for congestion control [26]. In addition, the EWMA of the rate (or smoothed rate), ${\bar{r}}_{k}$ , has been used in [27, 28] as a measure of the quality of service. EWMA is also used as a metric in statistical quality control [29].

2.2. Channel model

We consider a frequency-flat block-fading channel, where the channel remains constant during each timeslot, and can change for consecutive timeslots. Therefore, we assume that the duration of each timeslot (T_s ) is less than the coherence time of the channel. Hence, channel responses at different timeslots can be correlated. The channel power gain at the k th timeslot is denoted by γ_k . Since we assume constant transmit power, the received signal-to-noise ratio (SNR) in the link for the k th timeslot will be proportional to γ_k . The fading range 0 ≤ γ is partitioned into M disjoint regions so that the j th region is defined as R_j = {γ : A_j ≤ γ < A_j+1}, where A₁ = 0 and A_M+1= ∞. The channel for the k th timeslot is in state j if γ_k ∈ R_j . Also the values of A_j are selected according to the adaptive modulation and coding as follows. Consider that transmitter has a set of modulation and coding schemes {Q₁, Q₂, ... , Q_M } to select from in each time slot. We select A_j ; j = 2, ... , M such that if channel is in state j, transmitter can use Q_j and ensures that the frames transmitted with this scheme have error probability less than FER_th which is a target threshold for frame error rate (FER).

Let $C = {Ĉ_{j} | j = 1, . . ., M}$ denote the set of number of transmit packets associated with the set of channel states, if γ_k ∈ R_j then $C_{k} = Ĉ_{j}$ where C_k is the number of packets that can be transmitted in the k th timeslot. Note that packet error rate will be below the same threshold, i.e., PER_th = FER_th, since (a) adaptive algorithm applies different Q_i schemes so that transmitter ensures the same error threshold for all frames, and (b) if a frame transmission was unsuccessful all packets in the frame will be lost. Therefore, the ratio of the lost packets to the total number of packets equals the ratio of the erroneous frames to the total number of frames, regardless of the channel state.

PE R_{th} = \sum_{C_{j} = 1}^{M} (Prob {packet in a frame of size Ĉ_{j}} \times FER (Ĉ_{j})) = FE R_{th}

Subsequently, we consider three models for the random process C_k , with diverse degrees of complexity.

1. Uncorrelated model

In this model, the channel response at different timeslots are assumed uncorrelated so, where P_j is the probability of the channel state R_j . This simple model may be accurate for fading channels that exhibit high time-variability. It is also the fitting model when there is no prior information about the channel time correlation.

2. First-order markov model

To model the time correlation of the channel we use an M- state FSMC [30] with time discretized to T_s and transition probabilities as

Accordingly, the random process C_k will be modeled with the same M- state FSMC so:

\Pr {C_{k + 1} = Ĉ_{j} | C_{k} = Ĉ_{i}} = P_{i, j}

(4)

The transition probabilities depend on the normalized Doppler frequency f_dT_s which determines the rate of variation of the channel with respect to the timeslot duration, where f_d is the channel Doppler frequency. Although the physical wireless channel is inherently non-Markovian, it has been shown that an FSMC can capture the essence of the channel dynamics when the number of regions/states (M) is low and the channel fades slow enough (see for example [21] and references therein). Note that the uncorrelated model can be viewed as a particular case of FSMC where P_i,j = P_j , ∀i.

3. Second-order Markov model

In order to model dynamics of C_k more accurately, we also consider second-order FSMC channel models. They are more accurate than the first-order FSMC since C_k+1depends on both C_k and C_k-1.

P_{l, i, j} = \Pr {C_{k + 1} = Ĉ_{j} | C_{k - 1} = Ĉ_{l}, C_{k} = Ĉ_{i}}

(5)

In this article, we use the so-called Cartesian product method [21] for the second-order models. We will investigate the effect of the FSMC order on the performance of the resulting algorithm through numerical results. Note that the formulation of the first-order Markov model can be considered as a special case of the second-order model with P_i,j = P_l,i,j for any l.

3. Problem formulation

We consider a wireless link in a CSMA network which desires to optimize its transmission rate, energy consumption, and delay. We distinguish two dynamic optimization problems: FTH and ITH problems. In the FTH problem, the performance of the link is optimized over a finite number of timeslots, whereas in the ITH problem the link performance is optimized considering an infinite number of timeslots. Next, they are formulated as dynamic programming problems.

3.1. Finite time horizon

We define a utility maximization problem over N timeslots or stages as follows:

\max_{{u_{k}}} \underset{k = 1, ... N}{E_{{C_{k}}}} [g_{N + 1} (s_{N + 1}) + \sum_{k = 1}^{N} g (s_{k}, μ_{k})],

(6)

where the expectation is taken over the random process C_k . The function g(s_k , μ_k ) is the utility per stage and is a measure of the quality of service of the link at each timeslot. It depends on the action vector μ_k = (I_k , r_k ) and on the system state vector. We consider a second-order Markov model for C_k and include component C_k-1in the state vector $s_{k} = (q_{k}, {\bar{q}}_{k}, {\bar{r}}_{k}, C_{k}, C_{k - 1})$ . Note that the first-order model can be considered as a special case with $s_{k} = (q_{k}, {\bar{q}}_{k}, {\bar{r}}_{k}, C_{k})$ . Considering φ(·) as the state update function we can write: s_k+1= φ (s_k , μ_k , C_k+1, B_k ). In (6) g_N+1is the final stage utility which depends only on the final state of the system, s_N+1and it can include some limitations or penalties on the final state of the system.

Here we consider a special format for utility per stage function in order to clarify how it controls system performance:

\begin{gathered} g (s_{k}, μ_{k}) = E_{B_{k}} [U ({\bar{r}}_{k}) - α V ({\bar{q}}_{k}) - β E_{k}] = U ({\bar{r}}_{k}) - α V ({\bar{q}}_{k}) - \\ β e I_{k} (1 - P_{B}), k = 1, . . ., N \end{gathered}

(7)

where U(·), and -V(·) are suitable continuous, concave functions, and parameters α and β control the tradeoff between rate, energy, and delay in the utility function. A similar formulation for per stage utility is used in [27, 28] for multi-period utility maximization while queue management and thus queue sizes were not considered.

The number of packets remaining in the queue at the final stage can be penalized with a price of η as follows:

g_{N + 1} (s_{N + 1}) = U ({\bar{r}}_{N + 1}) - α V ({\bar{q}}_{N + 1}) - η q_{N + 1}

(8)

3.2. Infinite time horizon

In this case we maximize the average utility per stage which is defined by

\max_{{μ_{k}}} \lim_{N \to \infty} E_{{C_{k}}} [\frac{1}{N} \sum_{k = 1}^{N} g (s_{k}, μ_{k})]

(9)

where the action and state vectors as well as the per stage utility function are defined similar to the FTH problem. We consider both the first- and second-order models for the channel state by applying appropriate format of s_k .

4. Optimal adaptive control

To maximize FTH or ITH utility functions the controller should decide optimal actions $μ_{k}^{*} (s_{k})$ at the beginning of each timeslot as a function of the system state s_k . Note that the decision must be causal since future system states are unknown due to the randomness of the channel state (C_k ) and occupancy (B_k ). In this section, by using the DP algorithm [31], we derive algorithms that compute the optimal control functions for the FTH and ITH problems. It is important to remark that the resulting optimal control functions are computed and stored offline. Then, they will be used online to dynamically adapt the actions to the system state. As described earlier, the system state definition can support uncorrelated, first- and second-order channel models so we do not limit the solution to any specific channel model.

4.1. Per stage adaptation to maximize FTH utility

The optimal control policy is the sequence of control functions (one for each timeslot) $π^{*} = {μ_{1}^{*} (s_{1}), μ_{2}^{*} (s_{2}), . . ., μ_{N}^{*} (s_{N})}$ that maximize (6). Note that the control functions provide the optimal action for each of the possible system states at different stages. Using the DP algorithm, the optimal policy π* is obtained from the following backward recursion for k = N, N - 1, ..., 1:

J_{(N + 1)} (s_{N + 1}) = g_{N + 1} (s_{N + 1})

(10)

J_{k} (s_{k}) = \max_{μ_{k}} \{g (s_{k}, μ_{k}) + E_{C_{k + 1}, B_{k}} [J_{k + 1} (s_{k + 1})]\}

(11)

μ_{k}^{*} (s_{k}) = \underset{μ_{k}}{\arg \max} \{g (s_{k}, μ_{k}) + E_{C_{k + 1}, B_{k}} [J_{k + 1} (s_{k + 1})]\}

(12)

The function J_k (s_k ) is the maximum expected accumulative utility, achieved under optimal decision, when the system is in state s_k at the k th stage. Thus, J₁ (s₁) is the expected total utility for N stages when the initial state is s₁.

The application of the DP algorithm requires computation of function J_k (s_k ) for all possible system states (s_k ) at each stage and necessitates the system state space to be finite. Since the state components ${\bar{r}}_{k}$ , and ${\bar{q}}_{k}$ can take values from continuous spaces, we discretize them using finite grids ${{\bar{q}}_{m}^{d} | m = 1, 2, . . . M_{q}}$ , and ${{\bar{r}}_{m}^{d} | m = 1, 2, . . ., M_{r}}$ . Then, we can express each non-grid value as a linear interpolation of the nearby grid values:

\bar{q} = \sum_{m = 1}^{M_{q}} w_{q}^{m} (\bar{q}) {\bar{q}}_{m}^{d}

(13)

\bar{r} = \sum_{m = 1}^{M_{r}} w_{r}^{m} (\bar{r}) {\bar{r}}_{m}^{d}

(14)

where $w_{q}^{m}$ and $w_{r}^{m}$ are non-negative weights and $\sum_{m = 1}^{M_{q}} w_{q}^{m} (\bar{q}) = \sum_{m = 1}^{M_{r}} w_{r}^{m} (\bar{r}) = 1$ . It can be shown that if Lipschitz condition holds for the functions $w_{q}^{m} (\bar{q})$ , $w_{r}^{m} (\bar{r})$ , and g(s_k , μ_k ), and for the state update functions (1-3), the DP solution of the discretized problem converges to the optimal policy for the original continuous problem, as the density of the grid increases [32]. For the problem in hand the utility and the state update functions are continuous and thus satisfy the Lipschitz condition. We select $w_{q}^{m}$ and $w_{r}^{m}$ to be suitable continuous functions of the state variables which are chosen on the basis of geometric considerations as suggested in [31] and each state will be described by two nearby discrete states:

w_{x}^{m} (\bar{x}) = \{\begin{gathered} \begin{matrix} (\bar{x} - {\bar{x}}_{m + 1}^{d}) / ({\bar{x}}_{m}^{d} - {\bar{x}}_{m - 1}^{d}), & {\bar{x}}_{m - 1}^{d} < \bar{x} < {\bar{x}}_{m}^{d} \\ ({\bar{x}}_{m + 1}^{d} - \bar{x}) / ({\bar{x}}_{m + 1}^{d} - {\bar{x}}_{m}^{d}), & {\bar{x}}_{m}^{d} < \bar{x} < {\bar{x}}_{m + 1}^{d} \end{matrix} \\ 0, otherwise \end{gathered}

(15)

The DP algorithm for the discretized state space $s_{k}^{d} = (q_{k}, {\bar{q}}_{k}^{d}, {\bar{r}}_{k}^{d}, C_{k}, C_{k - 1})$ and k = N, N - 1, ... , 1 will be

J_{(N + 1)} (s_{N + 1}^{d}) = g_{N + 1} (s_{N + 1}^{d})

(16)

J_{k} (s_{k}^{d}) = \max_{μ_{k}} \{g (s_{k}, μ_{k}) + E_{C_{k + 1}, B_{k}} [{\tilde{J}}_{k + 1} (s_{k + 1})]\}

(17)

{\tilde{μ}}_{k}^{*} (s_{k}^{d}) = \arg \max_{μ_{k}} \{g (s_{k}, μ_{k}) + E_{C_{k + 1}, B_{k}} [{\tilde{J}}_{k + 1} (s_{k + 1})]\}

(18)

where ${\tilde{J}}_{k + 1} (s_{k + 1})$ is the estimation of $J_{k + 1} (s_{k + 1})$ by its values at discretized states and is given by:

{\tilde{J}}_{k + 1} (s_{k + 1}) = \sum_{m = 1}^{M_{q}} \sum_{l = 1}^{M_{r}} w_{q}^{m} ({\bar{q}}_{k + 1}) w_{r}^{l} ({\bar{r}}_{k + 1}) J_{k + 1} (s_{k + 1}^{d})

(19)

In the above equation, we have $s_{k + 1}^{d} = (q_{k + 1}, {\bar{q}}_{m}^{d}, {\bar{r}}_{l}^{d}, C_{k + 1}, C_{k})$ and $s_{k + 1} = (q_{k + 1}, {\bar{q}}_{k + 1}, {\bar{r}}_{k + 1}, C_{k + 1}, C_{k})$ . Furthermore q_k+1, ${\bar{q}}_{k + 1}$ and ${\bar{r}}_{k + 1}$ are given by (1), (2), and (3), respectively. We consider the second-order Markov model by using both C_k-1and C_k in the state vector. The other channel models can be considered as its special case. The solution provided in Equations (17)-(19) is valid for any concave and continuous function of g.

Next we replace g(s_k , μ_k ) in (17) and (18) with its format provided in (7) and calculate the expectation, $E_{C_{k + 1}, B_{k}} [\cdot]$ , using the channel transition probabilities, $\Pr \{C_{k + 1} = Ĉ_{j} | C_{k}, C_{k - 1}\}$ and channel occupancy probability, Pr{B_k = 1}. The expected accumulative utility and the optimal control functions for k = N, N - 1, ... , 1 will be

\begin{gathered} J_{k} (s_{k}^{d}) = U ({\bar{r}}_{k}^{d}) - α V ({\bar{q}}_{k}^{d}) + \max_{μ_{k}} {- β e I_{k} (1 - P_{B}) + \sum_{j = 1}^{M} \sum_{i = 0}^{1} \Pr {B_{k} = i} \Pr {C_{k + 1} = \\ Ĉ_{j} | C_{k}, C_{k - 1}} {\tilde{J}}_{k + 1} (s_{k + 1})} \end{gathered}

(20)

\begin{gathered} {\bar{μ}}_{k}^{*} (s_{k}^{d}) = \\ \arg \max_{μ_{k}} \{- β e I_{k} (1 - P_{B}) + \sum_{j = 1}^{M} \sum_{i = 0}^{1} \Pr {B_{k} = i} \Pr {C_{k + 1} = Ĉ_{j} | C_{k}, C_{k - 1}} {\tilde{J}}_{k + 1} (s_{k + 1})\} \end{gathered}

(21)

Since ${\bar{r}}_{k}^{d}$ and ${\bar{q}}_{k}^{d}$ are independent of the decision in the k th timeslot, $U ({\bar{r}}_{k}^{d}) - α V ({\bar{q}}_{k}^{d})$ do not affect the maximization in (21). Also the summations in (20) and (21) are over all M channel states and two possible channel occupancy conditions.

The discrete DP algorithm can be executed offline and the resulting optimal policy can be stored in a look-up table available at the transmitter. Then, it will be used online to dynamically adapt the action to the system state.

4.2. State-based adaptive control to optimize average utility per stage

To solve the ITH problem of (9) we first define the average utility per stage when using policy π and starting from the initial state s as

J_{π} (s) = \lim_{N \to \infty} E [\frac{1}{N} \sum_{k = 1}^{N} g (s_{k}) | (s_{1} = s), π]

(22)

We denote the optimal policy as π* which produces the maximum average utility per stage J*. Both π* and J* are independent of the initial state since the influence of the utility of the early stages on the average utility reduces to 0 as N → ∞. Moreover, since the utility per stage, the transition probabilities (4), and the state update Equations (1)-(3) are all stationary, the optimal policy will be stationary (does not change from stage to stage). Therefore, it is a single function, μ*(s), that maps the system states to actions regardless of the stage.

J*, together with the so-called relative value function h*(s), should satisfy the following Bellman's fixed point equation [31] for every state:

J^{*} + h^{*} (s) = \max_{μ} {g (s, μ) + E [h^{*} (s^{+})]}

(23)

where s⁺ indicates the successor state of the current state s. Considering φ(·) as the state update function s⁺ = φ (s, μ, C⁺, B). The expectation in Equation (23) is over the random processes {B_k } and {C_k }.

We use a modified relative value iteration algorithm to solve the ITH problem [31]. First, we define a variant of the Bellman operator over any function f as

B_{τ} f (s) = \max_{μ} {g (s, μ) + τ E [f (s^{+})]}

(24)

where parameter τ ∈ (0, 1) is a scalar. Then, the following iterative algorithm is used in order to calculate h⁽ⁿ⁺¹⁾(s) for all states of the state space in the iteration (n + 1):

h^{(n + 1)} (s) = (1 - τ) h^{(n)} (s) + B_{τ} h^{(n)} (s) - B_{τ} h^{(n)} (s^{'}),

(25)

where s' is some fixed state. We initialize this algorithm with h⁽⁰⁾(s) = g(s, I = 0, r = 0). Convergence of (25) is guaranteed since queue and channel states are recurrent [31]. The decision will also be updated and will finally converge to the optimal decision as n → ∞:

μ^{(n)} (s) = \arg \max_{μ} \{g (s, μ) + τ E [h^{(n)} (s^{+})]\},

(26)

The practical application of (24) requires the state space to be discrete, so we use the same discretization procedure as in Section 4.1. This results in the following modified Bellman operator:

B_{τ}^{d} f (s^{d}) = \max_{μ} \{g (s^{d}, μ) + τ E [\sum_{m = 1}^{M_{q}} \sum_{l = 1}^{M_{r}} w_{q}^{m} (q^{+}) w_{r}^{l} (r^{+}) f (s^{+ d})]\}

(27)

Therefore, we apply (27) and compute h⁽ⁿ⁾(s^d ) for all possible discrete states. For the uncorrelated and first-order channel models there are (L × M_q × M_r × M) discrete states and for the second-order channel model this number should be multiplied by M.

5. Structural properties of the optimal solution

In the previous section, we provided DP algorithms that can be applied to find optimal decisions through numerical calculations. In this section, we investigate some structural properties of the solution. We use the following practical assumptions throughout this section.

Assumption 1: Per stage utility function has a format of (7) and (8) with U(·), and V(·) as increasing functions.

Assumption 2: Consider $(C^{-} = Ĉ_{a^{-}}, C = Ĉ_{a})$ as the channel state in the previous and current slot, respectively, and $(Ĉ_{b^{-}}, Ĉ_{b})$ as other possible channel states in these two slots, where $Ĉ_{a} \leq Ĉ_{b}$ and $Ĉ_{a^{-}} \leq Ĉ_{b^{-}}$ . We assume that there exists a j such that the following inequality holds for channel transition probabilities:

\sum_{i = 1}^{j} P_{b^{-}, b, i} \leq \sum_{i = 1}^{j} P_{a^{-}, a, i}

where $P_{a^{-}, a, i}$ is the probability of going from channel states $(Ĉ_{a^{-}}, Ĉ_{a})$ to the next state $C^{+} = Ĉ_{i}$ as defined for second order Markov model.

Assumption 2 is valid in practice for Markov channels since $(Ĉ_{a^{-}}, Ĉ_{a})$ is supposed to be lower than and $(Ĉ_{b^{-}}, Ĉ_{b})$ and each side of the inequality calculates the probability of going to the first j states with lowest rates. For example, if $P_{b^{-}, b, 1} \leq P_{a^{-}, a, 1}$ then the inequality will be true for j = 1 for and assumption 2 holds. If the inequality turns out to be true for any value of j then the assumption is correct. Based on this assumption we provide the following lemma:

Lemma 1: If f(C) and g(C) are two increasing functions, f(C) ≤ g(C), C⁺ is the next channel state, and similar to Assumption 2 $Ĉ_{a} \leq Ĉ_{b}$ and $(Ĉ_{a^{-}} \leq Ĉ_{b^{-}})$ then we have

E {f (C^{+}) | C^{-} = Ĉ_{a^{-}}, C = Ĉ_{a}} \leq E {g (C^{+}) | C^{-} = Ĉ_{b^{-}}, C = Ĉ_{b}}

(28)

Proof is provided in the Appendix.

5.1. Structural properties of FTH solution

The following theorem indicates monotonicity of J_k (s_k ) versus the state variables.

Theorem 1: J_k (s_k ) is a decreasing function of q_k and ${\bar{q}}_{k}$ , and an increasing function of ${\bar{r}}_{k}$ and C_k for all values of k.

Proof: In order to prove the theorem we show through induction that for k = N + 1, ..., 1 we have J_k (s_k + Δ) ≤ J_k (s_k ) for any vector Δ that increase q_k and ${\bar{q}}_{k}$ , and decrease ${\bar{r}}_{k}$ and C_k .

Based on Equation (11) for optimal decision in the k th stage, we define G_k (s_k , μ_k ) as:

G_{k} (s_{k}, μ_{k}) = g (s_{k}, μ_{k}) + E [J_{k + 1} (s_{k + 1})]

(29)

Thus, $J_{k} (s_{k}) = G_{k} (s_{k}, μ_{k}^{*})$ where $μ_{k}^{*}$ is optimal decision for state s_k . Also we define Δ = (δ₁, δ₂, - δ₃, -δ₄, -δ₅) for any value of δ_i ≥ 0, i = 1, ... , 5 such that $s_{k} + Δ = (q_{k} + δ_{1}, {\bar{q}}_{k} + δ_{2}, {\bar{r}}_{k} - δ_{3}, C_{k} - δ_{4}, C_{k - 1} - δ_{5})$ is an element of the state space.

For k = N + 1 we have $J_{N + 1} (s_{N + 1}) = U ({\bar{r}}_{N + 1}) - γ V({\bar{q}}_{N + 1}) - η q_{N + 1}$ and using assumption 1 it is clear that J_N+1(s_N+1+ Δ) ≤ J_N+1(s_N+1). Assuming J_k+1(s_k+1) is a monotonic function we show J_k (s_k ) is also monotonic for k = N, ... , 1 which completes the proof.

We define $μ_{k, Δ}^{*}$ as optimal decision for state s_k + Δ in stage k, so we can write $J_{k} (s_{k} + Δ) = G_{k} (s_{k} + Δ, μ_{k, Δ}^{*})$ , however $μ_{k, Δ}^{*}$ is not an optimal decision for state s_k so we have:

G_{k} (s_{k}, μ_{k, Δ}^{*}) \leq G_{k} (s_{k}, μ_{k}^{*}) = J_{k} (s_{k})

(30)

Using Assumption 1 it is clear that g is a monotonic function of the state variables

g (s_{k} + Δ, μ_{k, Δ}^{*}) \leq g (s_{k}, μ_{k, Δ}^{*})

(31)

We consider φ (·) as the state update function and define two possible next states $s_{k + 1, Δ}^{*} = φ (s_{k} + Δ, μ_{k, Δ}^{*}, C_{k + 1}, B_{k})$ and $s_{k + 1}^{#} = φ (s_{k}, μ_{k, Δ}^{*}, C_{k + 1}, B_{k})$ . For known values of C_k+1and B_k we can use (1)-(3) and easily show that $s_{k + 1,}^{*_{Δ}} = s_{k + 1}^{#} + Δ^{'}$ in which $Δ^{'} = (δ_{1}^{'}, δ_{2}^{'}, - δ_{3}^{'}, - δ_{4}^{'}, - δ_{5}^{'})$ for some $δ_{i}^{'} \geq 0$ . Thus $J_{k + 1} (s_{k + 1, Δ}^{*}) \leq J_{k + 1} (s_{k + 1}^{#})$ .

We define $f (C_{k + 1}) ≜ E_{B} [J_{k + 1} (s_{k + 1, Δ}^{*})]$ and $g (C_{k + 1}) ≜ E_{B} [J_{k + 1} (s_{k + 1}^{#})]$ since B is independent of the system state thus we have f(C_k+1) ≤ g (C_k+1) and since J_k+1(·) is an increasing function of C, then f(·) and g(·) are increasing functions. Applying Lemma 1 with $Ĉ_{a^{-}} = (C_{k - 1} - δ_{5})$ , $Ĉ_{a} = (C_{k} - δ_{4})$ , $Ĉ_{b^{-}} = C_{k - 1}$ , and $Ĉ_{b} = C_{k}$ we find that

E_{C^{+}} {E_{B} [J_{K + 1} (s_{k + 1, Δ}^{*})]} \leq E_{C^{+}} {E_{B} [J_{k + 1} (s_{k + 1}^{#})]}

(32)

Combining (31), (32) and considering definition of G_k in (29) we get

J_{k} (s_{k} + Δ) = G_{k} (s_{k} + Δ, μ_{k, Δ}^{*}) \leq G_{k} (s_{k}, μ_{k, Δ}^{*})

(33)

Equation (30) together with (33) prove the theorem by showing: J_k (s_k + Δ) ≤ J_k (s_k ).

■

Assuming uncorrelated channel model the following theorem indicates the "threshold structure" of the optimal transmission policy versus the channel state.

Theorem 2: If the optimal access decision in state $s_{k} = (q_{k}, {\bar{q}}_{k}, {\bar{r}}_{k}, C_{k})$ is $I_{k}^{*} (s_{k}) = 1$ , then for another possible state $s_{k}^{'} = (q_{k}, {\bar{q}}_{k}, {\bar{r}}_{k}, C_{k}^{'})$ in the same slot with improved channel state $C_{k}^{'} \geq C_{k}$ we have $I_{k}^{*} (s_{k}^{'}) = 1$ .

Proof: Assume $μ_{k}^{*} (s_{k}) = (I_{k} = 1, r_{k})$ but $μ_{k}^{*} (s_{k}^{'}) = (I_{k}^{'} = 0, r_{k}^{'})$ as the optimal decision for s_k and $s_{k}^{'}$ , respectively. According to the definition of G_k in the proof of Theorem 1, $μ_{k}^{*} (s_{k})$ maximizes G_k (s_k , μ_k ) and we have

G_{k} (s_{k}, μ_{k}^{*} (s_{k}^{'})) \leq G_{k} (s_{k}, μ_{k}^{*} (s_{k}))

(34)

On the other hand since s_k and $s_{k}^{'}$ differs only in the channel state, we have $g (s_{k}, μ_{k}^{*} (s_{k}^{'})) = g (s_{k}^{'}, μ_{k}^{*} (s_{k}^{'}))$ and by using $(I_{k}^{'} = 0, r_{k}^{'})$ for both states, queue size will modify similarly for s_k , and $s_{k}^{'}$ which results in the same next state, $s_{k + 1} = s_{k + 1}^{'}$ . Also for uncorrelated channel model the averaging over next channel state does not depend on the current state, thus $E_{C^{+}} {E_{B} [J_{k + 1} (s_{k + 1})]} = E_{C^{+}} {E_{B} [J_{k + 1} (s_{k + 1}^{'})]}$ and

G_{k} (s_{k}, μ_{k}^{*} (s_{k}^{'})) = G_{k} (s_{k}^{'}, μ_{k}^{*} (s_{k}^{'}))

(35)

By applying decision $μ_{k}^{*} (s_{k}) = (I_{k} = 1, r_{k})$ and since transmission with better channel state will decrease q and $\bar{q}$ which increase J_k according to Theorem 1, we have

G_{k} (s_{k}, μ_{k}^{*} (s_{k})) \leq G_{k} (s_{k}^{'}, μ_{k}^{*} (s_{k}))

(36)

Combining (34), (35), and (36) results in

G_{k} (s_{k}^{'}, μ_{k}^{*} (s_{k}^{'})) \leq G_{k} (s_{k}^{'}, μ_{k}^{*} (s_{k}))

which is in contrast to optimality of the $μ_{k}^{*} (s_{k}^{'}) = (I_{k}^{'} = 0, r_{k}^{'})$ . Thus, we should have $μ_{k}^{*} (s_{k}^{'}) = (I_{k}^{'} = 1, r_{k}^{'})$ .

■

Note that Theorem 2 may be incorrect when channel state is time correlated. For example, consider two possible channel states C_k and $C_{k}^{'}$ , with $C_{k} < C_{k}^{'}$ and assume that optimal decision is to transmit for a state with C_k . Also assume that probability of going from $C_{k}^{'}$ to a better channel state and from C_k to a worse channel state is high. So, we can argue heuristically that in this condition it may be optimal to transmit data when channel is in state C_k but not to transmit when it is in state $C_{k}^{'}$ .

5.2. Structural properties of ITH solution

We provide structural properties of ITH solution in this section through the following theorems. First we show that relative value function, h*(s), is a monotonic function in Theorem 3 and then prove the threshold structure of access decision versus channel state in Theorem 4.

Theorem 3: h*(s), is a decreasing function of q and $\bar{q}$ , and an increasing function of $\bar{r}$ and C.

Proof: We define Δ = (δ₁, δ₂, -δ₃, -δ₄, -δ₅) with δ_i ≥ 0 and show that h*(s + Δ) ≤ h*(s). We also define G_f (s, μ(s)) on function f as

G_{f} (s, μ) = g (s, μ) + τ E [f (s^{+})]

(37)

Assuming μ* as the decision that maximizes G_f and according to the Bellman equation (24) we have

B_{τ} f (s) = G_{f} (s, μ^{*})

Taking into account h*(s) = lim_k→∞h⁽ⁿ⁾(s), we prove through induction for every iteration n, h⁽ⁿ⁾(s + Δ) ≤ h⁽ⁿ⁾(s). For n = 0 we define h⁽⁰⁾(s) = g(s, I = 0, r = 0) which according to Assumption 1 it is clear that h⁽⁰⁾ (s + Δ) ≤ h⁽⁰⁾ (s). We assume that h⁽ⁿ⁾(s) is monotonic and show h⁽ⁿ⁺¹⁾(s) is also monotonic. First, we show that B _τh⁽ⁿ⁾(s) is monotonic. Using (26) for states s and s + Δ and assuming μ* and $μ_{Δ}^{*}$ , respectively, as maximizing actions we have

{G_{h}}^{(n)} (s, μ_{Δ}^{*}) \leq B_{τ} h^{(n)} (s) = {G_{h}}^{(n)} (s, μ^{*})

(38)

From definition of g it is clear that $g (s + Δ, μ_{Δ}^{*}) \leq g (s, μ_{Δ}^{*})$ . We can use the similar approach as the proof of Theorem 1 and apply Lemma 1 to show that

E_{C} + {E_{B} [h^{(n)} ({(s + Δ)}^{+}) | μ_{Δ}^{*}]} \leq E_{C} + {E_{B} [h^{(n)} (s^{+}) | μ_{Δ}^{*}]}

(39)

which results in

B_{τ} h^{(n)} (s + Δ) = G_{h} (n) (s + Δ, μ_{Δ}^{*}) \leq G_{h} (n) (s, μ_{Δ}^{*})

(40)

Combining (38) and (40) we find that B _τh⁽ⁿ⁾(s + Δ) ≤ B _τh⁽ⁿ⁾(s). Using Equation (25) and taking into account that B _τh⁽ⁿ⁾(s') is independent of the state vector, it can be easily shown that h^{(n + 1)}(s) is a monotonic function.

■

Assuming uncorrelated channel model the following theorem indicates existence of a threshold for channel state that the link should decide to transmit when channel state is better than or equal to that threshold.

Theorem 4: There exists a threshold, C_th, that for $s_{th} = (q = C_{th}, \bar{q}, \bar{r}, C_{th})$ with any $\bar{q}$ and $\bar{r}$ , we have I*(s_th ) = 1. Also for any s with C ≥ C_th and q ≥ C_th we have I*(s) = 1.

Proof: Assume in timeslot k we have C_k = C^max and q_k = C^max, transmission at this time has the energy cost of β e but it will reduce q by C^max which will reduce $\bar{q}$ by θ_qC^max and also will reduce the future costs related to the queue size. However, transmission of theses C^max packets at any later time slot requires the same amount of energy. Thus, it is better to transmit these packets at state s_th to reduce the queue size as early as possible and reduce the future costs related to the queue size. We conclude that: "if C = C^max and q = C^max then I*(s) = 1" which proves existence of C_th.

In order to prove the second part of the theorem we assume $s = (q \geq C_{th}, \bar{q}, \bar{r}, C \geq C_{th})$ and consider optimal decisions μ*(s_th) = (I*(s_th), r*(s_th)) and μ*(s) = (I*(s), r*(s)) for states s_th and , respectively. If I(s) = 0 we can show similar to the proof of Theorem 2 that it cannot be an optimal transmission policy.

■

6. Numerical results

For numerical analysis of the adaptive control algorithms provided in Section 4 we consider a lightweight sensor in a wireless network that may transmit its status using few bits. In each timeslot the sensor may send its own packet or forward packets of other sensors. We assume a Rayleigh flat fading channel, and use a set of simple Modulation and Coding schemes. Note that our adaptive algorithm only requires the FSMC model which can be found for many practical fading channels [21] and do not depend on Rayleigh fading assumption or Modulation schemes. However, in this section we consider the following types of modulations joint with Reed-Solomon (RS) coding:

Q₁: No transmission since link is in deep fade.
Q₂: BPSK with RS (63,47).
Q₃: QPSK with RS (127,94).
Q₄: 16-QAM with RS (255, 188).

Note that in each time slot one frame will be transmitted and the time duration of the frames is identical for different schemes. Figure 2 illustrates FER of the aforementioned schemes. Setting 0.01 as the FER threshold, we find SNR thresholds A_j for the fading regions which ensure the required FER limit for (Q₁, Q₂, Q₃, Q₄) as {0, 3.8, 7.77, 33.1, ∞}. For example, we will use Q₃ while SNR is between 7.77 (8.9 dB) and 33.1 (15.2 dB). Assuming a Rayleigh fading channel, we use the Markov model proposed in [21, 30] to obtain the transition probabilities for a given normalized Doppler frequency. Unless otherwise indicated we assume the average SNR at the receiver is $\bar{SNR} = 10 dB$ and normalized Doppler frequency is f_dT_s = 0.02. Each frame that will be transmitted in the data slot contains a coded block. Considering the packet length of 47 bits, 0, 1, 2, or 4 packets can be transmitted in a frame based on the channel state so $C = \{0, 1, 2, 4\}$ .

Regarding the per stage utility function (7) we use $U ({\bar{r}}_{k}) = \log (ε + {\bar{r}}_{k})$ , and $V ({\bar{q}}_{k}) = {({\bar{q}}_{k})}^{2}$ to avoid very small rates and large queue sizes. Logarithmic utility is used to provide proportional fairness in the network [14] and prevent selfish rate maximization of the link. Also it provides fairness among multiple flows over a single link [27]. Recently, it has been shown, based on experimental results and Weber-Fechner psychophysical law, that user experience and satisfaction follows logarithm laws, and quality of experience (QoE) versus rate is formulated as QoE(r) = log (ar + b) [33]. In the utility function, energy and queue sizes are used with negative weights to minimize energy consumption and delay. Remember that the energy consumed at the k th timeslot is given by E_k = eI_k (1 - B_k ). Therefore

g (s_{k}, μ_{k}) = \log (ε + {\bar{r}}_{k}) - α {({\bar{q}}_{k})}^{2} - β e I_{k} (1 - P_{B})

(41)

We also use the following parameters for simulations unless otherwise indicated: θ_q = θ_r = 0.7, L = 12, α = 0.005, βe = 1, , P_B = 0.1, ε = 0.001. Using these parameters, Figure 3 shows the selected per stage utility which is an increasing function of ${\bar{r}}_{k}$ and decreasing function of ${\bar{q}}_{k}$ . Our selected parameters result in a utility function which is negative; however, behavior of this function versus system state is such that by maximizing it we will maximize rate while minimizing energy and delay.

As described in Section 4, continuous state variables, ${\bar{r}}_{k}$ and ${\bar{q}}_{k}$ , should be discretized in order to achieve a finite state system and dynamic programming solution. We set M_r = 21, and M_q = 13 for discretization. It is shown in our simulations that enhancement achieved by selecting greater values for M_r , and M_q is insignificant. Also the maximum queue size is assumed to be L = 12 and the number of arrival packets at each stage r_k is limited by 4.

6.1. FTH results

As indicated earlier for FTH problem we use (8) as the final stage utility function with η = 5 as the price for packets remaining in the queue where $U ({\bar{r}}_{k}) = \log (ε + {\bar{r}}_{k})$ , and $V ({\bar{q}}_{k}) = {({\bar{q}}_{k})}^{2}$ . We assume that the initial state of the link is ${\bar{r}}_{1} = 0.1$ , ${\bar{q}}_{1} = q_{1} = 0$ and consider a flat fading Rayleigh channel. The recursive algorithm (16)-(18) is used to obtain the optimal control policy (over the discretized state space) for different values of N. The transition probabilities of the Markov models are computed as described earlier.

The optimal control policies are then used in Monte Carlo simulations, over the channel response γ_k and the channel occupation B_k processes, to maximize J₁(s₁), the sum of the utilities of the N stages. Figure 4 illustrates the J₁(s₁), as a function of the number of timeslots N, for different channel correlation models and two values of average SNR. This figure shows that the performance is enhanced by exploiting the channel correlation through the FSMC models, mainly for large values of N. It also shows that the use of second-order FSMC is not worthwhile in these cases. For N > 50, J₁ (s₁) varies almost as a linear function of N where the slope depends on the channel correlation model and the average SNR.

We have also investigated the dependence of performance on the initial state of the link. Figure 5 illustrates the average utility, J₁(s₁)/N, versus the initial EWMA rate ${\bar{r}}_{1}$ for different values of N. As expected the higher initial EWMA rate, the higher average utility. It also shows that sensitivity to the initial state decreases as the number of slots increases.

Size of the grid used for discretization, M_q and M_r , can affect performance of the system. In Table 1 we provide FTH performance of the algorithms that have used different values of M_q and M_r for N = 40. We can see that enhancement achieved by selecting values greater than M_r = 21, and M_q = 13 is negligible.

Table 1 Performance versus discretization grid size

Full size table

6.2. ITH results

We use the modified relative value iteration algorithm (25), with τ = 0.9, in order to find the optimal control policies for the infinite time horizon problems. Unless otherwise indicated, in the following results we have considered the first-order FSMC channel model. In each iteration the algorithm computes new values of I⁽ⁿ⁾(s) and r⁽ⁿ⁾(s) for all possible states and finally it converges to the optimal control policy (for the discretized problem) μ*(s) = (I*(s), r*(s)). Figure 6 illustrates the convergence of the iterative algorithm, by showing the percentage of decisions, (I⁽ⁿ⁾(s), r⁽ⁿ⁾(s)). which are modified in each iteration in comparison with the previous iteration. Apart from the optimal control policy, the algorithm also provides the optimal relative value function, h*(s). Figure 7a,b illustrates r*(s), and h*(s) versus some elements of the state vector while fixing the others. They show interesting properties of the optimal policy and relative value function with respect to the system state. For example, Figure 7a shows that the arrival rate should be reduced as the channel goes to the fade state or as the EWMA of rate increases. Figure 7b indicates that the relative utility function decreases as the queue size increases or the channel goes to the fade state.

Figure 8 demonstrates the optimal actions for a particular realization of the channel process in a period of 200 timeslots. Note that when the channel goes to a deep fade during timeslots 231 to 282 the link does not access the channel (I_k = 0) so there is not energy consumption in this period (E_k = 0). Also, new packet arrivals are reduced to prevent high queue backlog but kept at a minimum rate to prevent $\log (ε + {\bar{r}}_{k})$ from very negative values. After the deep fade finishes the link starts to transmit backlogged packets while keeping slow arrival rate until timeslot 288.

Based on the selected format of the per stage utility (7), we can reduce the energy consumption by increasing β. However, this is achieved at the cost of reducing the transmission rate and increasing the delay as shown in Figure 9. In other words, the figure shows the tradeoff between energy, rate, and delay as a function of β. Here, the average delay is calculated using the little's low: $\bar{D} = \sum_{k} q_{k} / \sum_{k} r_{k}$ [34].

Figure 10 shows the performance of the optimal policies, obtained from different channel correlation models, as a function of the time variability of the channel. In particular, it shows the resulting average utility per stage, as a function of the normalized Doppler frequency f_dT_s, for the first- and second-order FSMC models and different values of the EWMA parameters for packet arrivals and queue occupancy. It shows that average utility is higher for fading channels with higher f_dT_s since channel remains for a short time in deep fades. For θ = θ_q = θ_r = 0.7, which corresponds to larger averaging time of rate and queue size, both channel models exhibit similar performance. However, for θ = θ_q = θ_r = 0.3 we see that the more accurate second-order FSMC model enhances the performance of the link compared to the first-order FSMC model.

7. Conclusions

We addressed the problem of optimal channel access and rate adaptation in the links of CSMA wireless networks. We defined a utility function that trades off the energy consumption and the average packet transmission rate and delay. By using dynamic programming, we derive algorithms and optimal policies that maximize the average utility by adapting the arrival packet rate and channel access as functions of the queue occupancy, channel state, and smoothed rate. The optimal policies can be computed and stored offline. Then, they can be used online for dynamic access control and queue management of the link. The proposed algorithms exploit the time correlation of the channel by means of different FSMC models. Both FTH and ITH problems were addressed. In the first case, the average utility is optimized for a finite time period, whereas in the second case, the long-term average utility is maximized. Structural properties of the optimal solution are investigated and it is shown that optimal transmission policy has a threshold structure versus the channel state. For the ITH problem we proved the existence of a channel state that the link should always transmit when the channel is in that state or in a better one. Numerical results show that the overall performance of the link can be enhanced by increasing the order of the FSMC channel model. However, it increases the complexity of the algorithms and the memory required to store the optimal policies.

Appendix

Proof of Lemma 1

The difference between right- and left-hand of inequality (28) can be calculated using the channel transition probabilities:

D ≜ E {g (C^{+}) | Ĉ_{b^{-}}, Ĉ_{b}} - E {f (C^{+}) | Ĉ_{a^{-}}, Ĉ_{a}} = \sum_{i = 1}^{M} P_{b^{-}, b, i} g (Ĉ_{i}) - P_{a^{-}, a, i} f (Ĉ_{i})

We partition the summation and rewrite it as

\begin{gathered} D = \sum_{i = 1}^{j} P_{b^{-}, b, i} [g (Ĉ_{i}) - f (Ĉ_{i})] - (P_{a^{-}, a, i} - P_{b^{-}, b, i}) f (Ĉ_{i}) \\ + \sum_{i = j + 1}^{M} (P_{b^{-}, b, i} - P_{a^{-}, a, i}) g (Ĉ_{i}) + P_{a^{-}, a, i} [g (Ĉ_{i}) - f (Ĉ_{i})] \\ \geq \sum_{i = j + 1}^{M} (P_{b^{-}, b, i} - P_{a^{-}, a, i}) g (Ĉ_{i}) - \sum_{i = 1}^{j} (P_{a^{-}, a, i} - P_{b^{-}, b, i}) f (Ĉ_{i}) \\ \geq g (Ĉ_{j}) \sum_{i = j + 1}^{M} (P_{b^{-}, b, i} - P_{a^{-}, a, i}) - f (Ĉ_{j}) \sum_{i = 1}^{j} (P_{a^{-}, a, i} - P_{b^{-}, b, i}) \end{gathered}

the first inequality is a result of $g (Ĉ_{i}) - f (Ĉ_{i}) \geq 0$ and the second one considers $g (Ĉ_{j}) \leq g (Ĉ_{i})$ , i = j + 1, ... , M, and $f (Ĉ_{j}) \geq f (Ĉ_{i}) i = 1, . . ., j$ . Since $\sum_{i = 1}^{M} P_{b^{-}, b, i} = \sum_{i = 1}^{M} P_{a^{-}, a, i} = 1$ we have $\sum_{i = 1}^{i} P_{a^{-}, a, i} - P_{b^{-}, b, i} = \sum_{i = j + 1}^{M} P_{b^{-}, b, i} - P_{a^{-}, a, i}$ thus

D \geq [g (Ĉ_{j}) - f (Ĉ_{j})] \sum_{i = 1}^{j} (P_{a^{-}, a, i} - P_{b^{-}, b, i}) \geq 0

where the second inequality is a result of Assumption 2. ■

References

Karmokar AK, Djonin DV, Bhargava VK: Optimal and suboptimal packet scheduling over time-varying fading channels. IEEE Trans Wirel Commun 2006, 5(2):446-457.
Article Google Scholar
Djonin DV, Krishnamurthy V: MIMO transmission control in fading channels--a constrained Markov decision process formulation with monotone randomized policies. IEEE Trans Signal Process 2007, 55(10):5069-5083.
Article MathSciNet Google Scholar
Wang H, Mandayam NB: A simple packet-transmission scheme for wireless data over fading channels. IEEE Trans Commun 2004, 52(7):1055-1059. 10.1109/TCOMM.2004.831354
Article Google Scholar
Uysal-Biyikoglu E, Prabhakar B, Gamal AE: Energy-efficient packet transmission over a wireless link. IEEE/ACM Trans Netw 2002, 10(4):487-499. 10.1109/TNET.2002.801419
Article Google Scholar
Wang H, Mandayam NB: Opportunistic file transfer over a fading channel under energy and delay constraints. IEEE Trans Commun 2005, 53(4):632. 10.1109/TCOMM.2005.844934
Article Google Scholar
Berry R, Gallager R: Communication over fading channels with delay constraints. IEEE Trans Inf Theory 2002, 48(5):1135-1149. 10.1109/18.995554
Article MathSciNet Google Scholar
Goyal M, Kumar A, Sharma V: Power constrained and delay optimal policies for scheduling transmission over a fading channel. In Proc INFOCOM. San francisco, USA; 2003:311-320.
Google Scholar
Rajan D, Subharwal A, Aazhang B: Delay and rate constrained transmission policies over wireless channels. Proc IEEE GLOBECOM Conference 2001, 806-810.
Google Scholar
IEEE: Wireless LAN medium access control (MAC) and physical layer (PHY) specifications. IEEE standard 802.11 2006.
Google Scholar
IEEE: Wireless medium access control (MAC) and physical layer (PHY) specifications for low-rate wireless personal area networks (WPANs). In IEEE Std 802.15.4 Proceedings of ACM Sigmetrics. Seattle, WA, USA; 2006. S Rajagopalan, D Shah, J Shin, Network adiabatic theorem: an efficient randomized protocol for contention resolution 2009, pp. 133-144
Google Scholar
Jiang L, Leconte M, Ni J, Srikant R, Walrand J: Fast mixing of parallel Glauber dynamics and low-delay CSMA scheduling. 2010.
Google Scholar
Barcelo J, Bellalta B, Cano C, Sfairopoulou A, Oliver M, Verma K: Towards a collision-free WLAN: dynamic parameter adjustment in CSMA/E2CA. EURASIP J Wirel Commun Netw 2011. doi:10.1155/2011/708617
Google Scholar
Kar K, Sarkar S, Tassiulas L: Achieving proportional fairness using local information in Aloha networks. IEEE Trans Autom Control 2004, 49(10):1858-1862. 10.1109/TAC.2004.835596
Article MathSciNet Google Scholar
Mohsenian-Rad AH, Huang J, Chiang M, Wong VWS: Utility-optimal random access: optimal performance without frequent explicit message passing. IEEE Trans Wirel Commun 2009, 8(2):898-911.
Article Google Scholar
Wang X, Kar K: Cross-layer rate control in multi-hop wireless networks with random access. IEEE J Sel Areas Commun 2006, 24(8):1548-1559.
Article Google Scholar
Khodaian M, Khalaj BH: Delay constrained utility maximization in multihop random access networks. IET Commun 2010, 4(16):1908-1918. 10.1049/iet-com.2009.0622
Article MathSciNet Google Scholar
Liu J, Stoylar A, Chiang M, Poor HV: Queue based random access in wireless networks: optimality and stability. IEEE Trans Inf Theory 2009, 55(9):4087-4098.
Article Google Scholar
Warrier A, Janakiraman S, Ha S, Rhee I: DiffQ: practical differential backlog congestion control for wireless networks. In Proceedings of IEEE INFOCOM. Rio de Janeiro, Brazil; 2009:262-270.
Google Scholar
Nardelli B, Lee J, Lee K, Yi Y, Chong S, Knightly E, Chiang M: Experimental evaluation of optimal CSMA. In Proceedings of IEEE INFOCOM. Shanghai, China; 2011:1188-1196.
Google Scholar
Sadeghi P, Kennedy RA, Rapajic PB, Shams R: Finite state Markov modeling of fading channels. IEEE Signal Process Mag 2008, 57: 57-80.
Article Google Scholar
Ni J, Tan B, Srikant R: Q-CSMA: queue-length based CSMA/CA algorithms for achieving maximum throughput and low delay in wireless networks. In Proceedings of IEEE INFOCOM Mini-Conference. San Diego, CA, USA; 2010:1-5.
Google Scholar
Bharghavan V, Demers A, Shenker S, Zhang L: MACAW: a media access protocol for wireless LAN's. In Proceedings of ACM SIGCOMM. London, UK; 1994:212-225.
Google Scholar
Bianchi G: Performance analysis of the IEEE 802.11 distributed coordination function. IEEE J Sel Areas Commun 2000, 18(3):535-547. 10.1109/49.840210
Article Google Scholar
Vandalore B, Feng W, Jain R, Fahmy S: A survey of application layer techniques for adaptive streaming of multimedia. Real-Time Imag 2001, 7(3):221-235. 10.1006/rtim.2001.0224
Article Google Scholar
Floyd S, Jacobson V: Random early detection gateways for congestion avoidance. IEEE/ACM Trans Netw 1993, 1(4):397-413. 10.1109/90.251892
Article Google Scholar
ONeill D, Akuiyibo E, Boyd SP, Goldsmith AJ: Optimizing adaptive modulation in wireless networks via multi-period network utility maximization. In IEEE International Conference on Communications. Cape Town, South Africa; 2010:1-5.
Google Scholar
Akuiyibo E, Boyd SP: Adaptive modulation with smoothed flow utility. EURASIP J Wirel Commun Netw 2010. doi:10.1155/2010/815213
Google Scholar
Montgomery DC: Introduction to Statistical Quality Control. 3rd edition. John Wiley & Sons, New York; 1996.
Google Scholar
Wang HS, Moayeri N: Finite-state markov channel--a useful model for radio communication channels. IEEE Trans Veh Technol 1995, 44: 163-171. 10.1109/25.350282
Article Google Scholar
Bertsekas DP: Dynamic Programming and Optimal Control. Volume I. 3rd edition. Athena Scientific, Belmont; 2005.
Google Scholar
Bertsekas DP: Convergence of discretization procedures in dynamic programming. IEEE Trans Autom Control 1975, 20: 415-419. 10.1109/TAC.1975.1100984
Article MathSciNet Google Scholar
Reichl P, Tuffin B, Schatz R: Logarithmic laws in service quality perception: where microeconomics meets psychophysics and quality of experience. Telecommun Syst 2011., 47: doi:10.1007/s11235-011-9503-7
Google Scholar
Bertsekas D, Gallager R: Data Networks. 2nd edition. Prentice Hall, Englewood Cliffs, NJ; 1992.
Google Scholar

Download references

Acknowledgements

This study was supported in part by the Spanish Government, Ministerio de Ciencia e Innovación (MICINN), under projects COMONSENS (CSD2008-00010, CONSOLIDER-INGENIO 2010 program) and COSIMA (TEC2010-19545-C04-03), in part by Iran Telecommunication Research Center under contract 6947/500, and in part by Iran National Science Foundation under grant number 87041174. This study was completed while M. Khodaian was at CEIT and TECNUN (University of Navarra).

Author information

Authors and Affiliations

Department of Electrical Engineering, Sharif University of Technology, Tehran, Iran
Mahdi Khodaian & Babak H Khalaj
Department of Communication Engineering, University of Cantabria, Santander, Spain
Jesús Pérez
CEIT and TECNUN (University of Navarra), 20009, San Sebastian, Spain
Pedro M Crespo

Authors

Mahdi Khodaian
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Babak H Khalaj
View author publications
You can also search for this author in PubMed Google Scholar
Pedro M Crespo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jesús Pérez.

Additional information

Competing interests

The authors declare that they have no competing interests

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Khodaian, M., Pérez, J., Khalaj, B.H. et al. Adaptive access and rate control of CSMA for energy, rate, and delay optimization. J Wireless Com Network 2012, 27 (2012). https://doi.org/10.1186/1687-1499-2012-27

Download citation

Received: 08 September 2011
Accepted: 30 January 2012
Published: 30 January 2012
DOI: https://doi.org/10.1186/1687-1499-2012-27

Adaptive access and rate control of CSMA for energy, rate, and delay optimization

Abstract

1. Introduction

2. System model

2.1. Link model

2.2. Channel model

1. Uncorrelated model

2. First-order markov model

3. Second-order Markov model

3. Problem formulation

3.1. Finite time horizon

3.2. Infinite time horizon

4. Optimal adaptive control

4.1. Per stage adaptation to maximize FTH utility

4.2. State-based adaptive control to optimize average utility per stage

5. Structural properties of the optimal solution

5.1. Structural properties of FTH solution

5.2. Structural properties of ITH solution

6. Numerical results

6.1. FTH results

6.2. ITH results

7. Conclusions

Appendix

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords