Dynamic power control for energy harvesting wireless multimedia sensor networks

Koulali, Mohammed-Amine; Kobbane, Abdellatif; El Koutbi, Mohammed; Tembine, Hamidou; Ben-Othman, Jalel

doi:10.1186/1687-1499-2012-158

Research
Open access
Published: 01 May 2012

Dynamic power control for energy harvesting wireless multimedia sensor networks

Mohammed-Amine Koulali^1,2,
Abdellatif Kobbane¹,
Mohammed El Koutbi¹,
Hamidou Tembine³ &
…
Jalel Ben-Othman⁴

EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 158 (2012) Cite this article

6125 Accesses
14 Citations
Metrics details

Abstract

Optimization of energy usage in wireless sensor networks (WSN) has been an active research field for the last decades and various approaches have been explored. In fact, A well designed energy consumption model is the foundation for developing and evaluating a power management scheme in network of energy constrained devices such as: WSN. We are interested in developing optimal centralized power control policies for energy harvesting wireless multimedia sensor networks (WMSN) equipped with photovoltaic cells. We propose a new complete information Markov decision process model to characterize sensor's battery discharge/recharge process and inspect the structural properties of optimal transmit policies.

1 Introduction

The recent technological advances in the fields of micro-electronic, wireless communication along with reduction of production costs have motivated the development of a novel generation of wireless networks. wireless sensor networks (WSN) are articulated over a set of miniaturized battery powered devices (sensors) with communication capabilities and are expected to become highly integrated into our daily activities. This class of networks is perceived as an evolution of AdHoc networks with specific energy and computation limitations. Also, the increasing availability, at low cost, of multimedia devices (cameras, microphones,...) has triggered the emergence of multimedia wireless sensor networks (WMSN) [1, 2]. With the diversity of their application domains, ranging from healthcare and intelligent patient monitoring to disaster relief and industrial process supervision through intrusion detection and border protection, WMSN hold a promising future [3]. It is noteworthy that the volume and the nature of carried multimedia content, mainly composed of images and/or video streams, impose severe requirements on sensor's residual energy and available bandwidth.

The energy scarcity represents one of the major limitations of WMSN, indeed, post-deployment replacement of the sensors batteries is generally not practical or even impossible. Therefore, a proper management strategy of the residual energy happens to be a crucial prerequisite to any large scale WMSN deployment. In order to preserve the sensors energy, an optimal choice of transmit powers and an efficient scavenging mechanism of energy from the deployment environment along with an adequate topological placement of the sensors are necessary.

A variety of topologies have been proposed for WMSN deployment, notably: single-tier flat, single-tier clustered and multi-tier (see Figure 1). Introducing hierarchy in WMSN benefits at various levels: indeed, since sensors forward exclusively packets produced within their cluster, the communication overhead is reduced and consequently the network lifetime is prolonged. Also, resourced devices (multi-processing hubs) realize heavy computations and aggregation of data reported by their cluster sensors, reducing by the way the energy consumption resulting from relaying redundant data.

Energy harvesting [4–7] in the context of WSN received increasing attention from research community. Indeed, enabling sensors to replenish their energy reserves, extends the WMSN's deployment lifetime and enlarges their application domains. Energy harvesting sources are various and encompass solar, wind and vibratory sources. In this work we will focus our attention on sensors equipped with photovoltaic cells that realize solar energy transformation into electric energy needed to recharge their batteries.

Energy consumption optimization remains a crucial issue even for energy harvesting WMSN. In fact, the harvested energy should be exploited optimally to cope with energy sources periodicity (day/night cycles) and unpredictability (wind activity/inactivity periods). Since, most of energy consumption is incurred at transceiver level, a balance should be found between conflicting objectives: maximizing the achieved throughput while reducing energy consumption and consequently extending the sensor's battery lifetime. Optimal energy management policies for energy harvesting sensors are considered in [8]. The discounted throughput is maximized over an infinite horizon, where queuing for data is also considered. In [9], the authors consider a binary power control problem: at each slot the wireless device could either transmit at a constant power or remain silent. The authors consider only the single user case and the optimal transmission policy is shown to be of a threshold type for the soft and strict delay constraint cases. The authors of [10] presented a decentralized power control with stochastic channel variation scheme. The proposed scheme considers a cost function that accounts for the QoS of each user and the interference to other users. The single user optimal policy is generalized to the multiple users scenario for the ergodic regime where the spreading factor and the number of users grow infinitely but their ratio remains constant. In [11], the authors consider a single transmitter-receiver scenario where the transmitter has a finite buffer, and solved the problem of dynamically assigning rates/powers to packets in order to minimize the long-term average transmission energy subject to an upper bound on the buffer overflow probability. The problem is formulated as a constrained Markov decision process (CMDP) and an analytical solution is given and proved to be monotone in queue length. The authors of [12] use an evolutionary game theoretic formulation to characterize the equilibrium policy for power allocation under channel un-certainty and delayed imperfect payoffs. A heterogonous learning framework that accounts for user and technology specificities is proposed. The authors of [13] address the problem of network resource allocation for energy-harvesting sensor platforms with time-varying battery recharging rates. They propose a joint approach that combines QuickFix for getting the optimal sampling rate and SnapIt that adapts the sampling rate with the objective of maintaining the battery at a target level. The considered networks are characterized by a special directed acyclic network graph (DAG) structure and the choice of a given rate implies a specific transmit power. In [14], an energy harvesting body sensor network formed by sensors with a two-state energy harvester device is considered. The authors develop policies based on the energy-error probability tradeoff to maximize successful transmission probability while minimizing probability of running out of energy. The developed strategies exploit the knowledge of the current energy level and the process governing event generation and battery recharge to select the appropriate transmission mode. The problem of throughput optimal energy allocation is studied for energy harvesting systems in a time constrained slotted setting in [15]. The structural properties of the optimal power allocation policy are obtained through dynamic programming and convex optimization. The optimal use of the harvested energy for different energy profiles and storage capabilities is discussed in [16] with the outage probability considered as a performance metric. The authors developed a discrete time Markov model of the evolution of battery and transmission state and provide optimal transmission strategy that minimizes outage probability.

Our objective is to define an optimal energy management policy for the centralized dynamic power control problem for energy harvesting WMSN. Thus, at each time slot, the transmission powers of all the sensors are fixed by the base station to maximize the expected system's throughput under minimum energy consumption.

The added value of our work covers the following points:

We formalize the centralized power control problem in the context of hierarchical energy harvesting WMSN as a complete information MDP.
A stochastic model for the discharge/recharge battery process for energy harvesting WMSN is provided.
We consider a novel utility function that balances the sojourn in a each battery state along with achieved throughput for a given transmit policy.
The structural properties of centrally computed optimal transmit policies are inspected.

This article is organized as follows: In Section 2, we give a mathematical formulation for dynamic power control problem in the context of energy harvesting WMSN. Optimal transmission policies are treated in Sections 3 and 4 for the single and multiple sensors scenarios. Then we present numerical simulation results in Section 5. Finally, we conclude the article and announce our future works in Section 6.

2 System model

We consider a WMSN formed by a set of sensors $N = {1, \dots, n}$ , under the authority of a single base station or gateway. Each sensor is equipped by its own battery and uses power to communicate with the base station either directly or through multimedia processing hubs. We discretize each sensor's battery capacity to several intervals that give a more coarse-grained description of the battery state, e.g., full, medium, discharged. The sensors are equipped with photovoltaic cells that make them capable of harvesting solar energy while undergoing the discharge process. The residual energy of the battery dictates available transmission powers and the achieved throughput is affected by the chosen transmit powers of other sensors. Time is discrete and at each time slot t, each sensor knows its own battery state, whereas the battery state of the other sensors and their actions remain unknown.

The formulated problem fits within the MDP framework with full information and infinite planning horizon, where the base station will compute and provide each sensor with its optimal transmission strategy, given the fact that it has access to all sensors' information i.e environment, battery and radio channel status. In order to be aware of each sensor battery level, we assume that time is slotted into virtual slots that encompasses several physical slots. the initial physical slots will be affected to the sensors to communicate their battery levels to the base station in a TDMA fashion. The remaining slots will serve for data exchange with interference possibility. We use three bits to code the energy level of each battery and thus, we could represent up to eight battery states.

2.1 Mathematical formulation

Denote by $w = {w_{t}}_{t \geq t_{0}}$ the set of environment states at each time slot post t₀. Thus, the power allocation dynamic problem for a sensor j could be modeled by an MDP:

Ω = {X^{j}, {(A^{j} (w, s_{j}))}_{s_{j} \in X^{j}, j \in N, w}, q^{j}, λ}

(1)

Where:

$X^{j} = {0, 1, 2, \dots, m - 1}$ is a finite set of states of sensor's j battery. A state of the battery represents some interval of percentage of the remaining energy. The energy level of the battery increases (respectively decreases) sequentially. Initially, the sensor's battery is in its highest state (m - 1), as sensors perform their affected tasks (event detection, packets forwarding,...) they consume energy and their batteries energy levels decrease sequentially. The harvested solar energy will be converted into electrical energy and will increase the sensors batteries residual energy level sequentially.
$\forall s_{j, t} \in X^{j}, A^{j} (w_{t}, s_{j, t}) = {p_{0}, \dots, p_{s_{j, t}}}$ is a finite set of available transmission powers for sensor j. This set satisfies A^j(w_t, s_{j, t}- 1) ⊂ A^j(w_t, s_{j, t}): more powers are available at higher states. A sensor makes a decision on its transmit power $p_{t}^{j} \in A^{j} (w_{t}, s_{j, t})$ based on its remaining energy s_{j, t}at time t.
q^jis the state transition probability of sensor's j battery. Given the state of environment w_t, the state s_{j, t}of the battery of j, the state c_{j, t}of the radio channel in the vicinity of j, and the actions of the others $p_{t}^{- j} = (p_{t}^{1}, p_{t}^{2}, \dots, p_{t}^{j - 1}, p_{t}^{j + 1}, \dots, p_{t}^{n})$ the new states are (s_{j, t+1}, c_{j, t+1}) with the probability $q^{j} ((s_{j, t + 1}, c_{j, t + 1}) | (s_{j, t}, c_{j, t}), p_{t}^{j}, p_{t}^{- j})$ .
The discount factor λ indicates for a user the decay in the gain value with the evolution of the time.

Let η_t = (w_t, s_{1, t}, c_{1, t},..., s_{n, t}, c_{n, t}) be the state profile of all the system at time t. The action profile of the system is: $p_{t} = (p_{t}^{1}, \dots, p_{t}^{n})$ , where $\forall j \in N, p_{t}^{j} \in A^{j} (w_{t}, s_{j, t})$ . When w_t = 0, the SINR of sensor j is null. For w_t ≠0, the SINR of sensor j is given by:

SIN R_{j} (η_{t}, p_{t}) = \frac{p_{t}^{j} h_{j} (w_{t}, s_{j, t}, c_{j, t})}{N_{0} + \sum_{i \neq j} p_{t}^{i} h_{i} (w_{t}, s_{i, t}, c_{i, t})}

(2)

where $h_{j} (w_{t}, s_{j, t}, c_{j, t}) p_{t}^{j}$ represents the power received at the base station or the multimedia processing hub given that states are s_{j, t}(respectively c_{j, t}) for the battery (respectively the radio channel in the vicinity) of sensor $j . p_{t}^{j}$ is the power level chosen by j and h_j (w_t, s_{j, t}, c_{j, t}) is a function of the channel state and others exogenous characteristics, N₀ is the variance of the noise. The throughput of sensor j at time t is an increasing function of the SINR_j (η_t, p_t).

\{\begin{matrix} Th p_{j} (η_{t}, p_{t}) = f^{j} (SIN R_{j} (η_{t}, p_{t})) \\ f^{j} (0) = 0 \end{matrix}

(3)

In the rest of the article, we consider f ^j to be Shannon capacity [17] and thus:

f^{j} (SIN R_{j} (η_{t}, p_{t})) = log (1 + SIN R_{j} (η_{t}, p_{t}))

(4)

2.2 Stochastic battery model for sensors with energy harvesting capabilities

The sensors harvest energy through a photovoltaic cell and use the scavenged power to recharge their batteries. Thus, the sensor battery will move from state s_{j, t}to state s_{j, t}+1 with probability $q^{j} (s_{j, t} + 1 | s_{j, t}, p_{t}^{j}) = p_{harvest}$ . The transmit power choice of j determines the transitions to the next state. Thus, when the sensor j selects a transmit power $p_{t}^{j} \in A^{j} (w_{t}, s_{j, t})$ , the new state of the battery is s_{j, t+1}with probability $q^{j} (s_{j, t + 1} | s_{j, t}, p_{t}^{j}), q^{j} (s_{j, t + 1} | s_{j, t}, p_{t}^{j}) = 0$ if s_{j, t+1 ∉ {}s_{j, t}+ 1, s_{j, t}, s_{j, t} -1_}. If the energy harvesting process is frozen for a long period of time (due to cloudy weather for example), the state 0 is reached (the battery is completely empty) and the sensor is considered to be out of service. Figure 2 shows the state transition probabilities of sensor's j battery.

The probability to move to the lower adjacent state increases with the energy consumption i.e $\forall p_{t}^{j}, p_{t}^{' j} \in A^{j} (w_{t}, s_{j, t})$ :

p_{t}^{j} > p_{t}^{' j} \Rightarrow q^{j} (s_{j, t} - 1 | s_{j, t}, p_{t}^{j}) > q^{j} (s_{j, t} - 1 | s_{j, t}, p_{t}^{' j})

(5)

At each time slot t, depending on the remaining energy off the battery, the sensor j makes a decision on its transmit power $p_{t}^{j}$ . Denote by the policy $d_{j}^{\infty} = (d_{0}^{j}, d_{1}^{j}, \dots, d_{m - 1}^{j})$ the collection of decision rules of that sensor under infinite planning horizon, where $\forall s_{i} \in X^{j}, d_{i}^{j} \in Δ (A^{j} (s_{i}))$ and Δ(A^j (s_i)) stands for the space of probability distribution over sensor's j action space for the battery state s_i.

The sojourn time $T^{j} (l, d_{j}^{\infty})$ in the state l under transmit policy $d_{j}^{\infty}$ can be expressed as:

\begin{aligned} T^{j} (l, d_{j}^{\infty}, t_{0}) & = \underset{t}{arg min} {t > t_{0} | s_{j, t} = l - 1, s_{j, t_{0}} = l, d_{j}^{\infty}} \\ = 1 + q^{j} (l | l, p_{t_{0}}^{j}) T^{j} (l, d_{j}^{\infty}, t_{0} + 1) . \end{aligned}

When $p_{t}^{j}$ depends only on the state of the battery but not on the time (stationary policy), the sojourn becomes:

T^{j} (l, d_{j}^{\infty}) = \frac{1}{q^{j} (l - 1 | l, d_{j}^{\infty}) + q^{j} (l + 1 | l, d_{j}^{\infty})}

(6)

3 Optimal policy for a single sensor

When considering a single sensor, the interferences are omitted from the SINR expression that becomes equivalent to the signal to noise ratio (SNR): $SN R_{j} (η_{t}, p_{t}) = \frac{p_{t}^{j} h_{j} (w_{t}, s_{j, t}, c_{j, t})}{N_{0}}$ and the sensor receives an immediate reward $r_{t} (s_{j, t}, p_{t}^{j})$ for choosing transmit power $p_{t}^{j}$ at slot t. The immediate reward reflects a balance between maximizing the expected sojourn in a given battery state and the corresponding achieved throughput for the chosen transmit power:

r_{t} (s_{j, t}, p_{t}^{j}) = (1 - \frac{1}{T^{j} (s_{j, t}, p_{t}^{j})}) log (1 + \frac{p_{t}^{j} h^{j}}{N_{0}})

(7)

Let ind: $X^{j} \to ℝ$ be a non-decreasing function on $X^{j}$ . We consider transition probabilities that adhere to the following expression:

q^{j} (s_{j, t}^{'} | s_{j, t}, p_{t}^{j}) = \{\begin{matrix} 1 - \frac{p_{t}^{j}}{p w (s_{j, t})} - \frac{P_{harvest}}{ind (s_{j, t})} & s_{j, t}^{'} = s_{j, t} \\ \frac{p_{t}^{j}}{p w (s_{j, t})} - \frac{(ind (s_{j, t}) - 1) p_{harvest}}{ind (s_{j, t})} & s_{_{j, t}}^{'} = s_{j, t} - 1 \\ _{P_{harvest}} & s_{j, t}^{'} = s_{j, t} + 1 \\ _{0} & else . \end{matrix}

(8)

In the discounted total reward problem the gains on the first stages are more important than the future ones. In particular, a gain acquired at time n is assumed to have a present value λⁿ r(s_j, p_j) where 0 < λ < 1 is a discount factor. Under an infinite planning horizon and due to its elegant theory, the ease in which it allows inclusion of constraints, and its facility for sensitivity analysis, linear programming formulation [18] is adequate to solve our MDP. We randomly choose a set of real constants ${α (s_{j})}_{s_{j} \in X^{j}}$ to be a distribution probability over the states of sensor's j battery. Therefore, the set of α (s_j) should respect the following constraint: $\sum_{s_{j} \in X^{j}} α (s_{j}) = 1$ . We also consider for every state s_j and available action P^j∈A^j (s_j) the linear program variable $x (s_{j}, p^{j}) = \sum_{n = 0}^{\infty} λ^{n} Prob (s_{j}, p^{j})$ that indicates the expected discounted time of the sensor's battery being in state s_j and making decision p^j. The linear program equivalent to the λ discounted MDP Ω is described by:

\{\begin{matrix} Maximize \sum_{s_{j} \in X^{j}} \sum_{p^{j} \in A (s_{j})} r (s_{j}, p^{j}) x (s_{j}, p^{j}) \\ Subject to \\ \sum_{p^{j} \in A^{j} (s_{j})} x (s_{j}^{'}, p^{j}) - \sum_{s_{j} \in X^{j}} \sum_{p^{j} \in A^{j} (s_{j})} λ q^{j} (s_{j}^{'} | s_{j}, p^{j}) x (s_{j}, p^{j}) = α (s_{j}^{'}) \\ \forall s_{j} \in X^{j}, \forall p^{j} \in A^{j} (s_{j}), x (s_{j}, p^{j}) \geq 0 \\ \sum_{{s^{'}}_{j} \in X^{j}} α (s_{j}^{'}) = 1 \\ 0 < λ < 1 . \end{matrix}

(9)

The solution of this linear program (LP) is obtained through application of the simplex method. After solving the LP above, we recover the optimal decision rules of the associated MDP by applying the rule [19]:

A discounted MDP has always a deterministic optimal decision rule [20] that we select based on the following criterion:
$\forall p^{l} \in A^{j} (s_{k}), d_{k}^{j} = p^{l} \Rightarrow x (s_{k}, p^{l}) > 0, s_{k} \in X^{j} .$
(10)

We argue that there exists structured decision rules for the MDP Ω.

Proposition 1. There exists optimal monotone non-decreasing decision rules on $X^{j}$ for the MDP Ω.

The detailed proof of Proposition 1 is given below:

Proof. Let $Q (k, l, p_{t}^{j}) = \sum_{i = k}^{i = m - 1} q^{j} (s_{i, t} | s_{l, t}, p_{t}^{j})$ , and $p_{t}^{j}, p_{t}^{' j}$ two transmit powers such as $p_{t}^{j} < p_{t}^{' j}$ :

(1)
The immediate reward r^j for sensor j is non-decreasing on $X^{j}$ for all $p_{t}^{' j} \in A^{j}$ .
(2)
The immediate reward r^j for sensor j is superadditive.
$\begin{gathered} r (s_{l, t} + 1, p_{t}^{' j}) + r (s_{l, t}, p_{t}^{j}) - r (s_{l, t} + 1, p_{t}^{j}) - r (s_{l, t}, p_{t}^{' j}) = \\ (\frac{1}{T^{j} (s_{l, t}, p_{t}^{' j})} - \frac{1}{T^{j} (s_{l, t} + 1, p_{t}^{' j})}) log (1 + \frac{p_{t}^{' j} h^{j}}{N_{0}}) + (\frac{1}{T^{j} (s_{l, t} + 1, p_{t}^{j})} - \frac{1}{T^{j} (s_{l, t}, p_{t}^{j})}) log (1 + \frac{p_{t}^{j} h^{j}}{N_{0}}) \\ = g (p_{t}^{' j}) - g (p_{t}^{j}) \end{gathered}$

Where $g : p_{t}^{j} \mapsto \frac{pw (s_{l, t} + 1) - pw (s_{l, t})}{pw (s_{l, t}) pw (s_{l, t} + 1)} p_{t}^{j} + \frac{ind (s_{l, t} + l) - ind (s_{l, t})}{ind (s_{l, t}) ind (s_{l, t} + 1)} p_{harvest}$ . Since the functions pw and ind are monotone non-decreasing we conclude that r is a superadditive function.

(3)
$Q (k, l, p_{t}^{j})$ is non-decreasing on $X^{j} : Let (s_{k}, s_{l}) \in X^{j} \times X^{j}$ ,
$\begin{gathered} s_{k} > s_{l} + 1 : Δ Q (k, l, p_{t}^{j}) = Q (k, l + 1, p_{t}^{j}) - Q (k, l, p_{t}^{j}) = Q (k, l + 1, p_{t}^{j}) = p_{harvest} \geq 0 . \\ s_{k} = s_{l} + 1 : \end{gathered}$
$\begin{aligned} Δ Q (k, l, p_{t}^{j}) & = Q (k, l + 1, p_{t}^{j}) - Q (k, l, p_{t}^{j}) \\ = \sum_{i = k}^{i = m - 1} q^{j} (s_{i, t} | s_{l, t} + 1, p_{t}^{j}) - q^{j} (s_{i, t} | s_{l, t}, p_{t}^{j}) \\ = q^{j} (s_{l, t} + 1 | s_{l, t} + 1) + p_{harvest} - p_{harvest} \geq 0 . \end{aligned}$

$s_{k} = s_{l} : q^{j} (s_{l, t} | s_{l, t}, p_{t}^{j})$ is a monotone non-decreasing function on X^j , thus:

\begin{aligned} Δ Q (k, l, p_{t}^{j}) & = \sum_{i = k}^{i = m - 1} q^{j} (s_{i, t} | s_{l, t} + 1, p_{t}^{j}) - q^{j} (s_{i, t} | s_{l, t}, p_{t}^{j}) \\ = q^{j} (s_{l, t} + 1 | s_{l, t} + 1, p_{t}^{j}) + q^{j} (s_{l, t} | s_{l, t} + 1, p_{t}^{j}) + p_{harvest} - q^{j} (s_{l, t} | s_{l, t}, p_{t}^{j}) - p_{harvest} \\ \geq q^{j} (s_{l, t} | s_{l, t} + 1, p_{t}^{j}) \geq 0 . \end{aligned}

s_k ≤ s_l - 1:

\begin{aligned} Δ Q (k, l, p_{t}^{j}) & = \sum_{i = k}^{i = m - 1} q^{j} (s_{i, t} | s_{l, t} + 1, p_{t}^{j}) - q^{j} (s_{i, t} | s_{l, t}, p_{t}^{j}) \\ = 1 - 1 = 0 . \end{aligned}

(4)
$Q (k, l, p_{t}^{j})$ is globally superadditive on $X^{j} \times A^{j}$ , denote by $ϑ = Q (k, l + 1, p_{t}^{' j}) + Q (k, l, p_{t}^{j}) - Q (k, l + 1, p_{t}^{j}) - Q (k, l, p_{t}^{' j})$ :

s_k > s_l + 1: ϑ = P_harvest - P_harvest = 0.

s_k = s_l + 1:

\begin{aligned} ϑ & = p_{harvest} + q^{j} (s_{l + 1} | {s_{l}}_{+ 1}, p_{t}^{' j}) + p_{harvest} - q^{j} (s_{l + 1} | s_{l + 1}, p_{t}^{j}) - p_{harvest} - p_{harvest} \\ \leq 0 . \end{aligned}

\begin{gathered} s_{k} = s_{l} : ϑ = 1 + q^{j} (s_{l} | s_{l}, p_{t}^{j}) + p_{harvest} - 1 - q^{j} (s_{l} | s_{l}, p_{t}^{' j}) - p_{harvest} \geq 0 \\ s_{k} \leq s_{l} - 1 : ϑ = 1 - 1 = 0 . \end{gathered}

We conclude by virtue of Theorem 6.11.6 of [20] that deterministic optimal monotone non-decreasing policies over the set X^j exists for the MDP (1).

4 Optimality for all the sensors batteries lifetime

Under the assumption that each sensor uses CDMA with mutual orthogonal codes to communicate with the base station or the multimedia processing hubs, sensors transmissions do not interfere. Therefore, the overall policy $(d_{1}^{\infty *}, \dots, d_{n}^{\infty *})$ realized when every sensor adopts its optimal transmit power happens to be the optimal transmit policy for the overall system noted d^∞*.

For the general case: non-orthogonal codes are used and sensors transmissions do interfere. The WMSN is modeled by the MDP: $Ω^{+} = {X, {(A (w_{t}, S_{t}))}_{S_{t} \in X, w}, Q, λ}$ with full information. We extend the previously formulated mathematical model to account for multiple sensors and denote the augmented states and actions spaces respectively $X = \prod_{k = 1}^{n} X^{k}$ and $A = \prod_{k = 1}^{n} A^{k}$ . The transition probability from state S_tto S_t+1for the power profile P_tis given by the formula:

Q (S_{t + 1} | S_{t}, P_{t}) = \prod_{k = 1}^{n} q (s_{k, t + 1} | s_{k, t}, p_{t}^{k}), (s_{k, t + 1} | s_{k, t}) \in X^{k} \times X^{k} .

(11)

With each sensor utility expressed as follows:

r_{t} (s_{j, t}, p_{t}^{j}) = (1 - \frac{1}{T^{j} (s_{j, t}, p_{t}^{j})}) \times log (1 + \frac{p_{t}^{j} h^{j}}{N_{0} + \sum_{k \neq j} p_{t}^{k} h^{k}})

(12)

The immediate reward for the network becomes:

R_{t} (S_{t}, P_{t}) = \sum_{j = 1}^{n} r_{t} (s_{j, t}, p_{t}^{j})

(13)

We reconsider the LP in (9) for the augmented MDP:

\{\begin{matrix} Maximize \sum_{S \in X} \sum_{a \in A (S)} R (S, a) x (S, a) \\ Subject to \\ \sum_{a \in A (S)} x (S^{'}, a) - \sum_{S \in X} \sum_{a \in A (S)} λ Q (S^{'} | S, a) x (S, a) = α (S^{'}) . \\ \forall S \in X, \forall a \in A (S), x (S, a) \geq 0 \\ \sum_{S^{'} \in X} α (S^{'}) = 1 \\ 0 < λ < 1 . \end{matrix}

5 Numerical investigations

We discretize each battery residual energy capacity to five states: near full, high, medium, low and discharged. The states set is: {0, 1, 2, 3, 4} and the transmit power panel for each state is detailed in Table 1:

Table 1 Transmit power panel

Full size table

Our objective is to characterize the optimal transmit policy for a single sensor with a discount factor λ = 0.6. We solve the associated LP through the simplex algorithm to obtain the optimal policy summarized below:

σ_{j}^{*} = (0 \to P_{0}, 1 \to P_{0}, 2 \to P_{2}, 3 \to P_{3}, 4 \to P_{4}) .

Table 2 describes the optimal transmit policy for a network formed by three sensor with three states {0, 1, 2} for a λ = 0.6 discounted Ω⁺ under infinite horizon planning. We notice that sensors tend to use their lowest available transmission powers as using higher ones result in reduced throughput due to interference and rapid depletion of their batteries.

Table 2 Optimal transmit power

Full size table

6 Concluding remarks

In this article we considered the problem of dynamic centralized power allocation for energy harvesting WMSN. We focus on solar powered sensors and provide a stochastic model for the associated battery discharge/recharge process. The dynamic power control problem was formulated as a MDP and the structural properties of optimal transmission policies established. We plan, in a near future, to generalize our approach for the decentralized case with partial channel information using stochastic game theory.

References

Akyildiz I, Melodia T, Chowdhury K: A survey on wireless multimedia sensor networks. Comput Netw 2007, 51(4):921-960. 10.1016/j.comnet.2006.10.002
Article Google Scholar
Almalkawi I, Guerrero Zapata M, Al-Karaki J, Morillo-Pozo J: Wireless multimedia sensor networks: current trends and future directions. Sensors 2010, 10(7):6662-6717. 10.3390/s100706662
Article Google Scholar
Fowler K: The future of sensors and sensor networks survey results projecting the next 5 years. In Proc Sensors Applications Symposium, 2009 (SAS 2009). New Orleans, LA, USA; 2009:1-6.
Chapter Google Scholar
Sudevalayam S, Kulkarni P: Energy harvesting sensor nodes: survey and implications. IEEE Commun Surv Tutor 2010, PP(99):1-19.
Google Scholar
Seah W, Eu Z, Tan H: Wireless sensor networks powered by ambient energy harvesting (WSN-HEAP)-Survey and challenges. In Proc 1st International Conference on Wireless Communication, Vehicular Technology, Information Theory and Aerospace & Electronic Systems Technology, 2009. Wireless VITAE 2009. Aalborg, Danemark; 2009:1-5.
Chapter Google Scholar
Kansal A, Srivastava M: An environmental energy harvesting framework for sensor networks. Proceedings of the 2003 international symposium on Low power electronics and design 2003, 481-486.
Google Scholar
Kansal A, Hsu J, Zahedi S, Srivastava M: Power management in energy harvesting sensor networks. ACM Trans Embed Comput Syst (TECS) 2007, 6(4):32-66. 10.1145/1274858.1274870
Article Google Scholar
Sharma V, Mukherji U, Joseph V, Gupta S: Optimal energy management policies for energy harvesting sensor nodes. IEEE Trans Wirel Commun 2010, 9(4):1326-1336.
Article Google Scholar
Wang H, Mandayam N, Goodman DJ, Lig-das P: Dynamic power control under energy and delay constraints. In Proc Global Telecommunications Conference, 2001 (GLOBECOM '01). San Antonio, TX, USA; 2001:1287-1291.
Google Scholar
Chamberland J, Veeravalli V: Decentralized dynamic power control for cellular CDMA systems. IEEE Trans Wirel Commun 2003, 2(3):549-559. 10.1109/TWC.2003.811186
Article Google Scholar
Ata B: Dynamic power control in a wireless static channel subject to a quality-of-service con-straint. Operat Res 2005, 53(5):842-851. 10.1287/opre.1040.0188
Article MathSciNet MATH Google Scholar
Tembine H, Kobbane A, El Koutbi M: Robust power allocation games under channel uncertainty and time delays. In Proc Wireless Days (WD) (2010 IFIP). Venice, Italy; 2010:1-5.
Chapter Google Scholar
Liu R, Sinha P, Koksal C: Joint energy management and resource allocation in rechargeable sensor networks. INFOCOM, 2010 Proceedings IEEE 2010, 1-9.
Google Scholar
Seyedi A, Sikdar B: Energy efficient transmission strategies for body sensor networks with energy harvesting. IEEE Trans Commun 2010, 58(7):2116-2126.
Article Google Scholar
Ho C, Zhang R: Optimal energy allocation for wireless communications powered by energy harvesters. IEEE International Symposium on Information Theory Proceedings (ISIT) 2010, 2368-2372.
Google Scholar
Medepally B, Mehta N, Murthy C: Implications of energy profile and storage on energy harvesting sensor link performance. Global Telecommunications Conference, 2009. GLOBECOM 2009. IEEE 2009, 1-6.
Chapter Google Scholar
Fallgren M: On the complexity of maximizing the minimum Shannon capacity in wireless networks by joint channel assignment and power allocation. In Proc 18th IEEE International Workshop on Quality of Service (IWQoS 2010). Beijing, China; 2010:1-7.
Chapter Google Scholar
Schrijver A: Theory of Linear and Integer Programming. Wiley, New York; 1986.
MATH Google Scholar
Maros I: Computational Techniques of the Simplex Method. Volume 61. Springer, New York; 2003.
MATH Google Scholar
Puterman M: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York; 1994.
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Laboratoire des Systèmes d'Information Mobiles et Embarqués (SIME)/Mobile Intelligent System research group, Mohammed V-Souissi University, ENSIAS, Madinat Al Irfane, BP 713, Agdal, Rabat, Morocco
Mohammed-Amine Koulali, Abdellatif Kobbane & Mohammed El Koutbi
Ecole Nationale des Sciences Appliquées d'Oujda (ENSAO), Mohammed I University, Oujda, Morocco
Mohammed-Amine Koulali
Department Telecommunications, Supelec,3, rue Joliot-Curie 91192, Gif Sur Yvette, Cedex, France
Hamidou Tembine
University of Paris 13, 99 Avenue Jean-Baptiste Clement, 93430, Villetaneuse, France
Jalel Ben-Othman

Authors

Mohammed-Amine Koulali
View author publications
You can also search for this author in PubMed Google Scholar
Abdellatif Kobbane
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed El Koutbi
View author publications
You can also search for this author in PubMed Google Scholar
Hamidou Tembine
View author publications
You can also search for this author in PubMed Google Scholar
Jalel Ben-Othman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammed-Amine Koulali.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Koulali, MA., Kobbane, A., El Koutbi, M. et al. Dynamic power control for energy harvesting wireless multimedia sensor networks. J Wireless Com Network 2012, 158 (2012). https://doi.org/10.1186/1687-1499-2012-158

Download citation

Received: 01 October 2011
Accepted: 01 May 2012
Published: 01 May 2012
DOI: https://doi.org/10.1186/1687-1499-2012-158

Dynamic power control for energy harvesting wireless multimedia sensor networks

Abstract

1 Introduction

2 System model

2.1 Mathematical formulation

2.2 Stochastic battery model for sensors with energy harvesting capabilities

3 Optimal policy for a single sensor

4 Optimality for all the sensors batteries lifetime

5 Numerical investigations

6 Concluding remarks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords