Adaptive Power Allocation in Wireless Sensor Networks with Spatially Correlated Data and Analog Modulation: Perfect and Imperfect CSI

. We address the problem of power allocation in a wireless sensor network where distributed sensors amplify and forward their partial and noisy observations of a Gaussian random source to a remote fusion center (FC). The FC reconstructs the source based on linear minimum mean-squared error (LMMSE) estimation rule. Motivated by the availability of limited energy in the sensor networks, we undertake the design of power allocation based on minimization of the reconstruction distortion subject to a constraint on the network transmit power. The design is based on the following two cases: (i) exact knowledge of the channel gains and (ii) the estimates of the channel gains. We show that the distortion can be represented as a convex function of the transmit powers of the sensors. Moreover, we show that the power allocation based on this distortion function does not bear any closed form solution. To this end, we propose a novel design based on the successive approximation of the LMMSE distortion, which turns out to be simple, computationally e ﬃ cient, and exhibits excellent convergence properties. The simulation examples illustrate that the proposed design holds considerable performance gain compared to a uniform power allocation scheme.


Introduction
Wireless sensor networking is an emerging technology which finds application in many fields including environment and habitat monitoring, health care, automation, military applications such as battlefield monitoring and surveillance, and underwater wireless sensor networks (UWSNs) for marine environment monitoring [1,2].A wireless sensor network (WSN) consists of spatially distributed sensors that cooperatively monitor physical or environmental conditions, for example, temperature, vibration, pressure, motion, or pollutants.
We consider a system in star topology where sensors amplify and transmit their noisy observations of a common source, via some orthogonal multiple access scheme such as frequency division multiple-access (FDMA), to a central processing unit called fusion center (FC) which reconstructs the source in a way that the overall distortion (e.g., mean squared error) be minimized.Conceptually, the system is similar to the CEO problem [3,4].
The sensors in the network have partial and spatially correlated observations of the underlying source.The correlation exists where sensors measure data in same geographical location, for example, acoustic sensors that are sensing a common event produce measurements that are correlated.In addition, observation noise and communication channel may not have same conditions across the sensors.Therefore, transmission of the observations based on uniform power allocation is not an optimal strategy.
In this paper, we study the problem of adaptive power allocation given a network power constraint with the objective to minimize the reconstruction MSE.The optimal power allocations are jointly determined at the FC which are then conveyed to the individual sensors via feedback channels.The communication channels from the sensors to the FC experience independent flat fading.The channel from the sensor to the FC is usually estimated using some training sequence.The receiver noise and the limited available power means that the channel estimation always incurs some estimation error.Consequently, the design of 2 EURASIP Journal on Wireless Communications and Networking power allocation scheme should also take into account the channel estimation errors [5,6].In this paper, first we design the power allocation scheme based on perfect knowledge of the channel state information (CSI) and subsequently, in the design, we incorporate the effect of imperfect CSI.
In a sensor network measuring a memoryless Gaussian source uncoded transmission, that is, amplify and forward (AF), outperforms the separate coding and transmission over the multiple-access channel [7][8][9].Motivated by this result, Vuran et al. in [10] considered the estimation of a random source with distributed sensors and suggested a sensor selection procedure which exploits the spatial correlation to minimize the estimation error (based on the LMMSE estimation criterion).The sensor selection procedure suggests that the sensors with high correlation with the source and low cross-correlations should be selected.The procedure does not take into account the fact that even if a sensor has high correlation with the source and low cross-correlations with the other sensors, it can still be a bad selection in terms of energy efficiency if its observation noise is high and/or the communication channel to the FC is bad.A recent related work appears in [11].Bahceci and Khandani in [12] proposed a power allocation scheme where each sensor observes a separate source albeit correlated.Reference [13] presented a power scheduling scheme for sensor networks to detect a source based on the binary hypothesis testing rule which exploits the correlation in the observation noises at the sensors.Other works like [14][15][16][17][18] proposed power allocation schemes for parameter estimation in wireless sensor networks without considering the spatial correlation.In this paper, we present a novel framework which incorporates adaptive power allocation (APA) in the network by taking into account the spatial correlation and cross-correlations of the observations, observation quality, and communication channel to the FC.The power allocation design also takes into account the channel estimation errors.
We assume that the FC reconstructs the underlying source using linear minimum mean squared error (LMMSE) estimation rule.The power allocation design is based on minimization of the reconstruction distortion subject to a constraint on the total transmit power of the sensors.Due to the spatial correlation among the sensor observations, the design of the power allocation scheme based on the given optimization problem presents a unique challenge because the LMMSE estimation/reconstruction error of the underlying source contains nonlinearly coupled optimization variables.Herein, first we prove that the estimation distortion can be represented as a convex function of the sensor transmit powers, then we show that the power allocation design based on this distortion function turns out to be complicated and does not bear a closed solution.Subsequently, we propose a novel design based on the successive approximation of the LMMSE estimation distortion.The resulting power allocation algorithm is simple, computationally efficient, and exhibits excellent convergence properties.The proposed designs hold considerable performance gain compared to a uniform power allocation scheme.To the best of our knowledge, in the present literature, there is no such work on the design of power allocation for the sensor network under consideration which jointly exploits spatial correlation, observation noises, channel gains, and their estimation errors.
The rest of the paper is organized as follows.Section 2 describes the system set-up.The power allocation problems and their solutions are presented in Sections 3 and 4, respectively for perfect and imperfect knowledge of the CSI.Section 5 evaluates performance of the power allocation designs.Section 6 concludes the work.

System Model
Consider the system model shown in Figure 1 in which N spatially distributed sensors observe an unknown zeromean real Gaussian random source s ∼ N (0, σ 2 s ), and communicate with the fusion center (FC) via orthogonal multiple-access channels.Each sensor has a partial and noisy observation of the source, and sends an amplified version of it to the FC.The FC collects the signals from all sensors and reconstructs the source according to a given fidelity criterion, for example, minimum mean-squared estimation error.The s i ∼ N (0, σ 2 si ), and n i ∼ N (0, σ 2 ni ), respectively, denote the partial observation of the source s and the noise corrupting this observation such that the noisy observation at sensor i is ( The estimation of the source is done on a sample by sample basis, and its procedure is same for all samples.Therefore, for clarity, in the subsequent formulation we drop the time index.We assume that the sensors amplify and forward their observations to the FC via orthogonal channels where each channel experiences flat fading independent over time and across sensors.The optimality of the AF scheme is established for the Gaussian network with nonorthogonal multiple-access channel from the sensors to the FC [7].However, for the network with orthogonal multiple-access channel it has been shown in [19,20] that the separate source channel coding outperforms the AF scheme.The optimality of the coded source-channel communication in general requires coding over long block lengths and will require some data processing at the sensors.This will increase the power consumption at the sensors and will lead to longer processing delays which may not be tolerable in many applications.Therefore, due to simplicity, low latency, and ease of implementation, in this paper we adopt the AF transmission strategy.
The received signal at FC from sensor i is where P i is a amplifying factor and w i ∼ CN (0, σ 2 wi ) is a circularly-symmetric Gaussian receiver noise.The fading channels {h i } N i=1 between the sensors and the FC are h i ∼ CN (0, σ 2 hi ), for all i with gain factors {g i = |h i |} N i=1 which are Rayleigh distributed.Noting that h i = g i e jθh j , we can write (2) as where the exponential term e − jθh j can be merged into the variable w i without changing its statistical properties-due to the circular-symmetry property of w i [21].Since the underlying source s and the noisy observation x i = s i + n i are real-valued, therefore, we only need to consider the component of the noise w i which is in-phase with the observation x i , that is, where w i ∼ N (0, σ 2 wi ) and σ 2 wi = 0.5σ 2 wi .For the analysis in this work, we assume that the observation noise n i , ∀i (similarly the receiver noise w i , ∀i) is independent across the sensors and is also independent of w i , ∀i (n i , ∀i).Moreover, we assume that the source s, the observation s i at sensor i, the observation s j at sensor j, the observation noise n i at the sensor, and the receiver noise w i at the FC are jointly Gaussian across sensors (∀i and ∀ j) with zero mean and covariance (Λ s,si,sj ,ni,wi ) specified by We also assume that the samples of s, s i , n i , and w i are individually independent in time.
In (5), the correlation coefficient ρ si represents the correlation between s and s i and the coefficient ρ i j denotes the correlation between s i and s j .The values of these correlation coefficients depend on the distance of the sensors w.r.t. the position of the source s and w.r.t. each other, respectively, and can be characterized as follows: Cov S, S i σ s σ si = e −(dsi/θ1) θ 2 , which is a power exponential model for correlation [10,22].In (6), d si is the distance between the source s and sensor i, and d i j is the distance between the sensors i and j.
The parameter θ 1 > 0 controls how fast the correlation decays with distance and is called range parameter.The other parameter θ 2 is called a smoothness or roughness parameter which is 0 < θ 2 ≤ 2. Equation (6) shows that the correlation decays with distance with limiting values of 1 and 0 as d si (d i j ) → 0 and d si (d i j ) → ∞, respectively.Therefore, the correlation changes with the change in the elative positions of the source and the sensors.The change may happen due to movement of either the sensors or the source or both, for example, an animal may kick a sensor node to a different location.We assume that the relative positions of the sensors with respect to each other and the underlying source are perfectly known.Moreover, we assume that the positions remain unchanged for at least one estimation cycle.
Based on the correlation model, therefore, we can say that the FC in essence is interested to reconstruct the source s which is located at a specific location by collecting observations from spatially distributed sensors where correlation of the observations with the source and among the sensors, respectively, depends on the spatial location of the sensors w.r.t. the source and w.r.t. each other.Note that the location of the source and the sensors can be in two or three dimensional space.
The equation ( 4) can be written equivalently in matrixvector notation as follows: where [• • • ] T denotes the matrix-vector transpose operation.At the FC, the optimal estimator in minimum meansquared error (MMSE) sense is the conditional mean of s given the observation z, that is, , where E denotes the mathematical expectation.Under the jointly Gaussian assumption of the s and z, the conditional mean estimator turns out to be linear and is called linear minimum meansquared error estimator (LMMSEE).Therefore, we seek the estimate of the source like s = N i=1 a i z i = a T z which can be written as [23] where a T = [a 1 , . . ., a N ] = c sz C −1 z is a row-vector of LMMSEE weighting coefficients.The resultant distortion of the estimate s in comparison to the original signal s is measured by the mean-squared error and is given by [23] where The estimation distortion D can also be written as follows: which is obtained by using the Woodbury identity for matrix inversion [24] (see Appendix A).Let Y = HC −1 w H, where ).Now we can write (10) as Theorem 1.The estimation distortion function in (11) is convex over P i , for i = 1, . . ., N.
Let p = [P 1 , . . ., P N ] T be a vector of the transmit power of the sensors.The proof of the theorem consists in showing that the Hessian of the distortion function in (11) with respect to p is positive semidefinite.To this end, a detailed proof is given in Appendix B.
Remark 1.The reconstruction distortion is upper bounded by the variance of the source σ 2 s , and lower bounded by the variances of the observation noises and spatial correlation and cross-correlation values as given by The lower-bound distortion is achieved when the observations of the sensors are received at the FC via ideal communication channels which can be verified by setting g 2 i P i /σ 2 wi = ∞, for all i in (11).Note that it is not possible to achieve distortion less than The achieved distortion is equal to the upper bound value (i.e., D = σ 2 s ) when either no signal is received at the FC from the sensors or the observations are uncorrelated with the source s or both, which can be verified from (11).

Power Allocation with Perfect CSI
In this section, we assume that the channel state information (CSI), that is, the channel gains {h i } N i=1 are perfectly known at the FC.The case of imperfect CSI is considered in the next section.

Minimization of the Distortion Subject to the Power
Constraint.We base our adaptive power allocation design on the following optimization problem.
Prob.Minimize the distortion D(P 1 , . . ., P N ) subject to where i denotes the total power in the transmitted signal of sensor i.The sum power constraint in (13) enables a fair comparison between the networks of different sizes.Moreover, for a sensor network which forms part of a bigger network where each subnetwork performs different sensing task but share the same frequency band to transmit observations, to limit the interference between the subnetworks, the total power emitted from each subnetwork is upper bounded.Furthermore, recent studies have shown that the ICT (Information and Communication Technology) power consumption is a significant contributor to the global warming [25].Therefore, in the context of sensor networks, putting cap on the total power consumption conserves energy and limits the contribution to the global warming.
Since the optimization problem in ( 13) is convex (the objective is convex and the constraints are linear), therefore we can use the Lagrangian method of multipliers to find the optimal P i 's [26].The Lagrangian cost function is where λ and {μ i } N i=1 are dual variables or Lagrange multipliers.The associated Karush-Kuhn-Tucker (KKT) conditions are where J i is a diagonal matrix with unity at (i, i)th place and all other elements are equal to zero.The expression in ( 15) is a complicated function of the optimization variables.Therefore, a closed form solution for this problem is not tractable.However, we can resort 1: Initialize λ [0] and P i [0] for i = 1, . . ., N 2: Set κ = 0 3: While (|D [κ] − D [κ−1] | ≥ ) do where κ denotes the while loop iteration index 4: κ = κ + 1 5: Find P i [κ] , i = 1, . . ., N, by numerically solving (15), for example, using bisectional search method 6: Update λ using gradient method as follows: to numerical methods (e.g., bisectional search over P i 's in (15) and gradient method to update λ) to find the optimal P i , i = 1, . . ., N in an iterative manner as outlined under Algorithm 1. From (17), note that the active sensors P i > 0 have corresponding Lagrangian multipliers μ i = 0.The sensors with P i = 0 are removed from the system.The parameter α [κ] in * denotes the step-size.Since the objective function of the optimization problem is convex and bounded and the constraints are linear, therefore the algorithm can achieve convergence to the absolute minimum (the KKT point) of the problem provided the step-size α [κ]  is selected properly [27].Unfortunately, Algorithm 1 will be computationally quite expensive (unless the network size is small) due to (i) a number of matrix inversions involved in (15) while numerically searching for P i , i = 1, . . ., N in each iteration; (ii) the dependence of the convergence properties on the step-size α [κ]  [27,28].
In the sequel, based on a successive approximation (SA) principle, we present a novel quasianalytical solution of the optimization problem which is simple and the associated algorithm is computationally efficient compared to Algorithm 1, exhibits remarkable convergence properties, and achieves distortion very close to the global optimum of the Algorithm 1 with no appreciable performance gap.This so-called SA-based design can be viewed as the joint optimization of the transmit powers and the modified LMMSE coefficients as we will see in the subsequent development.
According to the idea of successive approximation, a modified function is constructed from the given function in some special way [29][30][31].Then that modified function is solved iteratively/successively to find the solution for the underlying problem.The solution obtained by the SA approach can be viewed as quasianalytical solution.We apply the idea of successive approximation to the reconstruction distortion function and solve the problem of power allocation in the sensor network.To this end, at the FC, to form the estimate s = N i=1 a i z i of the source s and to characterize the resultant mean-squared distortion D, we proceed as follows.
We can write the distortion D as and by solving ∂D/∂a i = 0, we get the following expression for the LMMSE weighting coefficients: The variables β i and γ i are, respectively, defined as follows: where 20) and ( 21), the distortion in (18) simplifies to Equation ( 21) forms a set of N coupled equations which constitute the Wiener-Hopf equation for the LMMSE filter coefficients (β i , for all i).If we know the transmit powers then for given covariance Λ s,si,sj ,ni,wi and the channel gains {g i } N i=1 , we can find the coefficients {β i } N i=1 by solving (21).For the solution, it is convenient to employ the matrixvector form as follows: where [R] i j = γ j g j P j σ si σ sj ρ i j for i / = j, and [R] i j = 1 for i = j.Note that [R] i j denotes the (i, j)th element of the matrix R.However, the point here is that we do not know the transmit powers {P i } N i=1 .The following subsection presents an alternative solution to the problem of optimizing the transmit powers of the sensors under the network-wide power constraint such that the distortion D be minimized.Therein, to derive an algorithm for the solution of P i , the underlying idea is to assume β as constant.Based on this assumption, we derive an iterative algorithm which computes β using the values of {P n } N n=1 from the previous iteration.This successive approximation (SA) of the distortion function in (22) makes the solution of the power allocation problem simple and easy to compute as will be seen in the ensuing development.Note that the resulting design for power allocation can be viewed as a joint optimization of {β i } N i=1 and {P i } N i=1 .

Minimization of the Distortion Subject to the Power
Constraint-SA.Herein, we solve the optimization problem in (13) based on the distortion function D in ( 22) and using the successive-approximation principle outlined in the preceding subsection.For given β i ≥ 0, it is easy to verify that the distortion function is convex with respect to the optimization variables P i , i = 1, . . ., N. Therefore, the KKT conditions are sufficient for optimality [26] which are given as follows: Solving (24) for active sensor i (i.e., P i > 0, μ i = 0 from (26)), we get for i = 1, . . ., N, where ζ i : = g 2 i /σ 2 wi denotes the channel SNR for sensor i and (x) + : = max(x, 0).Based on (27), following observations are in order.
(1) There exists a cut-off value i , the power allocation policy follows waterfilling on channel SNR, that is, P i increases with increasing ζ i ; and for ζ i > ζ (o)  i , the power allocation is according to inversion in the channel SNR, that is, increasing ζ i decreases P i .
(2) The sensors with higher observation noise variances are given less power.For sensor i, in the limiting case The sensors with weak correlation with the source are allotted less power.For instance, if ρ si → 0 then P i → 0.
Combining the aforementioned points we can see that the final power allocation policy for the sensors depends on the spatial correlations, variance of the observation noises, and the channel SNRs.Moreover, we see that depending on the values of these system parameters some of the sensors may be switched-off altogether.For sensor i to be active, following condition must hold: which stems from the fact that P i > 0 if sensor i is active.Let K denotes the set of active sensors defined as follows: Since the problem is convex, the minimum of the objective function occurs at the sum power constraint boundary, that is, the constraint is active.Therefore, the transmit powers P k , k ∈ K must satisfy the power constraint with equality, that is, k∈K P k σ 2 k = P tot , which gives Based on the solution from ( 27) through ( 30), Algorithm 2 can be proposed which iteratively optimizes the transmit powers {P i } N i=1 and the variables {β i } N i=1 , while minimizing the reconstruction distortion subject to the power constraint.If during iterations any sensor does not fulfill the condition in (28), it is switched off and the algorithm continues with the remaining sensors until the convergence criterion is fulfilled.Regarding the convergence properties of the algorithm, consider the following.
(i) Since in each iteration (successive approximation) we are minimizing a convex function over the convex-set of the transmit powers {P i | N i=1 P i σ 2 i ≤ P tot and P i ≥ 0, i = 1, . . ., N}, therefore the optimality of the transmit powers {P i } N i=1 in each approximation (for given {β i } N i=1 ) combined with the optimality of {β i } N i=1 for given {P i } N i=1 as per (23) means that the algorithm achieves monotonic decrease in the distortion, that is, and consequently it does converge to a unique minimum point.
(ii) The algorithm consistently arrives at the same combination of the transmit power tuple (P 1 , . . ., P N ) and achieves the same minimum distortion for a wide range of different initialization points.In other words, we can say that the algorithm exhibits start point independence (for a wide range of initialization points).Moreover, the algorithm asymptotically ) do where κ denotes the while loop iteration index 5: κ = κ + 1 6: For i = 1, . . ., N determine transmit power as follows: 7: if Condition in ( 28) is true then 8: Determine P i [κ] from (27) 9: else 10: P i [κ] = 0 11: end if 12: For i = 1, . . ., N update β [κ]  i from (23) 13: Calculate D [κ]  14: end while Algorithm 2 achieves the lower-bound distortion D 0 with increasing transmit power P tot .In Section 5, we illustrate the monotonic decrease, start point insensitivity, and the asymptotic convergence to D 0 with several simulation examples.There we also show that the convergence may be achieved in as few as two or three iterations.
(iii) We have shown that the original problem is convex and therefore the objective function has a global minima under the power constraint which can be achieved by Algorithm 1.Now the question is how closely does the successive approximation-based algorithm converge to the global minimum value?
The simulation examples in Section 5 show that the distortion achieved by both algorithms are extremely close and the performance gap between the fulloptimization and the successive approximation based algorithms is virtually negligible.
It is a quite remarkable that the algorithm exhibits such excellent convergence properties which illustrates that the proposed successive approximation strategy works quite well.Finally, compared to the power allocation Algorithm 1, the ease of computation and simplicity of the design based on the successive approximation principle can be appreciated from the simple and elegant structure of ( 27)- (30).

Power Allocation with Imperfect CSI
Heretofore, we have assumed perfect knowledge of the channel gains {h i } N i=1 .However, in practice, we have estimates { h i } N i=1 of the actual channel gains.One way to estimate the channel is by a training sequence whereby each sensor transmits a known sequence of data symbols called pilots.Then based on the received data, the FC estimates the channel.Let t i denote the pilot symbol transmitted by sensor i in the channel estimation phase.The corresponding received signal is r i = h i t i + w i and based on which the LMMSE estimate h i of h i is where (• • • ) * denotes the complex conjugate operation.The variance of the estimation error Δh i : wherein |t i | 2 is power of the transmitted pilot.Note that the variance of channel estimation error is finite for finite |t i | 2 and σ 2 wi .The actual channel can be represented as a sum of the estimate and the estimation error, that is, where Δh i ∼ CN (0, δ 2 i ).Such an approach to model the channel estimation error can be viewed as the Bayesian approach [6].
One way to design the power-scheduling scheme is by replacing h i and g i , respectively, by h i and g i in the formulations of the foregoing section.This constitutes a naive-approach because it ignores the error in the channel estimate.An alternative design originates by substituting (34) in (2) as follows: in which u i can be viewed as total receiver noise corresponding to sensor i with E {si,ni,wi,Δhi} [u i ] = 0, wi , for all i, and E {si,sj ,ni,nj ,wi,w j ,Δhi,Δhj } [u i u * j ] = 0, for all i / = j.Noting that h i = g i e jθ h i , we can write where the exponential term e − jθ h i can be absorbed into u i , that is, into the Gaussian variables Δh i and w i without changing their statistical properties-thanks to their circular symmetry.Since the underlying source s and the observation s i + n i are real-valued, as a consequence only the part of the noise u i in-phase with the sensor observation is relevant for estimation of the source s.Therefore, we can write where . Following a procedure similar to Section 3, it can be shown that the mean-squared reconstruction distortion of the estimate s = N i=1 a i z i with respect to s is given by where β j γ j g j P j σ si σ sj ρ i j , (39) The solution of the optimization problem in (13) with the objective to minimize the distortion D defined in (38) subject to the constraint on the total network power can be obtained by using the method of Lagrangian multipliers and is outlined as follows.
(1) The power allotted to sensor k is for k = 1, . . ., N, where ζ k : = g 2 k /σ 2 wk defines the channel SNR based on the channel estimate.
(2) For sensor k to be active, that is, P k > 0, the following condition: must hold, otherwise it is switched-off.
(3) The index-set K of the active sensors is which is determined such that the power constraint be satisfied with equality.
Based on (40)-(43), the power allocation for the sensors can be obtained using the procedure outlined under Algorithm 2. (The power allocation design under Algorithm 1 can similarly be extended to the imperfect CSI case.)Note that the convergence properties of the algorithm with perfect CSI also applies to the imperfect CSI case.Moreover, the above power allocation design exhibits robustness to the channel estimation errors compared to the naive approach as shown in the subsequent section.

Performance Evaluation and Discussion
Through simulation examples, this section corroborates the analytical findings and illustrates the effectiveness of the proposed adaptive power allocation (APA) designs under Algorithms 1 and 2 for the perfect and imperfect CSI cases.We assume without any loss of generality that σ 2 s = {σ 2  si } N i=1 = 1.In the simulations, the distortion is calculated from 10 5 realizations of the underlying source, partial observations, and observation and receiver noises according to the covariance Λ s,si,sj ,ni,wi .The simulation examples focus on the successive approximation-based power allocation design unless stated otherwise.In the figures, log(• • • ) denotes the logarithm with base 10.

Spatial Correlation.
In order to show the efficacy of our design, we compare its performance with a uniform power allocation-based design.In the figures, the designs are, respectively, denoted as APA (Adaptive Power Allocation) and UPA (Uniform Power Allocation).Moreover, {P i } N i=1 = P u = P tot /N for the UPA design.
We consider two sensor networks, respectively, comprising N = 3 and N = 500 sensors which are uniformly distributed in a 100 × 100 grid with the source s at its center.Figure 2 plots the distortion achieved by the SAbased APA design and compares it with the UPA design.The distortion is averaged over 10 4 independent realizations (drawn from a uniform distribution) of the sensors deployment.The figure shows that our proposed design outperforms the UPA scheme and the achieved distortion monotonically approaches the lower-bound distortion value D 0 with increasing P tot .For given θ 1 , we can observe that the distortion decreases with increasing the number of sensors and the performance gap between the APA and UPA designs also increases.Moreover, we can see that increasing the value of θ 1 decreases the distortion.This is because, for given deployment, the spatial correlation of the sensors with the source (and with each other) improves with increasing θ 1 [c.f. ( 6)].Note that at low value of θ 1 , the distortion is high and the performance gap between the APA and UPA designs is very small.However, as the value of θ 1 increases the distortion decreases and the performance gap between the APA and UPA increases.However, we can see that with increasing the value of θ 1 further the performance gap starts decreasing.This is because for very small value of θ 1 the sensors have very low correlation with the source and for very large value of θ 1 the correlation is high for all sensors, and in these extreme cases the UPA scheme is as good as the APA scheme.
Next, for the sake of illustration, we focus on the network with three sensors, that is, N = 3, and we consider the following two examples:  The simulations in the sequel are based on these two examples.We have taken these examples for purely illustrative purpose and they in no way limit the generality of our designs.For zero-observation noise variances, the spatial correlation lower bounds the reconstruction distortion at D 0 (0.4038, 0.1796), where first value is for Ex1 and the second value is for Ex2.The distortion in the subsequent simulation examples cannot be below this value no matter how high the transmit power becomes.Figure 3 shows that the proposed design gives reconstruction distortion which is less than that achieved by the uniform power allocation.This superior performance originates from the reason that the proposed design assigns all or more power to the sensor(s) with better correlation properties.This is contrary to the UPA scheme which gives equal importance to all sensors regardless of the correlation structure and thereby wasting power.For both examples, note that the achieved distortion decreases monotonically with increasing P tot but is never less than the lower-bound value D 0 (0.4086, 0.1875).
For Ex1 and Ex2, Figure 4 shows that with different initial values of {P k } N k=1 , Algorithm 2 converges to the same distortion value (0.5817 for Ex1 and 0.2646 for Ex2).Moreover, at the convergence, the power distribution among the sensors is 10 log (P 1 , P 2 , P 3 ) Ex1 = (13.8913,0, 8.5279) and 10 log (P 1 , P 2 , P 3 ) Ex2 = (19.8403,0, 5.5743), respectively, for Ex1 and Ex2 in all cases irrespective of the initialization point of the algorithm.Note that in one iteration the distortion reaches fairly close to the minimum value.Nevertheless, after the second or third iteration there is virtually no appreciable change in these values.This observation extends to all simulation examples presented herein.
Figure 5 compares performance of the proposed APA designs under Algorithms 1 and 2 which shows that the distortion curves produced by the two algorithms are extremely close and the performance gap is negligible.This is quite remarkable result especially when viewed in combination with the simplicity and computational efficiency of Algorithm 2 compared to Algorithm 1.
The simulation examples in the sequel only treat the APA design based on the successive approximation (SA) without including comparison with the APA design under Algorithm 1 and the UPA scheme.Nevertheless, in all the instances, the SA-based design closely achieves the performance of the design in Algorithm 1 and outperforms the UPA scheme except in a symmetric case where both designs (APA and UPA) converge.In the symmetric case, the correlations, variances of the observation and the receiver noises, and the channel gains are same across all sensors.Figure 6 shows that for given P tot the distortion decreases with increase in the so-called channel SNR and vice versa.Note that in each case the achieved distortion monotonically approaches the lower-bound value (D 0 = 0.4086) with increasing P tot .Figure 6(b) shows how the total power P tot is distributed among the sensors in C w5 and C w6 .The figure shows that in C w5 , the sensors with better correlation properties are given more power, which is due to the fact that, in this case, the system is symmetric with respect to all other system parameters.However, the case C w6 is different where the channel SNRs are not same across the sensors.For this case the figure shows that the power allocation policy follows sensor selection and waterfilling with respect to the SNR until the next sensor is turned on, after which point the power allocation is according to the channel inversion.These examples show that the power allocation policy is jointly determined by the spatial correlation values, variance of the observation noises, and the channel SNRs.n1 , σ 2 n2 , σ 2 n3 } = {1, 0.01, 0.1} and D 0 = 0.6160.Figure 7 shows that increasing the variances of the observation noise at the sensors, increases the reconstruction distortion.Moreover, in each case, the achieved distortion D decreases monotonically with the transmit power P tot and approaches D 0 at high transmit power.8 illustrate the achieved distortion when we have (i) perfect knowledge of the CSI, that is, 0 and (ii) the estimates of the channels with estimation error variances δ 2 = 0.01 and δ 2 = 0.1.The distortion is averaged over 2 × 10 6 realizations of the source and the corresponding partial observations, the observation and receiver noises, and the channel gains.From the figure we can see that the channel errors increase the reconstruction distortion.Moreover, the proposed design (Rob in the figure) shows robustness to the channel estimation errors compared to the naive approach (Nav in the figure).

Conclusions
In this paper we have investigated the joint adaptive power allocation design for the sensors to transmit their observations to the FC with the goal to reconstruct the underlying source with minimum distortion subject to a constraint on the total network power.We demonstrated the effectiveness of our proposed design with a few simple examples where we vary one parameter while keeping all others the same across sensors.In practice, the correlation, observation noises, and the SNRs of the communication channels may simultaneously vary across sensors.Consequently, the power allocation policy is a nontrivial function of all these factors.Our proposed design incorporates them in (quasi)analytical expressions.We showed that the power allocation problem is convex which does not bear closed-form analytical solution.We showed that the problem can be solved by an iterative numerical procedure which is computationally expensive unless the network size is small.Then, based on the successive approximation principle, we proposed a novel power allocation design which is simple and easy to compute, and exhibits excellent convergence properties.We demonstrated that our proposed design outperforms a scheme based on the uniform power allocation except in a symmetric case where both designs converge.The performance gain may be large for a relatively large sensor network with high heterogeneity in the spatial correlation, observation quality, and the channel gains.In the proposed design, we also incorporated the case when imperfect knowledge of the channel gains is available.We demonstrated that the proposed design exhibits robustness to the channel estimation errors relative to a naive approach.The future work may consider the extension to the nonorthogonal channels from the sensors to the FC.Moreover, optimizing the performance such as reconstruction distortion or the power consumption over a priori given network life can be investigated.We only considered the power required for transmitting the observations to the FC, where as the future work may also include the power consumed in the sensing, preparing the observation for transmission and in the receiving processes.

Appendices
A. Proof of Reconstruction Distortion in (10) From ( 9

B. Proof of Theorem 1
The vector of the transmit powers is p = [P 1 , . . ., P N ] T and we know that the estimation distortion is given by In moving from (B.4) to (B.5) we have used the invariance property of Tr under cyclic permutations, that is Tr(ABC) = Tr(BCA) = Tr(CAB).[24].
In order to prove that the function D is convex over p , we derive its Hessian and then show that it is positive semidefinite.To this end, first we calculate the first-order derivative of D with respect to each component of p , that is, ∂D/∂P k for k = 1, . . ., N as follows: , where e k = (e 1 , . . ., e N ) T with e i = 1 for i = k and e i = 0 for i / = k, i = 1, . . ., N [24].Now we take the derivative of (B.6) with respect to P l for l = 1, . . ., N as follows: We know that the product of the positive semidefinite matrices is also a positive semidefinite matrix and the Hadamard product of positive semidefinite matrices is also a positive semidefinite matrix.Therefore, we can conclude that the Hessian ∂ 2 D/∂p 2 is also positive semidefinite which establish, the convexity of the function in (11) with respect to the transmit powers P i , i = 1, . . ., N.

Figure 5 :
Figure 5: Performance comparison of the APA designs.

2 wl 1 + 2 wl
+ I) −1 e k e T k C(YC + I) Tr C(YC + I) −1 e k e T k C(YC + I) −1 e l e T l C(YC + I) −C(YC+I) −1 e l e T l C(YC+I) −1 e k e T k C(YC+I) Tr e T k C(YC+I) −1 e l e T l C(YC+I) −1 C(YC+I) −1 e k +e T l C(YC+I) −1 e k e T k C(YC+I) −1 C(YC+I) −1 e l YC+ I) −1 e l e T l C(YC+I) −1 C(YC+I) −1 e k + e T l C(YC+I) −1 e k e T k C(YC+I) −1 C(YC+I) −1 e l (B.7)where we have used ∂UV/∂z = U∂V/∂z + ∂U/∂zV and Tr(ABC) = Tr(BCA).Now let R = diag (g 2 1 /σ 2 w1 , . . ., g 2 N /σ 2 wN ) T and Y = (YC + I) −1 then the Hessian of D can be written in the following form: where A•B denotes the Hadamard or Schur product of A and B. Note that R, C, C, and Y are positive semidefinite matrices. ).