 Research
 Open Access
 Published:
Outage minimization for parallel fading channels with limited feedback
EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 352 (2012)
Abstract
We address an optimal power allocation problem for minimizing the outage probability for M parallel blockNakagamifading channels under a longterm average sum transmit power constraint with finite rate feedback of channel state information (CSI). A simulationbased optimization technique called simultaneous perturbation stochastic approximation algorithm (SPSA) is employed first to numerically derive a locally optimal power codebook. Due to the high computational complexity and long convergence time of SPSA, we make an ordering assumption on the power codebook entries and derive effective hyperplane based approximations to the channel quantization regions and present a number of lowcomplexity suboptimal quantized power codebook design algorithms. Unlike existing work on outage minimization for multipleinput multipleoutput (MIMO) channels with limited feedback, we do not assume that identical transmission power is used for all channels within each channel quantization region. We also do not resort to a Gaussian approximation for the instantaneous mutual information in general as used in many existing work. Based on our power ordering assumption and hyperplane based approximations, we show that allocating identical power to all channels within a given channel quantization region in the limited feedback scenario is asymptotically optimal only at high average power (or average signaltonoise ratio (SNR)) for the Rayleigh fading case, whereas for the general Nakagami case, the transmit power allocation for an individual channel within each quantized region is asymptotically proportional to the corresponding Nakagami fading parameter (severity of fading). We also present a novel diversity order result for the outage probability for the Nakagami fading case. Finally, we derive a suitable Gaussian approximation based lowcomplexity power allocation scheme for a large number of parallel channels, which has important applications in wideband slowfading orthogonal frequencydivision multiplexing (OFDM) systems. Extensive numerical results illustrate that only a few bits of feedback close the gap substantially in outage performance between the limited feedback case and the full instantaneous CSI at the transmitter case.
Introduction
Determining the information theoretic capacity of blockfading wireless channels has been an important area of research over the past decade. Various notions of capacity for singleuser fading channels include ergodic capacity[1], delaylimited capacity[2] and capacity versus outage probability[3]. For delaysensitive traffic such as voice and video, the latter two notions are rather important. In particular, the notion of outage probability signifies the probability that the capacity of a wireless channel falls below a required rate threshold. In[3], optimal power allocation for outage minimization in the case of parallel fading channels (single user) was obtained with the assumption of full channel state information (CSI) at the transmitter. However, full CSI at the transmitter is hard to obtain in practice due to limited bandwidth in the feedback channel from the receiver to the transmitter, and it is more common to have full CSI at the receiver. This has motivated researchers over the last decade to analyze performances of wireless systems with various forms of partial CSI at the transmitter (CSIT), such as noisy CSIT, statistical CSIT and quantized CSIT. In particular, the idea of Grassmanian line packing was used to design optimal beamforming codebooks for MIMO systems in[4], whereas in a related study[5], the authors derived a lower bound on the outage performance of a multipleantenna systems using beamforming based on quantized CSIT. More recently, in[6], maximization of expected rate over a singleinput singleoutput slowly fading channel is investigated using optimized discrete rate and power control with quantized CSIT. A general framework for power allocation in Gaussian vector channels with l_{ p } norm constraints on the eigenvalues of the MIMO channel matrix was investigated in[7]. The authors of[6] have also investigate the diversitymultiplexing tradeoff in MIMO channels with quantized CSIT in[8] (see also[9]). A number of recent articles have investigated outage minimization for fading channels with limited feedback for MIMO or multiantenna systems. Such studies include[9–13]. In particular,[12] looks at outage minimization with a finiterate power codebook for MIMO systems. The key finding of this article (see also[8]) is that the optimal power codebook has a circular structure in that the same transmit power is allocated to the outage region and the best channel region. In order to design the optimal power codebook, it assumes however that the same transmit power (as a function of the entire channel matrix) is used in all transmit antennas. This allows the authors of[12] to reduce the finiterate power codebook design problem to an equivalent scalar quantization problem. Even then, finding the cumulative density function for the equivalent scalar random variable requires computing multidimensional probability integrals which is computationally complex. Furthermore, the optimal power codebook entries are found via generic gradient search techniques which can take unreasonably long time to converge. Using a similar setting, the same authors have investigated the outage diversity behavior for multipleantenna systems with quantized CSIT in[13] (see also[10]). In[11], the problem of outage minimization using quantized CSIT is investigated for the fading relay channel and[14] also studied the outage minimization problem for cooperative amplifyandforward systems. In[9], a Gaussian approximation is used to capture the probability distribution of the mutual information for a MIMO system in order to study the outage behavior. Finally, many of the above results only apply to Rayleigh fading channels (where the MIMO channel matrix is assumed to have complex circularly symmetric Gaussian distributed entries). Note however that the circular nature of the optimal power codebook and some of the useful approximations developed in[10] for asymptotically large number of channel feedback bits are also relevant for our study and we duly acknowledge this fact. Our focus is however on designing practical lowcomplexity but suboptimal algorithms for designing the quantized power codebook and derive theoretical properties of these power allocation schemes in order to justify the various approximations used in designing the suboptimal schemes.
In our article, we look at an Mparallel fading channels system as introduced in[15], where one codeword spans M subchannels in one fading block and each block undergoes the same CSI, and we aim to minimize the outage probability under a sum (across all channels) long term average power constraint with quantized CSIT. Technologically, parallel fading channels constitute a useful and fundamental communication framework for various applications, for example, multipleantenna systems after singular value decomposition or an OFDM system with frequencyselective fading[15]. Due to the unavailability of full CSIT in our framework, our model is better suited to the case of multicarrier OFDM systems, with M parallel subchannels located at nonadjacent carrier frequencies. The concept of parallel channels also extends to multiple transmission timeslots[16] and diversity available through cooperative communications such as multiple relays etc. Our results in this article are applicable to all these scenarios.
Our main contributions can be summarized as follows:

We first formulate the abovedescribed optimization problem and provide an simulation based iterative optimization algorithm: simultaneous perturbation stochastic approximation algorithm (SPSA), to numerically solve for the joint optimization of locally optimal channel partitions and quantized power allocation.

Based on a power ordering assumption and a hyperplane based approximation to the basic rate achieving mutual information curve in the vector channel space, we derive a number of lowcomplexity suboptimal finiterate power codebook design algorithms for outage minimization with quantized channel information—without assuming identical transmission power per channel or using a Gaussian approximation for the instantaneous mutual information in general.

We show that in the high average power (or average SNR) regime, it is asymptotically optimal to allocate transmit power proportional to the Nakagami m fading parameter in the individual channels within each quantized region. In the Rayleigh fading case, this corresponds to allocating the same power across all channels within each quantized region (but only in the high average power regime).

We also derive a novel diversity order result for the outage probability in the Nakagami fading case.

Finally, we investigate the suitability of a Gaussian approximation scheme for the instantaneous mutual information in the case of a large number of independent (but not identically distributed) parallel channels, which is applicable to a slow fading broadband frequency selective channel or to a flat fast fading channel[17, 18]. Note that as we will show later, although the Gaussian approximation is seen to perform poorly for a small number of parallel channels, it performs efficiently for a large number of channels (e.g., M ≥ 16), thus having important practical applications to such broadband multicarrier systems.
The organization of the article is as follows. Section ‘Channel model and outage minimization’ presents the fading channel model and the typical outage problem based on full CSIT. Section ‘Optimum quantized power control with finiterate feedback’ presents the outage minimization problem with quantized CSIT followed by the modified problem formulation using the power ordering and hyperplane based approximation. Various suboptimal algorithms are then presented for finding the power codebook in the high average power regime along with their associated theoretical properties. A new result on the diversity order for the outage probability is then presented for the Nakagami fading case using our power allocation algorithm based on the power ordering and hyperplane based approximation. Section ‘Large number of channels analysis’ presents a Gaussian approximation based suboptimal algorithm applicable to the case of a large number of independent parallel channels. Section ‘Numerical results’ presents an extensive set of numerical results illustrating the efficiency of our algorithms measured by closeness of their outage performance as compared to the full CSIT based optimal power allocation solution. Finally, Section ‘Conclusions’ presents some concluding remarks and ideas for future extensions of this study.
Channel model and outage minimization
We consider an Mparallel flatfading channel model, where a transmitted codeword spans M subchannels in one fading block. For each fading block, the received signal of each subchannel i,i ∈ {1,2,…,M}, can be represented as^{a}:
where h_{ i } is the channel power gain and x_{ i } is the channel input symbol. The noise sequences w_{1},…,w_{ M }are independent and identically distributed (i.i.d) Gaussian random variables with zero mean and unit variance. It is assumed that the components of channel power gain vector h = (h_{1},…,h_{ M }) are mutually independent, individually i.i.d across fading blocks and ergodic and fading is sufficiently slow so that the input symbols transmitted over the same fading block experience the same channel state. It is also assumed that the fading block length N → ∞ so that information theoretic results can be applied. The individual fading distributions may not be identical. However, they (and hence the joint channel fading distribution) are assumed to be continuous.
Given a channel realization h, and assuming the availability of full channel state information (CSI) at the transmitter and receiver, denote the corresponding power allocation to the M subchannels by the vector p(h) = (p_{1}(h),…,p_{ M }(h)). Then the maximum mutual information of an Mparallel channel is given by[19],
where, the rate unit is nats per real dimension. Note that in (2), we consider that the capacity is averaged over parallel channels as[15] did.
Thus, the outage probability, defined as the probability that the instantaneous mutual information of the channel is less than a prespecified transmission rate r_{0} (nats/channel use), can be expressed as
Under a long term average power constraint defined by E[<p(h)>] ≤ P_{av}, (where <x> denotes the arithmetic mean of the vector x with length M, namely,\left.\right)" close=">">\n \n x\n \n), the outage minimization problem can be described as
The optimal power allocation with full CSI at the transmitter for this problem can be found explicitly by using convex optimization techniques and was presented in Proposition 4 of[3]. The readers are referred to[3] for further details. Note that here P_{av} can be thought of effectively as the transmitter side signaltonoise ratio (since noise variance has been normalized to unity). In the following we will address the optimal power allocation problem for outage minimization where only partial or limited CSI is available at the transmitter. For the purpose of analysis, we will assume that each channel h_{ i } is gamma distributed (Nakagami fading) with mean\frac{1}{{\lambda}_{i}}, which probability density function (pdf) is given by
where Γ(.) is gamma function (\mathrm{\Gamma}\left(s\right)={\int}_{0}^{\infty}{t}^{s1}{e}^{t}\mathit{\text{dt}}) and constant m_{ i }≥ 0. 5. m_{ i }is called the fading parameter. Larger values of the fading parameter m_{ i } imply less severe fading environments. When m_{ i }= 1, the above distribution boils down to an exponential distribution (corresponding to Rayleigh fading) and the non fading case corresponds to m_{ i }= ∞.
Optimum quantized power control with finiterate feedback
It is well known that having perfect CSI at both transmitter and receiver is hard to satisfy in a practical system due to bandwidth constraints on the receiver to transmitter feedback link as well as considerable communication cost overhead. In this section, we consider designing a power allocation procedure for Mparallel flatfading channels based on quantized vector CSI h= (h_{1},…,h_{ M }) (in M dimensions) acquired via a nodelay and errorfree feedback link with limited rate from the receiver to the transmitter.
Optimal power allocation with limited feedback strategy
We assume that the receiver can perfectly estimate the full CSI information. Given B bits of feedback, a power codebook\mathcal{P}=\{{\mathbf{p}}_{1},\dots ,{\mathbf{p}}_{L}\}, where P_{ j }= {p_{1j},…,p_{ Mj }}, j = 1,…,L of cardinality L = 2^{B}, is designed offline purely on the basis of the statistics of h. Note that the power levels for different channels here are distinct as opposed to[9, 12] where the same transmit power was allocated to all transmit antennas in the MIMO setting. This codebook is known a priori by both the transmitter and the receiver. Given a channel realization h,

First, the receiver applies a deterministic mapping denoted as I from current instantaneous h information into one of L integer indices[9], where the mapping I partitions the entire Mdimensional space of h into L regions{\mathcal{R}}_{1},{\mathcal{R}}_{2},\dots ,{\mathcal{R}}_{L}, given asI\left(\mathbf{h}\right)=j,\phantom{\rule{.3em}{0ex}}\text{if}\phantom{\rule{.3em}{0ex}}\mathbf{h}\in {\mathcal{R}}_{j},\phantom{\rule{.3em}{0ex}}j=1,\dots ,\mathrm{L.}

Second, the receiver sends the corresponding index j = I(h) to the transmitter via the feedback link.

Then, the j th entry of the power codebook\mathcal{P}, i.e., P_{ j }, will be employed by the transmitter for transmission.
Therefore the key steps involved in the limited feedback design problem constitute obtaining (offline) the jointly optimal CSI partitions and power codebook design. Our objective is to design efficient algorithms for solving this joint optimization problem of the channel partition regions and the power codebook, so as to minimize the outage probability while satisfying a long term average power constraint.
Let\text{Pr}\left({\mathcal{R}}_{j}\right),E[\bullet {\mathcal{R}}_{j}] denote\text{Pr}(\mathbf{h}\in {\mathcal{R}}_{j}) (the probability that h falls in the region{\mathcal{R}}_{j}) andE[\bullet \mathbf{h}\in {\mathcal{R}}_{j}], respectively. Define the indicator function x_{ j }, j = 1,…,L as
Then outage minimization problem (4) with limited feedback can be formulated as
where{{\mathbf{p}}_{j}}^{\mathrm{\Sigma}}=\frac{1}{M}\sum _{i=1}^{M}{p}_{\mathit{\text{ij}}}, i.e., the average of all the elements in vector P_{ j }. It can be easily verified that the above optimization problem satisfies the long term average power constraint with equality.
The dual problem of (7) is expressed as
where λ is the nonnegative Lagrange multiplier associated with the long term average power constraint in Problem (7), and the Lagrange dual function g(λ) is defined as
With a fixed λ, we can employ an iterative simulationbased optimization algorithm called the simultaneous perturbation stochastic approximation algorithm (SPSA) to find the optimal power codebook of problem (9). A stepbystep guide to an implementation of SPSA can be found in[20], which, when applied to our problem, can be summarized in the following steps.
Step 1 Initialization and coefficient selection: Set counter index k = 0. Pick initial guess of the power codebook{\widehat{\mathcal{P}}}_{0} and nonnegative coefficients a, c, A, α and γ in the SPSA gain sequences{a}_{k}=\frac{a}{{(A+k+1)}^{\alpha}} and{c}_{k}=\frac{c}{{(k+1)}^{\gamma}}. For guideline on choosing these coefficients see[20].
Step 2 Generation of simultaneous perturbation vector: Generate a pdimensional (p = ML) random perturbation vector Δ_{ k }, where each component of Δ_{ k }are i.i.d Bernoulli ± 1 distributed with probability of\frac{1}{2} for each outcome.
Step 3 Loss function evaluations: Obtain two measurements of the loss function\mathcal{L}(\xb7) based on the simultaneous perturbation around the current power codebook{\widehat{\mathcal{P}}}_{k}:\mathcal{L}({\widehat{\mathcal{P}}}_{k}+{c}_{k}{\mathrm{\Delta}}_{k}) and\mathcal{L}({\widehat{\mathcal{P}}}_{k}{c}_{k}{\mathrm{\Delta}}_{k}) with c_{ k }and Δ_{ k }from Steps 1 and 2.
Step 4 Gradient approximation: Generate the simultaneous perturbation approximation to the unknown gradient{\u011d}_{k}({\widehat{\mathcal{P}}}_{k}) given as,
where Δ_{ ki } is the i th component of the Δ_{ k }vector.
Step 5 Updating power codebook: Use the algorithm
to update{\widehat{\mathcal{P}}}_{k} to a new value{\widehat{\mathcal{P}}}_{k+1}.
Step 6 Iteration or termination: Return to Step 2 with k + 1 replacing k. Terminate the algorithm if there is little change in several successive iterations or the maximum allowable number of iterations has been reached.
Note that in the Step 3 of the SPSA which involves calculating a loss function with a given power codebook, we use the objective function of problem (9) as the loss function. And then given a power codebook, we use the nearest neighbor condition of a generalized Lloyd algorithm with a Lagrangian distortion\left(\right)close="">\n \n d\n (\n h\n ,\n \n \n p\n \n \n j\n \n \n )\n =\n \n \n x\n \n \n j\n \n \n +\n \lambda \n \n \n \n \n p\n \n \n j\n \n \n \n \n \Sigma \n \n \n \n to generate the optimal partition regions[21], given as, j = 1,…,L,
Therefore, with a given power codebook and resulting quantization regions, we can numerically calculate the loss function. We repeatedly apply Step 2 to Step 5 of SPSA until the resulting outage probability converges within a prespecified accuracy (Step 6 of SPSA). After that, we solve the dual problem for finding the optimal λ by using a subgradient based search method, i.e., updating λ until convergence using{\lambda}^{l+1}={\left[{\lambda}^{l}{\alpha}^{l}\left({P}_{\text{av}}\sum _{j=1}^{L}E\left[{{\mathbf{p}}_{j}}^{\mathrm{\Sigma}}{\mathcal{R}}_{j}\right]\text{Pr}({\mathcal{R}}_{j})\right)\right]}^{+}, where l is the iteration number, α^{l} is a positive scalar step sizes for the l th iteration satisfying\sum _{l}{\alpha}^{l}=\infty and\sum _{l}{{\alpha}^{l}}^{2}<\infty. Due to the fact that problem (7) is not convex, in general, the optimal solution we obtain here is only locally optimal.
Figure1 gives an example of what the structure of a locally optimal channel quantization region looks like by using SPSA with a given power codebook for M = 2 channels with 2 bits of feedback (i.e., four quantization regions, each color stands for a particular region), where a sufficiently large number of training samples (channel realizations for h_{1}, h_{2}) drawn from empirical distributions is used to generate the optimal partition regions according to (10). From Figure1, we can see that, in general, it is difficult to compute the surface area (or in general volumes in higher dimensional space) of these regions which have irregular shapes. Although we can use SPSA to numerically obtain a locally optimal power codebook and partition regions, it takes a very long time to converge and is computationally highly complex especially when the number of feedback bits or the number of channels is large. In the following few sections, we therefore focus on designing suboptimal algorithms by introducing appropriate assumptions and approximations to the quantized regions and power codebook.
Power ordering assumption and hyperplane approximation (POHPA)
Let P(h) represent the optimal power allocation strategy which maps the channel realization h to a power level in\mathcal{P}. Without loss of generality, we assume that power levels are such that\left(\right)close="">\n \n \n \n \n \n p\n \n \n 1\n \n \n \n \n \Sigma \n \n \n \n \cdots \n \n \n \n \n \n p\n \n \n L\n \n \n \n \n \Sigma \n \n \n \n corresponding to the partition{\mathcal{R}}_{1},{\mathcal{R}}_{2},\dots ,{\mathcal{R}}_{L}, then we have the following result which generalizes the circular nature of the quantized channel regions presented in[9, 12] for a scalar power allocation scenario to the parallel channels case with a vector power allocation.
Lemma 1
Let P^{∗}(h) denotes as the minimum power level required to have no outage, i.e.,\frac{1}{M}\sum _{i=1}^{M}\phantom{\rule{1em}{0ex}}\frac{1}{2}log\left(1+{h}_{i}{p}_{i}^{\ast}(\mathit{h})\right)={r}_{0}. The optimal solution satisfies:
Proof
The proof is similar to[9]. However, since it generalizes the result for a scalar power allocation in[9] to a vector power allocation case in this article, we provide a sketch of the proof (see Appendix Appendix 1). □
If the same transmit power is allocated to all transmit channels, i.e.,\left(\right)close="">\n \n \n \n p\n \n \n 1\n j\n \n \n =\n \cdots \n =\n \n \n p\n \n \n Mj\n \n \n =\n \n \n p\n \n \n j\n \n \n \Sigma \n \n \n \n, the above Lemma result reduces to the case of[9, 12]. From Lemma 1, we also have that there is no outage in the first L−1 regions and outage only occurs in the last region{\mathcal{R}}_{L}; the optimal partition satisfies that a channel realization h = {h_{1},…,h_{ M }} either belongs to the region{\mathcal{R}}_{j}, where j ∈ {1,…,L} is the maximum index that can guarantee zero outage for it or belongs to{\mathcal{R}}_{L};{\mathcal{R}}_{L} includes two parts:\left(\right)close="">\n \n \n \n h\n \n \n \n (\n \n \n p\n \n \n \u2217\n \n \n (\n h\n )\n )\n \n \n \Sigma \n \n \n \n \n \n p\n \n \n 1\n \n \n \Sigma \n \n \n \n \n \n (outage) and\left(\right)close="">\n \n \n \n h\n \n \n \n (\n \n \n p\n \n \n \u2217\n \n \n (\n h\n )\n )\n \n \n \Sigma \n \n \n \u2264\n \n \n p\n \n \n L\n \n \n \Sigma \n \n \n \n \n \n, denoted as{\mathcal{R}}_{L,1} and{\mathcal{R}}_{L,2}, respectively.
From Lemma 1, we have the boundary between{\mathcal{R}}_{j1} and{\mathcal{R}}_{j},\phantom{\rule{1em}{0ex}}j=2,\dots ,L1 is a hypersurface denoted as g(h_{1},…,h_{M−1},P_{ j }), which is obtained by solving for h_{ M }from equationr(\mathbf{h},{\mathbf{p}}_{j})=\frac{1}{M}\sum _{i=1}^{M}\frac{1}{2}log(1+{h}_{i}{p}_{\mathit{\text{ij}}})={r}_{0}, namely,
wherek={e}^{2M{r}_{0}}1. The boundaries between{\mathcal{R}}_{L} and{\mathcal{R}}_{1},\phantom{\rule{1em}{0ex}}{\mathcal{R}}_{L1} are given by g(h_{1},…,h_{M−1},P_{1}), g(h_{1},…,h_{M−1},P_{ L }), respectively. Let {r_{i 1},…,r_{ iL }} represents the quantization thresholds on h_{ i } axes (i = 1,…,M), from (12), it can be easily verified that{r}_{\mathit{\text{ij}}}=\frac{k}{{p}_{\mathit{\text{ij}}}},i\in \{1,2,\dots ,M\},j\in \{1,2,\dots ,L\}. Therefore, if we assume that the power levels in power codebook are in descending order, i.e., P_{1}>⋯>P_{ L } which means p_{i 1}>⋯>p_{ iL }, i = 1,…,M and also implies\left(\right)close="">\n \n \n \n p\n \n \n 1\n \n \n \Sigma \n \n \n \n \cdots \n \n \n \n p\n \n \n L\n \n \n \Sigma \n \n \n \n, we can obtain r_{i 1}<⋯<r_{ iL }, i = 1,…,M, which gives a simple partition structure allowing easy numerical computation of the surface area (or volumes in higher dimensions) of the quantized regions. We call it the power ordering (PO) assumption. Figure2 gives an example of the optimal quantization structure with the PO assumption for the M = 2 channels case. With this PO assumption, the area below the hypersurface g(h_{1},…,h_{M−1},P_{1}) defines the outage region{\mathcal{R}}_{L,1}.
Denoting F(P_{ j }) as the probability that the channel state (h_{1},…,h_{ M }) lies below g(h_{1},…,h_{M−1}P_{ j }), j = 1,…,L, we have
where f(h_{1},…,h_{ M }) is the pdf for the channel vector,given byf({h}_{1},\dots ,{h}_{M})=\prod _{i=1}^{M}{\left({m}_{i}{\lambda}_{i}\right)}^{{m}_{i}}\frac{{h}_{i}^{{m}_{i}1}}{\mathrm{\Gamma}\left({m}_{i}\right)}{e}^{{m}_{i}{\lambda}_{i}{h}_{i}} for Nakagami fading (due to independence amongst the parallel channels). Thus, the probability that the channel realization\mathbf{h}\in {\mathcal{R}}_{j} is F(P_{j + 1})−F(P_{ j }) for j = 1,…,L − 1, and 1−F(P_{ L }) + F(P_{1}) for j = L. The outage minimization problem with limited feedback (7) can thus be simplified as
Problem (14) is in general a nonlinear nonconvex optimization problem. Since g(h_{1},…,h_{M−1}P_{ j }), j = 1,…,L is highly nonlinear, it is hard to obtain a closedform expression for F(P_{ j }). Although one can use numerical integrals to calculate F(P_{ j }), and use randomized search techniques to find the optimum solution of problem (14), the associated computational complexity increases exponentially with the number of feedback bits and channels. Next, we will employ another approach by deriving an approximation for g(h_{1},…,h_{M−1}P_{ j }), such that an analytical (approximate) closedform expression for F(P_{ j }) can be easily obtained (unlike[9] where a Gaussian distribution was used to approximate the distribution of the mutual information to evaluate an analytical expression for F(P_{ j })), thus significantly reducing the computational complexity of solving problem (14). Then based on the obtained optimal power allocation using this approximation, one can use Monte Carlo simulations to evaluate the “real outage” (corresponding outage probability performance given by F(P_{1})). More details on this can be found in the Numerical results Section.
From (12), the projection of hypersurface h_{ M }= g(h_{1},…,h_{M−1},P_{ j }, j = 1,…,L) on any arbitrary two channel coordinate plane, i.e., h_{ n }versus h_{ M }, n,m ∈ {1,2,…,M},n ≠ m, is a curve expressed as
It is easy to verify that the above curve is convex by showing that the second derivative of h_{ n } with respect to h_{ M } is strictly positive. This curve intersects the h_{ n }axis and the h_{ M } axis at quantization thresholds r_{ nj }and r_{ mj }, respectively. We can approximate the curve (15) by a straight line\left(\right)close="">\n \n \n \n \n \n r\n \n \n \u2032\n \n \n \n \n nj\n \n \n \n \n \n \n r\n \n \n \u2032\n \n \n \n \n Mj\n \n \n \n, as displayed in Figure3, which is parallel to r_{ nj }r_{ mj }and a tangent to the curve (15) at the intersection point ‘a’. The straight line intersects h_{ n } axis and h_{ M } axis at point\left(\right)close="">\n \n \n \n r\n \n \n nj\n \n \n \u2032\n \n \n \n and\left(\right)close="">\n \n \n \n r\n \n \n Mj\n \n \n \u2032\n \n \n \n, respectively. The line\left(\right)close="">\n \n \n \n r\n \n \n nj\n \n \n \u2032\n \n \n \n \n r\n \n \n Mj\n \n \n \u2032\n \n \n \n is expressed as
whereK=2(\sqrt{k+1}1),{{r}^{\prime}}_{\mathit{\text{nj}}}=\frac{K}{{p}_{\mathit{\text{nj}}}},{{r}^{\prime}}_{\mathit{\text{Mj}}}=\frac{K}{{p}_{\mathit{\text{Mj}}}} (new quantization thresholds), and point ‘a’ is\left(\frac{K}{2{p}_{\mathit{\text{Mj}}}},\frac{K}{2{p}_{\mathit{\text{nj}}}}\right). We name this approximation as the hyperplane approximation (HPA). Note that we could also use the straight line r_{ nj }r_{ mj } to do the approximation, but simulations demonstrate that the\left(\right)close="">\n \n \n \n r\n \n \n nj\n \n \n \u2032\n \n \n \n \n r\n \n \n Mj\n \n \n \u2032\n \n \n \n approximation is always better than r_{ nj }r_{ mj }. To see clearly, Figure4 gives an example of the HPA in threedimensional (3D) space.
Thus, with PO and HPA (POHPA), the boundaries between{\mathcal{R}}_{L,1} and{\mathcal{R}}_{1},{\mathcal{R}}_{1} and{\mathcal{R}}_{2},\dots ,{\mathcal{R}}_{L1} and{\mathcal{R}}_{L,2} can be approximated as
Any channel vector below D(h_{1},…,h_{M−1},P_{1}) is said to be in outage. Since D(h_{1},…,h_{M−1},P_{ j }) is linear, an analytical closedform approximation for F(P_{ j }) can be obtained, which is denoted as\left(\right)close="">\n \n \n \n F\n \n \n \u2032\n \n \n (\n \n \n p\n \n \n j\n \n \n )\n \n. In this case, by definition we have
\sum _{i=1}^{M}{h}_{i}{p}_{\mathit{\text{ij}}} is a weighted sum of independent gamma random variables, and\left(\right)close="">\n \n \n \n F\n \n \n \u2032\n \n \n (\n \n \n p\n \n \n j\n \n \n )\n \n can be treated as the cumulative distribution function (cdf) of\sum _{i=1}^{M}{h}_{i}{p}_{\mathit{\text{ij}}}. Thus a closedform expression for (18) can be obtained by using any of the following two equivalent results, which however differ in their analytical derivations.

1)
Multiple infinite series representation: This analytical expression was derived in [22],
\begin{array}{ll}\phantom{\rule{5.5pt}{0ex}}{F}^{\prime}\left({\mathbf{p}}_{j}\right)& =\frac{1}{\mathrm{\Gamma}\left(1+\sum _{i=1}^{M}{m}_{i}\right)}\left[\prod _{i=1}^{M}{\left(\frac{{m}_{i}{\lambda}_{i}K}{{p}_{\mathit{\text{ij}}}}\right)}^{{m}_{i}}\right]\phantom{\rule{2em}{0ex}}\\ \phantom{\rule{1em}{0ex}}\mathrm{\times}{\mathrm{\Phi}}_{2}^{\left(M\right)}\left({m}_{1},{m}_{2},\dots ,{m}_{M};1+\sum _{i=1}^{M}{m}_{i};\right.\phantom{\rule{2em}{0ex}}\\ \phantom{\rule{1em}{0ex}}\frac{{m}_{1}{\lambda}_{1}K}{{p}_{1j}},\left(\right)close=")">\frac{{m}_{2}{\lambda}_{2}K}{{p}_{2j}},\dots ,\frac{{m}_{M}{\lambda}_{M}K}{{p}_{\mathit{\text{Mj}}}}& ,\phantom{\rule{2em}{0ex}}\end{array}\n(19)
where\left(\right)close="">\n \n \n \n \Phi \n \n \n 2\n \n \n (\n M\n )\n \n \n (\n \u2026\n )\n \n is the confluent Lauricella multivariate hypergeometric function, involving multiple infinite sums[22]:
where{n}_{\tau}=\sum _{i=1}^{M}{n}_{i} and the Pochhammer symbol is defined as{\left(\alpha \right)}_{k}=\frac{\mathrm{\Gamma}(\alpha +k)}{\mathrm{\Gamma}\left(k\right)}[22]. Equation (19) can be numerically calculated. However, as M becomes large, computation of the multiple infinite sum may become too prohibitive to implement.

2)
Single infinite series representation: The second result provides a simpler expression for (18) involving only a single infinite sum [23], which was proposed by Moschopoulos [24].
\begin{array}{l}{F}^{\prime}\left({\mathbf{p}}_{j}\right)=\prod _{i=1}^{M}{\left(\frac{{\beta}_{1}}{{\beta}_{\mathit{\text{ij}}}}\right)}^{{m}_{i}}\sum _{n=0}^{\infty}\frac{{\delta}_{n}\gamma \left(\rho +n,\frac{K}{{\beta}_{1}}\right)}{\mathrm{\Gamma}(\rho +n)},\end{array}(21)
where\rho =\sum _{i=1}^{M}{m}_{i},{\beta}_{\mathit{\text{ij}}}=\frac{{p}_{\mathit{\text{ij}}}}{{m}_{i}{\lambda}_{i}},{\beta}_{1}=\text{min}\left({\beta}_{\mathit{\text{ij}}}\right), γ(.) is incomplete gamma function (\gamma (s,x)={\int}_{0}^{x}{t}^{s1}{e}^{t}\mathit{\text{dt}}) and the coefficients δ_{ n }are obtained recursively by
Special cases

If\rho =\sum _{i=1}^{M}{m}_{i} is an integer, (21) can be further simplified as[23]
\begin{array}{ll}\phantom{\rule{6pt}{0ex}}{F}^{\prime}\left({\mathbf{p}}_{j}\right)& =\prod _{i=1}^{M}{\left(\frac{{\beta}_{1}}{{\beta}_{\mathit{\text{ij}}}}\right)}^{{m}_{i}}\sum _{n=0}^{\infty}{\delta}_{n}\phantom{\rule{2em}{0ex}}\\ \phantom{\rule{1em}{0ex}}\mathrm{\times}\left\{1{e}^{\frac{K}{{\beta}_{1}}}\sum _{l=0}^{\rho +n1}\frac{{\left(\frac{K}{{\beta}_{1}}\right)}^{l}}{l!}\right\}.\phantom{\rule{2em}{0ex}}\end{array}(23) 
If M = 2, let β_{2} = max(β_{ ij }), and m_{ θ } is the corresponding fading parameter for β_{2}, we have
\begin{array}{ll}\phantom{\rule{6pt}{0ex}}{F}^{\prime}\left({\mathbf{p}}_{j}\right)& ={\left(\frac{{\beta}_{1}}{{\beta}_{2}}\right)}^{{m}_{\theta}}\sum _{n=0}^{\infty}\frac{{\left({m}_{\theta}\right)}_{n}{\left(1\frac{{\beta}_{1}}{{\beta}_{2}}\right)}^{n}}{n!}\phantom{\rule{2em}{0ex}}\\ \phantom{\rule{1em}{0ex}}\mathrm{\times}\frac{\gamma \left(\rho +n,\frac{K}{{\beta}_{1}}\right)}{\mathrm{\Gamma}(\rho +n)},\phantom{\rule{2em}{0ex}}\end{array}(24)
where (m_{ θ })_{n + 1}represents the Pochhammer symbol.
With POHPA, Problem (14) can be approximated as
It is not hard to verify that Problem (25) is still nonconvex. However, we can employ the KarushKuhnTucker (KKT) necessary conditions to achieve locally optimal solutions.
Remark 1
Note that KKT necessary conditions usually require regularity of a local optimum, which amounts to (in the context of Problem (25)) linear independence of the gradients of the active inequality constraints evaluated at the local optimum (see Proposition 3.3.1, p. 310 in[25]). In Problem (25), if a local optimum of the power vector satisfies P_{1} > ⋯> P_{L−1}> P_{ L }> 0, then the only active inequality constraint is the average power constraint, in which case the linear independence property is trivially satisfied. In the case where the local optimum for P_{ L }= 0, it can be easily shown by simple linear algebra that the gradients corresponding to these two (P_{ L }= 0 and the average power constraint) active inequality constraints satisfy the linear independence condition.
Since regularity of a local optimum is thus established, one can now use KKT necessary conditions to obtain the following important result that can be used to design locally optimal quantized power codebooks:
Theorem 1
Suppose\left(\right)close="">\n \n \n \n {\n \n \n \n \n p\n \n \n 1\n j\n \n \n \n \n \u2217\n \n \n ,\n \u2026\n ,\n \n \n \n \n p\n \n \n Mj\n \n \n \n \n \u2217\n \n \n }\n \n \n j\n =\n 1\n \n \n L\n \n \n \n be an optimum to Problem (25). Then we have
where\frac{\partial {F}^{\prime}({P}_{j})}{\partial {{p}_{M1}}^{\ast}}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}\frac{\mu \left({F}^{\prime}({P}_{2}){F}^{\prime}({P}_{1})\right)}{1\phantom{\rule{0.3em}{0ex}}\phantom{\rule{0.3em}{0ex}}\mu \sum _{i=1}^{M}({p}_{i1}{p}_{\mathit{\text{iL}}})},\phantom{\rule{0.3em}{0ex}}\frac{\partial {F}^{\prime}({P}_{j})}{\partial {{p}_{\mathit{\text{Mj}}}}^{\ast}}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}\frac{{F}^{\prime}({P}_{j+1}){F}^{\prime}({P}_{j})}{\sum _{i=1}^{M}({p}_{i,j1}{p}_{\mathit{\text{ij}}})}, j = 2,…,L−1, and\phantom{\rule{1em}{0ex}}\frac{\partial {F}^{\prime}({P}_{L})}{\partial {{p}_{\mathit{\text{ML}}}}^{\ast}}=\frac{1{F}^{\prime}({P}_{L})+{F}^{\prime}({P}_{1})}{\sum _{i=1}^{M}({p}_{i,L1}{p}_{\mathit{\text{iL}}})}
Proof
See Appendix Appendix 2. □
Combining the above result with the average power constraint in (25), we have the following system of (ML + 1) nonlinear equations.
A solution to (27) provides a locally optimum power allocation policy\left(\right)close="">\n \n \n \n {\n \n \n \n \n p\n \n \n j\n \n \n \n \n \u2217\n \n \n }\n \n \n j\n =\n 1\n \n \n L\n \n \n \n. For small values of L and M, the above system of nonlinear equations can be solved by various optimization softwares. However, the complexity of solving the above set of nonlinear equations is still too high for moderately large numbers of feedback bits and channels. Therefore, we consider several lowcomplexity suboptimal schemes suited to special cases of high or low P_{av} as described below.
High average power approximation (HP_{av}A)
In the high average power or average SNR regime, the following result allows us to simplify the computation of the quantized power codebook. It also illustrates that using our hyperplane based approximations, it is not optimal to allocate identical power to individual channels within each quantized region in general.
Theorem 2
For arbitrary M, in high average power (as P_{av}→∞), the multiple infinite series representation (19),\left(\right)close="">\n \n \n \n F\n \n \n \u2032\n \n \n (\n \n \n \n P\n \n \n \n j\n \n \n )\n ,\n j\n =\n 1\n ,\n \u2026\n ,\n L\n \n can be further approximated as
and a locally optimum power allocation scheme for (25) satisfies the following approximate relationship:
Proof
See Appendix Appendix 3. □

(29)
implies that in high P _{av}, for each quantization region, the power allocated to each channel asymptotically depends only on the severity of fading (represented by the parameter m).
Special cases:

Identical fading parameters: If m_{1} =⋯= m_{ M }, from (29), we have
{{p}_{1j}}^{\ast}\approx \cdots \approx {{p}_{\mathit{\text{Mj}}}}^{\ast},\phantom{\rule{1em}{0ex}}j=1,\dots ,L(30)
which means, in high P_{av}, with identical fading parameters for all channels, for each quantization region, the power assigned to each channel is asymptotically equal, and we call this solution as equal power per channel (EPPC).

Rayleigh fading: (m_{1} = ⋯= m_{ M }= 1), from Theorem 2, (28) reduces to
{F}^{\prime}\left({\mathbf{p}}_{j}\right)\approx \frac{1}{M!}\prod _{i=1}^{M}\left(\frac{{\lambda}_{i}K}{{p}_{\mathit{\text{ij}}}}\right)(31)
and (29) reduces to EPPC.
For the general case (29), without loss of generality, by letting{p}_{\mathit{\text{ij}}}=\frac{{m}_{i}}{{m}_{M}}{p}_{\mathit{\text{Mj}}},\phantom{\rule{2.77695pt}{0ex}}\forall i\in \{1,2,\dots ,M1\} and denoting p_{ Mj } as p_{ j } for simplicity, the above ML + 1 equations system (27) can be simplified into an L + 1 equations system only:
where{P}_{\text{av}}^{\prime}={P}_{\text{av}}\frac{M{m}_{M}}{\sum _{i=1}^{M}{m}_{i}}.
Thus HP_{av} A reduces the Mdimensional vector quantization problem into a onedimensional scalar quantization problem, as illustrated in Figure5, with corresponding quantization thresholds r_{1},…,r_{ L }, where\left(\right)close="">\n \n \n \n r\n \n \n j\n \n \n =\n \n \n r\n \n \n Mj\n \n \n \u2032\n \n \n =\n K\n /\n \n \n p\n \n \n j\n \n \n \n, and remarkably reduces the complexity. We call this suboptimal scheme as the proportional to fading parameter per channel (PFPPC) scheme. For small values of L, the above L nonlinear equations (where one can evaluate F^{′}(p_{ j }) using the single infinite series representation (21)) can be solved by various optimization softwares.
For large values of L (e.g., L ≥ 16 or B ≥ 4), one can use the socalled equal average power per region (EPPR) approximation for such a scalar quantization problem by using the mean value theorem[10]. This essentially implies that when L goes to infinity, the total average power assigned to each quantization region is asymptotically equal and the performance using this approximation is close to optimum for large number of bits of feedback. In this case, we need to solve the following set of L equations instead of (32)
which can be carried out by an iterative algorithm employing the standard bisection search method. We call this algorithm as ‘PFPPC+EPPR’.
Let\left(\right)close="">\n \n \n \n P\n \n \n tot\n \n \n j\n \n \n \n represents the total average power allocated to region{\mathcal{R}}_{j} at PFPPC case. Then the average power constraint in (32) can be rewritten as,
Since p_{1} > p_{2} >⋯> p_{ L },\sum _{j=1}^{L}{P}_{\text{tot}}^{j}={{P}^{\prime}}_{\text{av}}, it follows that
Thus, in high the P_{av} regime (P_{av} → ∞),{r}_{1}=\frac{K}{{p}_{1}}\to 0, and we have the following result which indicates that the total power allocated to the outage region is asymptotically (as P_{av} → ∞) negligible thus allowing us to further simplify the quantized power codebook design method:
Lemma 2
In the high P_{av} regime,\underset{{r}_{1}\to 0}{lim}{P}_{\text{tot}}^{L,1}=0, if\sum _{j=1}^{L}{\rho}^{j}>1, where\rho =\sum _{i=1}^{M}{m}_{i}.
Proof
See Appendix Appendix 4. □
Therefore another effective scheme for large L is to additionally (to PFPPC) employ what we call the zero power in outage region (ZPiOR) approximation (PFPPC+ZPiOR), by letting the power level p_{ L }= 0 resulting in{r}_{L}=\frac{K}{{p}_{L}}\to \infty, as showed in Figure6. Thus we have the total average power allocated to outage region{\mathcal{R}}_{L,1},\left(\right)close="">\n \n \n \n P\n \n \n tot\n \n \n L\n ,\n 1\n \n \n =\n \n \n p\n \n \n L\n \n \n \u2217\n \n \n F\n \n \n \u2032\n \n \n (\n \n \n p\n \n \n 1\n \n \n )\n =\n 0\n \n.
Remark 2
Note that if ρ ≥ 1, the condition\sum _{j=1}^{L}{\rho}^{j}>1 is clearly satisfied for any L ≥ 2. For 0. 5 < ρ < 1 (which is the case of no diversity with M = 1 or the single channel case), one can show that there exists a finite L for which the condition\sum _{j=1}^{L}{\rho}^{j}>1 is satisfied. This is easily seen by noting that the condition\sum _{j=1}^{L}{\rho}^{j}>1 is equivalent to ρ^{L + 1}< 2ρ−1 for ρ < 1. It is interesting to note however that when ρ = 0. 5 (which is the case when one has as single Nakagami channel with m = 0. 5, the worst possible fading parameter), there is no finite value of L that can achieve\sum _{j=1}^{L}{\rho}^{j}>1. Thus in high P_{av}, it is near optimal to allocate zero power to the outage region as long as ρ ≥ 1 with any L ≥ 2, or a single channel with 0. 5 < m < 1 and a sufficiently large L. For a single channel with m = 0. 5, it seems that even in high P_{av}, one needs to allocate nonzero power to the outage region.
Therefore the performance of the ZPiOR approximation (PFPPC+ZPiOR) becomes asymptotically (as L → ∞) close to that of the PFPPC scheme, except for the single channel case with m = 0. 5, where one can use the EPPR approximation instead to reduce complexity.
In this case, (32) can be simplified as
which can be easily solved by using a standard bisection method. In fact, numerical studies illustrate (as we will see later) that the ZPiOR approximation has a nearoptimum (for Problem (25)) performance for large number of quantization regions. Thus, the ZPiOR approximation achieves a better complexityperformance tradeoff than PFPPC+EPPR.
Remark 3
For the low P_{av} scenario, we can apply the ZPiOR approximation as well. This is because it is easy to verify that
then when the average power is small (P_{av} → 0), p_{ iL }→ 0,i = 1,…,M as well, and the corresponding quantization threshold r_{ iL }→ ∞. In this case, the region{\mathcal{R}}_{L} only includes{\mathcal{R}}_{L,1} (the outage region) and the corresponding power level P_{ L }= 0, thus making the ZPiOR approximation applicable. A similar observation was also made in[9].
Asymptotic behavior of outage probability
Here we briefly comment on the diversity behavior of the outage minimization algorithm using POHPA for Nakagamim fading channels. Define the diversity gain d as
Then we have the following important approximation for d, which generalizes the existing diversity order results for outage probability (e.g.,[13]), that are valid for Rayleigh fading channels only.
Theorem 3
For an arbitrary M, with log_{2}L bits of quantized feedback, using the optimal power allocation employing the POHPA approximation, we have
The diversity order can be approximated as
Proof
See Appendix Appendix 5. □
Special case: Note that for the Rayleigh fading case where m_{ i }= 1,∀i = 1,2,…,M, (40) becomesd\approx \sum _{j=1}^{L}{M}^{j}, which is consistent with similar results in[9, 10].
Remark 4
It is possible that the result in Theorem 3 may hold with equality, rather than being an approximation for the diversity order. However, due to the various levels of approximations involved in deriving this, we are unable to prove an exact equality at this stage. This will involve computing orders of approximation errors and showing that the error goes to zero as P_{av} goes to infinity. We leave this for future study.
Large number of channels analysis
The previous algorithms can be effectively applied to find locally optimal solutions or suitable approximations for them for moderate number of parallel channels, such as M < 10. Once M ≥ 10, these algorithms become computationally demanding. Given that practical wideband slow fading multicarrier systems such as OFDM (with large number of subcarriers), can be modeled as asymptotically large number of i.i.d parallel channels[18], one needs to find outage minimizing power allocation algorithms with limited feedback for large M. Below we provide such an algorithm using a Gaussian approximation for large M in high P_{av}.
Remark 5
According to[18], the mutually independence of large number of parallel channels can be justified by the assumptions that the number of independent propagation paths in wideband slow fading channel models increase linearly with the bandwidth and the carrier frequencies of parallel channels are sufficiently separated with the effect of the multipath spread essentially eliminated[18]. Even if the adjacent subcarriers are correlated, with subcarrier grouping technique[26], we can still have a large number of parallel independent subchannels (each comprises a number of subcarriers), such as 32 subchannels as stated in[26].
Note that in high average power, we have
where f_{ i }= h_{ i }λ_{ i },\frac{1}{{\lambda}_{i}} is the mean of channel gain h_{ i }, and under the Nakagami fading model, the pdf of f_{ i }is\frac{{\left({m}_{i}\right)}^{{m}_{i}}}{\mathrm{\Gamma}\left({m}_{i}\right)}{f}_{i}^{{m}_{i}1}{e}^{{m}_{i}{f}_{i}},\forall i.
Thus F(P_{ j }),j = 1,…,L for M channels can be approximated as
where{s}_{j}=2{r}_{0}\frac{1}{M}\sum _{i=1}^{M}log\left(\frac{{p}_{\mathit{\text{ij}}}}{{\lambda}_{i}}\right)={c}^{\prime}\frac{1}{M}\sum _{i=1}^{M}log\left({p}_{\mathit{\text{ij}}}\right),{c}^{\prime}=2{r}_{0}+\frac{1}{M}\sum _{i=1}^{M}log\left({\lambda}_{i}\right) and the function V(.) denotes the cdf of\frac{1}{M}\sum _{i=1}^{M}log\left({f}_{i}\right). It is easy to show that the pdf of z_{ i }= log(f_{ i }) is{f}_{{z}_{i}}=\frac{{\left({m}_{i}\right)}^{{m}_{i}}}{\mathrm{\Gamma}\left({m}_{i}\right)}{e}^{{m}_{i}{e}^{{z}_{i}}}{e}^{{m}_{i}{z}_{i}}. Denote its mean and variance by E[z_{ i }] and Var[z_{ i }], respectively. For the Rayleigh fading case, the pdf of{z}_{i}=log\left({f}_{i}\right) is{e}^{{e}^{{z}_{i}}}{e}^{{z}_{i}}, which is the well known Gumbel Distribution with mean E[z_{ i }] = −r, where r is EulerMascheroni constant (r = 0. 5772156649…) and variance\text{Var}\left[{z}_{i}\right]=\frac{{\Pi}^{2}}{6}.
Note that for large M, if m_{1} =⋯= m_{ M } or in the special case of Rayleigh fading (m_{ i }= 1, ∀i), z_{ i } is i.i.d with finite mean and variance and then the Central Limit Theorem (CLT) directly applies whereby one can use a Gaussian approximation for the pdf of\frac{1}{M}\sum _{i=1}^{M}{z}_{i}. However, in the general case where the fading parameters m_{ i } are different for different channels, z_{ i }, i={1,2,…,M} are independent but not necessarily identically distributed. In this case, it is important to prove that {z_{ i }− E[z_{ i }]} satisfies the Lindeberg condition (for a statement of this condition, see p. 262,[27]), so that a generalized CLT can be applied and a Gaussian approximation can be used for the instantaneous mutual information over parallel fading channels. Indeed, we can analytically prove the following Lemma:
Lemma 3
The sequence {z_{ i }− E[z_{ i }]} satisfies the Lindeberg condition.
Proof
The proof of this can be found in Appendix Appendix 6. □
Therefore when the number of channels M → ∞, the cdf of\frac{1}{M}\sum _{i=1}^{M}{z}_{i} can still be approximated (by applying Theorem 3, Chap. VIII.4 in[27]) by a Gaussian cdf with mean and variance given by
Thus, we have
The original problem (14) for a large number of channels case can be approximated as
Using the KKT necessary conditions, we again get
Note that
wheref\left(s\right)=\frac{1}{\sigma \sqrt{2\Pi}}{e}^{\frac{{(s\mu )}^{2}}{2{\sigma}^{2}}}. It is easily seen that the above expression for\frac{\partial V({s}_{j})}{\partial {p}_{\mathit{\text{ij}}}} is a monotonically increasing function of p_{ ij }for all i = 1,2,…,M and j = 1,2,…,L. Therefore, from (46), we have
The above result implies that at a local optimum, using the Gaussian approximation, the power levels (for each quantization region) for all channels are identical, which is identical to the EPPC scheme. With a slight abuse of notation, denote p_{ j }= p_{ ij },j = 1,…,L. The we have s_{ j }= c^{′}− log(p_{ j }) and the vector quantization problem (45) can be converted into the scalar quantization problem below with quantization thresholds s_{1},…,s_{ L }:
After employing the corresponding KKT necessary optimality conditions and simplifying, we have the system of L nonlinear equations below:
When L is not large, one can solve the above equations using optimization tools 1stOpt. When L is large (roughly L ≥ 16), we can also use the EPPR approximation or the ZPiOR approximation to solve them, as discussed in the section on high P_{av}approximations. Table1 below shows the applicability of various algorithms discussed so far according to different ranges of M,L and high P_{av}, where “GA” denotes the Gaussian approximation based algorithms.
Numerical results
To numerically illustrate the performance of the designed power allocation strategies, we consider an Mparallel (independent) Nakagami blockfading channels, which characterizes a multicarrier OFDM system, with M parallel subchannels located at nonadjacent carrier frequencies. The mean value of the gamma distributed fading gain for each channel is assumed to be inversely proportional to the square of the wireless propagation distance d, and the required transmission rate is taken to be r_{0} = 0. 25 nats per channel use. Outage performance with full CSI at the transmitter is obtained with the optimal power allocation results presented in[3]. It should be noted that the results illustrate the “real outage” performance of the proposed algorithms, where the power codebook designed via the algorithms is used to obtain the average outage probability over a large number of MonteCarlo simulated channel realizations. As a result, the average power required for a given real outage may not strictly be the same as the original average power based on which the power codebook is designed. However, for a given algorithm, the graphs can and should be used to determine the minimum outage probability obtainable for a given average power and vice versa.
Experiment 1
The first experiment examines the performance of POHPA. Figure7 compares the outage performance of SPSA, an exhaustive search over the space of all possible power allocation policies implementing the power ordering (PO) approximation only, and POHPA with 1 bit feedback for two channels case (d_{1} = 40m,d_{2} = 60m, m_{1} = m_{2} = 0. 5). It can be observed that when P_{av} is small, the performance of these three methods have negligible difference, while when P_{av} is large, SPSA is slightly better than PO and POHPA. Figure8 shows the outage performance of SPSA and POHPA for a higher dimensional case (four channels case, d_{1} = 30m,d_{2} = 40m,d_{1} = 60m,d_{2} = 70m, m_{1} = m_{2} = m_{3} = m_{4} = 1). Again, it can be seen clearly that with identical number of feedback bits, the outage probability gap between SPSA and POHPA gradually increases as P_{av} increases. From Figures7 and8, it seems that in a higher dimensional space (larger number of channels), with the same number of feedback bits, the outage probability gap between the two methods is bigger than the one in a low dimensional space, especially in high P_{av}. And with the same value of M, as the number of feedback bits increases, this gap seems to decrease, as shown in Figure8. This can be explained due to the fact that to achieve a fixed outage probability, a larger number of feedback bits requires less P_{av}.
Experiment 2
This experiment tests the performance of suboptimal scheme HP_{av}A (namely, PFPPC). Figure9 compares the outage performance of PFPPC approximation with its optimal case (POHPA) for two channels (d_{1} = 40 m,d_{2} = 60 m) with 1 bit feedback (m_{1} = m_{2} = 0. 5). The striking observation in Figure9 is that when P_{av} ≤ 32 dB, POHPA only slightly outperforms PFPPC, while when P_{av} > 32 dB, the performance of PFPPC and POHPA almost overlaps each other (i.e., the performance of PFPPC is very close to its optimum), indicating that PFPPC is an efficient nearoptimal scheme for POHPA especially at high P_{av}. In addition, Figure10 illustrate the efficiency of using PFPPC+EPPR and PFPPC+ZPiOR schemes for HP_{av} A (PFPPC) at large number of feedback bits. As shown in Figure10, with 4 bits of feedback (16 regions), PFPPC with ZPiOR approximation (PFPPC+ZPiOR) achieves almost equivalent performance to PFPPC with EPPR approximation (PFPPC+EPPR), and both schemes are very close to their optimal case (PFPPC). This result illustrates the fact that ZPiOR can be combined with PFPPC as a computationally simpler alternative to PFPPC+EPPR for large number of feedback bits.
Experiment 3
The third simulation, as illustrated in Figure11 for four channels case (d= [30 m,40 m, 60 m,70 m and m_{1} = m_{2} = m_{3} = m_{4} = 0. 5), studies the effect of increasing the number of feedback bits on the outage performance using the proposed schemes. For comparison, the performance of the optimal power control policy with full CSI[3] is also shown. Instead of comparing the performance with the POHPA scheme, we plot the outage probabilities of its computationally efficient nearoptimal schemes (PFPPC and its variants). With a small number of bits (1 bit and 2 bits) of feedback, PFPPC can be implemented by using the optimization software 1stopt, and with 4 bits of feedback (L = 16), we plot the performances of PFPPC+ZPiOR instead. The important observation from this figure is that the introducing one extra bit of feedback substantially reduces the gap with the full CSI performance and only a few bits of feedback can eliminate most of the gap with the full CSI performance. For example, at an outage probability of 10^{−2}, with 4 bits of feedback, there is only around 2.5 dB power loss compared to the full CSI case. This confirms that power allocation with limited feedback (only with a few feedback bits) can provide a dramatic performance advantage over no CSI (channel nonadaptive power allocation across all channels).
Figure12 depicts the diversity behavior of the proposed outage minimization scheme POHPA using the derived bound given in (39) for four channels with m_{1} = m_{2} = m_{3} = m_{4} = 0. 5. In high P_{av}, the outage performance of the PFPPC scheme is very close to optimum (POHPA), thus here we plot the performance of the computationally efficient PFPPC scheme instead of POHPA. We also use (28) to approximate the outage expression when the outage probability ≤ 10^{−10}. As we can see from Figure12, the derived bound captures the slope of outage behavior in high P_{av}extremely well.
Experiment 4
Figure13 shows the effect of the fading parameter m on the outage performance. It depicts the outage performance with 4 bits of feedback over six Nakagami fading channels with different values of the fading parameter: m = 0. 5, m = 1, and m = 2 (here we use identical fading parameter for each channel, i.e., m_{1} =⋯= m_{6} = m, and d = [20,30,40,60,70,80]). It can be noticed that as m increases, i.e., the fading severity decreases, significant performance gains can be easily observed. To achieve a target outage probability 10^{−2}, 4 bits of feedback with m = 2 provides around 2.65 and 1.18 dB improvements over 4 bits of feedback with m = 0. 5 and m = 1, respectively, as measured by the SNR gap with respect to their respective full CSI performances.
Experiment 5
Figure14 compares the outage performance between the PFPPC scheme and the Gaussian approximation (GA) for four channels case (m_{1} = m_{2} = m_{3} = m_{4} = 0. 5). It can be seen very clearly that the PFPPC scheme outperforms GA, the benefit of the PFPPC scheme becoming more pronounced as P_{av}increases. For instance, with the same feedback bits, at an outage probability of 10^{−3}, PFPPC with 1 bit requires roughly 7.8 dB less power than GA does; and PFPPC with 2 and 4 bits feedback provide around 5.6 and 2.3 dB power savings over GA, respectively. Even with only with 1 bit of feedback, PFPPC can achieve a better performance than GA with 2 bits of feedback in high P_{av}. These results indicate that the POHPA (with the PFPPC approximation) can achieve remarkable performance advantage over GA, especially in high P_{av}. However, when the number of channels is large, POHPA becomes computationally prohibitive. And in this case, GA is an efficient alternative, which is consistent with similar observations (for MIMO settings) in[28].
Figure15 illustrates the outage probability over large number of channels (16 channels) using GA, with the values of distances d_{1},…,d_{16}randomly obtained (with a uniform distribution) from the range [20 m, 100 m] and different identical fading parameters (m_{1} =⋯= m_{16} = m) m = 0. 5 and M = 2, respectively. We again see that only a few bits of feedback are required to close the gap with the performance with full CSI. For instance, to achieve a target outage probability 10^{−2}, with M = 2, the power consumption gap between 10 bits of feedback (less than one bit per channel) and its full CSI based counterpart is only about 2.4 dB, while with m = 0. 5, the gap is 2.8 dB. On the other hand, as m decreases, i.e., the fading severity increases, the outage performance of the limited feedback schemes deteriorates as expected.
Conclusions
In this article, we have derived a simulation based optimization algorithm using SPSA and presented various lowcomplexity suboptimal outage minimization algorithms via optimal power allocation with finiterate or quantized channel feedback for an Mparallel blockfading channels under a long term average power constraint. Numerical results illustrate the effectiveness of these algorithms via their outage performance in comparison with the performance of the optimal power allocation with full CSI, and show that only 4 bits of feedback close the gap with the outage performance of the full CSI algorithm substantially for M = 4 or M = 6. For a large number of channels (M = 16), our Gaussian approximation based algorithm performs approximately within 2.8 dB (SNR gap) of the full CSI based algorithm at an outage probability of 10^{−2}with less than 1 bit of (broadcast) feedback per channel when all channels undergo severe Nakagami fading with identical fading parameter m = 0. 5. Future study includes extension of these results to correlated fading channels, consideration of noisy or erroneous feedback as investigated in[29] and quantized CSIT based power allocation to more general optimization problems such as the serviceoutage based power and rate allocation in[15].
Endnote
^{a}A flat fading channel can be expressed as a complexbaseband model. However, according to the Appendix B.4.2 of ([19], pp. 527–528), one way to derive the capacity of a complexbaseband channel is to think of each user of the complex channel as two uses of a real channel. Thus, we only need to consider a realbaseband model, i.e., (1), and then multiply 1/2 at the maximum mutual information with unit nats per real dimension, as in (2).
Appendix 1
Proof of Lemma 1
Proof
The proof is similar to[9, 12]. For all j,1 ≤ j ≤ L −1, P(h) = P_{ j }, if\mathbf{h}\in {\mathcal{R}}_{j}, let{\mathcal{R}}_{j}^{\ast} be the set of all h such that\left(\right)close="">\n \n \n \n p\n \n \n j\n +\n 1\n \n \n \Sigma \n \n \n \n \n \n (\n \n \n p\n \n \n \u2217\n \n \n (\n h\n )\n )\n \n \n \Sigma \n \n \n \u2264\n \n \n p\n \n \n j\n \n \n \Sigma \n \n \n \n, we need to prove that{\mathcal{R}}_{j}^{\ast}={\mathcal{R}}_{j}. Assume the contrary, that{\mathcal{R}}_{j}^{\ast}\setminus {\mathcal{R}}_{j} is a non empty set (∖ denotes the set subtraction operation), i.e., if\mathbf{h}\in {\mathcal{R}}_{j}^{\ast}\setminus {\mathcal{R}}_{j}, then\mathbf{h}\in {\mathcal{R}}_{j}^{\ast} and\mathbf{h}\notin {\mathcal{R}}_{j}. And we can partition the set{\mathcal{R}}_{j}^{\ast}\setminus {\mathcal{R}}_{j} into two subsets{\mathcal{R}}_{j}^{}={\mathcal{R}}_{j}^{\ast}\setminus {\mathcal{R}}_{j}\bigcap \sum _{k=1}^{j1}{\mathcal{R}}_{k} and{\mathcal{R}}_{j}^{+}={\mathcal{R}}_{j}^{\ast}\setminus {\mathcal{R}}_{j}\bigcap \sum _{k=j+1}^{L}{\mathcal{R}}_{k}. If the set{\mathcal{R}}_{j}^{} has nonzero probability, then we can construct a new scheme by assigning all elements of this set to{\mathcal{R}}_{j} instead. Since\forall \mathbf{h}\in {\mathcal{R}}_{j}^{},{\mathbf{p}}^{\ast}{\left(\mathbf{h}\right)}^{\mathrm{\Sigma}}\le {\mathbf{p}}_{j}^{\mathrm{\Sigma}}, such rearrangement achieves the same outage probability but with less average power due to\left(\right)close="">\n \n \n \n p\n \n \n j\n \n \n \Sigma \n \n \n \n \n \n p\n \n \n k\n \n \n \Sigma \n \n \n ,\n 1\n \u2264\n k\n \u2264\n j\n \u2212\n 1\n \n, which is in contradiction with the optimality of the optimal solution\mathcal{P} and\mathcal{R}. On the other hand, the set{\mathcal{R}}_{j}^{+} is also an empty set, otherwise, we can easily see that this set is in outage (since\forall \mathbf{h}\in {\mathcal{R}}_{j}^{+},{\left({\mathbf{p}}^{\ast}\right(\mathbf{h}\left)\right)}^{\mathrm{\Sigma}}>{\mathbf{p}}_{j+1}^{\mathrm{\Sigma}}), thus we have larger overall outage probability, which is also a contradiction. Therefore, we have{\mathcal{R}}_{j}^{\ast}\subseteq {\mathcal{R}}_{j}. We also can similarly prove{\mathcal{R}}_{L}^{\ast}\subseteq {\mathcal{R}}_{L} as[9] did, omitted due to space limit. Since\bigcup _{j=1}^{L}{\mathcal{R}}_{j}^{\ast}=\bigcup _{j=1}^{L}{\mathcal{R}}_{j}, we can conclude that{\mathcal{R}}_{j}^{\ast}={\mathcal{R}}_{j},\forall j. □
Appendix 2
Proof of Theorem 1
Proof
We introduce μ as the Lagrange multiplier associated with the average power constraint. The Lagrangian can be written as
Setting the firstorder partial derivatives to zero (i.e.,\frac{\partial J}{\partial {{p}_{\mathit{\text{ij}}}}^{\ast}}=0,i=1,\dots ,M, for j = 1,…,L), produces
Since μ ≠ 0 (note that otherwise\frac{\partial {F}^{\prime}({\mathbf{p}}_{1})}{\partial {{p}_{i1}}^{\ast}}=0,i=1,\dots ,M and since\left(\right)close="">\n \n \n \n F\n \n \n \u2032\n \n \n (\n \n \n p\n \n \n 1\n \n \n )\n \n is monotonically decreasing with p_{i 1},\frac{\partial {F}^{\prime}({\mathbf{p}}_{1})}{\partial {{p}_{i1}}^{\ast}}=0 implies\left(\right)close="">\n \n \n \n \n \n p\n \n \n i\n 1\n \n \n \n \n \u2217\n \n \n =\n \u221e\n \n, corresponding to infinite average power which is impossible),\left(\right)close="">\n \n \n \n F\n \n \n \u2032\n \n \n (\n \n \n p\n \n \n 2\n \n \n )\n \u2260\n \n \n F\n \n \n \u2032\n \n \n (\n \n \n p\n \n \n 1\n \n \n )\n \n, thus(1\mu \sum _{i=1}^{M}{p}_{i1}+\mu \sum _{i=1}^{M}{p}_{\mathit{\text{iL}}}))\ne 0. Simplifying (52), we have,
Therefore, finally, we have
This completes the proof. □
Appendix 3
Proof of Theorem 2
Proof
In the multiple infinite series representation (19), for a sufficiently high P_{av}, we have\left\frac{{m}_{i}{\lambda}_{i}K}{{p}_{\mathit{\text{ij}}}}\right<1,\phantom{\rule{2.77695pt}{0ex}}\forall i,j. Thus from[30], the conditions of the convergence of the power series (20) are satisfied.
From (19), we have
whereT({n}_{1},\dots ,{n}_{M})=\frac{\prod _{i=1}^{M}{\left({m}_{i}\right)}_{{n}_{i}}{({m}_{i}{\lambda}_{i}K)}^{{n}_{i}}\frac{1}{{n}_{i}!}}{{\left(1+\sum _{i=1}^{M}{m}_{i}\right)}_{{n}_{\tau}}} and\mathbb{Z} is the set of nonnegative integers.
Since in the high P_{av} regime, using the approximation\frac{1}{\prod _{i=1}^{M}{p}_{\mathit{\text{ij}}}^{{n}_{i}}}\approx 0 for\sum _{i=1}^{M}{n}_{i}\ge 1, we have
From (56), we have, for i = 1,…,M
Finally, by substituting (57) in Theorem 1, we have
which completes the proof of Theorem 2. □
Appendix 4
Proof of Lemma 2
Proof
In[24], Moschopoulos justified the uniform convergence of the single infinite series in (21). With PFPPC, (21) can be rewritten as
where\rho =\sum _{i=1}^{M}{m}_{i}\ge 0.5\phantom{\rule{1em}{0ex}}(M\ge 1,{m}_{i}\ge 0.5) and the coefficients δ_{ n }are obtained recursively by
In high the P_{av} regime (P_{av} → ∞),{r}_{1}=\frac{K}{{p}_{1}}\to 0, then with PFPPC, the total average power allocated to the outage region{\mathcal{R}}_{L,1} is
We have
whereC=K{m}_{M}\phantom{\rule{.3em}{0ex}}max\left({\lambda}_{i}\right)\prod _{i=1}^{M}{\left(\frac{{\lambda}_{i}}{max\left({\lambda}_{i}\right)}\right)}^{{m}_{i}} and{C}^{\prime}=C\frac{{({m}_{M}max({\lambda}_{i}\left)\right)}^{\rho 1}}{\mathrm{\Gamma}\left(\rho \right)}, and the last equality follows from the fact that when n ≥ 1, the individual terms go to zero for any ρ, as r_{1} → 0.
From the proof of Theorem 3 (see below), we have