 Research
 Open access
 Published:
Joint source and relay design for twohop amplifyandforward relay networks with QoS constraints
EURASIP Journal on Wireless Communications and Networking volume 2013, Article number: 108 (2013)
Abstract
In this paper, we consider the joint design of source precoding matrix and the relay precoding matrix in a twohop multipleinput multipleoutput relay network. The goal is to find a pair of matrices in order to minimize the power consumption and at the same time meet preselected quality of service constraints that are defined as the mean square error of each data stream. Using majorization theory, we simplify the matrixvalued optimization problem into a scalarvalued one. We then propose a lower bound and an upper bound of the original problem, both in convex forms. Specifically, the latter is solved by a multilevel waterfilling algorithm that is much efficient than directly applying the interior point method. Numerical examples corroborate the proposed studies and also demonstrate the tightness of both bounds to the original problem.
1 Introduction
Relay networks have recently attracted much attention because of their promising capability in achieving reliable communication and wide coverage for the next generation of wireless systems [1, 2]. Different types of relaying strategies, e.g., amplify and forward (AF), decode and forward (DF), and compress and forward (CF) were introduced in [3–5], respectively. DF and CF decode data before retransmission and are, thus, also known as regenerative strategies; AF relay amplifies the received data only and is known as nonregenerative strategy. The computational simplicity in AF relay makes it highly attractive and a strong candidate for the realtime application. On the other side, multiinput multioutput (MIMO) technique [6] can enhance the data transmission rate by introducing the spatial diversity gain. Therefore, combining relaying and MIMO becomes a natural way to further advance the future wireless communication systems.
Most of the existing works focus on the AF MIMO relay networks, where a certain performance criterion is optimized subject to power constraints at both source and relay. For example, the mutual information and the total mean square error (MSE) criteria are selected as objective functions in [7, 8], respectively. The authors in [9] investigated the diversity multiplexing tradeoff for MIMO relays. Moreover, there are some works on beamforming design for special types of AF MIMO relays; for instance, the authors in [10] considered codebook design for halfduplex AF MIMO relays, while the author in [11] considered beamforming design for rather vast types of MIMO relays. Applying the majorization theory [12], the authors in [13] proposed a unified framework for most performance criteria. The extension of [13] to the multiple relay case was introduced in [14].
All of the mentioned methods above considered enhancing the system performance by maximizing or minimizing a certain objective function constraint on power consumption at some or all nodes. Although this model improves the system performance, it does not guarantee a certain quality of service (QoS) requirement for an individual user. The importance of considering QoS becomes more obvious in practical applications supporting several types of service each with a different reliability requirement. One of the pioneering works considering QoS in MIMO pointtopoint systems is [15], where the authors optimized the transmission power subject to predefined sets of QoS, e.g., individual MSE, signaltonoise ratio (SNR), and bit error rate. In [16], the authors investigated the optimization of source beamforming and relay weighting matrix in order to minimize the total power subject to a given set of QoS for multiinput singleoutput broadcast channel. On the aspect of relay network, AF MIMO relay network with QoS constraints has been investigated in [17]. Applying majorization theory, the author in [17] proposed a unified framework for the design of the optimal structure of the source precoding and relay amplifying matrices. The author applied a successive geometric programming method to obtain the optimal power loading among data streams. Unfortunately, the computational complexity of the proposed solution in [17] compromises its suitability for practical implementation. With similar assumptions, the authors in [18] considered a simplified version of the problem in which only the relay power is minimized. Then, the minimization is executed over a convex lower bound of the objective function. In a rather more general setup, the authors in [19] studied the joint relay and source power minimization and applied the majorization theory to reduce the problem to a scalar one. Then, using QoS convex relaxation, an upper bound and a lower bound on the optimal results are presented.
In this paper, building upon the results in [19], we take a specific look into a dualhop^{a} AF MIMO relay network. We first jointly design the source and the relay precoding matrices such that the overall transmission power is minimized subject to a given set of QoS constraints. Applying the majorization theory, we reduce the original matrixvalued problem to a scalarvalued one and then propose two new convex optimization problems whose objective values serve as the lower bound and the upper bound of the original problem. While both new problems can be handled by the existing convex optimization tools, e.g., CVX[20], we specifically design a multilevel waterfilling (MLWF) algorithm to solve the upper bound problem that can further reduce the computational complexity. Compared with the successive geometric programming approach developed in [17], the MLWF algorithm does not require any optimization tool and thus is easier to implement for practical relay systems. Numerical results corroborate the proposed studies and clearly demonstrate the tightness of the proposed lower bound and upper bound, especially over low MSE region.
The rest of this paper is arranged as follows: Section 2 presents the system model and formulates the optimization problem in matrix form. In Section 3, the optimization is simplified to a scalarvalued problem from the majorization theory. Two suboptimal problems whose objectives serve as the upper bound and the lower bound of the original problems are derived in the same section. In Section 4, the upper bound problem is solved from a multilevel waterfilling algorithm coupled with decomposition methods. The simulation results are presented in Section 5, and conclusions are drawn in Section 6.
1.1 Notations
Vectors and matrices are boldface small and capital letters, respectively; the transpose, complex conjugate, Hermitian, inverse, and pseudoinverse of A are denoted by A^{T}, A^{∗}, A^{H}, A^{−1}, and A^{†}, respectively; ∥ a ∥ denotes the Euclidean norm of the vector A; diag{a} is the diagonal matrix with diagonal elements given by A, while diag{a} is a vector with entries taken from the diagonal elements of A ; I is the identity matrix; and E {·} is the statistical expectation. Moreover, basic notations and definitions of majorization theory can be found in Appendix 1.
2 System model
As shown in Figure 1, we considrer a threenode multiantenna relay network that consists of a source node, a relay node, and a destination node, equipped with M, N, and P antennas, respectively. We assume that the direct link between the source and the destination is weak and, therefore, can be neglected. Denote the baseband MIMO channels between the source and relay as the N × M matrix H_{ s }, while that between the relay and destination as the P × N matrix H_{ r }. We further assume that the channel state information is perfectly known at all nodes. Suppose that L data streams, denoted by x, are precoded by an M × L precoding matrix B at source node. We require L ≤ min{M,N,P} so that the data streams can be detected with linear method at the destination. With an N × N precoding matrix F at relay, the received signal at destination is as follows:
where v_{ r } and v_{ d } are additive white complex Gaussian noise at relay and destination, respectively, i.e., {\mathbf{v}}_{r}\in \mathcal{C}\mathcal{N}(0,{\sigma}_{r}^{2}{I}_{N}) and {\mathbf{v}}_{d}\in \mathcal{C}\mathcal{N}(0,{\sigma}_{d}^{2}{I}_{N}). Without loss of generality, we set {\sigma}_{d}^{2}={\sigma}_{r}^{2}=1. Since the correlation and power can be designed through B, the data streams from the source can be assumed independent from each other, i.e., E{xx^{H}} = I.
We consider the minimum mean square error detection at destination with a P × L decoding matrix T. The estimated data can be expressed as follows:
with the error matrix
The MSE is defined as t r(C). For the given B and F, one can easily find the optimal T from the standard approach [21]:^{b}
and the corresponding error matrix is
As in [15], the QoS measurement here is taken as the MSE of each individual data stream. Let us denote the QoS vector as ρ = [ρ_{1},…,ρ_{ L }]^{T}, i.e., the MSE of the i th data stream is required to be smaller than ρ_{ i }. Note that ρ_{ i }<1 is necessary to avoid trivial solutions.
From (5), the QoS constraints are stated as follows:
where ‘ ≤’ denotes the elementwise operation if used between vectors.
The average power consumed by the source is computed from
while that spent by the relay is
3 Optimization problem
The goal is to find the optimal B and F to minimize the overall power spent by the source and relay and at the same time meet the QoS requirements.
Define
The optimization problem is then expressed as follows:
Unfortunately, the problem is nonconvex and cannot be solved in an efficient way.
3.1 Equivalent problem
Let us define a new optimization:
where ≻^{w} means weak majorization, and the details can be found in Appendix 1.
Theorem 1. Problems (P1) and (P2) are equivalent.
Proof. The idea is to prove that for each feasible point of (P1), there is a corresponding feasible point in (P2) that yields the same objective value and vice versa.
(P1) →(P2): For any feasible B in (P1), construct a new matrix \stackrel{~}{\mathbf{B}}=\mathbf{B}\mathbf{Q}, where Q is the unitary eigenmatrix of M^{H}R M. Then, {\stackrel{~}{\mathbf{M}}}^{H}\mathbf{R}\stackrel{~}{\mathbf{M}} is a diagonal matrix. It can be readily checked that the objective value of (P2) with (\stackrel{~}{\mathbf{B}},\mathbf{F}) is the same as the objective value of (P1) with (B,F). Moreover, since \mathbf{I}+{\stackrel{~}{\mathbf{M}}}^{H}\mathbf{R}\stackrel{~}{\mathbf{M}} is a diagonal matrix, there is
where \mathbf{Z}\triangleq {(\mathbf{I}+{\mathbf{M}}^{H}\mathbf{R}\mathbf{M})}^{1}. From Z_{ i i }≤ ρ_{ i }, one can conclude that diag{Z}≻^{w}ρ. We further know from Lemma 2 in Appendix 1 that λ {Z} ≻^{w}diag{Z}. Therefore, λ{Z}≻^{w} ρ and we achieve the conclusion that for any feasible point (B,F) in (P1), there is always a corresponding feasible point (\stackrel{~}{\mathbf{B}},\mathbf{F}) in (P2) that gives the same objective value.
(P2) →(P1): Define \stackrel{~}{\mathbf{Z}}={(\mathbf{I}+{\stackrel{~}{\mathbf{M}}}^{H}\mathbf{R}\stackrel{~}{\mathbf{M}})}^{1} and assume that (\stackrel{~}{\mathbf{B}},\mathbf{F}) is a feasible point of (P2). From (P2), we know that \mathit{\text{diag}}\left\{\stackrel{~}{\mathbf{Z}}\right\}=\mathit{\lambda}\left\{\stackrel{~}{\mathbf{Z}}\right\}{\succ}^{w}\mathit{\rho} holds. From Lemma 3 in Appendix 1, we know that there exists a vector c satisfying both \mathit{\lambda}\left\{\stackrel{~}{\mathbf{Z}}\right\}\succ \mathbf{c} and c ≤ ρ. From Lemma 2, we know that for each \mathbf{c}\prec \mathit{\lambda}\left\{\stackrel{~}{\mathbf{Z}}\right\}, there exists a matrix W with diag{W}=c and \mathit{\lambda}\left\{\mathbf{W}\right\}=\mathit{\lambda}\left\{\stackrel{~}{\mathbf{Z}}\right\}. Let \mathbf{W}={\mathbf{Q}}^{H}\stackrel{~}{\mathbf{Z}}\mathbf{Q} and define \mathbf{B}=\stackrel{~}{\mathbf{B}Q}. Then, diag{(I+M^{H}R M)^{− 1}} = diag {W} ≤ρ. Moreover, the objective function of (P1) with (B,F) is the same as the objective function of (P2). □
3.2 Processing matrix variables
Based on Theorem 1, we can solve (P2) instead of (P1).
Define two new positive semidefinite matrices as well as their singular value decomposition (SVD) as follows:
where U_{ X } and U_{ Y } are the N × L and P × L orthonormal matrices, respectively. Throughout this paper, we always sort the singular values and eigenvalues in an increasing order. Note that it is still possible that Λ_{ X } or Λ_{ Y } contains some zero entries.
The matrices {\mathbf{H}}_{s}\stackrel{~}{\mathbf{B}} and H_{ r }F can be represented as follows:
where V_{ X } can be any L × L unitary matrix, and V_{ Y } can be any N × L orthonormal matrices. These two matrices will be designed later to fulfill the optimality requirement.
Let
be the SVD of H_{ s } and H_{ r } respectively. We further express the singular matrices as {\mathbf{U}}_{s}\triangleq [{\mathbf{U}}_{s,2},{\mathbf{U}}_{s,1}] and {\mathbf{U}}_{r}\triangleq [{\mathbf{U}}_{r,2},{\mathbf{U}}_{r,1}], in which U_{s,1} and U_{r,1} contain L first columns.
One can rewrite (15) and (16) as follows:
There are three cases to be discussed:

If N = M = L, then \stackrel{~}{\mathbf{B}} can be reexpressed from (18) as
\begin{array}{l}\stackrel{~}{\mathbf{B}}={\mathbf{V}}_{s}{\mathit{\Sigma}}_{s}^{1}{\mathbf{U}}_{s,1}^{H}{\mathbf{U}}_{X}{\mathbf{\Lambda}}_{X}^{\frac{1}{2}}{\mathbf{V}}_{X}^{H}.\end{array}(19) 
If M > N = L, (18) can be simplified as
\begin{array}{l}\left[\begin{array}{ll}{\mathit{\Sigma}}_{s}^{1}& {\mathbf{0}}_{L\times (ML)}\end{array}\right]{\mathbf{V}}_{s}^{H}\stackrel{~}{\mathbf{B}}={\mathbf{U}}_{s,1}^{H}{\mathbf{U}}_{X}{\mathbf{\Lambda}}_{X}^{\frac{1}{2}}{\mathbf{V}}_{X}^{H}.\end{array}(20)The solution for \stackrel{~}{\mathbf{B}} is not unique, and the one which minimizes the objective function should be chosen. Basically, we need to solve
\begin{array}{ll}\underset{\stackrel{~}{\mathbf{B}}}{\mathrm{min}}& \text{tr}\left(\stackrel{~}{\mathbf{B}}{\stackrel{~}{\mathbf{B}}}^{H}\right)\phantom{\rule{2em}{0ex}}\\ \text{subject to}& \left[\begin{array}{ll}{\mathit{\Sigma}}_{s}^{1}& {\mathbf{0}}_{L\times (ML)}\end{array}\right]{\mathbf{V}}_{s}^{H}\stackrel{~}{\mathbf{B}}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}{\mathbf{U}}_{s,1}^{H}{\mathbf{U}}_{X}{\mathbf{\Lambda}}_{X}^{\frac{1}{2}}{\mathbf{V}}_{X}^{H}.\phantom{\rule{2em}{0ex}}\end{array}(21)Note that the second term in the objective function is not included in (21) since it will be taken cared by F. From Lemma 9 in [14], the optimal structure of \stackrel{~}{\mathbf{B}} is as follows:
\begin{array}{l}\stackrel{~}{\mathbf{B}}={\mathbf{V}}_{s}{\left[\begin{array}{ll}{\mathit{\Sigma}}_{s}^{1}& {\mathbf{0}}_{L\times (ML)}\end{array}\right]}^{T}{\mathbf{U}}_{s,1}^{H}{\mathbf{U}}_{X}{\mathbf{\Lambda}}_{X}^{\frac{1}{2}}{\mathbf{V}}_{X}^{H}.\end{array}(22) 
If M > L and N > L, then (20) holds if and only if {\mathbf{U}}_{s,2}^{H}{\mathbf{U}}_{X}={\mathbf{0}}_{\left(\right(NL)\times N)}. In this case, (22) is still the optimal structure for \stackrel{~}{\mathbf{B}}. Please refer to [14] for the detailed derivation.
Following the same procedure, the optimal structure of F can be computed as follows:
We then proceed to the optimization over new variables U_{ X }, V_{ X }, U_{ Y }, V_{ Y }, Λ_{ X }, and Λ_{ Y }. Substituting (22) and (23) into (P2) yields the new objective function
where the pseudoinverses of {\mathbf{\Lambda}}_{s}^{\u2020} and {\mathbf{\Lambda}}_{r}^{\u2020} are \left[{\mathit{\Sigma}}_{s}^{1}\right. 0_{L× (N − L)}]^{T} and {\left[{\mathit{\Sigma}}_{r}^{1}{\mathbf{0}}_{L\times (ML)}\right]}^{T}, respectively.
We apply the inversion lemma [22],
for the constraint and obtain
where G is defined as the corresponding term. Substituting (22) and (23) into G yields
Remark 1. Representing the problem in terms of these new variables decreases the dependency between the objective function and constraints and thus facilitates the optimization procedure. For example, U_{ Y } is only involved in objective function, but not in the constraints. Hence, we can adjust U_{ Y } to change the objective without affecting the constraint. Reversely, V_{ X } and V_{ Y } only affect the constraint, but not the objective function.
Remark 2. On the other hand, the diagonality constraint in (P2) can be satisfied by only adjusting V_{ X } so that the other constraint and the objective functions will not be affected.
3.3 Simplification
In the following subsections, we derive the optimum structures for V_{ X }, V_{ Y }, U_{ X }, and U_{ Y }, respectively.
3.3.1 Optimal V_{ X }and V_{ Y }
The functionality of the optimal V_{ X } is to make G diagonal. Moreover, if there is a V_{ Y } such that all components in diag{I−G} are minimized simultaneously, then this V_{ Y } must be optimal because it provides the largest possible freedom to optimize over the rest variables, i.e., U_{ X }, U_{ Y }, Λ_{ X }, and Λ_{ Y }. Applying (9.H.2 in [12]) on G, we obtain
where
is the multiplication of the diagonal matrices in (27). According to the definition of submajorization and supermajorization in Appendix 1, (28) can also be written as follows:
and is equivalent to
where 1 is an all one vector. Moreover, (31) can be reexpressed as follows:
Since one can always find V_{ X } that makes G diagonal, there is λ{I − G} = diag{I − G}. Moreover, from Lemma 2, we can rewrite (32) as follows:
which indicates that \mathit{\text{diag}}\{\mathbf{I}\widehat{\mathbf{G}}\} is a simultaneous lower bound for all the elements in diag{I−G}. Therefore, \mathbf{G}=\widehat{\mathbf{G}} must hold at the optimal point, and this can be achieved if V_{ Y } = U_{ X } and V_{ X } = I.
3.3.2 Optimal U_{ X }and U_{ Y }
With the optimal V_{ Y }, the constraint is independent from both U_{ Y } and U_{ X }. Therefore, one can find the optimal structure of U_{ X } and U_{ Y } purely from
We need the following matrix inequality (9.H.1.h in [12]) to proceed: Given two L × L positive semidefinite matrices A_{1} and A_{2} with eigenvalues λ_{ l }(A_{1}) and λ_{ l }(A_{2}) arranged in the increasing order, there is
Then, the first part in (34) can be lower bounded by the following:
Obviously, the minimum value can be achieved when U_{ X } = U_{ s }, since the diagonal elements of Λ_{ s } and Λ_{ X } are all arranged in increasing order. Similar discussion holds for the second term in (34), namely
and the minimum is achieved when U_{ Y } = U_{r,1}.
Substituting the optimal U_{ X }, U_{ Y }, V_{ X }, and V_{ Y } into the objective function and the constraints of (P2), we obtain the following:
Define a_{ i } and b_{ i } as the i th diagonal entries of {\left({\mathbf{\Lambda}}_{s}^{\u2020}\right)}^{2} and {\left({\mathbf{\Lambda}}_{r}^{\u2020}\right)}^{2}, respectively. Further, define x_{ i } and y_{ i } as the i th diagonal entries of Λ_{ X } and Λ_{ Y }, respectively. Then, problem (P2) is converted to a scalar form:
Unfortunately, the constraint (40) is nonconvex, and the problem cannot be solved efficiently. We then propose the following two convex bounds for each summand on the lefthand side of (40):
Replacing the corresponding term in (40) by the righthand side (RHS) in (41) or (42) while keeping the same objective function, we can obtain the upper or the lower bound of the original problem, respectively. Both the lower bound and the upper bound problems can be solved by the existing convex optimization tools based on the interior point method, e.g., CVX, [20, 23].
4 Algorithm for the upper bound problem
Power minimization is ensured by considering the minimization of the upper bound of the objective value. Therefore, we take a detailed look into the upper bound problem and target at a more efficient solution. The upper bound problem is restated as follows:
It is hard to obtain the closed form solution even with the KarushKuhnTucker (KKT) conditions. Observing the symmetry, we can apply the primal decomposition method [23] to break down the problem into two simpler subproblems.
4.1 The decomposition method
Problem (43) can be decomposed into two parallel subproblems together with a master problem connecting as a bridge [23]. Defining a new auxiliary vector t=[t_{1},t_{2},…,t_{ N }]^{T}, the subproblems are as follows:
and the master problem is
In many cases, the subproblems do not have closedform solution. Therefore, searching the optimum T for the master problem can be iteratively done from the subgradient method. Note that the convergence of the decomposed problem to the global optimal point can be guaranteed since (43) is convex.
4.2 Solving the subproblems
The Lagrangian corresponding to Φ_{ x }(t) is as follows:
and the corresponding KKT conditions are the following:
For simplicity, we define {\stackrel{~}{\mu}}_{k}=\sum _{i=k}^{N}{\mu}_{i}. Multiplying both sides of (46) by x_{ k } and combining with (49) give the following:
After a straightforward calculation, we obtain the following:
Based on (52), the KKT conditions are reduced to the following:
Next, we propose an efficient algorithm that could find the solution that satisfies (53), (54), and (55) simultaneously.
Algorithm 1 Multilevel waterfilling algorithm
The standard waterfilling algorithm embedded in step 3 can be found in Appendix 2. For an explicit demonstration, we present the algorithm in a diagram in Figure 2.
We next need to prove the optimality of the above MLWF. Since the problem is convex, the output of the algorithm is the optimal solution if and only if all of the KKT conditions are satisfied. The conditions (55) are simultaneously satisfied from the step 4 in the algorithm. Condition (53) is itself satisfied by the nature of the waterfilling algorithm, that is the water levels are always nonnegative. The above MLWF algorithm satisfies (54) as well, as seen from the following lemma.
Lemma 1. Successive water levels achieved by the proposed MLWF algorithm are ordered decreasingly.
Proof. The algorithm has two loops: one is the inner loop ‘steps 3 → 4 → 5 → 3’ and the other is the outer loop ‘steps 3 → 4 → 5 → 6 → 3.’ Each time when the inner loop finishes, one water level \stackrel{~}{\mu}(L,H) will be achieved. Then, we proceed to compute other water levels. In the inner loop, we first adopt the water level given by standard waterfilling algorithm. Our hypothesis is that the aforementioned water level satisfies all of the conditions (55). Then, we check our hypothesis by searching whether there is a k=l_{0} for which the corresponding condition in (55) is violated.
Assume l_{0} with 1 ≤ L < l_{0}< H ≤ N is the point at which the inner loop finishes, i.e., \stackrel{~}{\mu}(L,{l}_{0}) satisfies all the conditions from L to l_{0}. At this point, \sum _{i=L}^{{l}_{0}}\frac{1}{{x}_{i}+1}=\sum _{i=L}^{{l}_{0}}{\rho}_{i} is true due to the fact that the constraint holds with equality in the standard waterfilling algorithm. Moreover, \stackrel{~}{\mu}(L,H)<\stackrel{~}{\mu}(L,{l}_{0}) holds since \stackrel{~}{\mu}(L,H) is not large enough to satisfy (55) with k = l_{0}, though it satisfies all of the conditions from k = H to k = l_{0}+1. On the other hand, we have
since, based on our assumption, conditions (55) with index greater than l_{0} are satisfied with the water level introduced by \stackrel{~}{\mu}(L,H). From (56) and \stackrel{~}{\mu}(L,H)<\stackrel{~}{\mu}(L,{l}_{0}), we conclude that
The search for the next water level (of course in the following inner loop), is between H = N and L = l_{0} + 1. We then only need to prove \stackrel{~}{\mu}(l,{l}_{0}+1)<\stackrel{~}{\mu}(1,{l}_{0}),\phantom{\rule{1em}{0ex}}{l}_{0}+1\le l\le H. First, we know \stackrel{~}{\mu}(L,l) satisfies
From the last inner loop, we know that applying \stackrel{~}{\mu}(L,l) on the condition \sum _{i=L}^{{l}_{0}}\frac{1}{{x}_{i}+1}\le \sum _{i=L}^{{l}_{0}}{\rho}_{i}, does not work, i. e.,
Therefore, using \stackrel{~}{\mu}(L,l) in both (58) and (59) yields \sum _{i={l}_{0}+1}^{l}\frac{1}{{x}_{i}+1}<\sum _{i={l}_{0}+1}^{l}{\rho}_{i}. On the other hand, \stackrel{~}{\mu}({l}_{0}+1,l) gives \sum _{i={l}_{0}+1}^{l}\frac{1}{{x}_{i}+1}=\sum _{i={l}_{0}+1}^{l}{\rho}_{i}. Therefore, we infer \stackrel{~}{\mu}({l}_{0}+1,l)<\stackrel{~}{\mu}(L,l). Together with (57), we conclude \stackrel{~}{\mu}({l}_{0}+1,l)<\stackrel{~}{\mu}(L,{l}_{0}). □
Meanwhile, the subproblem Φ_{ y }(t) can be solved with the same MLWF, which will not be restated here.
Algorithm 2 The master algorithm
This algorithm is presented by a diagram in Figure 3.
5 Simulation results
In this section, we numerically examine the performance of the proposed method. For all examples, we assume that the channel matrices (H_{ r } and H_{ s }) have independent and identically distributed Gaussian entries with zero mean and variance 1. We consider 6 × 6 ×6 MIMO relay system, and 1000 MonteCarlo runs are taken for average. The upper bound optimization will be solved both by the proposed decomposition algorithms and by the CVX convex optimization toolbox [20], while the lowerbounded optimization will only be solved by CVX. Moreover, we also compare the proposed solutions with a suboptimal method and a trivial method listed in the following:

1.
Diagonalization method. We reduce the problem into a scalar problem using SVD of channel matrices. If we consider the structure of B = U _{ s }Λ _{ B } and \mathbf{F}={\mathbf{U}}_{r}{\mathbf{\Lambda}}_{F}{\mathbf{V}}_{s}^{H}, we can reduce the problem into a simple scalar problem. In this method we basically choose the precoding matrices to match the channel matrices. The optimization problem of (10) is simplified to the following:
\begin{array}{ll}\text{(P4):}\underset{{x}_{i},{y}_{i}}{\mathrm{min}}& \phantom{\rule{1em}{0ex}}\sum _{i=1}^{L}{a}_{i}{x}_{i}+{b}_{i}{y}_{i}\phantom{\rule{2em}{0ex}}\\ \text{subject to}& \phantom{\rule{1em}{0ex}}\frac{{y}_{i}+{x}_{i}+1}{{y}_{i}+{x}_{i}+{y}_{i}{x}_{i}+1}\le {\rho}_{i}\phantom{\rule{1em}{0ex}}\forall i\phantom{\rule{2em}{0ex}}\\ \phantom{\rule{1em}{0ex}}{x}_{i}\ge 0,{y}_{i}\ge 0,\phantom{\rule{1em}{0ex}}\forall i,\phantom{\rule{2em}{0ex}}\end{array}(60)where a_{ i }, b_{ i }, x_{ i }, and y_{ i } are the i th diagonal entries of {\left({\mathbf{\Lambda}}_{s}^{\u2020}\right)}^{2}, {\left({\mathbf{\Lambda}}_{r}^{\u2020}\right)}^{2}, Λ_{ B }, and Λ_{ F }, respectively. Note that the problem in (60) is a special case of (P3) in (39).

2.
Naive method. We consider a solution that satisfies all of the constraints in (43) with equality. In this method, x and y are given by the following:
\mathrm{x}=\mathrm{y}=2/\mathit{\rho}\mathbf{1}.
In the first example, we take equal QoS requirements for all data streams. The total consumed powers versus MSE are depicted in Figure 4 for the upper bound with the proposed method, the upper bound with CVX, the lower bound, and the suboptimal diagonalization method with CVX, as well as the naive method. It can be observed that the performance of the decomposition method is exactly the same as the one obtained from CVX. We can also observe that the lower and the upper bounds are quite tight at high SNR when they meet each other. The numerical results from the diagonalization method depicts weaker performance than the proposed method with roughly 2 dB more power consumption. The naive method, on the other hand, has the poorest performance and requires more power than the proposed method (roughly 3 dB).
The performances with unequal QoS constraints are presented in Figure 5, where the QoS vector is chosen as ρ = ρ[1, 1.5, 2, 2.5, 3, 3]^{T}. One may notice that all curves are about 2 dB higher than those in Figure 4.
This is because the smallest QoS constraint drags down the overall performance and thus demands for more power. Once again, we find that the upper bound and the lower bound are quite close to each other especially at high SNR values. Meanwhile, the proposed method coincides with the upper bound from the CVX. Moreover, the advantage of the proposed method over the suboptimal diagonalization method and the naive method is fairly explicit.
We also measure the average running time for the proposed method and the upper bound with CVX, measured in MATLAB. The results are shown in Table 1.
Remark 3. Note that resorting to simulation cannot serve as a rigorous measurement of the complexity analysis. Nevertheless, the large difference in the running time fairly indicates the much higher efficiency of the proposed decomposition method over the regular convex optimization tool.
The effect of the number of antennas on each node is also investigated here. We set the QoS requirements for all data streams as 0.2 and vary the number of antennas from 2 to 9. The results are displayed in Figure 6.
6 Conclusion
In this paper, we considered the joint design of source precoding matrix and relay precoding matrix in a twohop AF MIMO relay network. We minimized the power consumption subject to a set of predefined QoS constraints of each data stream. Using matrix calculus and majorization theory, we simplified the original matrixvalued problem to a relatively simpler scalar one and proposed two bounding problems that are convex and can be solved efficiently. We specifically designed a primary decomposition method to solve the upper bound problem that has less complexity than directly applying the interior point method. Numerical examples are provided to corroborate the proposed studies.
Appendix 1
Basics of majorization theory
Here, we briefly introduce the basics of majorization theory while the more comprehensive discussion can be found in [12].
Definition 1. If for any vector \mathrm{x}=[{x}_{1},{x}_{2},\dots ,{x}_{n}]\in {\mathbb{R}}^{n} , let
represents the elements of x in an increasing order. Similarly, assume
represents the elements of x in a decreasing order.
Definition 2. For any \mathrm{x},\mathrm{y}\in {\mathbb{R}}^{n} , x is majorized by y if
and is denoted by x≺y or y≻x.
Definition 3. [[12]] For any\mathrm{x},\mathrm{y}\in {\mathbb{R}}^{n}, x is weakly supermajorized by y if
We denote this with x ≺ ^{w} y (or equivalently y ≻ ^{w} x). Also, x is weakly submajorized by y if
We denote this with x≺ _{ w } y (or equivalently y≻ _{ w } x).
Lemma 2. (9.B.1 in [[12]]) Let M be an n×n Hermitian matrix with a vector diag{M} denoting its diagonal elements and let vector λ{M} contain its eigenvalues. Then,
where\stackrel{\u0304}{\mathit{\text{diag}}}{\left\{\mathrm{M}\right\}}_{i}=\text{mean}\left(\mathit{\text{diag}}\right\{\mathrm{M}\left\}\right).
Reversely, given vectors a and b with a≺b, then there exists an n × n Hermitian matrix M whose diagonal elements are a and eigenvalues are b.
Lemma 3. (5.A.9 in [[12]]) For any a and b satisfying a≺^{w}b, there exists a vector x such that
Appendix 2
Standard waterfilling algorithm
A standard waterfilling algorithm will be used to solve the following convex problem which appeared in our MLWF algorithm:
The waterfilling algorithm that yields the optimum x_{ i }’s is given by the following:

Input: The number of all positive eigenvalues N, all inverse eigenvalues {\left\{{a}_{i}\right\}}_{i}^{N}, and a positive feasible initial vector for ρ_{ i }.

Output: Allocated powers {\left\{{x}_{i}\right\}}_{i}^{N} and the optimum water level {μ}.

1.
Sort all of the eigenvalues in the increasing order and a _{N + 1} = ∞. Set L = N.

2.
If a _{ L } = a _{L+1}, then L = L − 1. Set μ = a _{ N }

3.
If \mu \le \frac{\sum _{i=1}^{N}{a}_{i}^{1/2}}{{(\rho (NL\left)\right)}^{2}}, then the optimum solution is \mu =\frac{\sum _{i=1}^{N}{a}_{i}^{1/2}}{{(\rho (NL\left)\right)}^{2}} and {x}_{i}={\mu}^{1/2}{a}_{i}^{1/2}1, or else, go to the next step.

4.
Set L=L−1 and μ=a _{ L } go back to step 3.
Endnotes
^{a}Dualhop relay network is of particular importance and has been the most frequently studied type in the past decades.
^{b}We assume perfect channel knowledge in this paper, while the channel estimation can be performed by the method in [24].
References
Laneman JN, Wornell GW: Distributed space time block coded protocols for exploiting cooperative diversity in wireless networks. IEEE Trans. Inform. Theory 2003, 49: 24152425. 10.1109/TIT.2003.817829
Laneman JN, Tse DNC, Wornell GW: Cooperative diversity in wireless networks: efficient protocols and outage behavior. IEEE Trans. Inform. Theory 2004, 50: 30623080. 10.1109/TIT.2004.838089
Van De Meulen EC: Threeterminal communication channels. Adv. Appl. Prob 1971, 3: 120154. 10.2307/1426331
Information transmission through a channel with relay, Technical Report B767, The Aloha System, University of Hawaii, Honolulu. 1976.
Cover T, El Gamal A: Capacity theorems for the relay channel. IEEE Trans. Inform. Theory 1979, 25: 572584. 10.1109/TIT.1979.1056084
Telatar IE: Capacity of multiantenna Gaussian channels. Eur. Trans. Telecom 1999, 10: 585595. 10.1002/ett.4460100604
Tang X, Hua Y: Optimal design of nonregenerative MIMO wireless relays. IEEE Trans. Wireless Commun 2007, 6(4):13981407.
Hammerstrom I, Wittneben A: Power allocation schemes for amplifyandforward MIMOOFDM relay links. IEEE Trans. Wireless Commun 2007, 6(8):27982802.
Yang S, Belfiore JC: Towards the optimal amplifyandforward cooperative diversity scheme. IEEE Trans. Inform. Theory 2007, 53(9):3114126.
Khoshnevis B, Yu W, Adve R: Grassmannian beamforming for MIMO amplifyandforward relaying. IEEE J. Sel. Areas Commun 2008, 26: 1397407.
Hua Y: An overview of beamforming and power allocation for MIMO relays. In Proceedings of IEEE Military Communications Conference. San Jose, CA; 2010:375380.
Marshall AW, Olkin I: Inequalities: Theory of Majorization and Its Applications. Academic, New York; 1979.
Rong Y, Tang X, Hua Y: A unified framework for optimizing linear nonregenerative multicarrier MIMO relay communication systems. IEEE Trans. Signal Process 2009, 57: 48374851.
Rong Y, Hua Y: Optimality of diagonalization of multihop MIMO relays. IEEE Trans. Wireless Commun 2009, 8: 60686077.
Palomar DP, Lagunas MA, Cioffi JM: Optimum linear joint transmitreceive processing for MIMO channels with QoS constraints. IEEE Trans. on Signal Processing 2004, 52(5):11791197. 10.1109/TSP.2004.826164
Zhang R, Chai CC, Liang Y: Joint beamforming power control for multiantenna relay broadcast channel with QoS constraints. IEEE Trans. Sig. Proc 2009, 57(2):726737.
Y Rong: Multihop nonregenerative MIMO relaysQoS considerations. IEEE Trans. Sig. Proc 2011, 59(1):290303.
Fu Y, Yang L, Zhu W, Liu C: Optimum linear design of twohop MIMO relay network with QoS requirements. IEEE Trans. Sig. Proc 2011, 59(5):22572269.
Mohammadi J, Gao F, Rong Y: Design of amplify and forward MIMO relay networks with QoS constraint. In Proceedings of the Global Telecommunications Conference (GLOBECOM 2010). Miami, USA; 2010:610.
Grant M, Boyd S: CVX: Matlab software for disciplined convex programming, version 2.0 beta. 2012.http://cvxr.com/cvx
Kay SM: Fundumentals of Statistical Signal Processing: Estimation Theory. PrenticeHall, Englewood Cliffs; 1993.
Petersen KB, Pedersen MS: The Matrix Cookbook. 2007.
Boyd S, Vandenberghe L: Convex Optimization. Cambridge University Press, New York; 2004. Available (online) at http://www.stanford.edu/~boyd/cvxbook.html
Gao F, Cui T, Nallanathan A: On channel estimation and optimal training design for amplify and forward relay network. IEEE Trans. Wireless Commun 2008, 7(5):19071916.
Acknowledgements
This work was supported in part by the National Basic Research Program of China (973 Program) under grants 2013CB336600 and 2012CB316102, the National Natural Science Foundation of China under grant 61201187, and the Tsinghua University Initiative Scientific Research Program under grant 20121088074. Jafar Mohammadi’s work was also supported by Helmholtz Research School on Security Technologies. Y. Rong’s work was supported under Australian Research Council’s Discovery Projects funding scheme (project numbers DP110100736, DP110102076).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Mohammadi, J., Gao, F., Rong, Y. et al. Joint source and relay design for twohop amplifyandforward relay networks with QoS constraints. J Wireless Com Network 2013, 108 (2013). https://doi.org/10.1186/168714992013108
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/168714992013108