Energy efficiency analysis of one-way and two-way relay systems

Sun, Can; Yang, Chenyang

doi:10.1186/1687-1499-2012-46

Research
Open access
Published: 14 February 2012

Energy efficiency analysis of one-way and two-way relay systems

Can Sun¹ &
Chenyang Yang¹

EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 46 (2012) Cite this article

6138 Accesses
76 Citations
Metrics details

Abstract

Relaying is supposed to be a low energy consumption technique since the long distance transmission is divided into several short distance transmissions. When the power consumptions (PCs) other than that consumed by transmitting information bits is taken into account, however, relaying may not be energy efficient. In this article, we study the energy efficiencies (EEs) of one-way relay transmission (OWRT) and two-way relay transmission (TWRT) by comparing with direct transmission (DT). We consider a system where two source nodes transmit to each other with the assistance of a half-duplex amplify-and-forward relay node. We first find the maximum EEs of DT, OWRT, and TWRT by optimizing the transmission time and the transmit powers at each node. Then we compare the maximum EEs of the three strategies, and analyze the impact of circuit PCs and data amount. Analytical and simulation results show that relaying is not always more energy efficient than DT. Moreover, TWRT is not always more energy efficient than OWRT, despite that it is more spectral efficient. The EE of TWRT is higher than those of DT and OWRT in symmetric systems where the circuit PCs at each node are identical and the numbers of bits to be transmitted in two directions are equal. In asymmetric systems, however, OWRT may provide higher EE than TWRT when the numbers of bits in two directions differ significantly.

1 Introduction

Since the explosive growth of wireless services is sharply increasing their contributions to the carbon footprint and the operating costs, energy efficiency (EE) has drawn more and more attention recently as a new design goal for various wireless communication systems [1–3], compared with spectral efficiency (SE) that has been the design focus for decades.

A widely used performance metric for EE is the number of transmitted bits per unit of energy. When only transmit power is taken into account, the EE monotonically decreases with the increase of the SE [4] at least for point-to-point transmission in additive white Gaussian noise (AWGN) channel. In that case, when we minimize the transmit power, the EE will be maximized [5]. In practical systems, however, not only the power for transmitting information bits but also various signaling and circuits contribute to the system energy consumption (EC), which fundamentally change the relationship between the SE and EE. Specifically, when the circuit power consumption (PC) is considered, the optimization problem that minimizes the overall transmit power does not necessarily lead to an energy efficient design [2].

Relaying is viewed as an energy saving technique because it can reduce the transmit power by breaking one long range transmission into several short range transmissions [3]. In fact, relaying has been extensively studied from another viewpoint, i.e., it is able to extend the coverage, enhance the reliability as well as the capacity of wireless systems [6]. One-way relay transmission (OWRT) can reduce the one-hop communication distance and provide spatial diversity, but its SE will also reduce to 1/2 of that of direct transmission (DT) when practical half-duplex relay is applied [7]. Fortunately, two-way relay transmission (TWRT) can recover the SE loss when properly designed [8–10]. However, it is not well-understood whether these relay strategies are energy efficient, when various energy costs in addition to transmit power are considered.

Considering both the transmit power and the receiver processing power, the EE of decode-and-forward (DF) OWRT systems was studied with single-antenna and multi-antenna nodes in [11, 12], respectively. In [13], after accounting for the energy cost of acquiring channel information, relay selection for an OWRT system with multiple DF relays was optimized to maximize the EE. In [14], the EE of DF OWRT was compared with that of DT, where the result shows that OWRT is more energy efficient when the distance between source and destination is large, otherwise DT is better. In [15, 16], the EEs of OWRT and base station cooperation transmission were compared, where the overall energy costs including those from manufacture and deployment were considered. In [17], TWRT was shown to be more energy efficient than OWRT via simulations, where only transmit power was considered in the EC model. In [5], the EE of TWRT was compared with those of OWRT and DT, with optimized relay position and transmit power at each node. It shows that when the relay is placed at the midpoint of two source nodes, TWRT consumes less energy than OWRT and DT. Again, only transmit power was considered in the EC model. When we take into account the energy costs other than that contributed by the transmit power, what is the results of comparison between relaying and DT? Will TWRT still be more energy efficient than OWRT?

In this article, we analyze the EEs of TWRT, OWRT, and DT by studying a simple amplify-and-forward (AF) relay system. In literature, there are other relay protocols such as DF and compress-and-forward (CF) that provide higher rate regions than AF. However, AF is also widely considered in practice [6], and is superior to DF in outage performance for TWRT when the channel gains from two source nodes to the relay node are symmetric [18]. Moreover, the system models differ a lot among the relay protocols. In order to analyze the maximal EE, we need to find the relationship between end-to-end data rate and transmit power. With AF protocol, we can obtain the data rate-transmit power relationship by deriving the signal-to-noise ratio (SNR) at the destination. With DF protocol, the end-to-end data rate is quite different, which is modeled as the lower one of the achievable data rates in two hops. When considering CF, the case is even more complicated since its transmission and processing procedure is usually very complex, which is rather involved for analysis. Here we focus on AF relay as a good start, while the EEs of other relay protocols will be considered in future studies.

We consider a delay-constrained system, where B bits of message should be transmitted as a block within a duration T. This model is widely used for applications with strict delay constraints on data delivery, e.g., Voice-over-IP and sensor networks, where the message is generated periodically and must be transmitted with a hard deadline [19–21]. Note that the energy consumed by transmitting information decreases as the transmission duration increases [4], but the energy consumed by circuits increases with the duration. Therefore, in such a system we can adjust the transmission duration to reduce the overall EC as long as the transmission duration is shorter than the block length T. In other word, the system may transmit the B bits in a shorter duration than T and then switch to an idle status until the next block [21]. During the idle status, a part of the transceiver hardware can be shut down, which can be exploited to improve the EE.

Specifically, we first maximize the EEs of TWRT, OWRT, and DT by optimizing transmission time and transmit powers, respectively, for the three strategies. We then compare the optimized EEs of TWRT with those of OWRT and DT. We show that when all the three strategies operate with optimized transmission time and power, relaying is not always more energy efficient than DT. Moreover, TWRT is not always more energy efficient than OWRT if the numbers of bits to be transmitted in two directions are unequal, or the circuit PCs at each node are different.

The rest of this article is organized as follows. System model and the ECs of the three transmit strategies are, respectively, described in Sections 2 and 3. Then the EEs of different strategies are optimized in Section 4. In Section 5, the optimized EEs are compared under varies circuit PCs and numbers of transmitted bits. Simulation results are given in Section 6. Section 7 concludes the article.

2 System model

Consider a system consisting of two source nodes $A$ and $B$ , and an AF half-duplex relay node (RN) ℝ, each equipped with a single antenna. We consider a delay constrained system, where the information bits are generated periodically and must be transmitted in a block within a hard deadline T. In each block, nodes A and B, respectively, intends to transmit B_aband B_babits to each other with bandwidth W. In practice, the information bits to be transmitted in each block compose a packet or a frame, depending on application scenarios. In the following, we use the term "packet size" to refer the amount of data in each block, i.e., B_aband B_ba.

The channels among three nodes are assumed as frequency-flat fading channels, which are respectively, denoted as h_ab, h_ar, and h_br, as shown in Figure 1. We assume perfect channel knowledge at each node. The noise power N₀ is assumed to be identical at each node.

To reduce the EC, the system may not use the entire duration T for transmission in each block. After B_aband B_babits have been transmitted, the nodes can operate at an idle status until next block. In other word, each node has three modes: transmission, reception, and idle. The PCs in these modes are, respectively, denoted as P^t/ϵ + P^ct, P^cr, and P^ci, where P^tis the transmit power, ϵ ∈ (0, 1] denotes the power amplifier efficiency, P^ct, P^cr, and P^ciare, respectively, the circuit PCs in transmission, reception, and idle modes.

The circuit PCs in P^ctand P^crconsist of two parts: the power consumed by baseband processing and radio frequency (RF) circuits. The PC of RF circuit is usually assumed independent of data rate [6, 21], while there are different assumptions for the PC of baseband processing circuit. In systems with low complexity baseband processing, the baseband PC can be neglected compared with the RF PC [6, 21]. Otherwise, the baseband PC is not negligible and increases with data rate [22]. In this article, we consider the first case, where P^ctand P^cronly consist of RF PC, which are modeled as constants independent of data rate. Modeling P^ctand P^cras functions of data rate leads to a different optimization problem, which will be considered in our future study.

The PC in idle mode P^ciis modeled as a constant, and P^ci≤ P^ct, P^ci≤ P^cr. Subscripts (·)_a, (·)_b, and (·)_rwill be used to denote the PCs at different nodes.

3 Energy consumptions of three transmit strategies

We consider three transmit strategies, DT, OWRT, and TWRT, to complete the bidirectional communication between the two source nodes. In the following, we respectively introduce their ECs.

3.1 Direct transmission

In DT, nodes $A$ and $B$ transmit to each other without the assistance of RN. The transmission procedure is shown in Figure 2a. During each block, the system first allocates a duration T_abfor the transmission from node $A$ to $B$ , where node $A$ is in transmit mode and node $B$ is in receive mode. Then the system allocates a duration T_bafor the transmission from node $B$ to $A$ , where node $A$ is in receive mode and node $B$ is in transmit mode. After the B_aband B_babits are transmitted, the system turns into idle status during T - T_ab- T_ba, where both nodes $A$ and $B$ are in idle mode. The EC of DT can be obtained as

\begin{align} E_{D} & = T_{a b} (P_{a}^{t} / ε + P_{a}^{c t} + P_{b}^{c r}) + T_{b a} (P_{b}^{t} / ε + P_{b}^{c t} + P_{a}^{c r}) \\ + (T - T_{a b} - T_{b a}) (P_{a}^{c i} + P_{b}^{c i}) \\ = T_{a b} (P_{a}^{t} / ε + P_{D}^{c 1} + P_{D}^{c i}) + T_{b a} (P_{b}^{t} / ε + P_{D}^{c 2} - P_{D}^{c i}) + T P_{D}^{c i} \end{align}

(1)

where $P_{D}^{c 1} ≜ P_{a}^{c t} + P_{b}^{c r}$ and $P_{D}^{c 2} ≜ P_{b}^{c t} + P_{a}^{c r}$ are, respectively, the total circuit PCs in $A \to B$ and $B \to A$ transmission, and $P_{D}^{c i} ≜ P_{a}^{c i} + P_{b}^{c i}$ is the total circuit PC in idle duration.

Given T_aband T_ba, nodes $A$ and $B$ should, respectively, transmit with data rates of B_ab/T_aband B_ba/T_babits-per-second (bps) to exchange the B_aband B_babits messages, which are given by Shannon capacity formula as

\frac{B_{a b}}{T_{a b}} = W {log}_{2} (1 + \frac{P_{a}^{t} {|h_{a b}|}^{2}}{N_{0}}), \frac{B_{b a}}{T_{b a}} = W {log}_{2} (1 + \frac{P_{b}^{t} {|h_{a b}|}^{2}}{N_{0}}) .

(2)

Since Shannon capacity formula represents the maximum achievable data rates under given transmit powers, the transmit power derived via this formula is the minimum transmit power that can support the required data rates. As a result, we can analyze the maximal EE for a given SE. We will also use the Shannon capacity formula to represent the relationship between data rates and transmit powers in OWRT and TWRT cases later.

3.2 One-way relay transmission

In OWRT, each of the $A \to B$ and $B \to A$ transmission is divided into two hops, thus the bidirectional transmission needs four phases, as shown in Figure 2b. For example, in $A \to B$ transmission, node $A$ transmits to RN in the first phase, and RN transmits to node $B$ in the second phase. With the AF relay protocol, the two phases in each direction employ identical time duration. For simplifying the analysis, we do not consider the direct link in OWRT. Although this will degrade the performance of OWRT, we will show later that it does not affect our comparison results for the EE.

The system allocates a duration T_abfor $A \to B$ transmission. During the first half of T_ab, node $A$ transmits to RN, and thus node $A$ is in transmit mode, node ℝ is in receive mode, and node $B$ is idle. During the second half of T_ab, RN forwards the information to node $B$ , and thus node ℝ is in transmit mode, node $B$ is in receive mode, and node $A$ is idle. Then, the system allocates a duration T_bafor $B \to A$ transmission. Finally, the system turns into idle status during T - T_ab- T_baafter the bidirectional transmission. The EC of OWRT can be obtained as

\begin{align} E_{O} & = \frac{T_{a b}}{2} (P_{a}^{t} / ε + P_{a}^{c t} + P_{r}^{c r} + P_{b}^{c i} + P_{r 1}^{t} / ε + P_{r}^{c t} + P_{b}^{c r} + P_{a}^{c i}) \\ + \frac{T_{b a}}{2} (P_{b}^{t} / ε + P_{b}^{c t} + P_{r}^{c r} + P_{a}^{c i} + P_{r 2}^{t} / ε + P_{r}^{c t} + P_{a}^{c r} + P_{b}^{c i}) \\ + (T - T_{a b} - T_{b a}) (P_{a}^{c i} + P_{b}^{c i} + P_{r}^{c i}) \\ = T_{a b} (\frac{P_{a}^{t} + P_{r 1}^{t}}{2 ε} + P_{O}^{c 1} - P_{O}^{c i}) + T_{b a} (\frac{P_{b}^{t} + P_{r 2}^{t}}{2 ε} + P_{O}^{c 2} - P_{O}^{c i}) + T P_{O}^{c i}, \end{align}

(3)

where $P_{r 1}^{t}$ and $P_{r 2}^{T}$ are, respectively, the relay transmit powers in $A \to B$ and $B \to A$ links, $P_{O}^{c 1} ≜ (P_{a}^{c t} + P_{r}^{c r} + P_{b}^{c i} + P_{r}^{c t} + P_{b}^{c r} + P_{a}^{c i}) / 2$ and $P_{O}^{c 2} ≜ (P_{b}^{c t} + P_{r}^{c r} + P_{a}^{c i} + P_{r}^{c t} + P_{a}^{c r} + P_{b}^{c i}) / 2$ are, respectively, the overall circuit PCs in $A \to B$ and $B \to A$ transmission, and $P_{O}^{c i} ≜ P_{a}^{c i} + P_{b}^{c i} + P_{r}^{c i}$ is the overall circuit PC in idle duration where all three nodes operate in idle mode.

The required bidirectional data rates can be obtained from the capacity formula and the expression of SNR for OWRT derived in [23], which are respectively,

\frac{B_{a b}}{T_{a b}} = \frac{W}{2} {log}_{2} (1 + \frac{P_{a}^{t} P_{r 1}^{t} {|h_{a r}|}^{2} {|h_{b r}|}^{2}}{{|h_{a r}|}^{2} P_{a}^{t} N_{0} + {|h_{b r}|}^{2} P_{r 1}^{t} N_{0} + N_{0}^{2}}),

(4)

\frac{B_{b a}}{T_{b a}} = \frac{W}{2} {log}_{2} (1 + \frac{P_{b}^{t} P_{r 2}^{t} {|h_{b r}|}^{2} {|h_{a r}|}^{2}}{{|h_{b r}|}^{2} P_{b}^{t} N_{0} + {|h_{a r}|}^{2} P_{r 2}^{t} N_{0} + N_{0}^{2}}),

(5)

where the factor 1/ 2 is due to the two-phase transmission in each direction.

3.3 Two-way relay transmission

In TWRT, the bidirectional transmission is completed in two phases, as shown in Figure 2c. In the first phase, both nodes $A$ and $B$ transmit to RN, where the nodes $A$ and $B$ are in transmit mode and the node ℝ is in receive mode. In the second phase, RN broadcasts its received signal to the nodes $A$ and $B$ , where the node ℝ is in transmit mode, and the nodes $A$ and $B$ are in receive mode. After receiving the superimposed signal, each of the source nodes $A$ and $B$ removes its own transmitted signal via self-interference cancelation [8], and obtains its desired signal sent from the other source node. The two phases employ identical durations as in OWRT.

The system allocates duration T_TWR to the bidirectional transmission, and then turns into idle status during T - T_TWR. The EC of TWRT is obtained as

\begin{align} E_{T} & = \frac{T_{TWR}}{2} (P_{a}^{t} / ε + P_{b}^{t} / ε + P_{a}^{c t} + P_{b}^{c t} + P_{r}^{c r}) + \frac{T_{TWR}}{2} (P_{r}^{t} / ε + P_{r}^{c t} + P_{a}^{c r} + P_{b}^{c r}) \\ + (T - T_{TWR}) (P_{a}^{c i} + P_{b}^{c i} + P_{r}^{c i}) \\ = T_{TWR} (\frac{P_{a}^{t} + P_{b}^{t} + P_{r}^{t}}{2 ε} + P_{T}^{c} - P_{T}^{c i}) + T P_{T}^{c i}, \end{align}

(6)

where $P_{T}^{c} ≜ (P_{a}^{c t} + P_{b}^{c t} + P_{r}^{c r} + P_{r}^{c t} + P_{a}^{c r} + P_{b}^{c r}) / 2$ and $P_{T}^{c i} ≜ P_{a}^{c i} + P_{b}^{c i} + P_{r}^{c i}$ are the overall circuit PCs in the bidirectional transmission duration and the idle duration, respectively.

The required bidirectional data rates can be obtained from the capacity formula and the SNR expression of TWRT derived in [23], which are respectively,

\frac{B_{a b}}{T_{TWR}} = \frac{W}{2} {log}_{2} (1 + \frac{P_{a}^{t} P_{r}^{t} {|h_{a r}|}^{2} {|h_{b r}|}^{2}}{{|h_{a r}|}^{2} P_{a}^{t} N_{0} + {|h_{b r}|}^{2} P_{b}^{t} N_{0} + {|h_{b r}|}^{2} P_{r}^{t} N_{0} + N_{0}^{2}}),

(7)

\frac{B_{b a}}{T_{TWR}} = \frac{W}{2} {log}_{2} (1 + \frac{P_{b}^{t} P_{r}^{t} {|h_{b r}|}^{2} {|h_{a r}|}^{2}}{{|h_{a r}|}^{2} P_{a}^{t} N_{0} + {|h_{b r}|}^{2} P_{b}^{t} N_{0} + {|h_{a r}|}^{2} P_{r}^{t} N_{0} + N_{0}^{2}}),

(8)

where the factor 1/ 2 is due to the two-phase transmission.

4 Energy efficiency optimization for three transmit strategies

In this section, we optimize the EEs for DT, OWRT, and TWRT. The EE is defined as the number of bits transmitted in two directions per unit of energy, i.e.,

η_{EE} = \frac{B_{a b} + B_{b a}}{E},

(9)

where E is the EC per block, which respectively equals to E_D, E_Oor E_Tin DT, OWRT, or TWRT.

To guarantee a fair comparison, we maximize the EEs of DT, OWRT, and TWRT with the same packet sizes B_aband B_ba. From the definition of η_EE, we see that EE maximization is equivalent to EC minimization for a given pair of B_aband B_ba. Consequently, we will minimize the EC per block for different strategies by optimizing transmission time and power of each node.

We consider that the transmission time should not exceed the duration of a block T, and the transmit power of each node should be less than the maximum transmit power $P_{max}^{t}$ . Note that the system may not be able to transmit B_aband B_babits within the duration T even if the maximum transmit power is used. In this case an outage occurs. Since we assume perfect channel knowledge at each node, the nodes can estimate the transmit power and the transmission time required for each block, which depend on the channel distribution and packet sizes B_aband B_ba. Once the channel statistics and the packet sizes are given, the outage probability is fixed. In practice, the packet sizes B_aband B_bacan be pre-determined according to the quality of service (QoS) requirements, channel environment, and the acceptable outage probability. We will use Monte-Carlo simulation to find the maximal B_aband B_bathat ensure the outage probability to be lower than a threshold, e.g., 10%. Then, we only need to consider the EE optimization when the packet sizes are smaller than the maximum B_aband B_ba.

4.1 Direct transmission

As shown in (3), the EC of DT is a function of the transmit powers $P_{a}^{t}$ and $P_{b}^{t}$ as well as the transmission time T_aband T_ba. The EC can be minimized by jointly optimizing the transmit powers and transmission time as follows,

\begin{matrix} min_{T_{a b,} T_{b a,} P_{a}^{t}, P_{b}^{t}} & T_{a b} (\frac{P_{a}^{t}}{ε} + P_{D}^{c 1} - P_{D}^{c i}) + T_{b a} (\frac{P_{b}^{t}}{ε} + P_{D}^{c 2} - P_{D}^{c i}) + T P_{D}^{c i} \\ s . t . & T_{a b} + T_{b a} \leq T, P_{a}^{t} \leq P_{max}^{t}, P_{b}^{t} \leq P_{max}^{t} . \end{matrix}

(10)

To solve this joint optimization problem, we first express the transmit powers $P_{a}^{t}$ and $P_{b}^{t}$ as functions of the transmission time T_aband T_baby using (2), which are respectively,

P_{a}^{t} = \frac{N_{0}}{{|h_{a b}|}^{2}} (2^{\frac{B_{a b}}{W T_{a b}}} - 1), P_{b}^{t} = \frac{N_{0}}{{|h_{a b}|}^{2}} (2^{\frac{B_{b a}}{W T_{b a}}} - 1) .

(11)

By substituting (11) into both the objective function and the constraints of (10), the problem (10) can be reformulated as follows,

\begin{matrix} min_{T_{a b}, T_{b a}} & T_{a b} [\frac{N_{0} (2^{\frac{B_{a b}}{W T_{a b}}} - 1)}{ε {|h_{a b}|}^{2}} + P_{D}^{c 1} - P_{D}^{c i}] + T_{b a} [\frac{N_{0} (2^{\frac{B_{a b}}{W T_{a b}}} - 1)}{ε {|h_{a b}|}^{2}} + P_{D}^{c 2} - P_{D}^{c i}] + T P_{D}^{c i} \\ s.t. & T_{a b} + T_{b a} \leq T, T_{a b} \geq T_{min 1}, T_{a b} \geq T_{min 2} . \end{matrix}

(12)

where

T_{min 1} = \frac{B_{a b}}{W {log}_{2} (1 + \frac{P_{max}^{t} {|h_{a b}|}^{2}}{N_{0}})}, T_{min 2} = \frac{B_{b a}}{W {log}_{2} (1 + \frac{P_{max}^{t} {|h_{a b}|}^{2}}{N_{0}})} .

(13)

The minimum value constraints on T_aband T_baare due to the transmit power constraints, without which the data rates B_ab/T_aband B_ba/T_bawill be too high to be supported even with the maximal transmit powers.

Note that the problem in (12) is equivalent to the joint optimization problem in (10), where now only the transmission time needs to be optimized. In the objective function of the problem in (12), the first term is a function of T_aband not related to T_ba. It is easy to show that its second order derivative with respect to T_abis positive. Thus it is a convex function of T_ab. Similarly, the second term in the objective function is a convex function of T_ba. The last term is independent of the transmission time. Therefore, the objective function is convex with respect to T_aband T_ba. All the constraints in (12) are also convex.^a Then the problem can be solved by using efficient convex optimization techniques, such as gradient descent algorithm [24].

4.2 One-way relay transmission

Similar to the DT case, we first express the transmit powers as functions of the transmission time using (4) and (5). Then the joint optimization of transmit power and transmission time can be solved with two steps: first find the optimal transmit powers as functions of the transmission time, then optimize the transmission time to minimize the EC.

For a given T_ab, both $P_{a}^{t}$ and $P_{b}^{t}$ can be obtained from (4), where multiple feasible solutions exist. In order to minimize the EC, we find the transmit powers that minimize the sum power as follows,

\begin{matrix} min_{P_{a}^{t}, P_{r 1}^{t}} & P_{a}^{t} + P_{r 1}^{t} \\ s.t. & P_{a}^{t} \leq P_{max}^{t}, P_{r 1}^{t} \leq P_{max}^{t}, and (4) . \end{matrix}

(14)

To ensure that all the constraints in (14) can be satisfied, the data rate B_ab/T_abshould be less than the maximum data rate supported by the maximum transmit power. This turns into a minimum value constraint for the transmit time, which is

T_{a b} \geq B_{a b} / [\frac{W}{2} {log}_{2} (1 + \frac{{(P_{max}^{t})}^{2} {|h_{a r}|}^{2} {|h_{b r}|}^{2}}{{|h_{a r}|}^{2} P_{max}^{t} N_{0} + {|h_{b r}|}^{2} P_{max}^{t} N_{0} + N_{0}^{2}})] ≜ T_{min 1} .

(15)

Denote the minimum value of $P_{a}^{t} + P_{r 1}^{t}$ as P_min1(T_ab), where T_ab≥ T_min1. It can be derived as a piecewise function as follows (see Appendix 1),

P_{min 1} (T_{a b}) = \{\begin{matrix} \frac{C_{1} {|h_{b r}|}^{2} P_{max}^{t} N_{0} + C_{1} N_{0}^{2}}{({|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} - C_{1} {|h_{a r}|}^{2} N_{0})} + P_{max}^{t}, & T_{min 1} \leq T_{a b} < T_{d 1} \\ C_{1} N_{0} (\frac{1}{{|h_{b r}|}^{2}} + \frac{1}{{|h_{a r}|}^{2}}) + \frac{2 \sqrt{C_{1}^{2} + C_{1}} N_{0}}{|h_{a r} h_{b r}|}, & T_{a b} \geq T_{d 1} \end{matrix}

(16)

or,

P_{min 1} (T_{a b}) = \{\begin{matrix} P_{max}^{t} + \frac{C_{1} {|h_{a r}|}^{2} P_{max}^{t} N_{0} + C_{1} N_{0}^{2}}{({|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} - C_{1} {|h_{b r}|}^{2} N_{0})}, & T_{min 1} \leq T_{a b} < T_{d 2} \\ C_{1} N_{0} (\frac{1}{{|h_{b r}|}^{2}} + \frac{1}{{|h_{a r}|}^{2}}) + \frac{2 \sqrt{C_{1}^{2} + C_{1}} N_{0}}{|h_{a r} h_{b r}|}, & T_{a b} \geq T_{d 2} \end{matrix}

(17)

where $C_{1} ≜ 2^{2 B_{a b} / (T_{a b} W)} - 1$ , the demarcation points T_{d 1}and T_{d 2}are defined in Appendix 1. If T_{d 1}≥ T_{d 2}, P_min1(T_ab) follows (16), otherwise, it follows (17).

The piecewise function can be explained as follows. When T_abis large, the data rate is low and both $P_{a}^{t}$ and $P_{r 1}^{t}$ are below their maximum value, then P_min1(T_ab) follows the second part in (16) or (17). As T_abdecreases, one of $P_{a}^{t}$ and $P_{r 1}^{t}$ will achieve its maximum value. When T_ab= T_{d 1}, we have $P_{r 1}^{t} = P_{max}^{t}$ , and when T_ab= T_{d 2}, $P_{a}^{t} = P_{max}^{t}$ . If T_{d 1}≥ T_{d 2}, $P_{r 1}^{t}$ achieves its maximum value first, P_min1(T_ab) follows the first part in (16). Otherwise, $P_{a}^{t}$ achieves its maximum value first, P_min1(T_ab) follows the first part in (17). When T_abdecreases to T_min1, both $P_{a}^{t}$ and $P_{r 1}^{t}$ achieve the maximum value. For simplicity, we refer the first part in (16) or (17) as "one-max" interval, because one of the nodes uses its maximum transmit power. We refer the second part in (16) or (17) as "non-max" interval, since neither of the nodes uses its maximum transmit power.

For a given T_ba, we can also find the values of $P_{b}^{t}$ and $P_{r 2}^{t}$ that minimize their summation. Following an analogous procedure, the minimum value of $P_{b}^{t} + P_{r 2}^{t}$ denoted as P_min2(T_ba) can be derived as a piecewise function of transmission time T_ba, which are respectively,

P_{min 2} (T_{b a}) = \{\begin{matrix} \frac{C_{2} {|h_{a r}|}^{2} P_{max}^{t} N_{0} + C_{2} N_{0}^{2}}{({|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} - C_{2} {|h_{b r}|}^{2} N_{0})} + P_{max}^{t}, & T_{min 2} \leq T_{b a} < T_{d 3} \\ C_{2} N_{0} (\frac{1}{{|h_{b r}|}^{2}} + \frac{1}{{|h_{a r}|}^{2}}) + \frac{2 \sqrt{C_{2}^{2} + C_{2}} N_{0}}{|h_{a r} h_{b r}|}, & T_{b a} \geq T_{d 3} \end{matrix}

(18)

or,

P_{min 2} (T_{b a}) = \{\begin{matrix} P_{max}^{t} + \frac{C_{2} {|h_{b r}|}^{2} P_{max}^{t} N_{0} + C_{2} N_{0}^{2}}{({|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} - C_{2} {|h_{a r}|}^{2} N_{0})}, & T_{min 2} \leq T_{b a} < T_{d 4} \\ C_{2} N_{0} (\frac{1}{{|h_{b r}|}^{2}} + \frac{1}{{|h_{a r}|}^{2}}) + \frac{2 \sqrt{C_{2}^{2} + C_{2}} N_{0}}{|h_{a r} h_{b r}|}, & T_{b a} \geq T_{d 4} \end{matrix}

(19)

where $C_{2} ≜ 2^{2 B_{b a} / (T_{b a} W)} - 1$ , the demarcation points T_{d 3}and T_{d 4}can be derived similarly as T_{d 1}and T_{d 2}in P_min1(T_ab). If T_{d 3}≥ T_{d 4}, P_min2(T_ba) follows (18), otherwise, it follows (19). The minimum value constraint for T_ba, i.e., T_ba≥ T_min2, is also due to the maximum transmit power constraint like that for T_abin (15), and T_min2 can be derived similarly as T_min1.

Then the optimization problem that minimizes the EC can be formulated as follows,

\begin{matrix} min_{T_{a b}, T_{b a}} & T_{a b} (\frac{P_{min 1} (T_{a b})}{2 ε} + P_{O}^{c 1} - P_{O}^{c i}) + T_{b a} (\frac{P_{min 2} (T_{b a})}{2 ε} + P_{O}^{c 2} - P_{O}^{c i}) + T P_{O}^{c i} \\ s.t. & T_{a b} + T_{b a} \leq T, T_{a b} \geq T_{min 1}, T_{b a} \geq T_{min 2} . \end{matrix}

(20)

We can show that the first term in the objective function is a quasi-convex function of T_ab(see Appendix 2). Similarly, the second term is a quasi-convex function of T_ba. The last term is a constant. However, the sum of two quasi-convex functions may not be quasi-convex. Therefore, we solve this problem using the following approach.

First, we assume that the optimal solution for (20) satisfies $T_{a b}^{opt} + T_{b a}^{opt} < T$ . In this case, the first constraint in (20) can be omitted. Since the second constraint is only related to T_ab, and the last constraint is only related to T_ba, the joint optimization problem can be decoupled into two subproblems, i.e., optimizing T_abto minimize the first term in objective function with the constraint T_ab≥ T_min1, and optimizing T_bato minimize the second term in objective function with the constraint T_ba≥ T_min2. Because we have proved that the first two terms in the objective function are, respectively, quasi-convex functions with respect to T_aband T_ba, both the two subproblems can be solved via quasi-convex optimization techniques such as bisection algorithm [24].

If the optimized T_aband T_bafrom the two subproblems satisfy $T_{a b}^{opt} + T_{b a}^{opt} < T$ , then our assumption holds, and we obtain the optimal transmission time. Otherwise, the optimal solution for (20) must satisfy $T_{a b}^{opt} + T_{b a}^{opt} = T$ . In this case, we only need to find the optimal $T_{a b}^{opt}$ , where a scalar searching is applied, and the optimal $T_{b a}^{opt}$ can be obtained as $T_{b a}^{opt} = T - T_{a b}^{opt}$ .

4.3 Two-way relay transmission

Analogous to the previous sections, we first derive the transmit powers as functions of the transmission time.

For a given T_TWR, we can find $P_{a}^{t}, P_{b}^{t}$ , and $P_{r}^{t}$ from (7) and (8), where multiple feasible solutions exist. To minimize the EC, again we find $P_{a}^{t}, P_{b}^{t}$ , and $P_{r}^{t}$ that minimize their summation from the following problem,

\begin{matrix} min_{P_{a}^{t}, P_{b}^{t}, P_{r}^{t}} & P_{a}^{t} + P_{b}^{t} + P_{r}^{t} \\ s . t . & P_{a}^{t} \leq P_{max}^{t}, P_{b}^{t} \leq P_{max}^{t}, P_{r}^{t} \leq P_{max}^{t}, (7) and (8). \end{matrix}

(21)

Following a similar derivation as in the case of OWRT, the minimum value of $P_{a}^{t} + P_{b}^{t} + P_{r}^{t}$ can be obtained as a piecewise function of the transmission time T_TWR, which is denoted as P_min(T_TWR).

When T_TWR is large, the data rates B_ab/T_TWR and B_ba/T_TWR are low, and all transmit powers are below their maximum values. The optimal transmit powers are derived with similar method in Appendix 1 as follows,

P_{a}^{t - opt} = \frac{C_{1} N_{0}}{{|h_{a r}|}^{2}} + \frac{N_{0} (C_{1}^{2} + C_{1} + C_{1} C_{2})}{|h_{a r} h_{b r}| \sqrt{(C_{1} + C_{2}) (C_{1} + C_{2} + 1)}},

(22a)

P_{b}^{t - opt} = \frac{C_{2} N_{0}}{{|h_{b r}|}^{2}} + \frac{N_{0} (C_{2}^{2} + C_{2} + C_{1} C_{2})}{|h_{a r} h_{b r}| \sqrt{(C_{1} + C_{2}) (C_{1} + C_{2} + 1)}},

(22b)

P_{r}^{t - opt} = \frac{C_{1} N_{0}}{{|h_{b r}|}^{2}} + \frac{C_{2} N_{0}}{{|h_{a r}|}^{2}} + \frac{N_{0} \sqrt{(C_{1} + C_{2}) (C_{1} + C_{2} + 1)}}{|h_{a r} h_{b r}|} .

(22c)

where $C_{1} ≜ 2^{\frac{2 B_{a b}}{W T_{TWR}}} - 1$ and $C_{2} ≜ 2^{\frac{2 B_{b a}}{W T_{TWR}}} - 1$ . The corresponding P_min(T_TWR) is the sum of (22a), (22b), and (22c).

When T_TWR decreases, the data rates increases, then $P_{a}^{t - opt}, P_{b}^{t - opt}$ , and $P_{r}^{t - opt}$ increase until one of them achieves the maximum value $P_{max}^{t}$ . By setting (22a), (22b), and (22c) to be $P_{max}^{t}$ , respectively, we can obtain T_TWR = T_{d 1}when $P_{a}^{t - opt} = P_{max}^{t}$ , T_TWR = T_{d 2}when $P_{b}^{t - opt} = P_{max}^{t}$ , and T_TWR = T_{d 3}when $P_{r}^{t - opt} = P_{max}^{t}$ . Without loss of generality, we assume that T_{d 1}≥ T_{d 2}and T_{d 1}≥ T_{d 3}(similar results can be obtained for other cases). In this case, $P_{a}^{t - opt}$ achieves the maximum value first, i.e., node $A$ transmits with the maximum transmit power. By substituting $P_{a}^{t} = P_{max}^{t}$ into (7) and (8), we have

P_{a}^{t - opt} = P_{max}^{t},

(23a)

P_{b}^{t - opt} = \frac{C_{1} C_{2} N_{0}^{2} ({|h_{a r}|}^{2} - {|h_{b r}|}^{2}) + C_{2} {|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} N_{0}}{C_{1} {|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} N_{0}},

(23b)

P_{r}^{t - opt} = \frac{C_{1} C_{2} N_{0}^{2} ({|h_{a r}|}^{2} - {|h_{b r}|}^{2}) + P_{max}^{t} N_{0} {|h_{a r}|}^{2} (C_{1} {|h_{a r}|}^{2} + C_{2} {|h_{b r}|}^{2}) + C_{1} {|h_{a r}|}^{2} N_{0}^{2}}{{|h_{a r}|}^{2} {|h_{b r}|}^{2} ({|h_{a r}|}^{2} P_{max}^{t} - C_{1} N_{0})} .

(23c)

The corresponding P_min(T_TWR) can be obtained by adding (23a), (23b), and (23c).

When T_TWR further decreases, the data rates further increases, $P_{b}^{t - opt}$ and $P_{r}^{t - opt}$ in (23) increase until one of them achieves its maximum value. Without loss of generality, assume that $P_{b}^{t - opt}$ in (23b) achieves $P_{max}^{t}$ first. The corresponding value of T_TWR is denoted as T_min, which can be obtained by setting (23b) to be $P_{max}^{t}$ . Then both nodes $A$ and $B$ transmit with the maximum power. Substituting $P_{a}^{t} = P_{b}^{t} = P_{max}^{t}$ into (7) and (8), we need to find one $P_{r}^{t}$ from two equations, which has no solution. Therefore, T_min is the minimum value of T_TWR due to the maximum transmit power constraint. Finally, the minimal sum transmit power is obtained as

P_{min} (T_{TWR}) = \{\begin{matrix} \begin{matrix} (23 a) + (23 b) + (23 c), T_{min} \leq T_{TWR} < T_{d 1} \\ (22 a) + (22 b) + (22 c), T_{TWR} \geq T_{d 1}, \end{matrix} \end{matrix}

(24)

where its first and second parts are, respectively, referred to as "one-max" and "non-max" interval for simplicity as that in the case of OWRT.

Then the optimization problem that minimizes the EC can be formulated as

\begin{matrix} min_{T_{TWR}} & T_{TWR} (\frac{P_{min} (T_{TWR})}{2 ε} + P_{T}^{c} - P_{T}^{c i}) + T P_{T}^{c i} \\ s.t. & T_{min} \leq T_{TWR} \leq T . \end{matrix}

(25)

Using the similar method in Appendix 2, we can prove that the objective function is a quasi-convex function of T_TWR. Therefore, efficient quasi-convex optimization techniques [24] can be applied to solve the problem.

5 Energy efficiency analysis

In this section, we compare the EEs of different transmit strategies, and analyze the impact of various channels and system settings.

From the objective functions in (20) and (25), we can see that the expressions of the ECs of OWRT and TWRT are quite complex because the minimal sum transmit powers are piecewise functions with very complicated expressions, i.e., (16), (17), (18), (19), and (24). To gain useful insight into the EE comparison, we consider the following two approximations.

Approximation 1: In the piecewise functions of P_min1(T_ab), P_min2(T_ba), and P_min(T_TWR), we only consider the "non-max" interval, where none of the nodes achieves its maximum transmit power.

We take the function P_min1(T_ab) in (16) as an example to explain the approximation. In the "non-max" interval, as transmission time T_abdecreases, both transmit powers at nodes $A$ and $B$ , i.e., $P_{a}^{t}$ and $P_{r 1}^{t}$ , increase for supporting the increased data rate B_ab/T_ab. In the "one-max" interval, $P_{r 1}^{t}$ has achieved its maximum value. As T_abdecreases, only $P_{a}^{t}$ can increase to support the increased data rate, thus $P_{a}^{t}$ grows much faster than that in "non-max" interval and approaches its maximum value rapidly. Therefore, the range (T_min1,T_{d 1}) of the "one-max" interval is very short, and in most cases the optimized $T_{a b}^{opt} \notin (T_{min 1}, T_{d 1})$ . Instead, $\begin{gathered} T_{a b}^{opt} \in (T_{d 1}, + \infty) \end{gathered}$ . Based on this observation, we only consider the "non-max" interval in range (T_{d 1}, +∞).

Since we only consider the case where none of the nodes achieve its maximal transmit power, we do not need to consider the maximum transmit power constraints. Therefore it is not necessary to consider the corresponding minimum value constraints on the transmission time in this section.

Approximation 2: In the expressions of P_min1(T_ab), P_min2(T_ba), and P_min(T_TWR), we respectively consider that

2^{\frac{2 B_{a b}}{W T_{a b}}} - 1 \approx 2^{\frac{2 B_{a b}}{W T_{a b}}}, 2^{\frac{2 B_{b a}}{W T_{b a}}} - 1 \approx 2^{\frac{2 B_{b a}}{W T_{b a}}},

(26a)

2^{\frac{2 B_{a b}}{W T_{TWR}}} + 2^{\frac{2 B_{b a}}{W T_{TWR}}} - 2 \approx 2^{\frac{2 B_{a b}}{W T_{TWR}}} + 2^{\frac{2 B_{a b}}{W T_{TWR}}} - 1 .

(26b)

We take (26a) as an example to explain the approximation, which affects the values of the transmit power P_min1(T_ab) and P_min2(T_ba) in OWRT. When the SEs in two directions, i.e., B_ab/(WT_ab) and B_ba/(WT_ba) are high, it is easy to see that the approximations in (26a) are accurate. On the other hand, when the SEs are low, the transmit powers P_min1(T_ab) and P_min2(T_ba) are much lower than the circuit PC. Then the approximations on transmit powers have little impact on the analysis of EC.

By applying these approximations, the ECs of OWRT and TWRT can be simplified as

\begin{align} E_{O} & \approx T_{a b} [\frac{N_{0}}{2 ε {|h_{e}|}^{2}} (2^{\frac{2 B_{a b}}{W T_{a b}}} - 1) + P_{O}^{c 1} - P_{O}^{c i}] \\ + T_{b a} [\frac{N_{0}}{2 ε {|h_{e}|}^{2}} (2^{\frac{2 B_{b a}}{W T_{b a}}} - 1) + P_{O}^{c 2} - P_{O}^{c i}] + T P_{O}^{c i}, \end{align}

(27)

E_{T} \approx T_{TWR} [\frac{N_{0}}{2 ε {|h_{e}|}^{2}} (2^{\frac{2 B_{a b}}{W T_{TWR}}} + 2^{\frac{2 B_{a b}}{W T_{TWR}}} - 2) + P_{T}^{c} - P_{T}^{c i}] + T P_{T}^{c i},

(28)

where $|h_{e}| ≜ 1 / (\frac{1}{|h_{a r}|} + \frac{1}{|h_{b r}|})$ can be viewed as an equivalent channel gain between two source nodes due to the usage of the relay.

For the convenience of comparison, we rewrite the EC of DT in the same form as follows,

\begin{align} E_{D} & = T_{a b} [\frac{N_{0}}{ε {|h_{a b}|}^{2}} (2^{\frac{B_{a b}}{W T_{a b}}} - 1) + P_{D}^{c 1} - P_{D}^{c i}] \\ + T_{b a} [\frac{N_{0}}{ε {|h_{a b}|}^{2}} (2^{\frac{B_{b a}}{W T_{b a}}} - 1) + P_{D}^{c 2} - P_{D}^{c i}] + T P_{D}^{c i} . \end{align}

(29)

5.1 Baseline case

As a baseline for further analysis, we first consider the case where all the circuit PCs are zero and the packet sizes in two directions are symmetric, i.e., P^ct= P^cr= P^ci= 0 and $B_{a b} = B_{b a} ≜ B$ . Then the ECs of OWRT, TWRT, and DT shown in (27), (28), and (29) are decreasing functions of the transmission time. As a result, the system will use the entire duration T for transmission. Due to the symmetric packet sizes, the optimal values of T_aband T_baare identical in DT and OWRT. This means that the optimal transmission time in DT and OWRT are $T_{a b}^{opt} = T_{b a}^{opt} = T / 2$ , and that in TWRT is $T_{TWR}^{opt} = T$ . After substituting the optimal transmission time into (27), (28), and (29), the minimum ECs can be obtained as

E_{D}^{min} = \frac{N_{0} T}{ε} \frac{(2^{\frac{2 B}{W T}} - 1)}{{|h_{a b}|}^{2}}, E_{O}^{min} \approx \frac{N_{0} T}{ε} \frac{(2^{\frac{4 B}{W T}} - 1)}{2 {|h_{e}|}^{2}}, E_{T}^{min} \approx \frac{N_{0} T}{ε} \frac{(2^{\frac{2 B}{W T}} - 1)}{{|h_{e}|}^{2}},

(30)

from which we can see that the optimal EE, $η_{EE}^{opt} = \frac{2 B}{E^{min}}$ , is a decreasing function of the packet size B in the three strategies. This implies that the maximal EE is achieved when B approaches zero.

Now, we compare the EEs of the three strategies. First, it shows from (30) that $E_{O}^{min} / E_{T}^{min} \geq 1$ , which means that TWRT is more energy efficient than OWRT.

Second, we see that $E_{D}^{min} / E_{T}^{min} = {|h_{e}|}^{2} / {|h_{a b}|}^{2}$ , i.e., the EE comparison between TWRT and DT depends on the effective channel gain |h_eff| and the direct link channel gain |h_ab|. If |h_eff| > |h_ab|, TWRT is more energy efficient, otherwise, DT is more energy efficient. To gain further insight into this comparison, we consider an AWGN channel,^b where |h_ab|² is normalized as 1, the distance from the RN to nodes $A$ and $B$ are, respectively, d and 1 - d. Then ${|h_{a r}|}^{2} = {(\frac{1}{d})}^{α}$ and ${|h_{b r}|}^{2} = {(\frac{1}{1 - d})}^{α}$ , where α is the path loss attenuation factor. Then the equivalent channel gain becomes

|h_{e}| = 1 / (\frac{1}{|h_{a r}|} + \frac{1}{|h_{b r}|}) = \frac{1}{d^{α / 2} + {(1 - d)}^{α / 2}},

(31)

which is related to the RN position. To maximize |h_eff|, the optimal relay position is the midpoint of the two source nodes, i.e., d = 0.5. In this case, |h_eff| = 2^α/2/2. When α > 2, which is true in most practical channel environments, |h_eff| = 2^{α/ 2}/2 > |h_ab| = 1, and TWRT is more energy efficient than DT.

Third, for DT and OWRT we have

E_{D}^{min} / E_{O}^{min} = \frac{{|h_{e}|}^{2}}{{|h_{a b}|}^{2}} \frac{2 (2^{\frac{2 B}{W T}} - 1)}{2^{\frac{4 B}{W T}} - 1} = \frac{{|h_{e}|}^{2}}{{|h_{a b}|}^{2}} \frac{2}{2^{\frac{2 B}{W T}} + 1} .

(32)

If |h_eff| ≤ |h_ab|, $since \frac{2}{2^{\frac{2 B}{W T}} + 1} \leq 1$ we have $E_{D}^{min} / E_{O}^{min} \leq 1$ , i.e., DT is more energy efficient than OWRT.

If |h_eff| > |h_ab|, the comparison result depends on the packet size B. When $B \to 0, \frac{2}{2^{\frac{2 B}{W T}} + 1} \to 1$ , then $E_{D}^{min} / E_{O}^{min} \to {|h_{e}|}^{2} / {|h_{a b}|}^{2} \geq 1$ . It means that in low traffic region, OWRT is more energy efficient. When $B \to \infty, \frac{2}{2^{\frac{2 B}{W T}} + 1} \to 0$ , then $E_{D}^{min} / E_{O}^{min} \to 0 < 1$ . It means that in high traffic region, DT is more energy efficient. An intuitive explanation is as follows. On one hand, OWRT needs two-phase for transmission in each direction, thus the data rate in each phase should be twice of that in DT, which requires more transmit power. On the other hand, OWRT has higher equivalent channel gain, which reduces the required transmit power. In low traffic region, doubling the lower data rate has little impact on the transmit power, and thus OWRT is more energy efficient due to higher equivalent channel gain.

Here we argue that even if OWRT exploits the direct link between $A$ and $B$ for spatial diversity, the conclusion will still be the same. With the direct link, the equivalent channel gain can be improved. However, the improvement is rather limited in most cases, because the signal attenuation between the two source nodes is much larger than that between the source nodes and the RN. Furthermore, OWRT has 1/2 spectral efficiency loss with respect to DT and TWRT, which cannot be recovered from the SNR gain.

5.2 Impact of circuit power consumption

In this subsection we assume symmetric packet size, i.e., B_ab= B_ba= B, but consider the non-zero circuit PCs in practical systems. Then the ECs in (27), (28), and (29) are no longer monotonically decreasing functions of the transmission time. With the increase of the transmission time, the transmit energy decreases since the required data rate reduces, however, the circuit energy increases linearly. We take TWRT as an example to analyze the EE.

The optimal transmission time in TWRT can be obtained by taking the derivative of E_Tin (28) with respect to T_TWR and setting it to be zero, which is

\frac{d E_{T}}{d T_{TWR}} \approx \frac{d}{d T_{TWR}} \{T_{TWR} [\frac{N_{0}}{ε {|h_{e}|}^{2}} (2^{\frac{2 B}{W T_{TWR}}} - 1) + P_{T}^{c} - P_{T}^{c i}] + T P_{T}^{c i}\}

(33a)

= [\frac{N_{0} (2^{\frac{2 B}{W T_{TWR}}} - 1)}{ε {|h_{e}|}^{2}} + P_{T}^{c} - P_{T}^{c i}] - \frac{N_{0} ln 2}{ε {|h_{e}|}^{2}} 2^{\frac{2 B}{W T_{TWR}}} \frac{2 B}{W T_{TWR}}

(33b)

≜ [\frac{N_{0} (2^{η_{SE - T}} - 1)}{ε {| h_{eff} |}^{2}} + P_{T}^{c} - P_{T}^{c i}] - \frac{N_{0} \ln 2}{ε {| h_{eff} |}^{2}} 2^{η_{SE - T}} η_{SE - T} = 0 |_{η_{SE - T} = η_{SE - T}^{opt}},

(33c)

where $η_{SE - T} ≜ \frac{2 B}{W T_{TWR}}$ is the bidirectional SE of TWRT.

Although it is difficult to obtain a closed form solution of the optimal T_TWR, some observations can be obtained from (33). The optimal SE that minimizes the EC should satisfy (33c), from which we can see that $η_{EE - T}^{opt}$ does not depend on the packet size B. Therefore, the optimal transmission time $T_{TWR}^{opt} = \frac{2 B}{W η_{SE - T}^{opt}}$ increases linearly with B. Considering that T_TWR should not exceed the time duration of a block T, we obtain the following observation.

Observation 1: In high traffic region, $T_{TWR}^{opt} = T$ . In low traffic region where $\frac{2 B}{W η_{SE - T}^{opt}} \leq T$ , the optimal transmission time $T_{TWR}^{opt} = \frac{2 B}{W η_{SE - T}^{opt}}$ increases linearly with the packet size B.

In high traffic region, the transmission time $T_{TWR}^{opt} = T$ , then the bidirectional $SE \frac{2 B}{W T}$ increases linearly with the packet size B, thus the transmit energy increases exponentially with B according to the capacity formula. In this case, the transmit EC is much larger than the circuit EC, thus the EE will be almost the same as that in zero circuit PC scenario.

In low traffic region, when the system transmits with the optimal transmission time $T_{TWR}^{opt} = \frac{2 B}{W η_{SE - T}^{opt}}$ , the equality in (33b) equals to zero. Then we have

\begin{align} T_{TWR}^{opt} [\frac{N_{0} (2^{\frac{2 B}{W T_{TWR}^{opt}}} - 1)}{ε {|h_{e}|}^{2}} + P_{T}^{c} - P_{T}^{c i}] & = \frac{2 B N_{0}}{ε {|h_{e}|}^{2} W} (ln 2) 2^{\frac{2 B}{W T_{TWR}^{opt}}} \\ = \frac{2 B N_{0}}{ε {|h_{e}|}^{2} W} (ln 2) 2^{η_{SE - T}^{opt}}, \end{align}

where the first equality comes from the fact that (33b) equals to zero, and the second equality comes from $T_{TWR}^{opt} = \frac{2 B}{W η_{SE - T}^{opt}}$ .

By substituting B_ab= B_ba= B and $T_{TWR} = T_{TWR}^{opt}$ into the EC of TWRT in (28), and then substituting (34), the minimum EC of TWRT can be obtained as

E_{T}^{min} = \frac{2 B N_{0}}{ε {|h_{e}|}^{2} W} (ln 2) 2^{η_{SE - T}^{opt}} + T P_{T}^{c i},

(34)

and the optimal EE of TWRT is given by

η_{EE - T}^{opt} = \frac{2 B}{\frac{2 B N_{0}}{ε {|h_{e}|}^{2} W} (ln 2) 2^{η_{SE - T}^{opt}} + T P_{T}^{c i}},

(35)

from which we can obtain the following observation.

Observation 2: In low traffic region, if the circuit PC in idle mode $P_{T}^{c i} = 0$ , we have $η_{EE - T}^{opt} = \frac{ε {|h_{e}|}^{2} W}{N_{0} (ln 2) 2^{η_{SE - T}^{opt}}}$ . Since we have shown that $η_{SE - T}^{opt}$ does not depend on the packet size B, $η_{EE - T}^{opt}$ also does not change with B in this case. If $P_{T}^{c i} \neq 0$ , $lim_{B \to 0} η_{EE - T}^{opt} = 0$ since a large portion of energy is consumed in the idle duration.

Note that although $lim_{B \to 0} η_{EE - T}^{opt} = 0$ due to the non-zero idle mode circuit PC, this observation does not mean that the idle duration is unnecessary. If the system transmits with the entire duration T, where $T > T_{TWR}^{opt}$ , it can save the EC in idle mode, but it wastes more EC in transmission mode because it does not transmit with the optimal transmission time. Finally, more energy will be consumed and the EE will be reduced. We will show this impact later in simulations.

Observation 2 shows that if $P_{T}^{c i} = 0, η_{EE - T}^{opt}$ does not change with B in low traffic region, where $\frac{2 B}{W η_{SE - T}^{opt}} \leq T$ , i.e., $B \leq T W η_{SE - T}^{opt} / 2$ . In other words, EE is insensitive to the packet size when $B \in (0, T W η_{SE - T}^{opt} / 2)$ . We can show that such a region becomes wider as the circuit power $P_{T}^{c}$ increases. By taking derivative with respect to $P_{T}^{c}$ at both side of (33c), we obtain

1 - \frac{N_{0} {(ln 2)}^{2}}{ε {|h_{e}|}^{2}} 2^{η_{SE - T}^{opt}} η_{SE - T}^{opt} \frac{d η_{SE - T}^{opt}}{d P_{T}^{c}} = 0,

(36)

from which we can see that $\frac{d η_{SE - T}^{opt}}{d P_{T}^{c}} = \frac{ε {|h_{e}|}^{2}}{N_{0} {(ln 2)}^{2} 2^{η_{SE - T}^{opt}} η_{SE - T}^{opt}} \geq 0$ , i.e., as the circuit power $P_{T}^{c}$ increases, $η_{SE - T}^{opt}$ increases, and then the region $(0, T W η_{SE - T}^{opt} / 2)$ extends.

Following analogous procedure, we can obtain the same observations as in the Observations 1 and 2 for DT and OWRT. The optimal EEs of DT and OWRT in low traffic region can be obtained as

η_{EE - D}^{opt} = \frac{2 B}{\frac{B N_{0}}{ε {|h_{a b}|}^{2} W} (ln 2) (2^{η_{S E - D 1}^{opt}} + 2^{η_{S E - D 2}^{opt}}) + T P_{D}^{c i}},

(37)

η_{EE - O}^{opt} = \frac{2 B}{\frac{B N_{0}}{ε {|h_{e}|}^{2} W} (ln 2) (2^{η_{S E - O 1}^{opt}} + 2^{2 η_{S E - O 2}^{opt}}) + T P_{O}^{c i}},

(38)

where $η_{S E - D 1}^{opt}$ and $η_{S E - D 2}^{opt}$ are the optimal SEs in $A \to B$ and $B \to A$ directions in DT, $η_{S E - O 1}^{opt}$ and $η_{S E - O 2}^{opt}$ are those in OWRT, none of them depends on the packet size B. We omit the detailed derivations for concise.

Since it is difficult to derive closed form expressions for the optimal transmission time and the optimal SEs, there are also no closed form expressions for the optimal EEs. We will use simulations to compare the EEs of DT, OWRT, and TWRT under non-zero circuit PCs.

5.3 Impact of unequal data amounts in two directions

In this section, we assume that the circuit PCs are identical at each node, and consider that the packet sizes in two directions differ. Define B_ab= βB_sand B_ba= (1 - β)B_s, where B_sis the overall number of bits to be transmitted in two directions, and β is a factor to reflect the traffic asymmetry. We will show that once B_sis given, the minimum ECs of DT and OWRT are independent of β, but the EC of TWRT is minimized when β = 0.5. In other words, the asymmetric packet sizes in two directions only reduces the EE of TWRT.

Proposition 1. The minimum EC of OWRT does not depend on β.

Proof. Since, we assume $P_{O}^{c 1} = P_{O}^{c 2} ≜ P_{O}^{c}$ , the EC of OWRT in (27) can be rewritten as

\begin{align} E_{O} & = T_{a b} [\frac{N_{0} (2^{\frac{2 β B_{s}}{W T_{a b}}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i}] \\ + T_{b a} [\frac{N_{0} (2^{\frac{2 (1 - β) B_{s}}{W T_{b a}}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i}] + T P_{O}^{c i} . \end{align}

(39)

To minimize the EC, the optimal transmit time should satisfy that (see Appendix 3),

\frac{β B_{s}}{T_{a b}^{opt}} = \frac{(1 - β) B_{s}}{T_{b a}^{opt}} ≜ R_{O},

(40)

i.e., the data rates on the two directions are identical, where R_Ois not a function of β. Then the minimum E_Ocan be obtained as follows by substituting (40) into (39),

\begin{align} E_{O}^{min} & = \frac{β B_{s}}{R_{O}} [\frac{N_{0} (2^{\frac{2 R_{O}}{W}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i}] + \frac{(1 - β) B_{s}}{R_{O}} [\frac{N_{0} (2^{\frac{2 R_{O}}{W}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i}] + T P_{O}^{c i} \\ = \frac{B_{s}}{R_{O}} [\frac{N_{0} (2^{\frac{2 R_{O}}{W}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i}] + T P_{O}^{c i}, \end{align}

(41)

which is not a function of β.

This proposition is easy to understand intuitively. Because with the optimized transmission time, the OWRT system transmits with the same data rate on each direction, and each bit is transmitted with identical data rate R_Oand thus with identical time duration 1/R_O. Therefore, the energy consumed by each bit is identical no matter in which direction it is transmitted. Then the minimum EC only depends on the overall number of transmitted bits B_s.

The minimum EC of DT, $E_{D}^{min}$ , can be obtained in a similar way, which also does not depend on β. We do not show the results for concise.

Proposition 2. The minimum EC of TWRT is a function of β, and its minimum value is achieved when β = 0.5.

Proof. The EC of TWRT in (28) can be rewritten as,

\begin{align} E_{T} & = T_{TWR} [\frac{N_{0} (2^{\frac{2 β B_{s}}{W T_{TWR}}} - 1)}{2 ε {|h_{e}|}^{2}} + \frac{P_{T}^{c} - P_{T}^{c i}}{2}] \\ + T_{TWR} [\frac{N_{0} (2^{\frac{2 (1 - β) B_{s}}{W T_{TWR}}} - 1)}{2 ε {|h_{e}|}^{2}} + \frac{P_{T}^{c} - P_{T}^{c i}}{2}] + T P_{T}^{c i} . \end{align}

(42)

If the transmission time in two directions could be different,^c the EC becomes

\begin{align} E_{T 1} & = T_{TWR1} [\frac{N_{0} (2^{\frac{2 β B_{s}}{W T_{TWR1}}} - 1)}{2 ε {|h_{e}|}^{2}} + \frac{P_{T}^{c} - P_{T}^{c i}}{2}] \\ + T_{TWR2} [\frac{N_{0} (2^{\frac{2 (1 - β) B_{s}}{W T_{TWR2}}} - 1)}{2 ε {|h_{e}|}^{2}} + \frac{P_{T}^{c} - P_{T}^{c i}}{2}] + T P_{T}^{c i} . \end{align}

(43)

Note that the only difference of E_Tand E_{T 1}is the transmission time in their first and second terms. With less constraints on the transmission time, the minimum value of E_{T 1}achieved by optimizing T_TWR1 and T_TWR2 is a lower bound of the minimum value of E_Tby optimizing T_TWR, i.e., $E_{T}^{min} = min_{T_{TWR}} (E_{T}) \geq min_{T_{TWR1}, T_{TWR2}} (E_{T 1}) = E_{T 1}^{min}$ .

Following the analogous procedure as we analyze the OWRT system, we can show that $E_{T 1}^{min}$ is not a function of β. Moreover, using similar method as in Appendix 3, we can prove that the optimal T_TWR1 and T_TWR2 that minimize (43) satisfy $\frac{β B_{s}}{T_{TWR1}^{opt}} = \frac{(1 - β) B_{s}}{T_{TWR2}^{opt}}$ . It suggests that only when β = 0.5, $T_{TWR1}^{opt} = T_{TWR2}^{opt}$ . In this case, by choosing $T_{TWR} = T_{TWR1}^{opt} = T_{TWR2}^{opt}$ , E_Tin (42) equals to $E_{T 1}^{min}$ . Therefore, only when β = 0.5, $E_{T 1}^{min}$ equals to its lower bound $E_{T 1}^{min}$ . Then proposition 2 is true.

6 Simulation results

In this section, we evaluate the EEs of the three transmission strategies, DT, OWRT, and TWRT, and validate previous analysis via simulations.

Simulation parameter settings are summarized in Table 1, where we consider that three nodes are located on a straight line, and the RN is at the midpoint of two source nodes. In this case, the equivalent channel gain in relaying achieves the maximal value. The small scale fading channels are independent and identically distributed (i.i.d.) Rayleigh block fading, which remain constant during one block but are independent from one block to another. All the results are averaged over 500 channel realizations.

Table 1 List of important parameters

Full size table

The increase of distance D, noise power N₀, and attenuation factor α all result in higher required transmit power. Since their impacts are similar, we only show the impact of α. Because the increase of block duration T is equivalent to a reduction of the transmitted bits number per unit of time, we set T as a constant and change the values of B_aband B_ba.

From [6, 21], the circuit PCs in practical systems usually range from dozens to hundreds of mW. Therefore, we set the circuit PCs in this range in the simulations. The power amplifier efficiency e is set as 0.35 [21].

6.1 Baseline case

We first compare the EEs of different strategies in the baseline case where the circuit PCs are zero and the packet sizes B_ab= B_ba.

To show the EEs in different channel conditions, we set the attenuation factor α as 2 or 4. Since we are more interested in comparing the EEs rather than showing their absolute values, we normalize the EEs by the maximum EE of DT system for each α. The normalized EE is shown in Figure 3, and the corresponding outage probability is shown in Figure 4. The x-axis is the overall number of transmitted bits in two directions normalized by the block duration and bandwidth, i.e., (B_ab+ B_ba)/(TW), which can be viewed as the average bidirectional SE per block.^d

In Figure 3, because of the normalization, the EE curves of DT under different α overlap. It shows that the spectral efficient strategy TWRT is also energy efficient with respect to OWRT. When the attenuation factor is large, i.e., α = 4, the EE of TWRT is higher than that of DT, while when α = 2 the result is just the opposite. The comparison between DT and OWRT depends both on the packet size and the channel condition. When α = 2, DT always outperforms OWRT. When α = 4, OWRT is superior to DT in low traffic region, but is inferior to DT in high traffic region. All these results agree well with our analysis.

Figure 4 shows that when α = 2 the outage probabilities of DT, OWRT, and TWRT are zero for the considered packet sizes. When α = 4, the outage probabilities all increase. We see that TWRT offers lowest outage probability, and thus can support larger packet size given the same outage probability.

Since we only consider the case where the outage probability is lower than an acceptable threshold, say 10%, the EE curves of OWRT or DT when α = 4 is only plotted for the scenarios where (B_ab+ B_ba)/(TW) is lower than 4 or 4.4 bits/s/Hz in Figure 3. In the following sections, we use the same method to determine the maximal packet sizes for DT, OWRT and TWRT, which ensure the outage probability to be lower than 10%.

6.2 Non-zero circuit power consumption

In Figure 5, we take TWRT as an example to show the impact of different circuit powers. We present the maximal EEs, which are achieved by the optimized transmission time and transmit power, i.e., there may be idle duration in each block. For comparison, we provide the baseline case again where the circuit PCs are zero. To show the necessity of the transmission time optimization, we also show the EE for a system who transmits with the entire block duration (i.e., there is no idle duration).

As expected, the non-zero circuit PC reduces the EE. It shows that the circuit PC only affects the EE in low traffic region, i.e., in low SE region. While in high SE region, since the transmit PC is much higher than the circuit PC, the EEs are almost the same for different circuit PCs. That is to say, the high and low SE regions are, respectively, "transmit power dominant" and "circuit power dominant".

When we assume the circuit PC in idle mode P^ci= 0, i.e., there exists an idle duration but its PC can be ignored, the EE does not change with SE in the "circuit power dominant" region. As the circuit PCs in the transmit and receive modes P^ctand P^crincrease, this region becomes wider.

When P^ci≠ 0, the EE reduces to zero as the packet size decreases. Comparing the lowest two curves where P^ci= 10 mW, we can see that the EE will decrease if we do not consider the idle duration, i.e., do not optimize the transmission time. Moreover, it is shown that when the PC in idle mode is not negligible, there is a non-zero optimal packet size that maximizes the maximal EE.

All these results agree with our earlier analytical analysis. We do not show the results of OWRT and DT, which are similar as those of TWRT.

In Figure 6, we compare the EEs of different strategies with equal circuit PC at each node, where α = 4. It shows that the EE of TWRT is always higher than that of OWRT. Since the path loss is severe, TWRT outperforms DT. OWRT is superior to DT in low traffic region, but becomes inferior in high traffic region. These results are the same as those in zero circuit PC scenario.

From Figure 6, we see that the idle mode circuit power P^cionly affects the energy efficiencies in low traffic region, and the comparison result among different strategies will not change no matter P^ciis zero or not. Since the different EE curves are more distinguishable when the circuit power in idle mode is zero, in the following we set the circuit power in idle status P^ci= 0 mW. Note that the circuit powers in transmit and receive modes P^ctand P^crare still non-zero.

In Figure 7, we compare the EEs with unequal circuit PCs at each node. We set the circuit PCs as $p_{b}^{c t} = k_{b} p_{a}^{c t}, p_{b}^{c r} = k_{b} p_{a}^{c r}$ , where k_b≥ 1, which means that node $B$ consumes more circuit power than node $A$ . We also set $p_{r}^{c t} = k_{r} p_{a}^{c t}, p_{r}^{c r} = k_{r} p_{a}^{c r}$ , where k_r≥ 1 or k_r≤ 1, which reflects the cases where the RN consumes more circuit power or less circuit power than node $A$ depending on specific application scenarios.

It is easy to understand that if the circuit PC at the RN is high, the advantage of relay transmission over direct transmission shrinks and vice versa. Therefore, we focus on the comparison between OWRT and TWRT in Figure 7. We plot the performance gain of the maximal EE of TWRT over that of OWRT, i.e., $\frac{max (η_{EE - T}^{opt})}{max (η_{EE - O}^{opt})}$ , in order to observe whether TWRT is more energy efficient than OWRT, and how much performance gain TWRT can achieve.

From the simulation results in Figure 7, we can see that as k_bincreases, i.e., the difference of the circuit PCs at the two source nodes becomes larger, the benefit of TWRT over OWRT shrinks. The OWRT even become more energy efficient than TWRT when the relay circuit PC is low.

6.3 Unequal bidirectional packet sizes

Finally, we compare the maximal EEs with unequal bidirectional packet sizes, which are shown in Figure 8. It shows that the EEs of DT and OWRT do not depend on the ratio B_ab/B_ba, but the EE of TWRT reduces as the difference between B_aband B_baincreases, and may even become lower than those of OWRT and DT.

Note that in all the simulations, we did not consider the Approximations 1 and 2 employed in the beginning of Section 5. We can see that the analytical results using those approximations agree well with the simulation results. This validates the previous theoretical analysis.

7 Conclusion

In this article, we studied the energy efficiencies of OWRT and TWRT, and compared with direct transmission. We first found the maximal energy efficiencies of three strategies by jointly optimizing the bidirectional transmission time and the transmit power. We then compared their maximal energy efficiencies with either zero or non-zero circuit power consumptions, and reveal the mechanisms to improve the energy efficiency of the three transmission strategies under different scenarios.

Analytical and simulation results showed that in symmetric systems with equal circuit power at each node and equal packet sizes in two directions, the spectral efficient two-way relaying is also more energy efficient than one-way relaying, but two-way relaying only provides higher energy efficiency than direct transmission when the path loss attenuation is large. In asymmetric systems where the circuit power consumptions at each node are different or the bidirectional packet sizes are unequal, the advantage of two-way relaying diminishes because it can not simultaneously minimize the energy consumed by the transmissions in two directions. One-way relaying may offer higher energy efficiency, depending on the difference between the amount of data in two directions. Compared with the joint transmit power and transmission time optimization, only optimizing the transmit power has a loss in EE when the packet size is small. All the comparison results reveal that relaying is not always more energy efficient than direct transmission, and the two-way relaying does not not always offer higher energy efficiency than one-way relaying. To save the energy consumption, a system should choose the most suitable transmission strategy considering its required amount of data to be transmitted, channel statistics, hardware circuit powers, and so on.

We also showed the relationship between the energy efficiency and the spectral efficiency, i.e., the required amount of data normalized by bandwidth and time duration, for all the three transmission strategy, which is largely dependent on the circuit power consumption. With zero circuit power, the energy efficiency achieves its maximum value as the spectral efficiency approaches zero. With non-zero circuit powers in transmit and receive duration but negligible circuit powers in idle duration, energy efficiency does not change with spectral efficiency in low traffic region but reduce sharply in high traffic region. With non-zero circuit powers in all the transmit, receive and idle modes, there exists a non-zero optimal spectral efficiency that maximizes the maximal energy efficiency.

Appendix 1: Solution of optimization problem (14)

From (4), the transmit power at node $A$ can be expressed as a function of the transmit power at the RN in $A \to B$ link as

P_{a}^{t} = \frac{C_{1} {|h_{b r}|}^{2} P_{r 1}^{t} N_{0} + C_{1} N_{0}^{2}}{{|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{r 1}^{t} - C_{1} {|h_{a r}|}^{2} N_{0}} ≜ f (P_{r 1}^{t}),

(44)

where $C_{1} ≜ 2^{2 B_{a b} / (T_{a b} W)} - 1$ .

By substituting (44) into both the objective function and the constraints of (14), the optimization problem can be rewritten as

\begin{matrix} min_{P_{r 1}^{t}} & f (P_{r 1}^{t}) + P_{r 1}^{t} \\ s.t . & f (P_{r 1}^{t}) \leq P_{max}^{t}, P_{r 1}^{t} \leq P_{max}^{t}, \end{matrix}

(45)

which only depends on $P_{r 1}^{t}$ .

It is easy to show that the objective function is convex by taking its second order derivative with respect to $P_{r 1}^{t}$ , which is positive. Without the two constraints in this problem, the optimal $P_{r 1}^{t}$ can be obtained as follows by setting the first order derivative of the objective function with respect to $P_{r 1}^{t}$ as zero,

P_{r 1}^{t - o p t} = \frac{C_{1} N_{0}}{{|h_{b r}|}^{2}} + \frac{\sqrt{C_{1}^{2} + C_{1}} N_{0}}{|h_{a r} h_{b r}|} .

(46)

Then the corresponding optimal transmit power at node A can be obtained by substituting (46) into (44),

P_{a}^{t - opt} = f (P_{r 1}^{t - opt}) = \frac{C_{1} N_{0}}{{|h_{a r}|}^{2}} + \frac{\sqrt{C_{1}^{2} + C_{1}} N_{0}}{|h_{a r} h_{b r}|} .

(47)

We can see that both $P_{r 1}^{t - opt}$ and $P_{a}^{t - opt}$ are increasing functions of $C_{1} = 2^{2 B_{a b} / (T_{a b} W)} - 1$ , thus are decreasing functions of T_ab. Therefore, when T_abis high enough, both $P_{r 1}^{t - opt}$ and $P_{a}^{t - opt} = f (P_{r 1}^{t - opt})$ will satisfy the two constraints in (45). Then (46) and (47) are the optimal solutions of the problem (14).

As T_abdecreases, both $P_{r 1}^{t - opt}$ and $P_{a}^{t - opt}$ increase, until one of them achieve its maximum value. By substituting (46) and (47) into $P_{r 1}^{t - opt} = P_{max}^{t}$ and $P_{a}^{t - opt} = P_{max}^{t}$ , respectively, we can derive the corresponding demarcation point T_ab= T_{d 1}where $P_{r 1}^{t - opt}$ achieves its maximal value, and can also derive the corresponding T_ab= T_{d 2}where $P_{a}^{t - opt}$ achieves its maximal value. The derived T_{d 1}and T_{d 2}are given by

T_{d 1} = \frac{2 B_{a b}}{W {log}_{2} (1 + \frac{{|h_{b r}|}^{2} [N_{0} + 2 P_{max}^{t} {|h_{a r}|}^{2} - \sqrt{N_{0}^{2} + 4 P_{max}^{t} N_{0} {|h_{a r}|}^{2} + 4 {(P_{max}^{t})}^{2} {|h_{a r} h_{b r}|}^{2}}]}{2 N_{0} ({|h_{a r}|}^{2} - {|h_{b r}|}^{2})})},

(48)

T_{d 2} = \frac{2 B_{a b}}{W {log}_{2} (1 + \frac{{|h_{b r}|}^{2} [N_{0} + 2 P_{max}^{t} {|h_{b r}|}^{2} - \sqrt{N_{0}^{2} + 4 P_{max}^{t} N_{0} {|h_{b r}|}^{2} + 4 {(P_{max}^{t})}^{2} {|h_{a r} h_{b r}|}^{2}}]}{2 N_{0} ({|h_{b r}|}^{2} - {|h_{a r}|}^{2})})} .

(49)

If T_{d 1}≥ T_{d 2}, as T_abdecreases, $P_{r 1}^{t - opt}$ achieves its maximal value first, then we have

P_{r 1}^{t - opt} = P_{max}^{t} .

(50)

The corresponding $P_{a}^{t - opt}$ can be obtained by substituting (50) into (44), which is

P_{a}^{t - opt} = \frac{C_{1} {|h_{b r}|}^{2} P_{max}^{t} N_{0} + C_{1} N_{0}^{2}}{{|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} - C_{1} {|h_{a r}|}^{2} N_{0}} .

(51)

If T_{d 1}< T_{d 2}, as T_abdecreases, $P_{a}^{t - opt}$ achieves its maximal value first, then we have

P_{a}^{t - opt} = P_{max}^{t} .

(52)

The corresponding $P_{r 1}^{t - opt}$ can be derived using (44) by substituting (52),

P_{r 1}^{t - opt} = \frac{C_{1} {|h_{a r}|}^{2} P_{max}^{t} N_{0} + C_{1} N_{0}^{2}}{{|h_{a r}|}^{2} {|h_{b r}|}^{2} P_{max}^{t} - C_{1} {|h_{b r}|}^{2} N_{0}} .

(53)

By adding (46) and (47), (50) and (51), and (52) and (53), we can obtain the expressions of $P_{min 1} (T_{a b}) = min (P_{a}^{t} + P_{r 1}^{t})$ in (16) and (17).

Appendix 2: Proof of quasi-convexity of the objective function in (20)

We consider the case that P_min1(T_ab) follows (16), the conclusion is the same if it follows (17). Since P_min1(T_ab) is a piecewise function of T_ab, $T_{a b} (\frac{P_{min 1} (T_{a b})}{2 ε} + P_{O}^{c 1} - P_{O}^{c i})$ is also a piecewise function. For simplicity, we define

T_{a b} (\frac{P_{min 1} (T_{a b})}{2 ε} + P_{O}^{c 1} - P_{O}^{c i}) = \{\begin{matrix} f_{l} (T_{a b}), & T_{min 1} \leq T < T_{d 1} \\ f_{r} (T_{a b}), & T \geq T_{d 1} . \end{matrix}

(54)

By taking the second order derivative of f_l(T_ab), we have ${f_{l}}^{''} (T_{a b}) \geq 0$ when T_min1 ≤ T < T_{d 1}. Therefore, f_l(T_ab) is a convex function in the range T_min1 ≤ T < T_{d 1}.

Then we will show that f_r(T_ab) is a quasi-convex function in the range T > T_{d 1}, where we will use the following lemma.

Lemma 1. Suppose that a function f(x) is second order differentiable in (x_L, x_R), $lim_{x \to x_{L}} f^{'} (x) < 0, lim_{x \to r_{R}} f^{'} (x) > 0$ , and f"(x) only has one zero point in (x_L, x_R). Then f(x) is a quasi-convex function on (x_L, x_R).

Proof. Since f(x) is second order differentiable, f'(x) is continuous on (x_L, x_R). Considering that $lim_{x \to x_{L}} f^{'} (x) < 0, lim_{x \to r_{R}} f^{'} (x) > 0$ f'(x) at least has one zero point in (x_L, x_R). We then show that f'(x) can only has one zero point.

Assume that f'(x) has three or more zero points such that f'(a) = f'(b) = f'(c) = 0. According to Rolle's theorem, there exists a point x₁ ∈ (a, b) such that f"(x₁) = 0, and also a point x₂ ∈ (b, c) such that f"(x₂) = 0. This conflicts with the assumption that f"(x) only has one zero point.

Assume that f'(x) has two zero points such that f'(a) = f'(b) = 0, a, b ∈ (x_L, x_R). According to Rolle's theorem, there is a point x₁ ∈ (a, b) which satisfies f"(x₁) = 0. Without loss of generality, we assume that f'(x₁) > 0. Considering that $lim_{x \to x_{R}} f^{'} (x) > 0$ , and in (x₁, x_R), f'(x) only has one zero point f'(b) = 0, therefore, f'(b) = 0 is the minimum value of f'(x) in (x₁, x_R), and thus f"(b) = 0. Then we have two zero points for f"(x), which conflicts with the assumption that f"(x) only has one zero point.

Consequently f'(x) can only has one zero point. Assume that f'(x_M) = 0. Then in (x_L, x_M), f(x) < 0, f(x) is non-increasing, while in (x_M, x_R), f'(x) > 0, f(x) is non-decreasing, which means that f(x) is a quasi-convex function in (x_L, x_R) [24].

By taking the derivative of f_r(T_ab), we find that $f_{r}^{'} (0) \to - \infty$ , and $lim_{T_{a b} \to \infty} f_{r}^{'} (T_{a b}) = P_{O}^{c 1} - P_{O}^{c i} \geq 0$ since the circuit PC in the idle mode is lower than that in the transmit or receive mode. We also find that

f_{r}^{″} (T_{a b}) = k_{1} [k_{2} + k_{3} g (T_{a b})],

(55)

where $k_{1} = \frac{2 {(ln 2)}^{2} B_{a b}^{2} N_{0}}{(W^{2} T_{a b}^{3} ε)} 2^{\frac{2 B_{a b}}{(W T_{a b})}} > 0, k_{2} = \frac{1}{{|h_{a r}|}^{2}} + \frac{1}{{|h_{b r}|}^{2}} > 0, k_{3} = \frac{1}{2 |h_{a r} h_{b r}|} > 0,$ , k₂ and k₃ do not depend on T_ab, and g(T_ab) is given by

g (T_{a b}) = \frac{4 {(2^{\frac{2 B_{a b}}{W T_{a b}}} - 1)}^{2} + 2 (2^{\frac{2 B_{a b}}{W T_{a b}}} - 1) - 1}{(2^{\frac{2 B_{a b}}{W T_{a b}}} - 1) \sqrt{(2^{\frac{2 B_{a b}}{W T_{a b}}} - 1) 2^{\frac{2 B_{a b}}{W T_{a b}}}}} .

(56)

We can easily obtain that $lim_{T_{a b} \to 0} g (T_{a b}) = 4, lim_{T_{a b} \to + \infty} g (T_{a b}) = - \infty$ and g'(T_ab) < 0, for T_ab> 0. Then g(T_ab) strictly monotonically decreases from 1 to -∞ when T_ab> 0. Therefore, f"(T_ab) in (55) only has one zero point. According to Lemma 1, f_r(T_ab) is a quasi-convex function on (0, + ∞), and thus a quasi-convex function on [T_{d 1}, + ∞).

Based on the expression of T_{d 1}derived in Appendix 1, we can obtain that $lim_{T_{a b} \to T_{d 1}} f_{l}^{'} (T_{a b}) = lim_{T_{a b} \to T_{d 1}} f_{r}^{'} (T_{a b}) ≜ δ$ . If $δ \leq 0, T_{a b} (\frac{P_{min 1} (T_{a b})}{2 ε} + P_{O}^{c 1} - P_{O}^{c i}) = f_{l} (T_{a b})$ monotonically decreases in [T_min1, T_{d 1}) due to the convexity of f_l(T_ab), while $T_{a b} (\frac{P_{min 1} (T_{a b})}{2 ε} + P_{O}^{c 1} - P_{O}^{c i}) = f_{r} (T_{a b})$ first decreases and then increases in [T_{d 1},+ ∞) due to the quasi-convexity of f_r(T_ab). Therefore, $T_{a b} (\frac{P_{min 1} (T_{a b})}{2 ε} + P_{O}^{c 1} - P_{O}^{c i})$ is quasi-convex in [T_min1, + ∞). If δ > 0, the same is true.

Appendix 3: Derivation of the optimal transmission time

Recall that in Approximation 1, we only consider the case where none of the nodes achieves its maximal transmit power and thus we can ignore the minimum value constraints on transmission time. Then the optimization problem of the transmission time is given by

\begin{matrix} min_{T_{a b}, T_{b a}} & T_{a b} [\frac{N_{0} (2^{\frac{2 β B_{s}}{W T_{a b}}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i}] + T_{b a} [\frac{N_{0} (2^{\frac{2 (1 - β) B_{s}}{W T_{b a}}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i}] + T P_{O}^{c i} \\ s.t. & T_{a b} + T_{b a} \leq T \end{matrix}

(57)

This is a convex problem, where the optimal T_aband T_bashould satisfy the following Karush-Kuhn-Tucker (KKT) conditions,

λ (T_{a b}^{opt} + T_{b a}^{opt} - T) = 0,

(58a)

\frac{N_{0} (2^{\frac{2 β B_{s}}{W T_{a b}^{opt}}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i} - \frac{N_{0}}{2 ε {|h_{e}|}^{2}} 2^{\frac{2 β B_{s}}{W T_{a b}^{opt}}} (ln 2) \frac{2 β B_{s}}{W T_{a b}^{opt}} = - λ,

(58b)

\frac{N_{0} (2^{\frac{2 (1 - β) B_{s}}{W T_{b a}^{opt}}} - 1)}{2 ε {|h_{e}|}^{2}} + P_{O}^{c} - P_{O}^{c i} - \frac{N_{0}}{2 ε {|h_{e}|}^{2}} 2^{\frac{2 (1 - β) B_{b a}}{W T_{a b}^{opt}}} (ln 2) \frac{2 (1 - β) B_{s}}{W T_{b a}^{opt}} = - λ,

(58c)

where λ is the Lagrange multiplier.

We can see that the expressions in the left-hand side of (58b) and (58c) equal to each other. Therefore, the optimal transmission time satisfies

\frac{β B_{s}}{T_{a b}^{opt}} = \frac{(1 - β) B_{s}}{T_{b a}^{opt}} ≜ R_{O} .

(59)

Substituting (59) into the KKT conditions, it is easy to see that R_Ois not a function of β.

Endnotes

^aThe feasible region of the EE optimization problem may be empty, which implies an outage of a block. Thereby we do not need to optimize for this block. Similar case also exists in the OWRT and TWRT optimization problems.

^bIt should be noted that AWGN channel is appropriate for modeling free space propagation where α = 2. We consider different path loss attenuation factors here, which may be an abuse of the terminology of "AWGN channel".

^cThis can not happen in practice, which is considered only for the proof.

^dThe average bidirectional SE per block takes into account the entire duration of a block, which includes not only the transmission time but also the idle duration.

References

Correia L, Zeller D, Blume O, Ferling D, Jading Y, Godor I, Auer G, Perre L: Challenges and enabling technologies for energy aware mobile radio networks. IEEE Commun Mag 2010, 48(11):66-72.
Article Google Scholar
Li G, Xu Z, Xiong C, Yang C, Zhang S, Chen Y, Xu S: Energy-efficient wireless communications: tutorial, survey and open issues. IEEE Commun Mag 2011, 18(6):28-35.
Article Google Scholar
Han C, Harrold T, Armour S, Krikidis I, Videv S, Grant PM, Haas H, Thompson JS, Ku I, Wang C, Le T, Nakhai MR, Zhang J, Hanzo L: Green radio: radio techniques to enable energy-efficient wireless networks. IEEE Commun Mag 2011, 49(6):46-54.
Article Google Scholar
Chen Y, Zhang S, Xu S, Li G: Fundamental trade-offs on green wireless networks. IEEE Commun Mag 2011, 49(6):30-37.
Article Google Scholar
Li Y, Zhang X, Peng M, Wang W: Power provisioning and relay positioning for two-way relay channel with analog network coding. IEEE Signal Process Lett 2011, 18(9):517-520.
Article Google Scholar
Dohler M, Li Y: Cooperative Communications, Hardware, Channel & Phy. Wiley, UK; 2010.
Book Google Scholar
Laneman J, Tse D, Wornell G: Cooperative diversity in wireless networks: efficient protocols and outage behavior. IEEE Trans Inf Theory 2004, 50(12):3062-3080. 10.1109/TIT.2004.838089
Article MathSciNet MATH Google Scholar
Rankov B, Wittneben A: Spectral efficient protocols for half-duplex fading relay channels. IEEE J Select Areas Commun 2007, 25(2):379-389.
Article Google Scholar
Oechtering T, Jorswieck E, Wyrembelski R, Boche H: On the optimal transmit strategy for the MIMO bidirectional broadcast channel. IEEE Trans Commun 2009, 57(12):3817-3826.
Article Google Scholar
Sun C, Li Y, Vucetic B, Yang C: Transceiver design for multi-user multi-antenna two-way relay channels. In Proceedings of IEEE Global Telecommunications Conference (GLOBECOM'10). Miami, Florida, USA; 2010:1-5.
Google Scholar
Bae C, Stark W: End-to-end energy-bandwidth tradeoff in multihop wireless networks. IEEE Trans Inf Theory 2009, 55(9):4051-4066.
Article MathSciNet Google Scholar
Chen C, Stark W, Chen S: Energy-bandwidth efficiency tradeoff in MIMO multi-hop wireless networks. IEEE J Select Areas Commun 2011, 29(8):1537-1546.
Article Google Scholar
Madan R, Mehta N, Molisch A, Zhang J: Energy-efficient cooperative relaying over fading channels with simple relay selection. IEEE Trans Wirel Commun 2008, 7(8):3013-3025.
Article Google Scholar
Wang S, Nie J: Energy efficiency optimization of cooperative communication in wireless sensor networks. EURASIP J Wirel Commun Netw 2010, 2010: 1-8.
Google Scholar
Cao D, Zhou S, Zhang C, Niu Z: Energy saving performance comparison of coordinated multi-point transmission and wireless relaying. In Proceedings of IEEE Global Telecommunications Conference (GLOBECOM'10). Miami, Florida, USA; 2010:1-5.
Google Scholar
Rost P: Opportunities, benefits, and constraints of relaying in mobile communication systems. Technische University Dresden; 2009.
Google Scholar
Xu H, Li B: XOR-assisted cooperative diversity in OFDMA wireless networks: Optimization framework and approximation algorithms. In Proceedings of IEEE International Conference on Computer Communications (INFOCOM'09). Rio de Janeiro, Brazil; 2009:2141-2149.
Google Scholar
Li Q, Ting S, Pandharipande A, Han Y: Adaptive two-way relaying and outage analysis. IEEE Trans Wirel Commun 2009, 8(6):3288-3299.
Article Google Scholar
Zafer M, Modiano E: Optimal rate control for delay-constrained data transmission over a wireless channel. IEEE Trans Inf Theory 2008, 54(9):4020-4039.
Article MathSciNet MATH Google Scholar
Lee J, Jindal N: Energy-efficient scheduling of delay constrained traffic over fading channels. IEEE Trans Wirel Commun 2009, 8(4):1866-1875.
Article Google Scholar
Cui S, Goldsmith A, Bahai A: Energy-constrained modulation optimization. IEEE Trans Wirel Commun 2005, 4(5):2349-2360.
Article Google Scholar
Howard S, Schlegel C, Iniewski K: Error control coding in low-power wireless sensor networks: when is ECC energy-efficient? EURASIP J Wirel Commun Netw 2006, 2006: 1-14.
Article Google Scholar
Louie R, Li Y, Vucetic B: Practical physical layer network coding for two-way relay channels: performance analysis and comparison. IEEE Trans Wirel Commun 2010, 9(2):764-777.
Article Google Scholar
Boyd S, Vandenberghe L: Convex Optimization. Cambridge University Press, New York; 2004.
Book MATH Google Scholar

Download references

Acknowledgements

We would like to thank Prof. Andreas F. Molisch and Prof. Zixiang Xiong for the helpful discussions. This study was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 61120106002 and in part by National Basic Research Program of China, 973 Program 2012CB316003.

Author information

Authors and Affiliations

School of Electronics and Information Engineering, Beihang University, Beijing, 100191, China
Can Sun & Chenyang Yang

Authors

Can Sun
View author publications
You can also search for this author in PubMed Google Scholar
Chenyang Yang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Can Sun.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Sun, C., Yang, C. Energy efficiency analysis of one-way and two-way relay systems. J Wireless Com Network 2012, 46 (2012). https://doi.org/10.1186/1687-1499-2012-46

Download citation

Received: 29 September 2011
Accepted: 14 February 2012
Published: 14 February 2012
DOI: https://doi.org/10.1186/1687-1499-2012-46

Energy efficiency analysis of one-way and two-way relay systems

Abstract

1 Introduction

2 System model

3 Energy consumptions of three transmit strategies

3.1 Direct transmission

3.2 One-way relay transmission

3.3 Two-way relay transmission

4 Energy efficiency optimization for three transmit strategies

4.1 Direct transmission

4.2 One-way relay transmission

4.3 Two-way relay transmission

5 Energy efficiency analysis

5.1 Baseline case

5.2 Impact of circuit power consumption

5.3 Impact of unequal data amounts in two directions

6 Simulation results

6.1 Baseline case

6.2 Non-zero circuit power consumption

6.3 Unequal bidirectional packet sizes

7 Conclusion

Appendix 1: Solution of optimization problem (14)

Appendix 2: Proof of quasi-convexity of the objective function in (20)

Appendix 3: Derivation of the optimal transmission time

Endnotes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords