# Energy efficiency analysis of one-way and two-way relay systems

- Can Sun
^{1}Email author and - Chenyang Yang
^{1}

**2012**:46

https://doi.org/10.1186/1687-1499-2012-46

© Sun and Yang; licensee Springer. 2012

**Received: **29 September 2011

**Accepted: **14 February 2012

**Published: **14 February 2012

## Abstract

Relaying is supposed to be a low energy consumption technique since the long distance transmission is divided into several short distance transmissions. When the power consumptions (PCs) other than that consumed by transmitting information bits is taken into account, however, relaying may not be energy efficient. In this article, we study the energy efficiencies (EEs) of one-way relay transmission (OWRT) and two-way relay transmission (TWRT) by comparing with direct transmission (DT). We consider a system where two source nodes transmit to each other with the assistance of a half-duplex amplify-and-forward relay node. We first find the maximum EEs of DT, OWRT, and TWRT by optimizing the transmission time and the transmit powers at each node. Then we compare the maximum EEs of the three strategies, and analyze the impact of circuit PCs and data amount. Analytical and simulation results show that relaying is not always more energy efficient than DT. Moreover, TWRT is not always more energy efficient than OWRT, despite that it is more spectral efficient. The EE of TWRT is higher than those of DT and OWRT in symmetric systems where the circuit PCs at each node are identical and the numbers of bits to be transmitted in two directions are equal. In asymmetric systems, however, OWRT may provide higher EE than TWRT when the numbers of bits in two directions differ significantly.

## 1 Introduction

Since the explosive growth of wireless services is sharply increasing their contributions to the carbon footprint and the operating costs, energy efficiency (EE) has drawn more and more attention recently as a new design goal for various wireless communication systems [1–3], compared with spectral efficiency (SE) that has been the design focus for decades.

A widely used performance metric for EE is the number of transmitted bits per unit of energy. When only transmit power is taken into account, the EE monotonically decreases with the increase of the SE [4] at least for point-to-point transmission in additive white Gaussian noise (AWGN) channel. In that case, when we minimize the transmit power, the EE will be maximized [5]. In practical systems, however, not only the power for transmitting information bits but also various signaling and circuits contribute to the system energy consumption (EC), which fundamentally change the relationship between the SE and EE. Specifically, when the circuit power consumption (PC) is considered, the optimization problem that minimizes the overall transmit power does not necessarily lead to an energy efficient design [2].

Relaying is viewed as an energy saving technique because it can reduce the transmit power by breaking one long range transmission into several short range transmissions [3]. In fact, relaying has been extensively studied from another viewpoint, i.e., it is able to extend the coverage, enhance the reliability as well as the capacity of wireless systems [6]. One-way relay transmission (OWRT) can reduce the one-hop communication distance and provide spatial diversity, but its SE will also reduce to 1/2 of that of direct transmission (DT) when practical half-duplex relay is applied [7]. Fortunately, two-way relay transmission (TWRT) can recover the SE loss when properly designed [8–10]. However, it is not well-understood whether these relay strategies are energy efficient, when various energy costs in addition to transmit power are considered.

Considering both the transmit power and the receiver processing power, the EE of decode-and-forward (DF) OWRT systems was studied with single-antenna and multi-antenna nodes in [11, 12], respectively. In [13], after accounting for the energy cost of acquiring channel information, relay selection for an OWRT system with multiple DF relays was optimized to maximize the EE. In [14], the EE of DF OWRT was compared with that of DT, where the result shows that OWRT is more energy efficient when the distance between source and destination is large, otherwise DT is better. In [15, 16], the EEs of OWRT and base station cooperation transmission were compared, where the overall energy costs including those from manufacture and deployment were considered. In [17], TWRT was shown to be more energy efficient than OWRT via simulations, where only transmit power was considered in the EC model. In [5], the EE of TWRT was compared with those of OWRT and DT, with optimized relay position and transmit power at each node. It shows that when the relay is placed at the midpoint of two source nodes, TWRT consumes less energy than OWRT and DT. Again, only transmit power was considered in the EC model. When we take into account the energy costs other than that contributed by the transmit power, what is the results of comparison between relaying and DT? Will TWRT still be more energy efficient than OWRT?

In this article, we analyze the EEs of TWRT, OWRT, and DT by studying a simple amplify-and-forward (AF) relay system. In literature, there are other relay protocols such as DF and compress-and-forward (CF) that provide higher rate regions than AF. However, AF is also widely considered in practice [6], and is superior to DF in outage performance for TWRT when the channel gains from two source nodes to the relay node are symmetric [18]. Moreover, the system models differ a lot among the relay protocols. In order to analyze the maximal EE, we need to find the relationship between end-to-end data rate and transmit power. With AF protocol, we can obtain the data rate-transmit power relationship by deriving the signal-to-noise ratio (SNR) at the destination. With DF protocol, the end-to-end data rate is quite different, which is modeled as the lower one of the achievable data rates in two hops. When considering CF, the case is even more complicated since its transmission and processing procedure is usually very complex, which is rather involved for analysis. Here we focus on AF relay as a good start, while the EEs of other relay protocols will be considered in future studies.

We consider a delay-constrained system, where *B* bits of message should be transmitted as a block within a duration *T*. This model is widely used for applications with strict delay constraints on data delivery, e.g., Voice-over-IP and sensor networks, where the message is generated periodically and must be transmitted with a hard deadline [19–21]. Note that the energy consumed by transmitting information decreases as the transmission duration increases [4], but the energy consumed by circuits increases with the duration. Therefore, in such a system we can adjust the transmission duration to reduce the overall EC as long as the transmission duration is shorter than the block length *T*. In other word, the system may transmit the *B* bits in a shorter duration than *T* and then switch to an idle status until the next block [21]. During the idle status, a part of the transceiver hardware can be shut down, which can be exploited to improve the EE.

Specifically, we first maximize the EEs of TWRT, OWRT, and DT by optimizing transmission time and transmit powers, respectively, for the three strategies. We then compare the optimized EEs of TWRT with those of OWRT and DT. We show that when all the three strategies operate with optimized transmission time and power, relaying is *not* always more energy efficient than DT. Moreover, TWRT is *not* always more energy efficient than OWRT if the numbers of bits to be transmitted in two directions are unequal, or the circuit PCs at each node are different.

The rest of this article is organized as follows. System model and the ECs of the three transmit strategies are, respectively, described in Sections 2 and 3. Then the EEs of different strategies are optimized in Section 4. In Section 5, the optimized EEs are compared under varies circuit PCs and numbers of transmitted bits. Simulation results are given in Section 6. Section 7 concludes the article.

## 2 System model

Consider a system consisting of two source nodes $A$ and $B$, and an AF half-duplex relay node (RN) ℝ, each equipped with a single antenna. We consider a delay constrained system, where the information bits are generated periodically and must be transmitted in a block within a hard deadline *T*. In each block, nodes *A* and *B*, respectively, intends to transmit *B*_{
ab
}and *B*_{
ba
}bits to each other with bandwidth *W.* In practice, the information bits to be transmitted in each block compose a packet or a frame, depending on application scenarios. In the following, we use the term "packet size" to refer the amount of data in each block, i.e., *B*_{
ab
}and *B*_{
ba
}.

*h*

_{ ab },

*h*

_{ ar }, and

*h*

_{ br }, as shown in Figure 1. We assume perfect channel knowledge at each node. The noise power

*N*

_{0}is assumed to be identical at each node.

To reduce the EC, the system may not use the entire duration *T* for transmission in each block. After *B*_{
ab
}and *B*_{
ba
}bits have been transmitted, the nodes can operate at an idle status until next block. In other word, each node has three modes: transmission, reception, and idle. The PCs in these modes are, respectively, denoted as *P*^{
t
}/ϵ + *P*^{
ct
}, *P*^{
cr
}, and *P*^{
ci
}, where *P*^{
t
}is the transmit power, ϵ ∈ (0, 1] denotes the power amplifier efficiency, *P*^{
ct
}, *P*^{
cr
}, and *P*^{
ci
}are, respectively, the circuit PCs in transmission, reception, and idle modes.

The circuit PCs in *P*^{
ct
}and *P*^{
cr
}consist of two parts: the power consumed by baseband processing and radio frequency (RF) circuits. The PC of RF circuit is usually assumed independent of data rate [6, 21], while there are different assumptions for the PC of baseband processing circuit. In systems with low complexity baseband processing, the baseband PC can be neglected compared with the RF PC [6, 21]. Otherwise, the baseband PC is not negligible and increases with data rate [22]. In this article, we consider the first case, where *P*^{
ct
}and *P*^{
cr
}only consist of RF PC, which are modeled as constants independent of data rate. Modeling *P*^{
ct
}and *P*^{
cr
}as functions of data rate leads to a different optimization problem, which will be considered in our future study.

The PC in idle mode *P*^{
ci
}is modeled as a constant, and *P*^{
ci
}≤ *P*^{
ct
}, *P*^{
ci
}≤ *P*^{
cr
}. Subscripts (·)_{
a
}, (·)_{
b
}, and (·)_{
r
}will be used to denote the PCs at different nodes.

## 3 Energy consumptions of three transmit strategies

We consider three transmit strategies, DT, OWRT, and TWRT, to complete the bidirectional communication between the two source nodes. In the following, we respectively introduce their ECs.

### 3.1 Direct transmission

*T*

_{ ab }for the transmission from node $A$ to $B$, where node $A$ is in transmit mode and node $B$ is in receive mode. Then the system allocates a duration

*T*

_{ ba }for the transmission from node $B$ to $A$, where node $A$ is in receive mode and node $B$ is in transmit mode. After the

*B*

_{ ab }and

*B*

_{ ba }bits are transmitted, the system turns into idle status during

*T*-

*T*

_{ ab }-

*T*

_{ ba }, where both nodes $A$ and $B$ are in idle mode. The EC of DT can be obtained as

where ${P}_{D}^{c1}\triangleq {P}_{a}^{ct}+{P}_{b}^{cr}$ and ${P}_{D}^{c2}\triangleq {P}_{b}^{ct}+{P}_{a}^{cr}$ are, respectively, the total circuit PCs in $A\to B$ and $B\to A$ transmission, and ${P}_{D}^{ci}\triangleq {P}_{a}^{ci}+{P}_{b}^{ci}$ is the total circuit PC in idle duration.

*T*

_{ ab }and

*T*

_{ ba }, nodes $A$ and $B$ should, respectively, transmit with data rates of

*B*

_{ ab }/

*T*

_{ ab }and

*B*

_{ ba }/

*T*

_{ ba }bits-per-second (bps) to exchange the

*B*

_{ ab }and

*B*

_{ ba }bits messages, which are given by Shannon capacity formula as

Since Shannon capacity formula represents the maximum achievable data rates under given transmit powers, the transmit power derived via this formula is the minimum transmit power that can support the required data rates. As a result, we can analyze the maximal EE for a given SE. We will also use the Shannon capacity formula to represent the relationship between data rates and transmit powers in OWRT and TWRT cases later.

### 3.2 One-way relay transmission

In OWRT, each of the $A\to B$ and $B\to A$ transmission is divided into two hops, thus the bidirectional transmission needs four phases, as shown in Figure 2b. For example, in $A\to B$ transmission, node $A$ transmits to RN in the first phase, and RN transmits to node $B$ in the second phase. With the AF relay protocol, the two phases in each direction employ identical time duration. For simplifying the analysis, we do not consider the direct link in OWRT. Although this will degrade the performance of OWRT, we will show later that it does not affect our comparison results for the EE.

*T*

_{ ab }for $A\to B$ transmission. During the first half of

*T*

_{ ab }, node $A$ transmits to RN, and thus node $A$ is in transmit mode, node ℝ is in receive mode, and node $B$ is idle. During the second half of

*T*

_{ ab }, RN forwards the information to node $B$, and thus node ℝ is in transmit mode, node $B$ is in receive mode, and node $A$ is idle. Then, the system allocates a duration

*T*

_{ ba }for $B\to A$ transmission. Finally, the system turns into idle status during

*T - T*

_{ ab }

*- T*

_{ ba }after the bidirectional transmission. The EC of OWRT can be obtained as

where ${P}_{r1}^{t}$ and ${P}_{r2}^{T}$ are, respectively, the relay transmit powers in $A\to B$ and $B\to A$ links, ${P}_{O}^{c1}\triangleq \left({P}_{a}^{ct}+{P}_{r}^{cr}+{P}_{b}^{ci}+{P}_{r}^{ct}+{P}_{b}^{cr}+{P}_{a}^{ci}\right)/2$ and ${P}_{O}^{c2}\triangleq \left({P}_{b}^{ct}+{P}_{r}^{cr}+{P}_{a}^{ci}+{P}_{r}^{ct}+{P}_{a}^{cr}+{P}_{b}^{ci}\right)/2$ are, respectively, the overall circuit PCs in $A\to B$ and $B\to A$ transmission, and ${P}_{O}^{ci}\triangleq {P}_{a}^{ci}+{P}_{b}^{ci}+{P}_{r}^{ci}$ is the overall circuit PC in idle duration where all three nodes operate in idle mode.

where the factor 1*/* 2 is due to the two-phase transmission in each direction.

### 3.3 Two-way relay transmission

In TWRT, the bidirectional transmission is completed in two phases, as shown in Figure 2c. In the first phase, both nodes $A$ and $B$ transmit to RN, where the nodes $A$ and $B$ are in transmit mode and the node ℝ is in receive mode. In the second phase, RN broadcasts its received signal to the nodes $A$ and $B$, where the node ℝ is in transmit mode, and the nodes $A$ and $B$ are in receive mode. After receiving the superimposed signal, each of the source nodes $A$ and $B$ removes its own transmitted signal via self-interference cancelation [8], and obtains its desired signal sent from the other source node. The two phases employ identical durations as in OWRT.

*T*

_{TWR}to the bidirectional transmission, and then turns into idle status during

*T*-

*T*

_{TWR}. The EC of TWRT is obtained as

where ${P}_{T}^{c}\triangleq \left({P}_{a}^{ct}+{P}_{b}^{ct}+{P}_{r}^{cr}+{P}_{r}^{ct}+{P}_{a}^{cr}+{P}_{b}^{cr}\right)/2$ and ${P}_{T}^{ci}\triangleq {P}_{a}^{ci}+{P}_{b}^{ci}+{P}_{r}^{ci}$ are the overall circuit PCs in the bidirectional transmission duration and the idle duration, respectively.

where the factor 1*/* 2 is due to the two-phase transmission.

## 4 Energy efficiency optimization for three transmit strategies

where *E* is the EC per block, which respectively equals to *E*_{
D
}, *E*_{
O
}or *E*_{
T
}in DT, OWRT, or TWRT.

To guarantee a fair comparison, we maximize the EEs of DT, OWRT, and TWRT with the same packet sizes *B*_{
ab
}and *B*_{
ba
}. From the definition of *η*_{EE}, we see that EE maximization is equivalent to EC minimization for a given pair of *B*_{
ab
}and *B*_{
ba
}. Consequently, we will minimize the EC per block for different strategies by optimizing transmission time and power of each node.

We consider that the transmission time should not exceed the duration of a block *T*, and the transmit power of each node should be less than the maximum transmit power ${P}_{\text{max}}^{t}$. Note that the system may not be able to transmit *B*_{
ab
}and *B*_{
ba
}bits within the duration *T* even if the maximum transmit power is used. In this case an outage occurs. Since we assume perfect channel knowledge at each node, the nodes can estimate the transmit power and the transmission time required for each block, which depend on the channel distribution and packet sizes *B*_{
ab
}and *B*_{
ba
}. Once the channel statistics and the packet sizes are given, the outage probability is fixed. In practice, the packet sizes *B*_{
ab
}and *B*_{
ba
}can be pre-determined according to the quality of service (QoS) requirements, channel environment, and the acceptable outage probability. We will use Monte-Carlo simulation to find the maximal *B*_{
ab
}and *B*_{
ba
}that ensure the outage probability to be lower than a threshold, e.g., 10%. Then, we only need to consider the EE optimization when the packet sizes are smaller than the maximum *B*_{
ab
}and *B*_{
ba
}.

### 4.1 Direct transmission

*T*

_{ ab }and

*T*

_{ ba }. The EC can be minimized by jointly optimizing the transmit powers and transmission time as follows,

*T*

_{ ab }and

*T*

_{ ba }by using (2), which are respectively,

The minimum value constraints on *T*_{
ab
}and *T*_{
ba
}are due to the transmit power constraints, without which the data rates *B*_{
ab
}*/T*_{
ab
}and *B*_{
ba
}*/T*_{
ba
}will be too high to be supported even with the maximal transmit powers.

Note that the problem in (12) is equivalent to the joint optimization problem in (10), where now only the transmission time needs to be optimized. In the objective function of the problem in (12), the first term is a function of *T*_{
ab
}and not related to *T*_{
ba
}. It is easy to show that its second order derivative with respect to *T*_{
ab
}is positive. Thus it is a convex function of *T*_{
ab
}. Similarly, the second term in the objective function is a convex function of *T*_{
ba
}. The last term is independent of the transmission time. Therefore, the objective function is convex with respect to *T*_{
ab
}and *T*_{
ba
}. All the constraints in (12) are also convex.^{a} Then the problem can be solved by using efficient convex optimization techniques, such as gradient descent algorithm [24].

### 4.2 One-way relay transmission

Similar to the DT case, we first express the transmit powers as functions of the transmission time using (4) and (5). Then the joint optimization of transmit power and transmission time can be solved with two steps: first find the optimal transmit powers as functions of the transmission time, then optimize the transmission time to minimize the EC.

*T*

_{ ab }, both ${P}_{a}^{t}$ and ${P}_{b}^{t}$ can be obtained from (4), where multiple feasible solutions exist. In order to minimize the EC, we find the transmit powers that minimize the sum power as follows,

*B*

_{ ab }

*/T*

_{ ab }should be less than the maximum data rate supported by the maximum transmit power. This turns into a minimum value constraint for the transmit time, which is

*P*

_{min1}(

*T*

_{ ab }), where

*T*

_{ ab }≥

*T*

_{min1}. It can be derived as a piecewise function as follows (see Appendix 1),

where ${C}_{1}\triangleq {2}^{2{B}_{ab}/\left({T}_{ab}W\right)}-1$, the demarcation points *T*_{d 1}and *T*_{d 2}are defined in Appendix 1. If *T*_{d 1}≥ *T*_{d 2}, *P*_{min1}(*T*_{
ab
}) follows (16), otherwise, it follows (17).

The piecewise function can be explained as follows. When *T*_{
ab
}is large, the data rate is low and both ${P}_{a}^{t}$ and ${P}_{r1}^{t}$ are below their maximum value, then *P*_{min1}(*T*_{
ab
}) follows the second part in (16) or (17). As *T*_{
ab
}decreases, one of ${P}_{a}^{t}$ and ${P}_{r1}^{t}$ will achieve its maximum value. When *T*_{
ab
}= *T*_{d 1}, we have ${P}_{r1}^{t}={P}_{\text{max}}^{t}$, and when *T*_{
ab
}= *T*_{d 2}, ${P}_{a}^{t}={P}_{\text{max}}^{t}$. If *T*_{d 1}≥ *T*_{d 2}, ${P}_{r1}^{t}$ achieves its maximum value first, *P*_{min1}(*T*_{
ab
}) follows the first part in (16). Otherwise, ${P}_{a}^{t}$ achieves its maximum value first, *P*_{min1}(*T*_{
ab
}) follows the first part in (17). When *T*_{
ab
}decreases to *T*_{min1}, both ${P}_{a}^{t}$ and ${P}_{r1}^{t}$ achieve the maximum value. For simplicity, we refer the first part in (16) or (17) as "one-max" interval, because one of the nodes uses its maximum transmit power. We refer the second part in (16) or (17) as "non-max" interval, since neither of the nodes uses its maximum transmit power.

*T*

_{ ba }, we can also find the values of ${P}_{b}^{t}$ and ${P}_{r2}^{t}$ that minimize their summation. Following an analogous procedure, the minimum value of ${P}_{b}^{t}+{P}_{r2}^{t}$ denoted as

*P*

_{min2}(

*T*

_{ ba }) can be derived as a piecewise function of transmission time

*T*

_{ ba }, which are respectively,

where ${C}_{2}\triangleq {2}^{2{B}_{ba}/\left({T}_{ba}W\right)}-1$, the demarcation points *T*_{d 3}and *T*_{d 4}can be derived similarly as *T*_{d 1}and *T*_{d 2}in *P*_{min1}(*T*_{
ab
}). If *T*_{d 3}≥ *T*_{d 4}, *P*_{min2}(*T*_{
ba
}) follows (18), otherwise, it follows (19). The minimum value constraint for *T*_{
ba
}, i.e., *T*_{
ba
}≥ *T*_{min2}, is also due to the maximum transmit power constraint like that for *T*_{
ab
}in (15), and *T*_{min2} can be derived similarly as *T*_{min1}.

We can show that the first term in the objective function is a quasi-convex function of *T*_{
ab
}(see Appendix 2). Similarly, the second term is a quasi-convex function of *T*_{
ba
}. The last term is a constant. However, the sum of two quasi-convex functions may not be quasi-convex. Therefore, we solve this problem using the following approach.

First, we assume that the optimal solution for (20) satisfies ${T}_{ab}^{\text{opt}}+{T}_{ba}^{\text{opt}}<T$. In this case, the first constraint in (20) can be omitted. Since the second constraint is only related to *T*_{
ab
}, and the last constraint is only related to *T*_{
ba
}, the joint optimization problem can be decoupled into two subproblems, i.e., optimizing *T*_{
ab
}to minimize the first term in objective function with the constraint *T*_{
ab
}≥ *T*_{min1}, and optimizing *T*_{
ba
}to minimize the second term in objective function with the constraint *T*_{
ba
}≥ *T*_{min2}. Because we have proved that the first two terms in the objective function are, respectively, quasi-convex functions with respect to *T*_{
ab
}and *T*_{
ba
}, both the two subproblems can be solved via quasi-convex optimization techniques such as bisection algorithm [24].

If the optimized *T*_{
ab
}and *T*_{
ba
}from the two subproblems satisfy ${T}_{ab}^{\text{opt}}+{T}_{ba}^{\text{opt}}<T$, then our assumption holds, and we obtain the optimal transmission time. Otherwise, the optimal solution for (20) must satisfy ${T}_{ab}^{\text{opt}}+{T}_{ba}^{\text{opt}}=T$. In this case, we only need to find the optimal ${T}_{ab}^{\text{opt}}$, where a scalar searching is applied, and the optimal ${T}_{ba}^{\text{opt}}$ can be obtained as ${T}_{ba}^{\text{opt}}=T-{T}_{ab}^{\text{opt}}$.

### 4.3 Two-way relay transmission

Analogous to the previous sections, we first derive the transmit powers as functions of the transmission time.

*T*

_{TWR}, we can find ${P}_{a}^{t},{P}_{b}^{t}$, and ${P}_{r}^{t}$ from (7) and (8), where multiple feasible solutions exist. To minimize the EC, again we find ${P}_{a}^{t},{P}_{b}^{t}$, and ${P}_{r}^{t}$ that minimize their summation from the following problem,

Following a similar derivation as in the case of OWRT, the minimum value of ${P}_{a}^{t}+{P}_{b}^{t}+{P}_{r}^{t}$ can be obtained as a piecewise function of the transmission time *T*_{TWR}, which is denoted as *P*_{min}(*T*_{TWR}).

*T*

_{TWR}is large, the data rates

*B*

_{ ab }

*/T*

_{TWR}and

*B*

_{ ba }

*/T*

_{TWR}are low, and all transmit powers are below their maximum values. The optimal transmit powers are derived with similar method in Appendix 1 as follows,

where ${C}_{1}\triangleq {2}^{\frac{2{B}_{ab}}{W{T}_{\text{TWR}}}}-1$ and ${C}_{2}\triangleq {2}^{\frac{2{B}_{ba}}{W{T}_{\text{TWR}}}}-1$. The corresponding *P*_{min}(*T*_{TWR}) is the sum of (22a), (22b), and (22c).

*T*

_{TWR}decreases, the data rates increases, then ${P}_{a}^{t-\text{opt}},{P}_{b}^{t-\text{opt}}$, and ${P}_{r}^{t-\text{opt}}$ increase until one of them achieves the maximum value ${P}_{\text{max}}^{t}$. By setting (22a), (22b), and (22c) to be ${P}_{\text{max}}^{t}$, respectively, we can obtain

*T*

_{TWR}=

*T*

_{d 1}when ${P}_{a}^{t-\text{opt}}={P}_{\text{max}}^{t}$,

*T*

_{TWR}= T

_{d 2}when ${P}_{b}^{t-\text{opt}}={P}_{\text{max}}^{t}$, and

*T*

_{TWR}=

*T*

_{d 3}when ${P}_{r}^{t-\text{opt}}={P}_{\text{max}}^{t}$. Without loss of generality, we assume that

*T*

_{d 1}≥

*T*

_{d 2}and

*T*

_{d 1}≥

*T*

_{d 3}(similar results can be obtained for other cases). In this case, ${P}_{a}^{t-\text{opt}}$ achieves the maximum value first, i.e., node $A$ transmits with the maximum transmit power. By substituting ${P}_{a}^{t}={P}_{\text{max}}^{t}$ into (7) and (8), we have

The corresponding *P*_{min}(*T*_{TWR}) can be obtained by adding (23a), (23b), and (23c).

*T*

_{TWR}further decreases, the data rates further increases, ${P}_{b}^{t-\text{opt}}$ and ${P}_{r}^{t-\text{opt}}$ in (23) increase until one of them achieves its maximum value. Without loss of generality, assume that ${P}_{b}^{t-\text{opt}}$ in (23b) achieves ${P}_{\text{max}}^{t}$ first. The corresponding value of

*T*

_{TWR}is denoted as

*T*

_{min}, which can be obtained by setting (23b) to be ${P}_{\text{max}}^{t}$. Then both nodes $A$ and $B$ transmit with the maximum power. Substituting ${P}_{a}^{t}={P}_{b}^{t}={P}_{\text{max}}^{t}$ into (7) and (8), we need to find one ${P}_{r}^{t}$ from two equations, which has no solution. Therefore,

*T*

_{min}is the minimum value of

*T*

_{TWR}due to the maximum transmit power constraint. Finally, the minimal sum transmit power is obtained as

where its first and second parts are, respectively, referred to as "one-max" and "non-max" interval for simplicity as that in the case of OWRT.

Using the similar method in Appendix 2, we can prove that the objective function is a quasi-convex function of *T*_{TWR}. Therefore, efficient quasi-convex optimization techniques [24] can be applied to solve the problem.

## 5 Energy efficiency analysis

In this section, we compare the EEs of different transmit strategies, and analyze the impact of various channels and system settings.

From the objective functions in (20) and (25), we can see that the expressions of the ECs of OWRT and TWRT are quite complex because the minimal sum transmit powers are piecewise functions with very complicated expressions, i.e., (16), (17), (18), (19), and (24). To gain useful insight into the EE comparison, we consider the following two approximations.

*Approximation 1: In the piecewise functions of P*_{min1}(*T*_{
ab
})*, P*_{min2}(*T*_{
ba
})*, and P*_{min}(*T*_{TWR}), *we only consider the "non-max" interval, where none of the nodes achieves its maximum transmit power.*

We take the function *P*_{min1}(*T*_{
ab
}) in (16) as an example to explain the approximation. In the "non-max" interval, as transmission time *T*_{
ab
}decreases, both transmit powers at nodes $A$ and $B$, i.e., ${P}_{a}^{t}$ and ${P}_{r1}^{t}$, increase for supporting the increased data rate *B*_{
ab
}/*T*_{
ab
}. In the "one-max" interval, ${P}_{r1}^{t}$ has achieved its maximum value. As *T*_{
ab
}decreases, only ${P}_{a}^{t}$ can increase to support the increased data rate, thus ${P}_{a}^{t}$ grows much faster than that in "non-max" interval and approaches its maximum value rapidly. Therefore, the range (*T*_{min1},*T*_{d 1}) of the "one-max" interval is very short, and in most cases the optimized ${T}_{ab}^{\text{opt}}\notin \left({T}_{\text{min}1},{T}_{d1}\right)$. Instead, $\begin{array}{c}{T}_{ab}^{\text{opt}}\in \left({T}_{d1},+\infty \right)\end{array}$. Based on this observation, we only consider the "non-max" interval in range (*T*_{d 1}, +∞).

Since we only consider the case where none of the nodes achieve its maximal transmit power, we do not need to consider the maximum transmit power constraints. Therefore it is not necessary to consider the corresponding minimum value constraints on the transmission time in this section.

*Approximation 2: In the expressions of P*

_{min1}(

*T*

_{ ab })

*, P*

_{min2}(

*T*

_{ ba })

*, and P*

_{min}(

*T*

_{TWR})

*, we respectively consider that*

We take (26a) as an example to explain the approximation, which affects the values of the transmit power *P*_{min1}(*T*_{
ab
}) and *P*_{min2}(*T*_{
ba
}) in OWRT. When the SEs in two directions, i.e., *B*_{
ab
}/(*WT*_{
ab
}) and *B*_{
ba
}/(*WT*_{
ba
}) are high, it is easy to see that the approximations in (26a) are accurate. On the other hand, when the SEs are low, the transmit powers *P*_{min1}(*T*_{
ab
}) and *P*_{min2}(*T*_{
ba
}) are much lower than the circuit PC. Then the approximations on transmit powers have little impact on the analysis of EC.

where $\left|{h}_{\text{e}}\right|\triangleq 1/\left(\frac{1}{\left|{h}_{ar}\right|}+\frac{1}{\left|{h}_{br}\right|}\right)$ can be viewed as an equivalent channel gain between two source nodes due to the usage of the relay.

### 5.1 Baseline case

*P*

^{ ct }=

*P*

^{ cr }=

*P*

^{ ci }= 0 and ${B}_{ab}={B}_{ba}\triangleq B$. Then the ECs of OWRT, TWRT, and DT shown in (27), (28), and (29) are decreasing functions of the transmission time. As a result, the system will use the entire duration

*T*for transmission. Due to the symmetric packet sizes, the optimal values of

*T*

_{ ab }and

*T*

_{ ba }are identical in DT and OWRT. This means that the optimal transmission time in DT and OWRT are ${T}_{ab}^{\text{opt}}={T}_{ba}^{\text{opt}}=T/2$, and that in TWRT is ${T}_{\text{TWR}}^{\text{opt}}=T$. After substituting the optimal transmission time into (27), (28), and (29), the minimum ECs can be obtained as

from which we can see that the optimal EE, ${\eta}_{\text{EE}}^{\text{opt}}=\frac{2B}{{E}^{\text{min}}}$, is a decreasing function of the packet size *B* in the three strategies. This implies that the maximal EE is achieved when *B* approaches zero.

Now, we compare the EEs of the three strategies. First, it shows from (30) that ${E}_{O}^{\text{min}}/{E}_{T}^{\text{min}}\ge 1$, which means that TWRT is more energy efficient than OWRT.

*h*

_{eff}| and the direct link channel gain |

*h*

_{ ab }|. If |

*h*

_{eff}| > |

*h*

_{ ab }|, TWRT is more energy efficient, otherwise, DT is more energy efficient. To gain further insight into this comparison, we consider an AWGN channel,

^{b}where |

*h*

_{ ab }|

^{2}is normalized as 1, the distance from the RN to nodes $A$ and $B$ are, respectively,

*d*and 1 -

*d*. Then ${\left|{h}_{ar}\right|}^{2}={\left(\frac{1}{d}\right)}^{\alpha}$ and ${\left|{h}_{br}\right|}^{2}={\left(\frac{1}{1-d}\right)}^{\alpha}$, where

*α*is the path loss attenuation factor. Then the equivalent channel gain becomes

which is related to the RN position. To maximize |*h*_{eff}|, the optimal relay position is the midpoint of the two source nodes, i.e., *d* = 0.5. In this case, |*h*_{eff}| = 2^{α/2}/2. When *α* > 2, which is true in most practical channel environments, |*h*_{eff}| = 2^{α/ 2}/2 > |*h*_{
ab
}| = 1, and TWRT is more energy efficient than DT.

If |*h*_{eff}| ≤ |*h*_{
ab
}|, $\text{since}\frac{2}{{2}^{\frac{2B}{WT}}+1}\le 1$ we have ${E}_{D}^{\text{min}}/{E}_{O}^{\text{min}}\le 1$, i.e., DT is more energy efficient than OWRT.

If |*h*_{eff}| > |*h*_{
ab
}|, the comparison result depends on the packet size *B.* When $B\to 0,\frac{2}{{2}^{\frac{2B}{WT}}+1}\to 1$, then ${E}_{D}^{\text{min}}/{E}_{O}^{\text{min}}\to {\left|{h}_{\text{e}}\right|}^{2}/{\left|{h}_{ab}\right|}^{2}\ge 1$. It means that in low traffic region, OWRT is more energy efficient. When $B\to \infty ,\frac{2}{{2}^{\frac{2B}{WT}}+1}\to 0$, then ${E}_{D}^{\text{min}}/{E}_{O}^{\text{min}}\to 0<1$. It means that in high traffic region, DT is more energy efficient. An intuitive explanation is as follows. On one hand, OWRT needs two-phase for transmission in each direction, thus the data rate in each phase should be twice of that in DT, which requires more transmit power. On the other hand, OWRT has higher equivalent channel gain, which reduces the required transmit power. In low traffic region, doubling the lower data rate has little impact on the transmit power, and thus OWRT is more energy efficient due to higher equivalent channel gain.

Here we argue that even if OWRT exploits the direct link between $A$ and $B$ for spatial diversity, the conclusion will still be the same. With the direct link, the equivalent channel gain can be improved. However, the improvement is rather limited in most cases, because the signal attenuation between the two source nodes is much larger than that between the source nodes and the RN. Furthermore, OWRT has 1/2 spectral efficiency loss with respect to DT and TWRT, which cannot be recovered from the SNR gain.

### 5.2 Impact of circuit power consumption

In this subsection we assume symmetric packet size, i.e., *B*_{
ab
}= *B*_{
ba
}= *B*, but consider the non-zero circuit PCs in practical systems. Then the ECs in (27), (28), and (29) are no longer monotonically decreasing functions of the transmission time. With the increase of the transmission time, the transmit energy decreases since the required data rate reduces, however, the circuit energy increases linearly. We take TWRT as an example to analyze the EE.

*E*

_{ T }in (28) with respect to

*T*

_{TWR}and setting it to be zero, which is

where ${\eta}_{\text{SE}-\text{T}}\triangleq \frac{2B}{W{T}_{\text{TWR}}}$ is the bidirectional SE of TWRT.

Although it is difficult to obtain a closed form solution of the optimal *T*_{TWR}, some observations can be obtained from (33). The optimal SE that minimizes the EC should satisfy (33c), from which we can see that ${\eta}_{\text{EE}-\text{T}}^{\text{opt}}$ does not depend on the packet size *B.* Therefore, the optimal transmission time ${T}_{\text{TWR}}^{\text{opt}}=\frac{2B}{W{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}$ increases linearly with *B*. Considering that *T*_{TWR} should not exceed the time duration of a block *T*, we obtain the following observation.

*Observation 1:* In high traffic region, ${T}_{\text{TWR}}^{\text{opt}}=T$. In low traffic region where $\frac{2B}{W{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}\le T$, the optimal transmission time ${T}_{\text{TWR}}^{\text{opt}}=\frac{2B}{W{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}$ increases linearly with the packet size *B.*

In high traffic region, the transmission time ${T}_{\text{TWR}}^{\text{opt}}=T$, then the bidirectional $\text{SE}\frac{2B}{WT}$ increases linearly with the packet size *B*, thus the transmit energy increases exponentially with *B* according to the capacity formula. In this case, the transmit EC is much larger than the circuit EC, thus the EE will be almost the same as that in zero circuit PC scenario.

where the first equality comes from the fact that (33b) equals to zero, and the second equality comes from ${T}_{\text{TWR}}^{\text{opt}}=\frac{2B}{W{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}$.

*B*

_{ ab }=

*B*

_{ ba }=

*B*and ${T}_{\text{TWR}}={T}_{\text{TWR}}^{\text{opt}}$ into the EC of TWRT in (28), and then substituting (34), the minimum EC of TWRT can be obtained as

from which we can obtain the following observation.

*Observation 2:* In low traffic region, if the circuit PC in idle mode ${P}_{T}^{ci}=0$, we have ${\eta}_{\text{EE}-\text{T}}^{\text{opt}}=\frac{\epsilon {\left|{h}_{\text{e}}\right|}^{2}W}{{N}_{0}\left(\text{ln}2\right){2}^{{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}}$. Since we have shown that ${\eta}_{\text{SE}-\text{T}}^{\text{opt}}$ does not depend on the packet size *B*, ${\eta}_{\text{EE}-\text{T}}^{\text{opt}}$ also does not change with *B* in this case. If ${P}_{T}^{ci}\ne 0$, $\underset{B\to 0}{\text{lim}}{\eta}_{\text{EE}-\text{T}}^{\text{opt}}=0$ since a large portion of energy is consumed in the idle duration.

Note that although $\underset{B\to 0}{\text{lim}}{\eta}_{\text{EE}-\text{T}}^{\text{opt}}=0$ due to the non-zero idle mode circuit PC, this observation does not mean that the idle duration is unnecessary. If the system transmits with the entire duration *T*, where $T>{T}_{\text{TWR}}^{\text{opt}}$, it can save the EC in idle mode, but it wastes more EC in transmission mode because it does not transmit with the optimal transmission time. Finally, more energy will be consumed and the EE will be reduced. We will show this impact later in simulations.

*B*in low traffic region, where $\frac{2B}{W{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}\le T$, i.e., $B\le TW{\eta}_{\text{SE}-\text{T}}^{\text{opt}}/2$. In other words, EE is insensitive to the packet size when $B\in \left(0,TW{\eta}_{\text{SE-T}}^{\text{opt}}/2\right)$. We can show that such a region becomes wider as the circuit power ${P}_{T}^{c}$ increases. By taking derivative with respect to ${P}_{T}^{c}$ at both side of (33c), we obtain

from which we can see that $\frac{d{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}{d{P}_{T}^{c}}=\frac{\epsilon {\left|{h}_{\text{e}}\right|}^{2}}{{N}_{0}{\left(\text{ln}2\right)}^{2}{2}^{{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}{\eta}_{\text{SE}-\text{T}}^{\text{opt}}}\ge 0$, i.e., as the circuit power ${P}_{T}^{c}$ increases, ${\eta}_{\text{SE}-\text{T}}^{\text{opt}}$ increases, and then the region $\left(0,TW{\eta}_{\text{SE}-\text{T}}^{\text{opt}}/2\right)$ extends.

where ${\eta}_{SE-D1}^{\text{opt}}$ and ${\eta}_{SE-D2}^{\text{opt}}$ are the optimal SEs in $A\to B$ and $B\to A$ directions in DT, ${\eta}_{SE-O1}^{\text{opt}}$ and ${\eta}_{SE-O2}^{\text{opt}}$ are those in OWRT, none of them depends on the packet size *B.* We omit the detailed derivations for concise.

Since it is difficult to derive closed form expressions for the optimal transmission time and the optimal SEs, there are also no closed form expressions for the optimal EEs. We will use simulations to compare the EEs of DT, OWRT, and TWRT under non-zero circuit PCs.

### 5.3 Impact of unequal data amounts in two directions

In this section, we assume that the circuit PCs are identical at each node, and consider that the packet sizes in two directions differ. Define *B*_{
ab
}= *βB*_{
s
}and *B*_{
ba
}= (1 - *β*)*B*_{
s
}, where *B*_{
s
}is the overall number of bits to be transmitted in two directions, and *β* is a factor to reflect the traffic asymmetry. We will show that once *B*_{
s
}is given, the minimum ECs of DT and OWRT are independent of *β*, but the EC of TWRT is minimized when *β* = 0.5. In other words, the asymmetric packet sizes in two directions only reduces the EE of TWRT.

**Proposition 1**. *The minimum EC of OWRT does not depend on β.*

*Proof.*Since, we assume ${P}_{O}^{c1}={P}_{O}^{c2}\triangleq {P}_{O}^{c}$, the EC of OWRT in (27) can be rewritten as

*R*

_{ O }is not a function of

*β.*Then the minimum

*E*

_{ O }can be obtained as follows by substituting (40) into (39),

which is not a function of *β.*

This proposition is easy to understand intuitively. Because with the optimized transmission time, the OWRT system transmits with the same data rate on each direction, and each bit is transmitted with identical data rate *R*_{
O
}and thus with identical time duration 1/*R*_{
O
}. Therefore, the energy consumed by each bit is identical no matter in which direction it is transmitted. Then the minimum EC only depends on the overall number of transmitted bits *B*_{
s
}.

The minimum EC of DT, ${E}_{D}^{\text{min}}$, can be obtained in a similar way, which also does not depend on *β.* We do not show the results for concise.

**Proposition 2**. *The minimum EC of TWRT is a function of β, and its minimum value is achieved when β* = 0.5.

*Proof.*The EC of TWRT in (28) can be rewritten as,

^{c}the EC becomes

Note that the only difference of *E*_{
T
}and *E*_{T 1}is the transmission time in their first and second terms. With less constraints on the transmission time, the minimum value of *E*_{T 1}achieved by optimizing *T*_{TWR1} and *T*_{TWR2} is a lower bound of the minimum value of *E*_{
T
}by optimizing *T*_{TWR}, i.e., ${E}_{T}^{\text{min}}=\underset{{T}_{\text{TWR}}}{\text{min}}\left({E}_{T}\right)\ge \underset{{T}_{\text{TWR1}},{T}_{\text{TWR2}}}{\text{min}}\left({E}_{T1}\right)={E}_{T1}^{\text{min}}$.

Following the analogous procedure as we analyze the OWRT system, we can show that ${E}_{T1}^{\text{min}}$ is not a function of *β.* Moreover, using similar method as in Appendix 3, we can prove that the optimal *T*_{TWR1} and *T*_{TWR2} that minimize (43) satisfy $\frac{\beta {B}_{s}}{{T}_{\text{TWR1}}^{\text{opt}}}=\frac{\left(1-\beta \right){B}_{s}}{{T}_{\text{TWR2}}^{\text{opt}}}$. It suggests that only when *β* = 0.5, ${T}_{\text{TWR1}}^{\text{opt}}={T}_{\text{TWR2}}^{\text{opt}}$. In this case, by choosing ${T}_{\text{TWR}}={T}_{\text{TWR1}}^{\text{opt}}={T}_{\text{TWR2}}^{\text{opt}}$, *E*_{
T
}in (42) equals to ${E}_{T1}^{\text{min}}$. Therefore, only when *β* = 0.5, ${E}_{T1}^{\text{min}}$ equals to its lower bound ${E}_{T1}^{\text{min}}$. Then proposition 2 is true.

## 6 Simulation results

In this section, we evaluate the EEs of the three transmission strategies, DT, OWRT, and TWRT, and validate previous analysis via simulations.

List of important parameters

Symbol | Definition | Simulation setting |
---|---|---|

| Distance between source nodes $A$ and $B$ | 100 m |

| Distance between $A$ and $B$ to relay node ℝ | 50 m |

PL | Path loss attenuation | 30 + 10log |

| Path loss attenuation factor | 2, 4 |

| Noise power at each node | -94 dBm |

| Bandwidth | 10MHz |

| Block duration | 5 ms |

| Packet sizes in two directions | ≥ 0 |

| Power amplifier efficiency | 0.35 |

${P}_{\text{max}}^{t}$ | Maximum transmit power | 40dBm |

${P}_{a}^{ct},{P}_{b}^{ct},{P}_{r}^{ct}$ | Circuit power in transmit mode at each node | From 0 to hundreds of mW |

${P}_{a}^{ct},{P}_{b}^{ct},{P}_{r}^{ct}$ | Circuit power in receive mode at each node | From 0 to hundreds of mW |

${P}_{a}^{ct},{P}_{b}^{ct},{P}_{r}^{ct}$ | Circuit power in idle mode at each node | From 0 to hundreds of mW |

The increase of distance *D*, noise power *N*_{0}, and attenuation factor *α* all result in higher required transmit power. Since their impacts are similar, we only show the impact of *α*. Because the increase of block duration *T* is equivalent to a reduction of the transmitted bits number per unit of time, we set *T* as a constant and change the values of *B*_{
ab
}and *B*_{
ba
}.

From [6, 21], the circuit PCs in practical systems usually range from dozens to hundreds of mW. Therefore, we set the circuit PCs in this range in the simulations. The power amplifier efficiency e is set as 0.35 [21].

### 6.1 Baseline case

We first compare the EEs of different strategies in the baseline case where the circuit PCs are zero and the packet sizes *B*_{
ab
}= *B*_{
ba
}.

*α*as 2 or 4. Since we are more interested in comparing the EEs rather than showing their absolute values, we normalize the EEs by the maximum EE of DT system for each

*α.*The normalized EE is shown in Figure 3, and the corresponding outage probability is shown in Figure 4. The

*x*-axis is the overall number of transmitted bits in two directions normalized by the block duration and bandwidth, i.e., (

*B*

_{ ab }+

*B*

_{ ba })/(

*TW*), which can be viewed as the average bidirectional SE per block.

^{d}

In Figure 3, because of the normalization, the EE curves of DT under different *α* overlap. It shows that the spectral efficient strategy TWRT is also energy efficient with respect to OWRT. When the attenuation factor is large, i.e., *α* = 4, the EE of TWRT is higher than that of DT, while when *α* = 2 the result is just the opposite. The comparison between DT and OWRT depends both on the packet size and the channel condition. When *α* = 2, DT always outperforms OWRT. When *α* = 4, OWRT is superior to DT in low traffic region, but is inferior to DT in high traffic region. All these results agree well with our analysis.

Figure 4 shows that when *α* = 2 the outage probabilities of DT, OWRT, and TWRT are zero for the considered packet sizes. When *α* = 4, the outage probabilities all increase. We see that TWRT offers lowest outage probability, and thus can support larger packet size given the same outage probability.

Since we only consider the case where the outage probability is lower than an acceptable threshold, say 10%, the EE curves of OWRT or DT when *α* = 4 is only plotted for the scenarios where (*B*_{
ab
}+ *B*_{
ba
})/(*TW*) is lower than 4 or 4.4 bits/s/Hz in Figure 3. In the following sections, we use the same method to determine the maximal packet sizes for DT, OWRT and TWRT, which ensure the outage probability to be lower than 10%.

### 6.2 Non-zero circuit power consumption

As expected, the non-zero circuit PC reduces the EE. It shows that the circuit PC only affects the EE in low traffic region, i.e., in low SE region. While in high SE region, since the transmit PC is much higher than the circuit PC, the EEs are almost the same for different circuit PCs. That is to say, the high and low SE regions are, respectively, "transmit power dominant" and "circuit power dominant".

When we assume the circuit PC in idle mode *P*^{
ci
}= 0, i.e., there exists an idle duration but its PC can be ignored, the EE does not change with SE in the "circuit power dominant" region. As the circuit PCs in the transmit and receive modes *P*^{
ct
}and *P*^{
cr
}increase, this region becomes wider.

When *P*^{
ci
}≠ 0, the EE reduces to zero as the packet size decreases. Comparing the lowest two curves where *P*^{
ci
}= 10 mW, we can see that the EE will decrease if we do not consider the idle duration, i.e., do not optimize the transmission time. Moreover, it is shown that when the PC in idle mode is not negligible, there is a non-zero optimal packet size that maximizes the maximal EE.

All these results agree with our earlier analytical analysis. We do not show the results of OWRT and DT, which are similar as those of TWRT.

*α*= 4. It shows that the EE of TWRT is always higher than that of OWRT. Since the path loss is severe, TWRT outperforms DT. OWRT is superior to DT in low traffic region, but becomes inferior in high traffic region. These results are the same as those in zero circuit PC scenario.

From Figure 6, we see that the idle mode circuit power *P*^{
ci
}only affects the energy efficiencies in low traffic region, and the comparison result among different strategies will not change no matter *P*^{
ci
}is zero or not. Since the different EE curves are more distinguishable when the circuit power in idle mode is zero, in the following we set the circuit power in idle status *P*^{
ci
}= 0 mW. Note that the circuit powers in transmit and receive modes *P*^{
ct
}and *P*^{
cr
}are still non-zero.

*k*

_{ b }≥ 1, which means that node $B$ consumes more circuit power than node $A$. We also set ${p}_{r}^{ct}={k}_{r}{p}_{a}^{ct},{p}_{r}^{cr}={k}_{r}{p}_{a}^{cr}$, where

*k*

_{ r }≥ 1 or

*k*

_{ r }≤ 1, which reflects the cases where the RN consumes more circuit power or less circuit power than node $A$ depending on specific application scenarios.

It is easy to understand that if the circuit PC at the RN is high, the advantage of relay transmission over direct transmission shrinks and vice versa. Therefore, we focus on the comparison between OWRT and TWRT in Figure 7. We plot the performance gain of the maximal EE of TWRT over that of OWRT, i.e., $\frac{\text{max}\left(\underset{\text{EE}-\text{T}}{\overset{\text{opt}}{\eta}}\right)}{\text{max}\left(\underset{\text{EE}-\text{O}}{\overset{\text{opt}}{\eta}}\right)}$, in order to observe whether TWRT is more energy efficient than OWRT, and how much performance gain TWRT can achieve.

From the simulation results in Figure 7, we can see that as *k*_{
b
}increases, i.e., the difference of the circuit PCs at the two source nodes becomes larger, the benefit of TWRT over OWRT shrinks. The OWRT even become more energy efficient than TWRT when the relay circuit PC is low.

### 6.3 Unequal bidirectional packet sizes

*B*

_{ ab }

*/B*

_{ ba }, but the EE of TWRT reduces as the difference between

*B*

_{ ab }and

*B*

_{ ba }increases, and may even become lower than those of OWRT and DT.

Note that in all the simulations, we did not consider the Approximations 1 and 2 employed in the beginning of Section 5. We can see that the analytical results using those approximations agree well with the simulation results. This validates the previous theoretical analysis.

## 7 Conclusion

In this article, we studied the energy efficiencies of OWRT and TWRT, and compared with direct transmission. We first found the maximal energy efficiencies of three strategies by jointly optimizing the bidirectional transmission time and the transmit power. We then compared their maximal energy efficiencies with either zero or non-zero circuit power consumptions, and reveal the mechanisms to improve the energy efficiency of the three transmission strategies under different scenarios.

Analytical and simulation results showed that in symmetric systems with equal circuit power at each node and equal packet sizes in two directions, the spectral efficient two-way relaying is also more energy efficient than one-way relaying, but two-way relaying only provides higher energy efficiency than direct transmission when the path loss attenuation is large. In asymmetric systems where the circuit power consumptions at each node are different or the bidirectional packet sizes are unequal, the advantage of two-way relaying diminishes because it can not simultaneously minimize the energy consumed by the transmissions in two directions. One-way relaying may offer higher energy efficiency, depending on the difference between the amount of data in two directions. Compared with the joint transmit power and transmission time optimization, only optimizing the transmit power has a loss in EE when the packet size is small. All the comparison results reveal that relaying is not always more energy efficient than direct transmission, and the two-way relaying does not not always offer higher energy efficiency than one-way relaying. To save the energy consumption, a system should choose the most suitable transmission strategy considering its required amount of data to be transmitted, channel statistics, hardware circuit powers, and so on.

We also showed the relationship between the energy efficiency and the spectral efficiency, i.e., the required amount of data normalized by bandwidth and time duration, for all the three transmission strategy, which is largely dependent on the circuit power consumption. With zero circuit power, the energy efficiency achieves its maximum value as the spectral efficiency approaches zero. With non-zero circuit powers in transmit and receive duration but negligible circuit powers in idle duration, energy efficiency does not change with spectral efficiency in low traffic region but reduce sharply in high traffic region. With non-zero circuit powers in all the transmit, receive and idle modes, there exists a non-zero optimal spectral efficiency that maximizes the maximal energy efficiency.

## Appendix 1: Solution of optimization problem (14)

where ${C}_{1}\triangleq {2}^{2{B}_{ab}/\left({T}_{ab}W\right)}-1$.

which only depends on ${P}_{r1}^{t}$.

We can see that both ${P}_{r1}^{t-\text{opt}}$ and ${P}_{a}^{t-\text{opt}}$ are increasing functions of ${C}_{1}={2}^{2{B}_{ab}/\left({T}_{ab}W\right)}-1$, thus are decreasing functions of *T*_{
ab
}. Therefore, when *T*_{
ab
}is high enough, both ${P}_{r1}^{t-\text{opt}}$ and ${P}_{a}^{t-\text{opt}}=f\left({P}_{r1}^{t-\text{opt}}\right)$ will satisfy the two constraints in (45). Then (46) and (47) are the optimal solutions of the problem (14).

*T*

_{ ab }decreases, both ${P}_{r1}^{t-\text{opt}}$ and ${P}_{a}^{t-\text{opt}}$ increase, until one of them achieve its maximum value. By substituting (46) and (47) into ${P}_{r1}^{t-\text{opt}}={P}_{\text{max}}^{t}$ and ${P}_{a}^{t-\text{opt}}={P}_{\text{max}}^{t}$, respectively, we can derive the corresponding demarcation point

*T*

_{ ab }=

*T*

_{d 1}where ${P}_{r1}^{t-\text{opt}}$ achieves its maximal value, and can also derive the corresponding

*T*

_{ ab }=

*T*

_{d 2}where ${P}_{a}^{t-\text{opt}}$ achieves its maximal value. The derived

*T*

_{d 1}and

*T*

_{d 2}are given by

*T*

_{d 1}≥

*T*

_{d 2}, as

*T*

_{ ab }decreases, ${P}_{r1}^{t-\text{opt}}$ achieves its maximal value first, then we have

*T*

_{d 1}<

*T*

_{d 2}, as

*T*

_{ ab }decreases, ${P}_{a}^{t-\text{opt}}$ achieves its maximal value first, then we have

By adding (46) and (47), (50) and (51), and (52) and (53), we can obtain the expressions of ${P}_{\text{min}1}\left({T}_{ab}\right)=\text{min}\left(\underset{a}{\overset{t}{P}}+{P}_{r1}^{t}\right)$ in (16) and (17).

## Appendix 2: Proof of quasi-convexity of the objective function in (20)

*P*

_{min1}(

*T*

_{ ab }) follows (16), the conclusion is the same if it follows (17). Since

*P*

_{min1}(

*T*

_{ ab }) is a piecewise function of

*T*

_{ ab }, ${T}_{ab}\left(\frac{{P}_{\text{min}1}\left({T}_{ab}\right)}{2\epsilon}+{P}_{O}^{c1}-{P}_{O}^{ci}\right)$ is also a piecewise function. For simplicity, we define

By taking the second order derivative of *f*_{
l
}(*T*_{
ab
}), we have ${{f}_{l}}^{\prime \prime}\left({T}_{ab}\right)\ge 0$ when *T*_{min1} ≤ *T* < *T*_{d 1}. Therefore, *f*_{
l
}(*T*_{
ab
}) is a convex function in the range *T*_{min1} ≤ *T* < *T*_{d 1}.

Then we will show that *f*_{
r
}(*T*_{
ab
}) is a quasi-convex function in the range *T* > *T*_{d 1}, where we will use the following lemma.

**Lemma 1**. *Suppose that a function f*(*x*) *is second order differentiable in* (*x*_{
L
}, *x*_{
R
}), $\underset{x\to {x}_{L}}{\text{lim}}{f}^{\prime}\left(x\right)<0,\underset{x\to {r}_{R}}{\text{lim}}{f}^{\prime}\left(x\right)>0$, *and f"*(*x*) *only has one zero point in* (*x*_{
L
}, *x*_{
R
}). *Then f*(*x*) *is a quasi-convex function on* (*x*_{
L
}, *x*_{
R
}).

*Proof*. Since *f*(*x*) is second order differentiable, *f'*(*x*) is continuous on (*x*_{
L
}, *x*_{
R
}). Considering that $\underset{x\to {x}_{L}}{\text{lim}}{f}^{\prime}\left(x\right)<0,\underset{x\to {r}_{R}}{\text{lim}}{f}^{\prime}\left(x\right)>0$ *f'*(*x*) at least has one zero point in (*x*_{
L
}, *x*_{
R
}). We then show that *f'*(*x*) can only has one zero point.

Assume that *f'*(*x*) has three or more zero points such that *f'*(*a*) = *f'*(*b*) = *f'*(*c*) = 0. According to Rolle's theorem, there exists a point *x*_{1} **∈** (*a, b*) such that *f"*(*x*_{1}) = 0, and also a point *x*_{2} ∈ (*b, c*) such that *f"*(*x*_{2}) = 0. This conflicts with the assumption that *f"*(*x*) only has one zero point.

Assume that *f'*(*x*) has two zero points such that *f'*(*a*) = *f'*(*b*) = 0, *a, b* ∈ (*x*_{
L
}, *x*_{
R
}). According to Rolle's theorem, there is a point *x*_{1} **∈** (*a, b*) which satisfies *f"*(*x*_{1}) = 0. Without loss of generality, we assume that *f'*(*x*_{1}) > 0. Considering that $\underset{x\to {x}_{R}}{\text{lim}}{f}^{\prime}\left(x\right)>0$, and in (*x*_{1}, *x*_{
R
}), *f'*(*x*) only has one zero point *f'*(*b*) = 0, therefore, *f*'(*b*) = 0 is the minimum value of *f'*(*x*) in (*x*_{1},