 Research
 Open Access
 Published:
Distributed resource allocation for D2D communications underlaying cellular network based on Stackelberg game
EURASIP Journal on Wireless Communications and Networking volume 2022, Article number: 35 (2022)
Abstract
With the development of artificial intelligence, the largescale access of intelligence equipment with complex heterogeneity will bring unpredictable spectrum limitation and complex interferences to traditional cellular networks. This situation can be alleviated by devicetodevice (D2D) technique which improves spectrum efficiency by reusing cellular resources. In this paper, a resource allocation framework comprising channel allocation and power control based on Stackelberg game is proposed for distributed interference coordination between D2D and cellular communications with qualityofservice guarantee. First, base station matches the channel to D2D pairs on the basis of potential throughput gain. Second, interferences from D2D pairs to cellular users are converted into a penalty for D2D pairs through the interlayer price, and the optimization problem of system throughput is decoupled into multiple subproblems those can be solved in distributed and iterative manner in each D2D pair. Simulation results show that two proposed distributed algorithms of channel allocation and power control on interference coordination perform well in convergence and overall system throughput.
Introduction
Recently, as wireless personal communications becomes ubiquitous, the number of intelligent equipment ranging from smart phones to medical implants is exploding. The contradiction between dramatically increasing of intelligent equipment and limited spectrum becomes prominent [1, 2]. Wireless personal communications cannot be just dependent on unlicensed wireless, e.g., WiFi, ZigBee, lowpower wide area network [3, 4]. LTEbased wireless technology as cellular network infrastructure can provide capacities of data processing and cooperative tasks for artificial intelligence (AI). Devicetodevice (D2D) technique realizes direct communications among proximity devices by reusing the licensed spectrum of cellular users (CUs) under the control of communication system [5, 6]. D2D communications can be introduced to guarantee qualityofservice (QoS), reliable communications and cooperation among equipment in networks [7]. An intelligent equipment can use D2D communication technology for direct communication via an isotropic antenna at every base station. D2D provides a new way for lowlatency communications, large data transmission and massive access for AI terminal [8].
D2D communications improve the performance of traditional cellular networks in terms of spectral efficiency, overall throughput and energy efficiency [9, 10]. Despite the potential data rate gain, D2D communications also pose new challenges on interference management [11,12,13]. On the one hand, CUs experience interlayer interferences from D2D pairs. On the other hand, D2D pairs experience interlayer interference from CUs and intralayer interference from other D2D pairs reusing the same resource [14]. Channel allocation and power control are two wellknown effective methods for the coordination of cellular and D2D communications [15, 16]. The two methods reduce interferences from D2D communications to the cellular networks and enhance the gain of D2D communications [17]. A great deal of work has recently appeared in the literature on the design of interference mitigation strategies of channel allocation and power control.
Most scholars have merely studied the resource allocation problem without power control to avoid interferences. Resource allocation methods in [18] and [19] prevent serious interference with the nearfar principle. The base station (BS) always chooses CUs those are well isolated for a particular D2D pair to share RBs. These methods are simple but may result in poor performance when QoS is further considered. Li et al. propose a resource allocation solution on the basis of adaptive antenna arrays and interference alignment to coordinate interference efficiently [20]. One D2D pair reuses the resource of one CU, and one CU shares resource with one D2D pair. Evidently, the spectral efficiency of onetoone resources reuse model is limited. Many studies allow multiple D2D pairs to share the same resources with one CU or one D2D pair to reuse the resources of multiple CUs [21, 22]. As research continues, an increasing number of researchers combine resource allocation and power control to improve the efficiency of resource reusing. Xu et al. studied the problem of pairing CUs and D2D users for sharing the same radio resources and proposed a reverse iterative combinatorial auctionbased power allocation scheme to optimize the system throughput [23]. Wang et al. further analyze the relationship between access probability and channel gain of D2D pairs and propose a heuristic resource allocation scheme based on greedy method [24]. A joint RBs assignment and power allocation framework based on interference graph is proposed in [25]. Similarly, in [26], the authors use hypergraph theorybased channel allocation to coordinate interferences between D2D pairs and CUs. However, schemes focused on centralized management are not suitable for a largescale network [27]. Generally, CUs and D2D users may be of selfinterest to maximize their benefits in the network. This will require CUs, D2D users and BS to solve distributed decision problems. A distributed framework of resource management and interferences coordination was designed in [28], and authors solve the resource allocation by a column generation algorithm. Some auctionbased game resource allocation schemes have been developed [29]. A distributed algorithm using auction game technique was adopted in [30] to maximize data rate of the underlay layer users while maintaining interferences in macrotier within acceptable range. However, an auction game needs many rounds of negotiation between buyers and sellers, which may result in large control signaling overhead [31]. Fictitious prices can be used in coordinating and controlling the transmissions of network nodes in game theory to solve contradiction between overhead and gain.
In this study, we design a distributed channel allocation and power control strategy in addressing the interlayer and intralayer interferences between users to maximize the overall system throughput and protect cellular communications. First, we design a lowcomplexity channel allocation method based on the potential throughput gain of D2D users; next, for the power control problem of D2D pairs, we model the interaction between CUs and D2D pairs as a Stackelberg game, in which BS prices the received interferences from the D2D pairs. Given the restriction and penalty by price, competition among all users turns into cooperation.
System model and problem formulation
System model
We consider resource sharing in a single cellular network, where D2D communications reuses the uplink spectrum resources of cellular communications as shown in Fig. 1. The network is provided with a set \({\mathcal {K}}\) of K orthogonal channels, \({\mathcal {K}}=\{1,2, \ldots , K\}\), each of which is represented by \(k\in {\mathcal {K}}\). C CUs are denoted by set \({\mathcal {C}}\). We consider a fully loaded network where the channels in the network are occupied by an equal number of CUs, \(C=K\). For brevity, we refer to the CU occupying channel k as CU k. N D2D pairs in the network are denoted by set \({\mathcal {D}}\), \({\mathcal {D}}=\{1, 2, \ldots , N\}\), \(N>K\). Multiple D2D pairs can share the same channel, and each D2D pair can reuse one channel at most. For conciseness, \({\mathcal {D}}_k\), \({\mathcal {D}}_{k}=\left\{ 1,2, \ldots , D_{k}\right\}\), represents the set of D2D pairs reusing channel k. \(p_{i}^{k}\) represents the transmit power of D2D pair i on channel k. \(\mathrm {p}_{{\mathcal {D}}_{\mathrm {k}}}=\left\{ p_{1}^{k}, p_{2}^{k}, \ldots , p_{i}^{k}, \ldots , p_{D_{k}}^{k}\right\}\) represents the power allocation vector for \({\mathcal {D}}_k\). We denote a channel reusing indicator of the ith D2D link at channel k by \(x_{i k} \in \{0,1\}\), where \(x_{i k} =1\) if D2D pair i accesses channel k, and \(x_{i k} =0\) otherwise.
In uplink transmission, BS suffers the interlayer interferences from D2D transmitters, and D2D receivers suffer interlayer and intralayer interferences from CUs and D2D transmitters those reuse the same channel, respectively. Therefore, the received SINR of D2D pair i and CU k on channel k can be written as
where \(p_c\) denotes the transmit power of the CUs, \(g_{i i}^{k}\) is the channel gain from the ith D2D transmitter to the ith D2D receiver on channel k, \(g_{k i}\) is channel gain from the CUs k to the ith D2D receiver, \(g_{j i}^{k}\) is channel gain from jth D2D transmitter to the ith D2D receiver on channel k, \(g_{k b}\) is channel gain from CU k to BS, and \(\sigma ^2\) represents noise power.
Problem formulation
Our objective is to maximize the total throughput of CUs and D2D pairs subject to a constraint to guarantee the performance of cellular transmissions. Mathematically, the overall throughput optimization problem for channel allocation and power control can be formulated as
where \(w_c\) is the frequency bandwidth of a channel. Constraint (3a) limits the maximum transmit power of D2D transmitter. Constraint (3b) guarantees the QoS of CUs, where \(Q_k\) is the interference tolerance level depending on the requirements and channel gain of the CU at channel k. Constraint (3c) ensures that one channel is assigned to each D2D pair at most.
Equation (3) is a typical complex MINLP problem, which is usually intractable. As the channel allocation matrix for D2D pairs, \(\mathrm {X}_{\mathrm {N} \times \mathrm {K}}=\left[ x_{i k}\right]\) leads to a combinatorial problem. Moreover, given the channel allocation, the objective function is nonconcave for power allocation vector. The optimal solution may be solved through exhaustive search, which has extremely high complexity even in a modest size network. One method to solve it is to relax the integer constraint to [0, 1] [31]. After relaxation, the data rate of D2D pairs and CUs can be calculated by expected SINR, respectively. However, Eq. (3) is still not a convex optimization problem. Another solution is to transform the expectation of rate to a convex function by changing logarithmic inversion of variable \(x_{ik}\); thus, the optimal solution is on the boundary of the feasible set [32]. However, given that the feasible set is no longer a polyhedron, it is difficult to the optimal solution.
Instead of centralized methods, we propose a distributed strategy with low coordination and communication overhead. We decouple the problem into two subproblems, channel allocation and power control. First, a heuristic algorithm based on the potential rate obtained by reusing partners of a single D2D pair and a CU is adopted to allocate channels for D2D pairs. Subsequently, we introduce the price for channel resources to decouple the interference constraint and develop a Stackelberg game model to arrive at an optimal power allocation iteratively.
Channel allocation
When D2D pair i reuses the channel resources of CU k, the sum rate of D2D pair i and CU k is
where \(w_c\) is the frequency channel bandwidth. In the optimization problem of overall throughput, the sum rate obtained by a reusing the partner of a single D2D pair and a CU is rarely concerned. We optimize \(T(p_i^k)\) to find an approximately optimal reusing relationship between D2D pairs and CUs,
The SINR of CUs in cellular network is usually required to be greater than a certain threshold. Here, we set SINR threshold of CUs in our model to be \(\gamma _{c}^{t h}\). Then, the actual power constraint of D2D pairs in channel allocation phase can be further expressed as
\(T\left( p_{i}^{k}\right)\) is a convex function on variable \(p_i^k\), and Eq. (5) is a convex optimization problem. The optimal power of Optimization Problem can be expressed as
At the same time, we obtain the throughput \({\widetilde{T}}(p_{i}^{k})\) corresponding to the optimal power. For channel k, allowing D2D pair with the optimal throughput \({\widetilde{T}}(p_{i}^{k})\) is more likely to improve system reusing efficiency under the premise of reaching the SINR threshold of CUs. Therefore, specific steps of channel allocation for D2D pairs in the system are as follows.
Step 1 Initialize \({\mathcal {D}}_{k}=\Phi , \forall k \in {\mathcal {C}}, \forall i \in {\mathcal {D}}, x_{i k}=0\). Calculate the optimal power of D2D pair \({\widetilde{p}}_{i}^{k}\) and the corresponding \({\widetilde{T}}(p_{i}^{k})\).
Step 2 \(\left( i^{*}, k^{*}\right) =\underset{k \in {\mathcal {C}}, \forall i \in {\mathcal {D}}}{\arg \max } T\left( p_{i}^{k}\right)\), find D2D pairs \(i^{*}\) and \(k^{*}\) corresponding to the maximum throughput.
Step 3 If \(\frac{p_{c} g_{k^{*} b}}{\sum _{i \in {\mathcal {D}}_{k} \cup \left\{ i^{*}\right\} } {\widetilde{p}}_{i}^{*} g_{i b}^{k^{*}}+\sigma ^{2}} \ge \gamma _{{\mathcal {C}}}^{t h}\), then \(x_{i^{*} k^{*}}=1\). Remove D2D pair \(i^*\) from \({\mathcal {D}}\), \({\mathcal {D}}={\mathcal {D}} \backslash \left\{ i^{*}\right\}\) and add \(i^*\) to the set of D2D pairs, reusing \(k^*\), \({\mathcal {D}}_{k}={\mathcal {D}}_{k} \cup \left\{ i^{*}\right\}\); else, remove D2D pair \(i^*\) from \({\mathcal {D}}\) only.
Step 4 Check \({\mathcal {D}}\). Go to Step 5 when \({\mathcal {D}}\) is empty; otherwise, perform Steps 2 and 3.
Step 5 Output \({\mathcal {D}}_k\) and \(\mathrm {X}_{N \times K}\).
Power control
We assume that the transmit power of CUs in the network is a fixed value \(p_c\), and the channel allocation is determined by a heuristic algorithm, as mentioned in channel allocation section. In this section, we consider the manner in which a reasonable power control strategy is designed for D2D pairs to maximize system throughput under the QoS constraint of CUs. The channels occupied by CUs in the system are orthogonal. Therefore, power control problem can be decoupled into K independent subproblems, each corresponding to a CU. We merely need to research power control of one D2D pair set \({\mathcal {D}}_k\). Without interference coordination, D2D pairs will choose the maximum transmit power to maximize their revenue, while CUs will refuse to share channel resources with D2D pairs. Therefore, we propose a power allocation scheme based on Stackelberg game with a price charging mechanism, where BS charges D2D pairs for their interferences to BS at channel k.
Stackelberg is a strategic game that consists of a leader and multiple followers competing with one another. In our model, the leader initially sets the price of interferences for the channel, and then D2D pairs, as followers, update their transmit power to maximize their utilities on the basis of the assigned interference price.
Utility functions
The channel rate of D2D pair i and CU k on channel k can be obtained by
For the leader, let \(\mu _{k}\) denote the unit price for the interferences brought by D2D pairs to channel k, and \(\mu _{k} \ge 0\). The utility of the leader can be defined as its own throughput performance plus the gain earned from the followers. Mathematically, the utility function of the leader can be formulated as
The optimization problem of leader is to maximize the utility, which can be expressed as
where \(Q_k\) is the interference tolerance level, which is dependent on the channel condition of CU k. Equation (11a) protects cellular transmissions.
We define \(C_{D 2 D, k}^{i}\) as the sum rate, except the D2D pair i in \({\mathcal {D}}_{k}\) as
Take the partial derivative of \(C_{D 2 D, k}^{i}\) on \(p_i^k\) and let \(p_{i}^{k}=p_{i}^{k^*}\), and \(p_i^{k^*}\) is the value obtained by the previous power iteration. Then, we derive the intralayer price \(c_i^k\) as
In followers’ game, players are the D2D pairs allocated to channel k, which are denoted by set \({\mathcal {D}}_k\). As one of followers, the utility function of D2D pair i is defined as
where the second item of utility function is the cost of interlayer interferences from D2D pair i to CU k, and the last term of utility function is the cost due to the intralayer interferences among D2D pairs.
On the basis of the utility function of D2D pairs, the optimization problem at each follower can be defined as
Let \(p_{i}^{k^{*}}\) be the optimal transmit power of D2D pair i on channel k, and \(\mathrm {p}_{{\mathcal {D}}_{\mathrm {k}}}^{*}=\left\{ p_{1}^{k^{*}}, p_{2}^{k^{*}}, \ldots , p_{i}^{k^{*}}, \ldots , p_{D_{k}}^{k}\right\}\) be the optimal power allocation vector for \({\mathcal {D}}_k\).
For the proposed Stackelberg game, Stackelberg equilibrium (SE) is defined as follows:
\(\mathbf {Definition1}\): \(\mu _{k}^*\) and \({\mathbf {p}}_{{\mathcal {D}}_k}^*\) are the optimal strategies for leader and followers in Stackelberg game, respectively. A pair of strategy \((\mu _{k}, {\mathbf {p}}_{{\mathcal {D}}_k})\) is an SE if no unilateral deviation in the strategy by the leader or the follower is profitable, that is,
Generally, the best responses (BRs) for the followers must be initially calculated to obtain SE. This step is done because the leader moves first and the followers move accordingly. Then, the leader derives BR according to the followers’ best strategy. On the basis of the functions of leader and followers, we can solve the problem through backward induction method. Therefore, we start with the problem of the followers.
Analysis of followers’ game
For D2D pairs, on the one hand, a high transmit power can bring a high data rate. On the other hand, a high transmitted power incurs high cost due to the interferences to other D2D pairs and the CUs. Therefore, tradeoff occurs between the data rate and the cost. The followers’ game can be written as a tuple, \({\mathcal {F}}\left\{ {\mathcal {D}}_{k},\left\{ p_{i}^{k}\right\} _{i \in {\mathcal {D}}_{k}},\left\{ U_{i}^{F}\left( c_{i}^{k}, {\mathbf {p}}_{{\mathcal {D}}_{k}}\right) \right\} _{i \in {\mathcal {D}}_{k}}\right\}\).
The logarithmic function grows slower than the cost. Thus, the objective function of optimization problem in Eq. (15) is an approximate concave function with respect to \(p_i^k\), and the second derivative of the function is greater than zero. BR of the power is derived by solving the firstorder partial derivative of objective function, shown as
Given that optimal power in Eq. (18) has a waterfilling form as that in [22], the followers’ game can reach SE between D2D pairs through iterative method. We define function \(f: f\left( p_1^k, \ldots , p_{{\mathcal {D}}_k}^k ; \mu _k\right) =\left( p_1^{k^*}\left( {\mathbf {p}}_{1}^k\right) , \ldots , p_{{\mathcal {D}}_k}^{k^*}\left( {\mathbf {p}}_{{\mathcal {D}}_k}^k\right) \right)\). f describes the optimal transmitted power given that the power of other D2D pairs are fixed, and f is BR function. We propose a synchronous iterative algorithm corresponding to the BR function, which is called BR algorithm, where all D2D links adjust their power according to the BR function, as shown as follows:
BR algorithm is described in Algorithm 1. By applying maximum theorem with \(U_{i}^{F}\left( \mu _{k}, p_{i}^{k}\right)\), we find that BR function is a continuous function [33]. If BR algorithm converges, each D2D pair adjusts its transmit power that maximizes the utility. Thus, no D2D pairs can increase their utility by adjusting their power only; that is, they are at an SE. The BR algorithm will never converge to a solution that is not an SE.
Analysis of leader’s game
The utility of leader consists of two parts: revenue from the data rate of CU k on channel k and selling interferences to D2D users. Therefore, the optimization can be written approximately as
To solve Eq. (20), we initially formulate Lagrangian function as
where \(\lambda\) is the Lagrange multiplier of the QoS constraint of total interferences.
On the basis of Karush–Kuhn–Tucker (KKT) condition of Lagrange problem, we have
Thus, the solution of optimization problem in Eq. (20) which we call analytical price has the following form,
The interlayer price \(\mu _{k}^{*}\) is the optimal price adjusted by the leader at a set of given transmit power, and BS updates the price and dynamically broadcasts the price to D2D pairs in \({\mathcal {D}}_{k}\).
We propose a Stackelberg game algorithm based on analytical price, which is executed periodically to overcome channel state changing caused by mobility. The iterative update process in each cycle is as follows.
Step 1 Initialize \(\Delta =1 \times 10^{2}\); select initial price \(\mu _{k}(t)\) randomly; and broadcast the price, t=0.
Step 2 Collect the information of D2D and cellular links and execute BR algorithm.
Step 3 Calculate the optimal price according to (24), and let \(\mu _{k}(t+1)=\mu _{k}^{*}\).
Step 4 If \(\left \mu _{k}(t+1)\mu _{k}(t)\right \ge \Delta\), then broadcast the current price, and let \(t=t+1\); else, go back to Step 2.
Step 5 Broadcast the final price.
The scheme for the interference management of D2D and cellular communications based on the heuristic channel allocation and the Stackelberg game power control described above is named Distributed Resource Allocation for D2D communications underlaying cellular network based on Stackelberg Game (DRASG). Specifically, given that the power control is updated based on analytical price, we call it DRASGAP algorithm.
The total complexity of DRASGAP can be derived from the complexity of each step in the twostage algorithm of channel selection and power control. The complexity of the channel selection phase is \({\mathcal {O}}\left( K N\right)\). In the power control, the upper bound of the complexity of the followers’ game is \({\mathcal {O}}(2^{D_{K}}{\log _{2} \varepsilon ^{1}})\), and the upper bound complexity of the leader’s game based on the Lagrangian analytical price is \({\mathcal {O}}\left( \log _{2} \Delta ^{1}\left( 2^{D_{K}} \times D_{K}\right) ^{2}\right)\). Therefore, the total complexity of DRASGAP is \({\mathcal {O}}\left( K N+{2^{D_{K}}}\log _{2} \varepsilon ^{1} \times \log _{2} \Delta ^{1}\left( 2^{D_{K}} \times D_{K}\right) ^{2}\right)\).
We also propose a method for obtaining the optimal price by decreasing the price to reduce computational complexity. The optimization problem of the leader is the utility maximization problem subject to interference constraints. Thus, the optimal price is dependent on the channel conditions, interferences and power constraints. If the leader sets the price extremely low, the followers will buy the interferences generated by \(p_{max}\), and the leader will then increase the price to further earn revenue. If the leader sets the price extremely high, the revenue from D2D will be zero. Given \(0 \le p_{i}^{k} \le p_{m a x}\), the optimal price is actually limited to a certain range, which is denoted as \(\mu _{k}^{l} \le {\hat{\mu }}_{k} \le \mu _{k}^{u}\). The upper bound \(\mu _{k}^{u}\) and lower bound \(\mu _{k}^{l}\) of the price can be obtained by Eq. (18) as
On the basis of the above analysis, we initially divide the price range \(\left[ \mu _{k}^{l}, \mu _{k}^{u}\right]\) into intervals those are sufficiently small. Then, the leader calculates the corresponding price in each interval and measures the aggregate interferences. BS checks each price in a descending order and ultimately stops at the price that maximizes revenue while maintaining interference constraint. The power control of our scheme framework here is updated on the basis of the decreasing price. Thus, we call the algorithm DRASGDP algorithm. The iterative update algorithm based on the decreasing price is shown in Algorithm 2.
Results and discussion
Experimental method
We conduct comprehensive experimental simulations to evaluate the performance of the proposed algorithms for distributed channel allocation and power control for D2D communications underlaying cellular network. We consider a singlecell network with a radius of 500 m, where BS is located at the center and D2D pairs are randomly distributed. The distance from each D2D transmitter to receiver is less than 50 m. We assume that the bandwidth of the subband is 180 kHz. The main simulation parameters are listed in Table 1, unless otherwise specified. Simulations were performed with MATLAB 7.0 platform, and 100 rounds were performed for each set of simulations to ensure the reliability of simulation results.
Sum rate of D2D pairs
We show the change of sum rate of D2D pairs versus the interference tolerance level of CUs numerically in Fig. 2. The sum rates of D2D pairs with the two proposed resources allocation algorithms, DRASGAP and DRASGDP, increase with the interference tolerance level. CUs accept higher interferences, which indicates that D2D links can access the channel occupied. Moreover, Fig. 2 shows that the performance of DRASGAP is slightly better than that of DRASGDP in terms of the sum rate of D2D pairs; on average, the former is 7.35 \(\%\) higher than the latter because each iteration aims to maximize the leader’s revenue and select the optimal price for the power adjustment in the first subgame stage of DRASGAP, which can approach the optimal power distribution under interference tolerance quickly and accurately. We also compare the performance of the algorithms without the influence of intralayer interferences, that is, \(c_i^k=0\). The intralayer interference price factor can restrain the selfishness of D2D pairs and improve the sum rate of D2D communications.
Sum rate of CUs
We simulate the sum rate of CUs under different tolerance level. Figure 3 shows that the sum rate of CUs decreases with the increase of the interference tolerance level, because more D2D pairs are able to access. We compare the performance of our proposed algorithms to the limitedarea scheme. In that area, D2D pairs are not permitted access to any channel when their transmitters are within a restricted area, with D as the radius, and BS as the center. The limitedarea scheme is adopted only as a benchmark to observe the proposed algorithms. Thus, the average sum rate of the CUs of limitedarea scheme is drawn as a horizontal line. Although the throughput of CUs can be protected by increasing the radius of the restricted area, it can be seen from Fig. 2 that our algorithms perform better in the performance of sum rate of D2D pairs when the interference tolerance level is high.
Total system throughput
We compare our algorithms with two similar resource allocation algorithms, PBRA [34] and WOA [35], in terms of total system throughput. In Fig. 4, on the one hand, the proposed algorithms DRASGAP and DRASGDP are sensitive to the interference tolerance level of CUs, and the total throughput varies greatly at different interference tolerance levels compared with WOA and PBRA. This result is caused by the optimal or suboptimal power and price, which can be obtained in the accessing process of D2D pairs in each sub link of the two game stages and finally reach the Nash equilibrium. On the other hand, the total system throughput with algorithms proposed in this study is considerably higher than the two comparison schemes under the same interference tolerance level. Particularly, when \(Q_k=0\), DRASGAP increases by 39.1 \(\%\) over WOA and 112.4 \(\%\) over PBRA. The intralayer interferences between the D2D pairs in WOA and PBRA are only reflected in the calculation of D2D data rate, while the proposed algorithms introduce the intralayer price factor to the followers’ game and reduce the adverse impact of the intralayer interferences on the system throughput.
Power convergence of D2D links
The change of transmission power of five D2D pairs on a specific channel is shown in Fig. 5, which reflects the power convergence of D2D links. In this experiment, the initial price of CU is set as 0.1, and \(Q_k=0\). The transmission powers of D2D links reach convergence quickly. The convergence value of the transmission power of D2D links is dependent on the channel gain of each D2D link. In the initial phase, all D2D links access high power at a low initial price. However, not all D2D links can maintain high power due to the interference constraint of CUs. In subsequent iteration, interferences from D2D pairs to CU are quantitatively converted into punishment by interlayer price. Therefore, D2D links gradually reduce their transmission power to prevent decreasing of D2D pairs revenue, the process of which continues until all D2D transmission powers converge.
Convergence of system throughput
We compare the convergence of system throughput in DRASGAP and DRASGDP with different interference tolerance levels. Figure 6 shows convergence trend of the two algorithms. We set the initial value of the interlayer price higher, \(\mu _k=0.45\). DRASGAP converges faster than DRASGDP at the same interference level. This is because that DRASGAP uses current optimal price as the guide price in each iteration of the followers’ game, and thus the power of D2D pairs in DRASGAP converges rapidly to maximize system throughput. However, it can also be found that both DRASGAP and DRASGDP perform well in convergence. The system throughputs of DRASGAP and DRASGDP are stable after 7 and 10 iterations, respectively, when \(Q_k=10\). In addition, the convergence performance is related to the interference tolerance level, whereby the larger \(Q_k\) is, the slower the convergence is.
Conclusions
In the study, a distributed framework of joint channel allocation and power control is proposed for D2D communications underlaying cellular networks where each uplink channel can be shared by one CU and multiple D2D pairs. To maximize the system throughput and guarantee the QoS of CUs, a heuristic channel allocation scheme based on potential data rate gain is designed to find a suitable channel matching between D2D pairs and CUs. The power control problem is formulated as a pricingbased Stackelberg game, where BS acts as a leader. BS sets the price of interferences on each sub channel to suppress interlayer interferences brought by D2D communications. Simulation results demonstrate that the proposed algorithms perform well in overall system throughput and convergence.
Availability of data and materials
Not applicable.
Abbreviations
 D2D:

Devicetodevice
 QoS:

Qualityofservice
 CUs:

cellular users
 BS:

base station
 BR:

best response
 SE:

Stackelberg equilibrium
 KKT:

Karush–Kuhn–Tucker
References
M. Wollschlaeger, T. Sauter, J. Jasperneite, The future of industrial communication: automation networks in the era of the internet of things and industry 4.0. IEEE Ind. Electron. Mag. 11(1), 17–27 (2017). https://doi.org/10.1109/MIE.2017.2649104
S. Zhai, Z. Qian, B. Yang, X. Wang, Data reconstructing algorithm in unreliable links based on matrix completion for heterogeneous wireless sensor networks. Int. J. Pattern Recognit. Artif Intell. 33(6), 1–21 (2019). https://doi.org/10.1142/S0218001419510121
B. Holfeld, D. Wieruch, T. Wirth, L. Thiele, A. Ashraf, J. ShehzadHuschke, I. Aktas, J. Ansari, Wireless communication for factory automation: an opportunity for LTE and 5G systems. IEEE Commun. Mag. 54(6), 36–43 (2016). https://doi.org/10.1109/MCOM.2016.7497764
M. Chen, J. Wan, S. Gonzalez, X. Liao, V.C.M. Leung, A survey of recent developments in home M2M networks. IEEE Commun. Surv. Tutor. 16(1), 98–114 (2014). https://doi.org/10.1109/SURV.2013.110113.00249
M. Bagaa, A. Ksentini, T. Taleb, R. Jantti, A. Chelli, I. Balasingham, An efficient D2Dbased strategies for machine type communications in 5G mobile systems, in 2016 IEEE Wireless Communications and Networking Conference, pp. 1–6 (2016). https://doi.org/10.1109/WCNC.2016.7564812
M. Ahmed, Y. Li, M. Waqas, M. Sheraz, D. Jin, Z. Han, A survey on socially aware devicetodevice communications. IEEE Commun. Surv. Tutor. 20(3), 2169–2197 (2018). https://doi.org/10.1109/COMST.2018.2820069
Z. Qian, C. Tian, Y. Guo, X. Wang, The key technology and development of intelligent and connected transportation system. J. Electron. Inf. Technol. 42(1), 2–19 (2020)
M. Noura, R. Nordin, A survey on interference management for devicetodevice (D2D) communication and its challenges in 5G networks. J. Netw. Comput. Appl. 12(3), 130–150 (2016). https://doi.org/10.1016/j.jnca.2016.04.021
X. Wang, Z. Qian, Y. Cong, X. Wang, Joint channel and power allocation based on stackelberg for D2D communications in cellular networks, in 2020 IEEE Globecom Workshops (GC Workshops), pp. 1–6 (2020). https://doi.org/10.1109/GCWkshps50303.2020.9367552
Y. Shen, C. Jiang, T.Q.S. Quek, Y. Ren, Devicetodeviceassisted communications in cellular networks: an energy efficient approach in downlink video sharing scenario. IEEE Trans. Wirel. Commun. 15(2), 1575–1587 (2016). https://doi.org/10.1109/TWC.2015.2492967
B. Zhang, Y. Wang, Q. Jin, M.A. Jianhua, Energyefficient architecture and technologies for device to device (D2D) based proximity service. China Commun. 12(12), 32–42 (2015)
P. Li, S. Guo, I. Stojmenovic, A truthful double auction for devicetodevice communications in cellular networks. IEEE J. Sel. Areas Commun. 34(1), 71–81 (2016). https://doi.org/10.1109/JSAC.2015.2452587
X. Li, J. Li, W. Liu, Y. Zhang, H. Shan, Groupsparsebased joint power and resource block allocation design of hybrid devicetodevice and LTEadvanced networks. IEEE J. Sel. Areas Commun. 34(1), 41–57 (2016). https://doi.org/10.1109/JSAC.2015.2452451
M. Lin, J. Ouyang, W.P. Zhu, Joint beamforming and power control for devicetodevice communications underlaying cellular networks. IEEE J. Sel. Areas Commun. 34(1), 138–150 (2016). https://doi.org/10.1109/JSAC.2015.2452491
N. Abedini, S. Tavildar, J. Li, T. Richardson, Distributed synchronization for devicetodevice communications in an LTE network. IEEE Trans. Wirel. Commun. 15(2), 1547–1561 (2016). https://doi.org/10.1109/TWC.2015.2492959
Y. Chen, B. Ai, Y. Niu, K. Guan, Z. Han, Resource allocation for devicetodevice communications underlaying heterogeneous cellular networks using coalitional games. IEEE Trans. Wirel. Commun. 17(6), 4163–4176 (2018). https://doi.org/10.1109/TWC.2018.2821151
K.J. Zou, M. Wang, K.W. Yang, J. Zhang, W. Sheng, Q. Chen, X. You, Proximity discovery for devicetodevice communications over a cellular network. IEEE Commun. Mag. 52(6), 98–107 (2014). https://doi.org/10.1109/MCOM.2014.6829951
H. Pang, W. Ping, X. Wang, F. Liu, N.N. Van, Joint mode selection and resource allocation using evolutionary algorithm for devicetodevice communication underlaying cellular networks. J. Commun. 8(11), 751–757 (2013). https://doi.org/10.12720/jcm.8.11.751757
T.D. Hoang, L.B. Le, T. LeNgoc, Energyefficient resource allocation for D2D communications in cellular networks. IEEE Trans. Veh. Technol. 65(9), 6972–6986 (2016). https://doi.org/10.1109/TVT.2015.2482388
Y. Li, Z. Kaleem, K. Chang, Interferenceaware resourcesharing scheme for multiple D2D group communications underlaying cellular networks. Wirel. Pers. Commun. 90(2), 749–768 (2016). https://doi.org/10.1007/s1127701632032
Y. Xiao, D. Niyato, K. Chen, Z. Han, Enhance devicetodevice communication with social awareness: a beliefbased stable marriage game framework. IEEE Wirel. Commun. 23(4), 36–44 (2016). https://doi.org/10.1109/MWC.2016.7553024
F. Wang, C. Xu, L. Song, Z. Han, Energyefficient resource allocation for devicetodevice underlay communication. IEEE Trans. Wirel. Commun. 14(4), 2082–2092 (2015). https://doi.org/10.1109/TWC.2014.2379653
C. Xu, L. Song, Z. Han, Q. Zhao, X. Wang, X. Cheng, B. Jiao, Efficiency resource allocation for devicetodevice underlay communication systems: a reverse iterative combinatorial auction based approach. IEEE J. Sel. Areas Commun. 31(9), 348–358 (2013). https://doi.org/10.1109/JSAC.2013.SUP.0513031
S. Wang, W. Guo, Z. Zhou, Y. Wu, X. Chu, Outage probability for multihop D2D communications with shortest path routing. IEEE Commun. Lett. 19(11), 1997–2000 (2015). https://doi.org/10.1109/LCOMM.2015.2475428
R. Zhang, X. Cheng, L. Yang, B. Jiao, Interference graphbased resource allocation (INGRA) for D2D communications underlaying cellular networks. IEEE Trans. Veh. Technol. 64(8), 3844–3850 (2015). https://doi.org/10.1109/TVT.2014.2356198
H. Zhang, L. Song, Z. Han, Radio resource allocation for devicetodevice underlay communication using hypergraph theory. IEEE Trans. Wirel. Commun. 15(7), 4852–4861 (2016). https://doi.org/10.1109/TWC.2016.2547862
A. Abrardo, M. Moretti, Distributed power allocation for D2D communications underlaying/overlaying of DMA cellular networks. IEEE Trans. Wirel. Commun. 16(3), 1466–1479 (2017). https://doi.org/10.1109/TWC.2016.2646360
D.H. Lee, K.W. Choi, W.S. Jeon, D.G. Jeong, Twostage semidistributed resource management for devicetodevice communication in cellular networks. IEEE Trans. Wirel. Commun. 13(4), 1908–1920 (2014). https://doi.org/10.1109/TWC.2014.022014.130480
M. Hasan, E. Hossain, Distributed resource allocation in d2denabled multitier cellular networks: an auction approach, in 2015 IEEE International Conference on Communications (ICC), pp. 2949–2954 (2015). https://doi.org/10.1109/ICC.2015.7248775
N.H. Almofari, S. Kishk, F.W. Zaki, Auction based algorithm for distributed resource allocation in multitierheterogeneous cellular networks, in 2016 11th International Conference on Computer Engineering Systems (ICCES), pp. 426–433 (2016). https://doi.org/10.1109/ICCES.2016.7822042
L. Song, D. Niyato, Z. Han, E. Hossain, Gametheoretic resource allocation methods for devicetodevice communication. IEEE Wirel. Commun. 21(3), 136–144 (2014). https://doi.org/10.1109/MWC.2014.6845058
Q. Ye, B. Rong, Y. Chen, M. AlShalash, C. Caramanis, J.G. Andrews, User association for load balancing in heterogeneous cellular networks. IEEE Trans. Wirel. Commun. 12(6), 2706–2716 (2013). https://doi.org/10.1109/TWC.2013.040413.120676
W. Yu, G. Ginis, J.M. Cioffi, Distributed multiuser power control for digital subscriber lines. IEEE J. Sel. Areas Commun. 20(5), 1105–1115 (2002). https://doi.org/10.1109/JSAC.2002.1007390
R. Yin, G. Yu, H. Zhang, Z. Zhang, G.Y. Li, Pricingbased interference coordination for D2D communications in cellular networks. IEEE Trans. Wirel. Commun. 14(3), 1519–1532 (2015). https://doi.org/10.1109/TWC.2014.2368151
Y. Sun, F. Wang, Z. Liu, Coalition formation game for resource allocation in D2D uplink underlaying cellular networks. IEEE Commun. Lett. 23(5), 888–891 (2019). https://doi.org/10.1109/LCOMM.2019.2908152
Acknowledgements
Not applicable.
Funding
This research is supported by the Science and Technology Research Project of the Education Department of Jilin Province, ‘Research on Resource Allocation Algorithm for D2D Communication underlaying Cellular Networks based on Game Theory and Convex Optimization’ (No. JJKH20220329KJ).
Author information
Affiliations
Contributions
XW proposed the decentralized framework of joint channelpower allocation for D2D communications and solved the optimization problem of system throughput with KKT and Stackelberg game. HP designed the experiments and obtained important data to support the conclusions. YS analyzed and interpreted the data of the simulations. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wang, X., Pan, H. & Shi, Y. Distributed resource allocation for D2D communications underlaying cellular network based on Stackelberg game. J Wireless Com Network 2022, 35 (2022). https://doi.org/10.1186/s13638021020556
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13638021020556
Keywords
 Devicetodevice (D2D)
 Stackelberg game
 Channel selection
 Power control
 Resource allocation