- Research
- Open Access

# Two novel price-based algorithms for spectrum sharing in cognitive radio networks

- Meng-Dung Weng
^{1}, - Bih-Hwang Lee
^{1}Email author and - Jhih-Ming Chen
^{2}

**2013**:265

https://doi.org/10.1186/1687-1499-2013-265

© Weng et al.; licensee Springer. 2013

**Received:**31 July 2013**Accepted:**5 November 2013**Published:**14 November 2013

## Abstract

Cognitive radio network is expected to use flexible radio frequency spectrum sharing techniques for achieving more efficient frequency spectrum usage. In this article, we consider the spectrum sharing problem that one primary user (PU) can share its frequency spectrum by renting this spectrum to multiple secondary users (SUs). The pricing scheme is a key issue for spectrum sharing in cognitive radio network. We first propose a nonlinear one-leader–multiple-follower (NLMF) sharing spectrum scheme as a multi-object optimization problem; the prices are offered by PU to SUs at the same time. This problem can be solved using particle swarm optimization (PSO); SUs gradually and iteratively adjust their strategies respectively based on the observations on their opponents' previous strategies until Nash equilibrium is completed. We then present a general nonlinear bilevel one-leader-multiple-follower (NBMF) optimization problem to further consider the revenue of the PU and a new optimal strategic pricing optimization technique which applies bilevel programming and swarm intelligence. A leader-follower game is formulated to obtain the Stackelberg-Nash equilibrium for spectrum sharing that considers not only revenue of a PU but also the SUs utility. We develop a swarm particle algorithm to iteratively solve the problem defined in the NBMF decision model for searching the strategic pricing optimization. The behaviors of two pricing models have been evaluated, and the performance results show that the proposed algorithms perform well to solve the spectrum sharing in a cognitive radio network.

## Keywords

- Spectrum sharing
- Cognitive radio
- Nash equilibrium
- Bilevel programming
- Swarm particle algorithm
- Strategic pricing optimization

## 1. Introduction

According to the regulations of the Federal Communication Commission (FCC) [1], a large portion of the unutilized priced frequency spectrum and the scarcity in spectrum resource should be used by providing tools to utilize spectrum holes [2]. Recently, cognitive radio (CR) provides great flexibility by extending software radio to improve spectrum utilization [2–6], which is now regarded as a hopeful wireless communication system. Primary users (licensed users) are willing to share frequency spectrum with secondary users (unlicensed users) which can adaptively adjust the transmission parameters to satisfy the requirements of quality of service (QoS) according to the environment information and opportunistically access those available frequency bands not occupied by primary users.

By this way, we can use the spectrum resource to enhance the system performance. We consider spectrum sharing as a spectrum trading process; therefore, price not only acts an important role in spectrum trading but also is an effective way to improve system utilization and performance to maximize the primary users' profits. The spectrum trading process indicates the values of both spectrum pricing and purchasing by allowing the spectrum trading between secondary users (SUs) and primary users (PUs). The price paid by SU to PU depends on the satisfaction to use that spectrum, while the price determines the PU's revenue.

In this article, we address the spectrum sharing problem in a cognitive radio environment, which consists of several SUs to compete with each other to demand opportunistic spectrum access to a single PU. We formulate this situation as an oligopoly market, where a few firms (i.e., SUs) compete with each other in terms of amount of commodity (i.e., the frequency spectrum) supplied to the market (i.e., PU) to gain the greatest profit.

A noncooperative game is used to analyze two situations, one case is that SUs receive the offered price by the PU and maximize their payoffs by the amount of their demand spectrum. Another case is that the main objection is to maximize the profit and revenue for all SUs and the PU, respectively. For these scenarios, we apply a dynamic game in which the selection of strategy by an SU is entirely determined by the pricing information obtained from the PU. Based on this information, each SU gradually adapts the size of its spectrum sharing which is controlled by the price function. To solve the corresponding optimization problem, we search the strategy space using swarm particle to identify the optimum behavior.

Compared with the current research on the spectrum sharing in a cognitive radio environment, the major contributions of this article are as follows. First, the idea is to bring together swarm particle algorithms and Nash strategy, and make the swarm particle algorithm to build the Nash equilibrium (NE). Nash-particle swarm optimization (PSO) is an alternative for multiple-objective optimizations as it is an optimization tool based on noncooperative game theory. Second, by applying bilevel techniques in the cognitive radio markets, we propose the concept and related definitions of nonlinear bilevel one-leader-multiple-follower (NBMF) decision problem and use swarm particle algorithm in designing the spectrum sharing scheme among PU and SUs for CR networks. We also build up a nonlinear NBMF decision model for strategic pricing problems, where the Stackelberg-Nash equilibrium is regarded as the solution.

The rest of this article is organized as follows. Section 2 addresses the related works on dynamic spectrum sharing in cognitive radio networks (CRN). Section 3 proposes the general NBMF decision model and introduces the Stackelberg-Nash equilibrium. We present the system model and describe the spectrum sharing and pricing strategy problem in Section 4. We develop Nash-PSO and NBMF-PSO algorithm as solutions to the NLMF, and the NBMF problems in Section 5. Then, we verify the effectiveness of the proposed algorithm to validate the NLMF and the NBMF decision model using simulation in Section 6, and finally we make conclusions in Section 7.

## 2. Background and related works

## 3. Mathematic descriptions of NBMF problems

### 3.1 Problem statements

The BLPP is regarded as an uncooperative, two-person game, as introduced by Von Stackelberg [42] in 1952. In the basic model, the decision variables are partitioned among two players who seek to optimize their individual utility functions. The bilevel programming techniques are mainly developed for solving decentralized management problems with decision makers in a hierarchical organization. A decision maker is known as the leader at the upper level, and it is known as the follower at the lower level. Each leader or follower optimizes his objective function with or without considering the objective of the other level, but the decision of each level affects the optimization of the other level.

Usually, in a real-world situation, there is more than one follower in the lower level. This type of the hierarchical structure is called a bilevel multi-follower (BLMF) decision making model. However, the different relationships among these followers might force the leader to use multiple different processes in deriving an optimal solution for leader decision making. Therefore, the leader's decision will be affected not only by the reactions of these followers, but also by the relationships among these followers. In general, there are three kinds of relationships among the followers: cooperative condition, uncooperative condition, and partial cooperative condition [43].

*N*followers in a bilevel decision system. Let

*x*and

*y*

_{ i }be the decision variables of the leader and the

*i*th follower, for

*i*= 1, 2, …,

*N*, respectively. We also assume that the objective functions of the leader and the

*i*th follower are

*F*(

*x, y*

_{ 1 }

*,…, y*

_{ N }) and

*f*

_{ i }(

*x, y*

_{ 1 }

*,…, y*

_{ N }), for

*i*= 1, 2, …,

*N*, respectively, while

*G*and

*g*are the vector valued functions of

*x*and the set of (

*y*

_{ 1 }

*,…, y*

_{ N }). The sets of

*X*and

*Y*represent the search spaces in the upper and lower bounds on the elements of the vectors

*x*and

*y*

_{ i }. An NBMF problem with one leader and

*N*followers is introduced with some related definitions and notations as defined in (1), where

- (a)Denote the constraint region of the NBMF problem by$\begin{array}{l}S=\left\{\left(x,{y}_{1},\dots ,{y}_{N}\right)\in X\times {Y}_{1}\times \dots \times {Y}_{N},G\left(x,{y}_{1},\dots ,{y}_{N}\right)\right.\\ \phantom{\rule{2em}{0ex}}\le 0,\left(\right)close="\}">g\left(x,{y}_{1},\dots ,{y}_{N}\right)\le 0,i=1,2,\dots ,N& .\end{array}$
- (b)Projection of
*S*onto the leader's decision space is defined as$\begin{array}{l}S\left(X\right)=\left\{x\in X:\exists y{}_{i}\in {Y}_{i},G\left(x,{y}_{1},\dots ,{y}_{N}\right)\le 0,g\left(x,{y}_{1},\dots ,{y}_{N}\right)\right.\\ \phantom{\rule{3.5em}{0ex}}\left(\right)close="\}">\le 0,i=1,2,\dots ,N& .\end{array}$ - (c)Denote the feasible region of the follower's problem for each fixed
*x*∈*S*(*X*) by$\begin{array}{l}{S}_{i}\left(x\right)=\left\{{y}_{i}\in {Y}_{i}:\left(x,{y}_{1},\dots ,{y}_{N}\right)\in S,g\left(x,{y}_{1},\dots ,{y}_{N}\right)\right.\\ \phantom{\rule{4em}{0ex}}\left(\right)close="\}">\le 0,i=1,2,\dots ,N& .\end{array}$ - (d)Let
*P*_{ i }(*x*) be the follower's rational reaction set for*x*∈*S*(*X*), which is defined as$\begin{array}{l}{P}_{i}\left(x\right)=\left\{{y}_{i}\in {Y}_{i}:{y}_{i}\in argmin\left[{f}_{i}\left(x,{\stackrel{\wedge}{y}}_{i}\right):{\stackrel{\wedge}{y}}_{i}\in {S}_{i}\left(x\right)\right]\right\},\\ \phantom{\rule{2em}{0ex}}i=1,2,\dots ,N,\end{array}$where$\begin{array}{l}argmin\left[{f}_{i}\left(x,{\stackrel{\wedge}{y}}_{i}\right):\stackrel{\wedge}{{y}_{i}}\in {S}_{i}\left(x\right)\right]\\ \phantom{\rule{2em}{0ex}}=\left[{y}_{i}\in {S}_{i}\left(x\right):{f}_{i}\left(x,{y}_{i}\right)\le {f}_{i}\left(x,{\stackrel{\wedge}{y}}_{i}\right),\stackrel{\wedge}{{y}_{i}}\in S\left(x\right)\right],\\ \phantom{\rule{1.5em}{0ex}}i=1,2,\dots ,N.\end{array}$The followers observe the leader's action and simultaneously react by selecting

*y*_{ i }from their feasible set to minimize their objective function. - (e)
Denote the inducible region by

*IR*= {(*x*,*y*_{1}, …*y*_{ N }) : (*x*,*y*_{1}, …,*y*_{ N }) ∈*S*,*y*_{ i }∈*P*_{ i }(*x*),*i*= 1, …,*N*}.

*P*(

*x*) defines the response, while the inducible region

*IR*represents the set, which the leader may optimize his objective. The leader's problem is then to optimize its objective function over the inducible region, the NBMF problem can be written as

### 3.2 Nash equilibrium and Stackelberg-Nash equilibrium

*N*objectives, a Nash strategy consists of

*N*players, each optimizes his own strategy. However, each player has to optimize his strategy, given that all the other strategies are fixed by the rest of the players. When no player can further improve his strategy, it means that the system has reached an equilibrium state called Nash equilibrium (NE). The bilevel programming is a multiple person noncooperative game with leader-follower strategy. In a game with

*N*followers,

*y*

_{ i }∈

*Y*

_{ i }is a strategy of follower, and

*y*

_{− i}= (

*y*

_{1}, …,

*y*

_{i − 1},

*y*

_{i + 1}, …,

*y*

_{ N }) ∈

*Y*

_{− i}is the set of others' strategy. If

*x*,

*y*

_{1}, …,

*y*

_{i − 1},

*y*

_{i + 1}, …,

*y*

_{ N }are revealed by the leader and the other followers, then the reaction ${y}_{i}^{*}$ of the

*i*th follower must be the optimal solution of the follower's objection function as shown in (2)

*x*is the Nash equilibrium of followers as shown in (3), for any $\left({y}_{1}^{*},\dots ,{y}_{i-1}^{*},{y}_{i},{y}_{i+1}^{*},\dots ,{y}_{N}^{*}\right)\in Y$ and

*i*= 1,2,…,

*N*. There is a unique Nash equilibrium that all the followers might make such equilibrium because no follower can further improve his own objective by altering his strategy unilaterally. The Stackelberg-Nash equilibrium of the bilevel programming is discussed by [44]. Let us define $\left({x}^{*},{y}_{1}^{*},\dots ,{y}_{i}^{*},\dots ,{y}_{N}^{*}\right)$ as the profile strategies with

*x** ∈

*X*, where $\left({y}_{1}^{*},\dots ,{y}_{i}^{*},\dots ,{y}_{N}^{*}\right)$ is a Nash equilibrium of followers with respect to

*x*

^{ * }, and the profile strategies $\left({x}^{*},{y}_{1}^{*},\dots ,{y}_{i}^{*},\dots ,{y}_{N}^{*}\right)$ are a Stackelberg-Nash equilibrium to the bilevel programming (1) if and only if (4) is satisfied for any $\stackrel{\wedge}{x}\in X$ and the Nash equilibrium $\left(\stackrel{\wedge}{{y}_{1}},\dots ,\stackrel{\wedge}{{y}_{i}},\dots {\stackrel{\wedge}{y}}_{N}\right)$ with respect to $\stackrel{\wedge}{x}$.

## 4. System model

*N*SUs as shown in Figure 1. When a frequency spectrum is unoccupied by its corresponding PU, it can sell portions of the available spectrum

*b*

_{i}(e.g., time slots in a time division multiple access (TDMA)-based wireless access system) to the SU

_{ i }(

*i*= 1, 2,…,

*N*) at the offered price

*p*. Both PU and SUs use adaptive modulation for wireless transmission. The spectrum demand of SUs depends on the transmission rate achieved due to the adaptive modulation in the allocated frequency spectrum and the price charged by PU. After obtaining the right to use the spectrum, SU uses an adaptive modulation to improve the performance.

_{ i },

*k*

_{ i }, can be obtained by (5) [46], where γ

_{ i }is the received signal-to-noise ratio (SNR) for SU

_{ i }; and $\mathit{BE}{R}_{i}^{\mathrm{tar}}$ is the bit error rate (BER) at the target level, which is to guarantee the required quality of transmission.

*P*(

**b**) be the price function, which can be obtained as shown in (6) [24], where

*b*

_{it}is the size of spectrum demanded by SU

_{ i }at time

*t*, and

**b**= {

*b*

_{ i }|

*i*= 1, …,

*N*} is the demand vector. Let us denote

*l*and

*q*as a fixed payment and the elasticity (i.e., slope) of the price function, respectively, while

*l*

_{ t }and

*q*

_{ t }are the values of

*l*and

*q*at time

*t*, respectively. Therefore, the price function is convex if

*l*

_{ t },

*q*

_{ t }, and

*α*are assumed to be positive and greater than one; moreover, the price function is linear if

*α*= 1. The item $\left({b}_{\mathit{it}}+{\displaystyle \sum _{j\ne i}{b}_{\mathit{jt}}}\right)$ is the total sharing spectrum at time

*t*. Consequently, the PU always determines the price function for the shared frequency spectrum in terms of the amount of spectrum demanded by SUs. The demand for the spectrum is larger, and the PU will charge a higher price in cognitive radio environment as

*α*= 1. Let us denote

*w*

_{ 1 }and

*w*

_{ 2 }to be the worth values of the spectrum for the PU and SU, respectively; then, it is necessary for the condition ${w}_{1}\times {\displaystyle \sum _{j\in \mathbf{b}}{b}_{j}}<P\left(\mathbf{b}\right)<{w}_{2}\times {\displaystyle \sum _{j\in \mathbf{b}}{b}_{j}}$ to ensure that the PU is willing to share the spectrum with SUs, while

*N*SUs are willing to buy the spectrum. In the NLMF mode, the price is determined by PU, and the values of

*w*

_{ 1 }and

*w*

_{ 2 }are set to 1 and 0, respectively. In the NBMF mode, the PU would never have predominated, and the part of its price will be transferred to the SUs. We get the values of

*w*

_{ 1 }and

*w*

_{ 2 }by estimating the values of

*l*

_{ t }and

*q*

_{ t }. Then, we can refer to this price and use the other spectrum directly by signing the contract without competition. A contract between PU and SUs ensures that PU will not deviate because of selfishness.

SUs define their spectrum demands by maximizing their payoff functions. Each SU's payoff function is defined as the subtraction between the earned revenue and the paid cost for sharing frequency spectrum with the PU. The revenue of SU_{
i
} can be obtained by *R*_{
i
} × *k*_{
i
} × *b*_{
i
}, while the cost of spectrum sharing is *b*_{
i
}*P*(**b**), where *R*_{
i
} is the user revenue of achievable per unit transmission rate. Let us define *U*_{
i
}(**b**) to be the profit function of each SU_{
i
}, which can be obtained by (7) [24].

*b*

_{ i }for

*i*= 1, …,

*N*. To obtain Nash equilibrium, we have to mathematically solve the marginal profit of SU

_{ i }. That is, we can derive the profit function with respect to the shared spectrum size and set it to zero as shown in (8). Each SU then solves its own equation to find the amount of its spectrum demand:

**b**

_{ −i }to be the set of strategies adopted by all SUs except SU

_{ i }, where

**b**

_{ −i }= {

**b**

_{ j }|

*j*=

*1*,

*2*, …,

*N*for

*j*≠

*i*} and

**b**=

**b**

_{ −i }∪ {

**b**

_{ i }}. BR

_{ i }is denoted as the BR function of SU

_{ i }

*,*given the size of the spectrum sharing by other SU's

**b**

_{ j }, which is defined as (9). The set ${\mathbf{b}}^{*}=\left\{{b}_{1}^{*},\dots ,{b}_{N}^{*}\right\}$ denotes the Nash equilibrium of any game if and only if (10) is satisfied, where ${\mathbf{b}}_{-i}^{*}$ denotes the set of BR

_{ j }for all SUs except SU

_{ i }:

*.*Subsequently, to relax the supposition, a dynamic game model is presented for which the information of the other SUs is unknown to a specific SU. In this scenario, we regard nonlinear one-leader-multiple-follower (NLMF) model as a nonlinear multiple-objective optimization problem, so it can be modeled as a dynamic game by Nash-PSO algorithms to adjust the requested spectrum size. The objective of the NLMF model is to maximize the profit function of all respective SUs, which is defined as (11). When the channel quality is getting better, the transmission rate can be higher; accordingly, the demand and the offered price for channels will increase. The value of

*b*

_{t max}cannot make the supply short of demand, while the value of

*b*

_{t min}is zero as the channel quality is very bad, which implies that the SU has no demand for the channel.

*l*

_{ t }and

*q*

_{ t }are introduced to measure the negative impact from SUs to PU and find a feasible pricing region to guarantee the primary service and satisfactory for SUs. If

*l*

_{ t }and

*q*

_{ t }are given among a predefined range, a feasible pricing region can be found to guarantee that it may not produce the negative values from the payoffs of the SUs, respectively:

For the spectrum trading, we consider two different pricing models, NLMF and NBMF, which are to hierarchically describe the strategic pricing problems in competitive cognitive radio markets. The NLMF model first considers the case that PU offers his price to SUs simultaneously. We use the Nash-PSO algorithm to build Nash equilibrium for multiple-objective optimization. In the NBMF model, SUs can observe the pricing strategy of PU and adapt their strategies accordingly from a bilevel angle. Therefore, we use an NBMF-PSO algorithm to estimate and adapt the strategies of SUs to achieve the best response and the strategy of PU to obtain the optimal solution. The Nash-PSO and NBMF-PSO algorithms will be discussed in the next section.

## 5. NASH-PSO and NBMF-PSO algorithms

*n*-dimensional space, where its

*m*th particle can be represented by a vector

*X*

_{ m }= (

*x*

_{m 1},

*x*

_{m 2},…,

*x*

_{ mn }), and it is treated as a potential solution that explores the search space by rate of position change called velocity, denoted by

*V*

_{ m }= (

*v*

_{m 1},

*v*

_{m 2}, …,

*v*

_{ mn }). Let pb

_{ m }be represented as the personal best position of any particle, i.e., pb

_{ m }= (pb

_{m 1}, pb

_{m 2}, …, pb

_{ mn }), which is in accord with the position in search space where particle had the lowest (or highest) value as determined by the fitness function. In addition, gb is defined to be the global best position of particle in swarm, which yields the best position of all particles in its neighborhood. The particle update method lies in accelerating each particle towards the optimum value based on its present velocity, its previous experience, and the experience of its neighbors within a reasonable time limit. Let us denote

*M*and

*w*to be the size of the swarm and the inertia weight used to balance the global and local search abilities, respectively. The position and velocity of each particle can be updated according to (15) and (16) [41], where

*m*(

*m*= 1, 2,…,

*M*) and

*d*(

*d*= 1, 2,…,

*n*) are for the

*m*th particle and the

*d*-dimensional vector, respectively, while

*c*

_{1}and c

_{2}are the positive constants;

*r*

_{1}and

*r*

_{2}are two uniformly distributed random numbers in the range [0,1]; and

*k*denotes the iteration number. For simplicity and immunity to the global optimum, the PSO algorithm is employed in this article to develop a Nash-PSO algorithm and an NBMF-PSO algorithm to reach an optimal solution for the NLMF and NBMF strategic pricing problem in an oligopoly market:

### 5.1 Nash-PSO algorithm

Nash presents a multiple-objective optimization problem which originated from the game theory and economics in 1950 [50, 51]. SUs define their spectrum demands from the single PU based on the NLMF model. It uses Nash-PSO algorithm to find the Nash equilibrium and maximizes its payoffs in a distributed fashion. In a practical cognitive radio environment, each SU has the knowledge of its payoffs and costs, but it does not know about the strategies and profits of the other SUs. The obtained profit of each SU is calculated based on the opponent's previous strategies about the optimal strategies which observe the pricing information from the PU; hence, we have to achieve the Nash equilibrium for each SU based on the interaction with the PU only. In this case, each SU can communicate with the PU to obtain the discriminated price function for different strategies. It is supposed if all SUs are intelligent, then they can apply the proposed approach to be aware of their opponent's payoff function and try to maximize their revenue by acknowledging the opponent's strategies. In the Nash-PSO algorithm, we first fix the *X* variables for leader (i.e., PU) and initiate a swarm to produce the followers' (i.e., SUs') decision variable (*Y*-particles), each of which has a velocity. Both their numbers are randomly distributed among a pre-defined range. The proposed Nash-PSO algorithm is an iterative algorithm which is to search the Nash equilibrium from the SUs by solving the NLMF model (11) as summarized in Algorithm 1. The global Nash equilibrium of the problem can be obtained if the iterations converge to a single point, because none of the players can gain more profit just by changing his strategic variable.

### 5.2 NBMF-PSO algorithm

**Important notations used in this article**

Notation | Meaning |
---|---|

| The number of particles for the leader |

| The number of particles for followers |

| The |

| The velocity of |

| The follower's choice for each |

| The |

| The velocity of |

pb | The best previously visited solution of |

pb | The best previously visited solution of |

| Current best solution for particle |

| Current best solution for particle |

| The |

| The |

${y}_{i}^{*}$ | The |

| Current iteration number for the upper-level problem |

| Current iteration number for the lower-level problem |

max_ | The predefined max iteration number for |

max_ | The predefined max iteration number for |

In this NBMF-PSO algorithm, we first initiate a swarm to produce the decision variable (*X*-particles) for PU and generate a population (*Y*-particles) for the followers, while the corresponding velocities are random numbers distributed among a pre-defined range. We then bring the *X*-particles to the lower-level problem of the NBMF model and use the Nash-PSO algorithm to generate the Nash equilibrium point from the followers by solving (14). After obtaining the best responses of the *Y*-particles from the followers, the leader's objective values for each decision variable of the *X*-particles can be calculated. To utilize the PSO strategy again, we obtain the leader's optimal strategy and find the solution changes for several consecutive generations which are smaller than a predefined value; hence, the Stackelberg-Nash equilibrium for the whole NBMF problem can be obtained. The detailed NBMF-PSO algorithm consists of two parts, Algorithm 2 and Algorithm 3, which generate the best response from SUs and the optimal strategies for PU, respectively, as specified as follows:

## 6. Performance evaluation

In this section, we present two different spectrum sharing models. We employ a strategic pricing problem in a CR market to test the NLMF model with Nash-PSO algorithm and the NBMF decision model with NBMF-PSO algorithm developed in this article. We consider the cognitive radio environment with one PU and two SUs sharing a frequency spectrum of 15 MHz. We use the same parameters and the same method as [24] compared with the Nash-PSO algorithm. We obtain the same result with [24] dynamic game.

### 6.1 The Nash-PSO algorithm

We consider a cognitive radio environment with one PU and three SUs sharing a frequency spectrum *B*^{tot} = 25 MHz. The target BER (BER^{tar}) is equal to 10^{−4} for three SUs. The revenue of an SU per unit transmission rate (*R*_{
i
}) is 12 for each user. For the price function of PU, we use *l* = 0, *q* = 1, *w*_{
1
} = 1, *w*_{
2
} = 0, *b*_{t min} = 0, and *b*_{t max} = 10.

_{1}(

*γ*

_{1}= 7 dB), SU

_{2}(

*γ*

_{2}= 8 dB), and SU

_{3}(

*γ*

_{3}= 9 dB), while Figure 4 uses the parameters SU

_{1}(

*γ*

_{1}= 8 dB), SU

_{2}(

*γ*

_{2}= 9 dB), and SU

_{3}(

*γ*

_{3}= 10 dB). The best response of each SU is a linear function of the other user's strategy. For the three user scenario, the best response function for each player is a plane, and the Nash equilibrium is located at the intersection point of three planes. For different channel quality, the Nash equilibrium will locate at the different places. SU can obtain higher transmission rate from the same spectrum size by adaptive modulation; hence, an SU with better spectral efficiency prefers to have a larger spectrum size to gain higher profit. In addition, the trajectory of spectrum sharing is shown for Nash-PSO best strategies in all iterations converging to the Nash equilibrium, which is considered to be the solution of the spectrum sharing scheme.

_{1}shares a larger spectrum size with the PU and achieves a higher revenue when its channel quality becomes better. In these figures, we considered a fixed channel quality for SU

_{2}(

*γ*

_{2}= 7 dB) and SU

_{3}(

*γ*

_{3}= 8 dB), while the quality of SU

_{1}changes from 5 to 11 dB. The size of shared spectrum and revenue proposed by SU

_{2}are higher than those proposed by SU

_{1}, as the channel quality of SU

_{1}is less than 7 dB. Similarly, the size of the shared spectrum and revenue proposed by SU

_{3}are higher than those proposed by SU

_{1}, as the channel quality of SU

_{1}is less than 8 dB. By improving the channel quality of SU

_{1}, the shared spectrum sizes and revenue offered by the SU

_{1}will be higher, as the channel quality of SU

_{1}is greater than 8 dB. The channel quality of an SU will impact the allocated spectrum size and revenue for the other SUs, which replies the impact of competitive strength among the SUs' strategies.

_{1}(

*γ*

_{1}= 8 dB), SU

_{2}(

*γ*

_{2}= 9 dB), and SU

_{3}(

*γ*

_{3}= 10 dB). The spectrum sharing gradually converges to the Nash equilibrium, where the Nash equilibrium point is at (1.7038, 4.0268, 6.5675).

### 6.2 The NBMF-PSO algorithm

*B*

^{tot}= 25 MHz. The BER

^{tar}is equal to 10

^{−4}for three SUs. The revenue of an SU per R

_{ i }is 12 for each user. As an example, we assume that PU is fixed at (0, 0), and SU

_{1}, SU

_{2}, and SU

_{3}are movable and begin at (0,102), (104, 0), and (0, −106), respectively. At

*t*= 0, SU

_{1}, SU

_{2}, and SU

_{3}move along a straight line at the same velocity of 0.085 m/s starting from (0, 102), (104, 0), and (0, −106) to (91.8, 102), (104, −91.8), and (−91.8, −106), respectively, by applying the NBMF-PSO algorithm as shown in Figure 8.

*σ*

^{2}= 10

^{−11}W is considered at the input of the receiver. For SU

_{ i }, its SNR can be calculated by (17), where

*P*

_{ i }is the transmit power of SU

_{ i }, and

*d*

_{ i }is the distance between SU

_{ i }and PU:

In the simulation, we consider an 18-interval dynamic game for simulations. To simplify the computation, the limit of the coefficients does not vary by different time slots. To design the values of *l*_{t min}, *l*_{t max}, *q*_{t min}, and *q*_{t max}, it is necessary to ensure that the PU is willing to share spectrum with the SUs, where we set *w*_{
1
} = 0.62, *w*_{
2
} = 0.72, *l*_{t min} = 0.154, *l*_{t max} = 0.679, *q*_{t min} = 0.523, *q*_{t max} = 0.627, *b*_{t min} = 0, and *b*_{t max} = 10 to satisfy this condition. However, this article does not discuss about the lendable time of the spectrum price, but it discusses the time variety of different channel qualities between the different prices of the NLMF and NBMF models. In the NBMF model, the price is $13.30 per unit spectrum as the channel quality is the best, while the price is $4.63 per unit spectrum as the channel quality is the worst.

When the cost of the spectrum offered by the PU is not higher than the revenue gained from the allocated spectrum, an SU is willing to stay the shared spectrum with the competition. Since the location of PU is fixed, the channel quality varies with time as SU_{1}, SU_{2}, and SU_{3} move, and its SNR and spectral efficiency vary as well. This variation in spectral efficiency affects the amount of spectrum demanded in each interval by SU_{1}, SU_{2}, and SU_{3.}

*l*= 0 and

*q*= 1 values, only to acquire the optimum utility function (11) of SUs. In the NBMF model, we further consider the profit of PU. By adapting the values of the coefficients

*l*and

*q*of the price function, the PU's slope of the price function can be changed, which helps PU achieve more demands from SUs in each interval of the dynamic spectrum sharing game. With the aim of maximizing both PU and SUs' profits, PU decides how many spectrums are to be lent based on the price each SU pays for the spectrum. The NBMF model obtains more size of sharing frequency than the NLMF model, and enhances the utility of the resource. Figure 9 shows the running results for

*l*

_{ t }and

*q*

_{ t }in the NBMF model.

*q*is small in the NBMF model, the demand of the sharing spectrum size becomes larger by each SU, and PU can gain higher price for the shared spectrum under more competition. The NBMF model performs better than NLMF model in the same situation by having larger size of sharing spectrum for each SU and higher unit price of the sharing spectrum for PU.

*b*

_{ i }for the spectrum demand. Once the spectrum demand is improved, the frequency allocation is enhanced for SUs, and the price may become higher for PU. Therefore, the NBMF model has larger revenue than the NLMF model.

Simulation results show that our proposed NLMF model utilizing swarm particle algorithm converges fast to the Nash equilibrium, and the NBMF model satisfies both groups of PU and SUs to enlarge the PU's revenue and acquire larger shared spectrum sizes for SUs. The NBMF model pays a lower price than the NLMF model at the same shared spectrum sizes. The shared spectrum sizes may be increased by improving the channel quality, and PU can offer larger spectrum sizes with higher price. PU provides reasonable price to SUs when the number of SUs increases.

## 7. Conclusions

Pricing is an important issue not only to maximize the revenue of PU, but also to allocate the radio spectrum sharing with SUs efficiently. In this article, we discussed the challenges in designing resource allocation and pricing in cognitive radio network. We have proposed a competitive spectrum sharing and pricing scheme based on noncooperative game for a cognitive radio network consisting of a PU and *N* SUs. We presented a dynamic game in which an SU adapts its spectrum sharing strategy by observing only the strategy which is a function of spectrum price offered by the PU. By analyzing the strategic pricing behavior of PU, we created an NLMF model and a specific NBMF decision model for oligopoly market in cognitive radio network. The NBMF model has applied NBMF-PSO algorithms to iteratively obtain the solution of this game for searching for the optimal solution of bilevel programming models.

Numerical studies were carried out to evaluate the performances of the two different pricing models. In the proposed NBMF model, PU can share more spectrum sizes with higher price for unit of shared frequency spectrum to SUs, while PU achieves higher revenue and SU acquires more size of the sharing spectrum by the same price. The NBMF model is more satisfactory for both groups of PU and SUs than NLMF, and enlarges the PU's revenue and provides reasonable price to SUs.

This article applies a strategic pricing problem in a cognitive radio market to help for both PU and SUs to make the strategic decisions by the Nash-PSO and NBMF-PSO algorithms. We combine swarm particle algorithms with Nash strategy based on noncooperative game theory to obtain the Nash equilibrium in multiple-objective optimization problems. By thinking of the gaming and bilevel relationship between PU and SUs, the NBMF decision model can better reflect the features of the real-world strategic pricing problems in the cognitive radio markets and format these problems more practically. The proposed NBMF-PSO algorithm is quite effective for solving the strategic pricing problems defined by the NBMF decision model. In the literature, no other algorithm exists hierarchically for the strategic pricing problems when both the gaming and bilevel relationships are considered between PU and SUs. The NBMF-PSO algorithm makes SUs using Nash-PSO algorithm to gain their rational reactions and reach the Nash equilibrium, and PU obtains the optimal pricing and acquires the highest profit. Further research work will focus on building the optimal strategic pricing models for the multiple PUs and multiple SUs in cognitive radio network.

## Declarations

### Acknowledgements

This study was supported in part by the National Science Council (NSC) of Taiwan under grant no. NSC 100-2221-E-011-077.

## Authors’ Affiliations

## References

- Commission FC:
*Spectrum Policy Task Force Report, FCC 02–155*. Washington DC: Federal Communications Commissions; 2002.Google Scholar - Haykin S: Cognitive radio: brain-empowered wireless communications.
*IEEE J. Select Areas. Comm.*2005, 23(2):201-220.View ArticleGoogle Scholar - Mitola J: Cognitive radio for flexible multimedia communications.
*Paper presented at the IEEE international workshop on mobile multimedia communications, San Diego, CA, USA, 15–17 Nov 1999*Google Scholar - Mitola J, Maguire GQ: Cognitive radio: making software radios more personal.
*IEEE Pers. Commun.*1999, 6: 13-18.View ArticleGoogle Scholar - Lu L, Zhou X, Onunkwo U, Ye Li G: Ten years of research in spectrum sensing and sharing in cognitive radio.
*EURASIP J. Wireless Commun. Netw.*2012, 2012: 1-16.View ArticleGoogle Scholar - Dashouk M, Alanyali M: A utility-based approach for secondary spectrum sharing.
*EURASIP J. Wireless Commun. Netw.*2011, 2011: 1-11.View ArticleGoogle Scholar - Akyildiz IF, Lee WY, Vuran MC, Mohanty S: NeXt generation/dynamic spectrum access/cognitive radio wireless networks: a survey.
*Comput. Netw.*2006, 50: 2127-2159.View ArticleMATHGoogle Scholar - Zhao Q, Sadler BM: A survey of dynamic spectrum access: signal processing, networking, and regulatory policy.
*IEEE Signal Proc. Mag.*2007, 24(3):79-89.View ArticleGoogle Scholar - Brik V, Rozner E, Banerjee S, Bahl P: DSAP: a protocol for coordinated spectrum access.
*Paper presented at first IEEE international symposium on new frontiers in dynamic spectrum access networks (DySPAN), Baltimore, MD, USA, 8–11 Nov 2005*Google Scholar - Zhao J, Zheng H, Yang GH: Distributed coordination in dynamic spectrum allocation networks.
*Paper presented at first IEEE international symposium on new frontiers in dynamic spectrum access networks (DySPAN), Baltimore, MD, USA, 8–11 Nov 2005*Google Scholar - Ren PY, Wang YC, Du QH, Xu J: A survey on dynamic spectrum access protocols for distributed cognitive wireless networks.
*EURASIP J. Wireless Commun. Netw.*2012, 2012: 1-21.View ArticleGoogle Scholar - Kim DI, Le L, Hossain E: Joint rate and power allocation for cognitive radios in dynamic spectrum access environment.
*IEEE Trans. Wireless Commun.*2008, 7(12):5517-5527.View ArticleGoogle Scholar - Aslam S, Lee K: Fair, efficient, and power-optimized spectrum sharing scheme for cognitive radio networks.
*EURASIP J. Wireless Commun. Netw.*2011, 2011: 1-11.View ArticleGoogle Scholar - Atakan B, Akan OB: BIOlogically-inspired spectrum sharing in cognitive radio networks. Paper presented at IEEE wireless communications and networking conference (WCNC), Kowloon, 11–15 Mar 2007Google Scholar
- Osborne MJ, Rubinstein A:
*A Course in Game Theory MIT Press*. Cambridge, MA: USA; 1994.MATHGoogle Scholar - Nie N, Comaniciu C: Adaptive channel allocation spectrum etiquette for cognitive radio networks.
*Paper presented at first IEEE international symposium on new frontiers in dynamic spectrum access networks, Baltimore, MD, USA, 8–11 Nov 2005*Google Scholar - Attar A, Nakhai MR, Aghvami AH: Cognitive radio game for secondary spectrum access problem.
*IEEE Trans. Wireless Commun.*2009, 8(4):2121-2131.View ArticleGoogle Scholar - Han Z, Pandana C, Ray Liu KJ: Distributive opportunistic spectrum access for cognitive radio using correlated equilibrium and no-regret learning.
*Paper presented at IEEE wireless communications and networking conference (WCNC), Kowloon, 11–15 Mar 2007*Google Scholar - Rodriguez V, Moessner K, Tafazolli R: Auction driven dynamic spectrum allocation: optimal bidding, pricing and service priorities for multi-rate, multi-class CDMA.
*IEEE PIMRC*2005, 3: 1850-1854.Google Scholar - Wang F, Krunz M, Cui S: Price-based spectrum management in cognitive radio networks.
*IEEE J. Sel. Areas Commun.*2008, 2: 74-87.Google Scholar - Huang J, Berry R, Honig ML: Auction-based spectrum sharing.
*ACM/Springer Mobile Netw. Apps.*2006, 11(3):405-418.View ArticleGoogle Scholar - Chang HB, Chen KC: Auction-based spectrum management of cognitive radio networks.
*IEEE Trans. Veh. Technol.*2010, 59(4):1923-1935.MathSciNetView ArticleGoogle Scholar - Wang X, Li Z, Xu P, Xu Y, Gao X, Chen H: Spectrum sharing in cognitive radio networks-an auction based approach.
*IEEE Trans. Syst. Man Cybern. B Cybern.*2010, 40(3):587-596.View ArticleGoogle Scholar - Niyato D, Hossain E: A game-theoretic approach to competitive spectrum sharing in cognitive radio networks.
*Paper presented at IEEE wireless communications and networking conference (WCNC), Kowloon, 11–15 Mar 2007*Google Scholar - Niyato D, Hossain E: Competitive pricing for spectrum sharing in cognitive radio networks: dynamic game, inefficiency of Nash equilibrium, and collusion.
*IEEE J. Sel. Areas Commun.*2008, 26(1):192-202.View ArticleGoogle Scholar - Niyato D, Hossain E: Market-equilibrium, competitive, and cooperative pricing for spectrum sharing in cognitive radio networks: analysis and comparison.
*IEEE Trans. Wireless Commun.*2008, 7(11):4273-4283.View ArticleGoogle Scholar - Chun SH, La RJ: Secondary spectrum trading—auction-based framework for spectrum allocation and profit sharing.
*IEEE/ACM Trans. Networking*2013, 21(1):176-189.View ArticleGoogle Scholar - Gao L, Wang X, Xu Y, Zhang Q: Spectrum trading in cognitive radio networks: a contract-theoretic modeling approach.
*IEEE J. Selected Areas Commun.*2011, 29(4):843-855.View ArticleGoogle Scholar - Niyato D, Hossain E: Hierarchical spectrum sharing in cognitive radio: a microeconomic approach.
*Paper presented at the IEEE wireless communications and networking conference (WCNC), Kowloon, 11–15 Mar 2007*Google Scholar - Yu H, Dang C, Wang S: Game theoretical analysis of buy-it-now price auctions.
*Int. J. Inf. Technol. Decis. Mak.*2006, 5: 557-581.View ArticleGoogle Scholar - Suh S, Kim T: Solving nonlinear bilevel programming models of equilibrium network design problem: a comparative review.
*Ann. Oper. Res.*1992, 34: 203-218.MathSciNetView ArticleMATHGoogle Scholar - Amat J, McCarl B: A representation and economic interpretation of a two-level programming problem.
*J. Oper. Res. Soc.*1981, 32: 783-792.MathSciNetView ArticleMATHGoogle Scholar - Gao Y, Zhang G, Lu J, Gao S: A bilevel model for railway train set organizing optimization.
*Paper presented at the international conference on intelligent systems and knowledge engineering (ISKE2007), Chengdu, China, 15–16 Oct 2007*Google Scholar - Zhang G, Zhang G, Gao Y, Lu J: Competitive strategic bidding optimization in electricity markets using bi-level programming and swarm technique.
*IEEE Trans. Ind. Electron.*2011, 58: 2138-2146.View ArticleGoogle Scholar - Colson B, Marcotte P, Savard G: Bilevel programming: a survey.
*Q. J. Oper. Res. (4OR)*2005, 3: 87-107.MathSciNetView ArticleMATHGoogle Scholar - Bard JF:
*Practical Bilevel Optimization: Algorithms and Applications*. Boston: Kluwer; 1998.View ArticleMATHGoogle Scholar - Bialas WF, Karwan MH: Two level linear programming.
*Manag. Sci.*1984, 30(8):1004-1020.MathSciNetView ArticleMATHGoogle Scholar - Hansen P, Jaumard B, Savard G: New branch-and-bound rules for linear bi-level programming.
*SIAM J. Sci. Statist. Comput.*1992, 13(5):1194-1217.MathSciNetView ArticleMATHGoogle Scholar - Calvete HI, Gale C, Mateo PM: A genetic algorithm for solving linear fractional bilevel problems.
*Ann. Oper. Res.*2009, 166: 39-56.MathSciNetView ArticleMATHGoogle Scholar - Gendreau M, Marcotte P, Savard G: A hybrid Tabu-ascent algorithm for the linear bilevel programming problem.
*J. Global Opt.*1996, 8(3):217-233.MathSciNetView ArticleMATHGoogle Scholar - Gao Y, Zhang G, Lu J: A particle swarm optimization based algorithm for fuzzy bilevel decision making.
*Paper presented at the international conference on fuzzy systems (FUZZ), Hong Kong, 1–6 Jun 2008*Google Scholar - von Stackelberg H:
*The Theory of the Market Economy*. London, UK: William Hodge; 1952.Google Scholar - Lu J, Shi C, Zhang G: On bilevel multi-follower decision making: General framework and solutions.
*Inform. Sci.*2006, 176(11):1607-1627.MathSciNetView ArticleMATHGoogle Scholar - Liu B: Stackelberg-Nash equilibrium for multilevel programming with multiple followers using genetic algorithms.
*Comput. Math. Appl.*1998, 36(7):79-89.MathSciNetView ArticleMATHGoogle Scholar - Oduguwa V, Roy R: Bi-level optimisation using genetic algorithm.
*Paper presented at the IEEE ICAIS, Divnomorskoe, Russia, 5–10 Sept 2002*Google Scholar - Goldsmith AJ, Chua S-G: Variable rate variable power MQAM for fading channels.
*IEEE Trans. Commun.*1997, 45(10):1218-1230.View ArticleGoogle Scholar - Kennedy J, Eberhart R: Particle swarm optimization.
*Proc. IEEE Int. Conf. Neural. Netw.*1995, 4: 1942-1947.View ArticleGoogle Scholar - Eberhart R, Kennedy J: A new optimizer using particle swarm theory.
*Paper presented at the 6th international symposium on micro machine human science, Nagoya, 4–6 Oct 1995*Google Scholar - Parsopoulos K, Vrahatis M: On the computation of all global minimizers through particle swarm optimization.
*IEEE Trans. Evol. Comput.*2004, 8(3):211-224.MathSciNetView ArticleGoogle Scholar - Nash J: Equilibrium points in n-person games.
*Proc. Natl. Acad. Sci. USA.*1950, 36: 48-49.MathSciNetView ArticleMATHGoogle Scholar - Nash J: Non-cooperative games.
*Ann. Math.*1951, 54(2):286-295.MathSciNetView ArticleMATHGoogle Scholar

## Copyright

This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.