Adaptive QoS provision for IEEE 802.16e BWA networks based on cross-layer design

This article proposes an integrated framework for adaptive QoS provision in IEEE 802.16e broadband wireless access networks based on cross-layer design. On one hand, an efficient admission control (AC) algorithm is proposed along with a semi-reservation scheme to guarantee the connection-level QoS. First, to guarantee the service continuity for handoff connections and resource efficiency, our semi-reservation scheme considers both users ’ handoff probability and average resource consumption together, which effectively avoids resource over-reservation and insufficient reservation. For AC, a new/handoff connection is accepted only when the target cell has enough resource to afford both instantaneous and average resource consumption to meet the average source rate request. On the other hand, a joint resource allocation and packet scheduling scheme is designed to provide packet-level QoS guarantee in term of “ QoS rate “ , which can ensure fairness for the services with identical priority level in case of bandwidth shortage. Particularly, an enhanced bandwidth request scheme is designed to reduce unnecessary BR delay and redundant signaling overhead caused by the existing one in IEEE 802.16e, which further improves the packet-level QoS performance and resource efficiency for uplink transmission. Simulation results show that the proposed approach not only balances the tradeoff among connection blocking rate, connection dropping rate, and connection failure rate, but also achieves low mean packet dropping rate (PDR), small deviation of PDR, and low QoS outage rate. Moreover, high resource efficiency is ensured.


Introduction
With explosive growth in the data service of Internet and multimedia applications, high-speed and high-quality wireless access is required for providing QoS guarantee for heterogeneous services in future mobile communication systems.As a promising solution for last-mile broadband wireless access (BWA) in metropolitan area, IEEE 802.16d/e [1,2] adopted adaptive modulation and coding (AMC) to maximize the system capacity under the bit error rate (BER) constraint over the error-prone wireless channel [3].Meanwhile, in the MAC layer, both connection-level and packet-level QoS requirements of heterogeneous services need to be well guaranteed regardless of the channel conditions, and fairness is another important issue to avoid the services with bad channel conditions or low priorities experiencing bandwidth starvation.Particularly, to the uplink transmission in IEEE 802.16e, the fixed/mobile subscriber station (SS) needs to send a bandwidth request (BR) message to base station (BS) for its uplink connection first before data transmission, which introduces additional access delay and signaling overhead for uplink transmission.These characteristics pose great challenge to balance the tradeoff between QoS provision and spectrum efficiency for uplink transmission.
Concerning the service connectivity of the network, the connection-level QoS requirements were achieved through admission control (AC) and resource reservation (RR) [4], whose performance can be evaluated by following metrics: handoff connection dropping rate (CDR), new connection blocking rate (CBR), ongoing connection failure rate (CFR).There are many tradeoffs among these metrics for designing AC and RR schemes.For AC, too stringent restrictions for accepting new/ handoff connections will reduce the CFR at the cost of deteriorating CBR, CDR, and resource utilization.Even though looser restrictions indicate lower CBR and CDR, too much accepted services may cause system overload, and CFR will greatly increase when the channel condition becomes seriously deteriorated.Since blocking a new connection is more acceptable than dropping a handoff connection from the user viewpoint, performing RR for handoff connections can effectively reduce the CDR.However, over-reservation will deteriorate the CBR and the resource utilization while insufficient reservation cannot achieve prospective CDR target.Therefore, a good AC and RR scheme should well balance these tradeoffs to guarantee the system stability.As for the AC schemes proposed for IEEE 802.16 BWA networks, the authors of [5,6] did not consider the handoff situation, which is a crucial characteristic of IEEE 802.16e.The authors of [7][8][9][10] took the handoff requirements in to account regardless of the channel condition.For the general AC schemes proposed in [4,[11][12][13][14][15][16][17][18], the time-variant channel conditions were not considered either.In [19], the authors modified the handoff-prioritized AC scheme considering AMC over the unreliable wireless channel, but few QoS-adaptive characteristics were discussed.
The packet-level QoS provision determined the quality of end users experience for multimedia applications [4].The performance of packet-level QoS provision is evaluated through the metrics including delay, delay jitter, BER, packet loss rate, etc., which is mainly determined by the bandwidth allocation (BA) and scheduling algorithm.In literature, the maximum channel to interference ratio (max C/I) algorithm in [20] was throughput-oriented without QoS consideration, while strict priority queue [21] was QoS-oriented regardless of channel conditions.To better exploit asynchronous variations of channel quality, the authors of [22] gave higher priority to the real-time packets only after their waiting period exceeds the emergency threshold.However, it does not fit well with the bust nature of heterogeneous traffics.Because when large real-time traffics enter the emergency status simultaneously with the bad channel conditions, packet dropping rate (PDR) tends to increase rapidly.Hou et al. [23] introduced the delay constraint into the proportional fairness formulation for QoS provision, but delay is not a proper metric to provide QoS satisfaction and service differentiation for non-real-time traffics.As a variation of modified largest weighted delay first (M-LWDF) [24], the algorithm in [25] considered the channel quality, QoS satisfaction, and service priority for BA.However, the QoS coefficients of various services are not analytically determined.Particularly, the packet-level QoS provision for uplink transmission is also influenced by BR mechanism.Unicast and multicast/broadcast pollings are the primary ways to request bandwidth, while piggyback is an optional way which will not be discussed.The problem of unicast polling is that it introduces constant delay for the delay-sensitive real-time connections.Multicast/broadcast polling provides a contention way to request bandwidth, which causes too much signaling overhead and BR delay for non-real-time services.Lee and Cho [26] reduced the BR delay and signaling overhead for VoIP connections, which did not consider other types of real-time traffics such as MPEG-based multimedia streaming.As for multicast/broadcast polling, the collision probability is a function of the number of BR messages and the contention period size.Oh and Kim [27] and Yan and Kuo [28] proposed two different models to find out the optimal contention period size.The performance of random access for BR was analyzed in [29][30][31].Oh and Kim [32] optimized the collision resolution algorithm for BR.However, they cannot eliminate the collisions caused by multicast/broadcast polling because of its contention-based access characteristic.However, the BR delay and the signaling overhead can be further reduced.
Motivated by these observations, we propose an integrated framework for adaptive QoS provision over IEEE 802.16eBWA networks based on cross-layer design, which is considered to be an efficient way to achieve efficient QoS guarantee and network resource management for wireless network [33,34].Our major contributions are a) Before accepting a new/handoff connection, the proposed AC algorithm considers whether there is enough bandwidth available to afford its average resource consumption and instantaneous resource consumption for QoS provision through cross-layer design method, which effectively avoids the system overload.So, the proposed AC scheme joint considers the types of service flows (SFs) QoS and MCS, thus embodies the idea of cross-layer design.b) Our semi-reservation scheme considers both users' handoff probability and average resource consumption together to perform RR, which effectively avoids resource over-reservation and insufficient reservation and ensures well the continuity of handoff connections as well as promises high spectrum efficiency.c) A joint resource allocation and packet scheduling scheme is designed to guarantee the packet-level QoS in term of "QoS rate", thus effectively avoids large realtime data being blocked in deteriorated channel condition.Particularly, when there is not enough bandwidth available to guarantee all "QoS rate" constraints, fairness is provided for the services with identical priority level."QoS rate" service model adopts cross-layer design method, since it considers both the bandwidth requirements in the MAC layer and the channel conditions in the physical layer.
d) An enhanced BR scheme is designed to reduce the unnecessary BR delay and the redundant signaling overhead caused by the existing one in IEEE 802.16e, which further improves the packet-level QoS performance and resource efficiency for uplink transmission.e) Performing adaptive QoS management to increase or decrease the average source rate based on load status and channel conditions, which enables more users to enter the network, as well as maintains the network stability and high spectral efficiency.
The rest of this article is organized as follows.Section 2 introduces the system model.Section 3 presents the proposed framework for adaptive QoS provision in detail.Section 4 evaluates the system performance through mathematical analysis.Section 5 analyzes the simulation results.Finally, conclusions are made.

QoS-adaptive service model
The MAC layer of IEEE 802.16e is connection-oriented, and a flexible QoS provision framework is designed.Each connection is associated with a unique SF characterizing by a set of QoS parameters such as delay/delay jitter, packet loss rate, minimum reserved rate, maximum sustained rate, etc., and a connection can be created, changed, and deleted through dynamic service addition, dynamic service change, and dynamic service deletion handshake transactions, respectively.Five types of SFs are defined in IEEE 802.16e for QoS differentiation: Unsolicited grant service (UGS), real-time polling service (rtPS), extended rtPS (ErtPS), non-real-time polling service (nrtPS), and best effort (BE).Their priorities from highest to lowest are: UGS, rtPS/ErtPS, nrtPS, and BE.Table 1 lists the characteristics of all SFs.
Considering the influences, i.e., user quantity, channel status (physical layer), service distribution, various QoS restrictions (QoS parameters in application layer), and resource allocation algorithm, that play on the system throughput, a reasonable cross-layer-based mathematical model (QoS-Adaptive Service Model) is proposed first to characterize the average system capacity and instantaneous capacity, which is the basis for RR and AC.
Let C m,x,y denote the yth connection belonging to the SF x in subscribe station (SS) m.For UGS, rtPS/ErtPS, nrtPS, and BE, the value of x equals 1, 2, 3, and 4, respectively.In this article, the traffic sources are considered to be rate adaptive, because different coding schemes are provided for multimedia services in application layer.We set G m,x,y service grades for the connection C m,x,y .Let R min m,x,y and R max m,x,y be the minimum rate and the maximum rate of the connection C m,x,y , respectively.For a connection at service grade g, its average required rate for QoS provision can be defined as It is obvious that the smaller g indicates lower average required rate for QoS provision, and vice versa.
For the connection C m,x,y , D m,x,y , W m,x,y , ψ m,x,y , and ω m,x,y , respectively, denote the tolerable delay, the waiting period of its packets before being transmitted, the packet error rate (PER) during transmission and the tolerable packet loss rate.A packet may be dropped when transmission error happens or its waiting period exceeds the tolerable delay.Thus, Equation 2 must be met to avoid the ongoing connection failure.
In the following section, we will find that the PER can be guaranteed through selecting proper modulation and coding scheme (MCS) based on the SINR knowledge.Thus, the resource allocation and scheduling algorithm should guarantee the maximum delay for a given outage probability.Particularly, reducing BR delay for uplink connections can help for reducing the PDR caused by delay variation.However, because of the burst nature of heterogeneous traffics, R avg m,x,y,g cannot accurately reflect the instantaneous rate requirements to provide QoS guarantee for the connection C m,x,y .Accordingly, based on cross-layer method, the term "QoS rate" is defined in Equation 3 for packet-level QoS provision (upper-layer), where R q m,x,y , R m,x,y , and R em m,x,y are the "QoS rate", the required rate to transmit the buffer data, and the rate to transmit the emergency data for the connection C m,x,y , respectively.The emergency data are the data whose waiting periods exceed the tunable delay threshold ξ m,x,y (0 <ξ m,x,y <D m,x,y ).Since both deteriorated channel condition and increased source rate may cause higher Pr {W m,x,y >D m,x,y }, smaller ξ m,x,y should be considered, and vice versa.And the "non-QoS rate" of the connection C m,x,y can be defined as R nq m,x,y = R m,x,y − R q m,x,y .Accordingly, we have R em m,x,y = 0 for the delay insensitive nrtPS/ BE connections, R q m,x,y = 0 for the BE connections without minimum rate requirement, and R nq m,x,y = 0 for UGS connections with fixed rate requirements.

Link adaptation model
This article considers the PHY layer of IEEE 802.16eBWA networks combining WirelessMAN-OFDM with AMC together for optimizing the system performance over the error-prone wireless channel.As a TDMAbased PHY technology, each frame of WirelessMAN-OFDM contains many transmission bursts from/to different SSs.The data rate and coding overhead for each burst are different, because different MCSs are chosen for the SSs for adapting to various detected signal-tonoise ratios (SNRs), and to meet the target BER accordingly.Since M-QAM modulation provides high spectrum efficiency while convolutional codes (CC) with bit interleaved coded modulation have strong forward error protection capability, they are chosen to form MCS compositions.The entire SINR range is divided into K + 1 non-overlapping consecutive partitions by the SINR boundary Particularly, because of unacceptable transmission error, no data are transmitted if the SINR is less than Г 1 .The MCS employed in this article is listed in Table 2.If SS m adopts MCS k, its average PER can be deduced as where L is the average packet length, h k is the number of error bits can be corrected by MCS k, and ε m is the BER constraint of SS m.
Adopting MCS k, the data rate from MAC layer viewpoint can be calculated as where B, PR k , Ω k , and CR k , respectively, denote the channel bandwidth, PHY transmission rate, the modulation level, and the CC code rate when MCS k is adopted.It is noted that the modulation levels of QPSK, QAM16, and QAM64 are 2, 4 and 6, respectively.
To analyze the system capacity over the time-variant wireless channel, we assume that both the path loss and shadowing are compensated by dynamically adjusting the transmission power.Thus, only the small-scale fading need to be considered.For SS m, the probability density function of its SNR g m under the Rayleigh fading environment is where γm is the average SNR of SS m.Accordingly, the probability of an SS adopting MCS k for transmission can be deduced as The average resource consumption (transmission time) for transmitting one bit can be deduced as Based on the QoS-adaptive characteristics in the MAC layer and the average resource consumption (transmission time) per bit in the PHY layer (Equation 8), we proceed to investigate cross-layer design for bandwidth resource management in the following section.

Cross-layer design for QoS-adaptive resource management
In the point-to-multipoint (PMP) mode of IEEE 802.16eBWA networks, BS is designed as a coordinator to perform QoS-adaptive resource management for its subordinate fixed/mobile SSs.The proposed adaptive QoS provision framework and the interaction between BS and SS are shown in Figure 1.At the connection level, the admission controller in BS restricts the number of new/handoff connections entering the target cell to avoid system overload, which ensures low CFR of ongoing connection.In addition, the RR executes the semi-reservation algorithm, which not only guarantees the service continuity for handoff connections, but also achieves high resource efficiency, that is because it effectively avoids resource over-reservation and insufficient reservation.Particularly, the SF managers in SS and BS communicate with each other to maintain the connections' survival as well as perform adaptive QoS adjustment.At the packet level, the resource dispenser in BS takes SS as the basic unit to perform resource allocation through cross-layer design idea, which considers both the "QoS rate" constraints in the MAC layer and the channel conditions in the PHY layer.The resource allocation result for downlink transmission is reflected in DL-MAP message, while the one for uplink transmission is figured out in UL-MAP message.Using the granted bandwidth for each SS, schedulers in BS and SS schedule the downlink and uplink data for transmission, respectively.Specifically, when an SS has data to send in the uplink, it needs to send a BR message to BS first.A BR generator is designed to execute the proposed BR scheme, which can help to reduce the BR delay and signaling overhead.Since the difference between uplink and downlink transmission mainly lies in whether a connection needs to request bandwidth before data transmission, for simplicity, we only discuss the uplink case for QoS provision in this article.Following sections will describe our proposed approach (PA) in detail.

The estimation of RR
We extend the probabilistic resource estimation and semi-reservation scheme [35] for reasonable RR considering the time variant channel conditions.For a mobile SS m managed by cell u, H m,u,v denotes the handoff probability from cell u to cell v, which can be calculated based on the current position, as well as the predicted moving speed and direction of mobile SS m.Let NC u denote the collection of the neighboring cells of cell u.
We have To reduce unnecessary RR, reservation threshold Δ is defined.Reservations are made only for the mobile SSs with handoff probabilities larger than Δ.Let Y m,x be the number of the connections belonging to SF x,y at service grade g.Suppose there are M SSs distributed in the whole network.In cell v, if H m,u,υ > Δ, the average reserved bandwidth for the connections belonging to SF x can be deduced as It is noted that in the above equation, RS v,x is the bandwidth co-reserved for the connections belonging to SF x other than for a specific connection or mobile SS.Accordingly, the total reserved bandwidth in cell v can be deduced as

Admission control
In this section, we discuss AC considering both instantaneous resource consumption and average resource consumption.Let AS v,x and PS v,x denote the instantaneous and average resource consumption of the connections belonging to SF x in cell v, respectively.k m is serial number of the selected MCS based on the instantaneous SNR of SS m.We have Thus, the total average and instantaneous resource consumption in cell v can be calculated as In case of bandwidth shortage, more new/handoff realtime connections can be accepted by decreasing the source rate of the ongoing connections which are not prioritize over them.Accordingly, the average resource PS v,x and instantaneous resource PS v,x must be reserved for ongoing connections after decreasing source rate for new/handoff connections belonging to SF x.Actually, since we satisfy bandwidth requirements (QoS) in upperlayer through source rate compression, i.e., decreasing transmission rate in data link layer via MCS, this proposed scheme embodies the idea of cross-layer design.In order to guarantee the minimum QoS requirements of ongoing connections, the average resource AS v,x and instantaneous resource PS v,x can be deduced as Since blocking a new connection is more acceptable than dropping an ongoing connection from the user viewpoint, the bandwidth reserved for handoff connection cannot be used for accepting new connection.Let TS v be the total available bandwidth in cell v.For a new connection meeting both inequalities in Equation 13, it will be accepted at its desired average source rate without source rate compression for other connections.In case of bandwidth shortage, a new connection is accepted at its minimum rate if the constraints in Equation 14 are met, which may causes the source rates of other connections being decreased.If neither Equations 13 nor 14 is met, the new connection will be rejected.
In our scheme, the handoff connection with higher priority may preempt the bandwidth reserved for the lower priority ones.Thus, the reserved bandwidth, which cannot be used by the handoff connections connection is accepted at its desired source rate if both inequalities in Equation 15 are met, which neither preempt the reserved bandwidth of the handoff connections belonging to other SFs, nor compress the sources rate of the ongoing connections.When there is not enough resource available, a handoff connection is accepted at its minimum rate.In this case, either reserved bandwidth preemption or the source rate degradation may happen.If neither Equations 15 nor 16 are met, the handoff connection will be dropped.

Adaptive QoS management
Accepting more new/handoff connection in case of bandwidth shortage is not the only reason to perform source rate compression.Since AC can keep AS v ≤ TS v for cell v, if the available bandwidth cannot afford all ongoing connections' average source rate and "QoS rate" requirements because of the deteriorated channel conditions, source rate compression will be performed to keep the system stable.In this case, either inequality in Equation 17 is met.
For source rate compression, the connections with lower priority are chose first.Among the connections with identical priority level, the connection whose master SS has the worst channel condition is chosen first.The selected connection can adapt to any coding scheme producing lower average source rate, and least number of degraded connections should be selected to reduce the signaling overhead.
If all "QoS rate" constraints of ongoing connections are guaranteed and there is still bandwidth left unused exempting the reserved bandwidth, we will increase ongoing connections' average source rate to improve the resource utilization and the service quality.Among the connections whose average source rates have been compressed, the one whose master SS has best channel condition will be chosen first.Then, for other connections, the one with highest priority level among the ones with best channel condition is chosen.The selected connection can adapt to its highest average source rate for reducing the signaling overhead as well as improving the system throughput.

Enhanced BR scheme
The term "QoS rate" is defined in Equation 3 to reflect the time-variant QoS requirement of the service because of its bursty characteristics.Based on this definition, a joint resource allocation and scheduling algorithm is designed to provide QoS guarantee based on "QoS rate" as well as fairness for the services with identical priority level in case of bandwidth shortage.Specifically, an enhanced BR mechanism is proposed, which reduces the number of bandwidth request messages by aggregating the nrtPS/BE connections in the same SS as one basic BR unit, as well as replaces the reactive unicast polling and multicast/broadcast polling with proactive unicast polling to reduce the BR delay and signaling overhead.
We enhance the BR scheme for IEEE 802.16eBWA networks in the following aspects: a) SS requests bandwidth only using unicast polling opportunity, which avoids the BR collisions caused by multicast/broadcast polling.b) Each uplink rtPS/ErtPS connection is taken as an individual BR unit (BRU) because of the stringent delay requirement, while all uplink nrtPS or BE connections in the same SS are aggregated as a BRU to reduce the signaling overhead for unicast polling.
c) The uplink protocol data units (PDUs) have two statuses: transmission-preparing (tp) and transmissionready (tr).The incoming uplink data are packed into the PDUs in tp status first.Once SS requests bandwidth for a BRU, the BR message takes the aggregated bandwidth requirement for all its PDUs to BS, and the PDUs of the BRU in tp status are transited to tr status accordingly.
d) The reserved bit in generic MAC header is defined as unicast polling index (UPI).When SS needs to be polled, UPI is set to 1; otherwise, it is set to 0.
Based on the BRU definition in (b), in BS, C m,x,y can also be used to denote the corresponding rtPS/ErtPS BRU, while C m,x,-1 is used to represent the nrtPS/BE BRU in SS m.For an uplink BRU C m,x,y , R tp m,x,y and R tr m,x,y represent the bandwidth requirement of its PDUs in tp and tr statuses, respectively.It is noted that only the PDUs in tr status can be transmitted out when there is uplink bandwidth available.To obtain the "QoS rate" of each uplink BRU in BS, we have R max m,x,y and R m,x,y = R tp m,x,y .Based on the definition in Equation 3, in BS, the uplink "QoS rate" of UGS/rtPS/ErtPS in SS m can be defined as R em m,x,y , and the "non-QoS rate" of SS m can be defined as Let h m,x,y be the tunable variable for the BRU C m,x,y to set UPI.If SS requests bandwidth for a BRU once new data come in, the bandwidth requirement can be reflected to BS in the shortest time at the cost of highest signaling overhead.We define the following rules to balance the tradeoff between the two issues: (1) when there is uplink bandwidth available, SS first requests bandwidth for the BRUs with expired unicast polling timer, then for the BRUs which UPIs have been set for; (2) if there are data PDU to be sent out, the SS sets UPI for BRU based on Equations 19 and 20 for rtPS/ErtPS and nrtPS/BE, respectively.Figure 2 depicts the operations of the enhanced BR scheme in SS. max Once BS receives an uplink PDU with UPI equaling one, in next frame, it will grant a unicast polling opportunity to the SS which sends the PDU.In addition, the SS whose BRU has expired unicast polling timer will also be granted a unicast polling opportunity in next frame.The operations of the enhanced BR scheme in BS are shown in Figure 3.

Joint BA and scheduling
BS follows strict priority to process the "QoS rate" requirements for its subordinated SSs, and the detailed resource allocation algorithm is designed based on Equation 21.
The channel condition is seriously deteriorated or the real-time traffic is boosted when X max < 2, which cause the available bandwidth cannot satisfy all "QoS rate" requirements of the real-time SF X max .In this case, BS will prior guarantee the emergency rate requirements other than "QoS rate" requirements.Even worse, if the available bandwidth cannot afford their emergency rate requirements, packet loss may happen.All SSs should share the packet loss to avoid the SSs with deteriorated channel condition suffering from more serious QoS degradation.So, BS chooses to serve the SS with the lowest satisfaction for emergency rate in recent S fames first.
Figure 2 The operations of the enhanced BR scheme in SS.
X max = 2 indicates that the available bandwidth cannot satisfy all "QoS rate" requirements of nrtPS.Let Th v,3 be the MAC throughput of the available bandwidth TS v,3 for nrtPS, thus G q m,v,3 satisfies (22).
To provide weighted fairness for the "QoS rate" requirements of nrtPS connections from MAC viewpoint, Equation 23 can be used to deduce the bandwidth granted to each SS for nrtPS connections. where Using the granted bandwidth of each SS, we perform packet scheduling for its connections considering "QoS rate" requirements first, and the scheduling rules are defined as: (1) for the connections belonging to different SF types, their packets are scheduled following order of strict priority; (2) for the connections belonging to the same SF type, the connection whose head-of-line packet has the longest waiting time will be served first.After QoS provision, SS applies round robin [36] to schedule packets based on the "non-QoS rate" requirements of its connections.

Mathematical analysis
In this section, we perform mathematical analysis for the PA from following viewpoints: connection-level QoS performance, queuing performance, and BR efficiency enhancement.

Connection-level QoS performance analysis
To simplify theoretical analysis for connection-level QoS provision, we assume (1) all connections in cell v belong to SF x, and they have the same minimum rate requirement R x min ; (2) the average SNR γm is identical for all SSs.So, ℙ m (k) in Equation 7and C avg m in Equation 8 can be simplified as ℙ(k) and ℂ avg , respectively.Set z m,v .The probability of s connections adopting MCS k for data transmission can be deduced as Accordingly, the characteristic function of the above equation is The average number of the connections adopting MCS k for data transmission is Suppose all the connections in cell v are at their lowest average source rate, the average resource consumption in cell v can be calculated as Figure 3 The operations of the enhanced BR scheme in BS.
To investigate the tradeoff among CBR, CDR, and CFR under the time-variant channel conditions, we study two extreme cases.One case is that all connections are in the best channel conditions, and we have , the value of M v is maximized, which implies that lower CBR and CDR are ensured.However, once the channel condition gets worse, PS v > TS v will be met.Since no rate compression can be performed, many connection may fail, which will result in system unstability.The average CFR in this case can be calculated as The other case is that only lowest transmission rate is available because of the deteriorated channel condition.When all bandwidth resources are used up, we have In this case, it is obvious that the lowest CFR is available at the cost of highest CDR and CBR, because from average viewpoint, there still be a lot of new/handoff connections can be accepted by the system, the number of which can be deduced as

Queuing performance analysis
In this section, we analyze the queuing performance for the network under the saturated status, in which all the available bandwidths are used up to guarantee all ongoing connections' average rate requirement for QoS guarantee.Since UGS connections always get fixed bandwidth for data transmission without BR, we only discuss other types of connections here.The uplink data access delay consists of BR delay and scheduling delay.By setting the BR delay equals 0, the analysis result for uplink transmission can be extend to the downlink transmission as well.
For an non-UGS uplink connection C m,x,y , suppose its data arrival follows Poisson process with rate l m,x,y packets per second, and the average length of the packet is L m,x,y .We have λ m,x,−1 = Y m, x y=1 λ m,x,y for nrtPS/BE BRU.Due to the effect of Equations 19 and 20, the average rate of the uplink data transmitting from tr status to tp status is q m,x,y bits per second.Therefore, the uplink transmission process of a BRU can be formulated as a twice queuing problem shown in Figure 4, which can be depicted by the two-dimensional Markov model shown in Figure 4.The steady-state equation in Figure 5 is obtained as Equation 30, in which μ m,x,y = q m,x,y /L m,x,y and I m,x,y = ℜ m,x,y /L m,x,y .

BR performance analysis
We first analyze the BR delay saved by our enhanced BR scheme.Let τ m,x,y be the unicast polling interval of the BRU of the uplink connection C m,x,y .Since queuing delay in tp buffer is identical with the BR delay of our enhanced BR scheme, the average BR delay of an rtPS/ ErtPS connection saved by our proposed BR scheme is Using multicast/broadcast polling, each nrtPS/BE connection is taken as a unit to request bandwidth.Suppose SS requests bandwidth for an nrtPS/BE connection when Equation 19 is met, the BR time of an nrtPS/BE , , m x y connection also follows exponential distribution with the mean value of 1/ς m,x,y , which is the collected effect of Equation 19 and BR retransmission in case of collision happening.F is the frame duration.In cell v, the average number of contention BR messages transmitted in one frame can be deduced as There are N BR opportunities for multicast/broadcast polling in one frame.The average collision probability is Before a contention BR message can successfully be received by BS, it may meet c times of collisions.The average collision time of a contention BR message is The time before a contention BR message can successfully be transmitted is deduced as where π m,x,y is the retransmission interval of an nrtPS/ BE connection C m,x,y .Compared with multicast/ broadcast polling, the BR delay saved by our scheme for nrtPS/BE connection is Considering the signaling overhead for BR, we first deduce the resource utilization of the contention period for multicast/broadcast polling as Figure 6 shows the numerical results of Equations 37 and 41 when T v is 80.We find that small contention size cause low BR resource utilization because of the high collision probability.Even though larger contention size ensures lower collision probability, more contention BR opportunities are left unused, which in turn reduce the BR resource utilization as well.Since the highest BR resource utilization of the contention period equals 37%, we can conclude that high signaling overhead is caused by multicast/broadcast polling.
Using multicast/broadcast polling, the average ratio of successful BR transmission has a tradeoff with BR delay [27,28].Thus, Equation 42 is defined to find out the optimal contention period size for multicast/broadcast polling.In Equation 42, larger θ v indicates higher successful BR transmission rate and smaller average delay induced by collision.Therefore, the optimal contention period size is obtained when θ v is maximized.
Assuming π m,x,y = π is met for all uplink nrtPS/BE connections, Equation 42 can be simplified as The maximum value of θ v is obtained when Δ v = 1/π.Using Equation 37, the optimal value of contention period size is deduced as Let M denote the total number of nrtPS/BE BRUs which may request bandwidth in one frame duration using our proposed BR scheme.We have Therefore, compared with the optimal case of multicast/broadcast polling, the BR signaling overhead saved by our proposed BR scheme is N o -M.

Following assumptions apply in the simulation:
(a) There are 50 cells in our simulation environment.The BS in each cell communicates with the BSs in its neighbor cells to exchange handoff-related information, and the bandwidth reserved for the handoff connections is refreshed every 3 s.The symbol rate in each cell is 20 MBd, and the frame duration is 1 ms.(d) The intervals of unicast polling timers for rtPS/ ErtPS BRUs equals 6 ms, while those for nrtPS and BE BRUs are 10 and 12 ms, respectively, and, the BR retransmission intervals of nrtPS and BE connections for multicast/broadcast polling are 6 and 8 ms, respectively.
Figure 7 shows comparison result of BR signaling overhead.As expected, our proposed enhanced BR scheme greatly reduces the signaling overhead compared with multicast/broadcast polling.
Using the proposed resource allocation and scheduling algorithm for packet-level QoS provision, our simulation compares the PA for AC and RR with the efficient AC scheme (EAC) [7] to evaluate the enhancement in CBR, CDR, CFR, and resource utilization with the growth of the connection arrival rate (CAR).Since the study of [7] assumes the system capacity is fixed, we consider three scenarios for performance comparison, which is, respectively, denoted as EAC1, EAC2, and EAC3.EAC1, EAC2, and EAC3 assume that the resource consumption for transmitting one bit in MAC layer equals 1/MR 1 , 1/  MR K , and ℂ avg , respectively.From the simulation results shown in Figure 8, we have the following conclusions: • EAC1 undervalues the average system capacity, and the number of the connections accepted by the system is reduced accordingly.Therefore, EAC1 provides the best system stability with lowest CFR at the cost of highest CBR and CDR, as well as lowest resource utilization.However, our PA can achieve almost the same CFR as EAC1 while maintaining much better CBR, CDR, and resource utilization.
• More connections can be accepted in EAC2, which overvalues the system capacity.Therefore, CBR, CDR, and resource utilization are improved.However, the system becomes more unstable because of the high CFR.Even though the CBR and CDR of our approach are higher compared with EAC2, their resource utilizations almost keep the same.We can find that accepting more new/handoff connections at the cost of losing ongoing connections do not help to improve the resource efficiency.
• Our approach outperforms EAC2 in the four performance metrics.EAC2 only considers the average resource consumption for AC, while our approach considers one more restriction: the practical symbol consumption.If only for this reason, more new/handoff connections should be accepted in EAC2, and lower CBR and CDR are attained accordingly.However, compared with statistic RR in [7], our approach alleviates both resource over-reservation and insufficient reservation, which causes more new/handoff connections can be accepted.
From the theoretical analysis and the simulation results, it is noted that our proposed algorithm well balances the tradeoff among CBR, CDR, and CFR, while achieving high spectrum efficiency.
To evaluate the performance of packet-level QoS provision and system throughput using our enhanced BR scheme as well as joint resource allocation and scheduling algorithm, the scheduling algorithms in [20][21][22] are coupled with the traditional BR scheme of IEEE 802.16e to perform uplink transmission, which are denoted as approach1, approach2, and approach3, respectively.The performance metrics are evaluated with the growth of average data arrival rate (DAR).Particularly, the highest average DAR will cause the network enter into the saturated status, while the lowest average DAR can use up all resources under the lowest transmission rate (i.e., using MCS 1 for data transmission).From the simulation results illustrated in Figure 9, we have the following conclusions: • The performance of packet-level QoS provision for rtPS/ErtPS is evaluated in terms of average PDR and maximum PDR variance.In Figure 9a,b, we find that our approach outperforms the others in the two metrics for real-time connections.The reason for ensuring lowest PDR is that our BR scheme reduces the BR delay, while our BA algorithm effectively avoids large real-time data being blocked under the deteriorated channel condition, which in turn reduces their scheduling delay.In addition, since fairness is ensured in case of bandwidth shortage to share the packet loss for the users with deteriorated channel conditions, smallest PDR variance is achieved.Since the connections with lower priority cannot use the bandwidth before all bandwidth requirements of high priority connections are satisfied in approach2, it works better than approach3, and approach1 produces highest PDR and largest packet dropping variance without QoS consideration.
• QoS outage rate is adopted to evaluate the QoS performance of nrtPS connections, which is the ratio of the "QoS rate" dissatisfaction times and the total BA times.Figure 9c shows that our approach provides the best QoS performance for nrtPS connections, because approach1 has no QoS consideration, approach2 causes the real-time connections preempting too much bandwidth for non-real-time connections, and approach3 does not consider QoS provision for nrtPS connections.
• Let G m be the total uplink bandwidth granted to SS m in each frame.Based on resource allocation results in BS, from MAC viewpoint, the system throughput of cell v can be deduced as Equation 46.It is obvious that the system throughput is a variable determined by the bandwidth requirement information of heterogeneous traffics, the QoS constraints, the channel conditions, and the resource allocation algorithm.Figure 9d illustrates the system throughput comparisons.When the bandwidth has not been used up under the low DAR, our approach achieves the highest throughput because the uplink bandwidth requirements can be reflected to BS more quickly.With the increase of the DAR, approach1 achieves the highest throughput because QoS provision compromise the system throughput gain of our approach, while approach2 provides the lowest throughput without considering the channel conditions.And, our approach works better than approach3 because we reduce the signaling overhead for BR.
From the simulation results in Figure 9, it is noted that our PA well balances the tradeoff between packetlevel QoS provision and spectrum efficiency.

Conclusions
Based on AMC in IEEE 802.16ePHY layer and flexible connection-oriented QoS provision in its MAC layer, this article investigates analytical integrated framework and adaptive QoS provision mechanism based on cross-layer design.First, we propose an integrated framework for adaptive QoS provision cross-layer-based design.Our major QoS provision mechanism concerns are about connection-level QoS provision through dynamic RR and AC, as well as packet-level QoS provision through joint resource allocation and packet scheduling.Second, to alleviate the resource over-reservation and insufficient reservation for handoff connections, we estimate the average reserved resource over the unreliable wireless channel considering the handoff probability.In addition, we perform AC based on both average resource consumption and practical resource consumption.Particularly, adaptive QoS management is used to perform average source rate compression for accepting more new/handoff connections, as well as average source rate increasing for improving the service quality and resource utilization.Finally, a joint resource allocation and packet scheduling algorithm is designed to provide packet-level QoS guarantee in term of "QoS rate", which provides fairness for the services with identical scheduling priority in case of bandwidth shortage.In addition, we enhance the BR mechanism to reduce the BR delay and signaling overhead, which belongs to packet-level QoS provision.The theoretical analyses and the simulation results show that our approach guarantees well the QoS requirements of heterogeneous services, as well as provides high spectrum efficiency.

Figure 4
Figure 4 Queuing model for uplink transmission.

Figure 5
Figure 5 State transition diagram of two-dimensional Markov model.
(b) In the initial status, there are 5,000 mobile SSs uniformly distributed over all cells.When an SS has intention to move from cell x to cell y, its handoff probability is a random value determined by its initial state.Under Rayleigh fading channel, all SSs have identical average SNR γm , which equals 13.8.(c) The new arriving connections are uniformly distributed in different mobile SSs.The probabilities of a new connection belonging to UGS, rtPS, ErtPS, nrtPS, and BE are 10, 25, 35, 20, and 10%, respectively, and, in unit of kb/s, the values of [R min m,x,y R max m,x,y ] for UGS, rtPS, ErtPS, nrtPS, and BE are [128, 128], [96, 386], [16, 64], [48, 128], and [0, 32], respectively.

Figure 6
Figure 6  The relationship between BR success rate and the resource utilization of the contention period.

Table 2
Modulation and coding schemes Figure 1 Proposed adaptive QoS provision framework.