 Research
 Open Access
 Published:
A risksensitive task offloading strategy for edge computing in industrial Internet of Things
EURASIP Journal on Wireless Communications and Networking volume 2021, Article number: 39 (2021)
Abstract
Edge computing has become one of the key enablers for ultrareliable and lowlatency communications in the industrial Internet of Things in the fifth generation communication systems and is also a promising technology in the future sixth generation communication systems. In this work, we consider the application of edge computing to smart factories for mission critical task offloading through wireless links. In such scenarios, although high endtoend delays from the generation to completion of tasks happen with low probability, they may incur severe casualties and property loss and should be seriously treated. Inspired by the risk management theory widely used in finance, we adopt the Conditional Value at Risk to capture the tail of the delay distribution. An upper bound of the Conditional Value at Risk is derived through analysis of the queues both at the devices and the edge computing servers. We aim to find out the optimal offloading policy taking into consideration both the average and the worstcase delay performance of the system. Given that the formulated optimization problem is a nonconvex mixed integer nonlinear programming problem, a decomposition into subproblems is performed and a twostage heuristic algorithm is proposed. The simulation results validate our analysis and indicate that the proposed algorithm can reduce the risk in both the queueing and endtoend delay.
Introduction
Intelligent factory automation is one of the typical applications envisioned in ultrareliable and lowlatency communications (URLLC) scenarios in the fifth generation (5G) and the coming sixth generation (6G) communications [1, 2]. In future smart factories, machines and sensors are seamlessly connected with each other through wireless links to conduct production tasks corporately. During the manufacturing process, a great number of operations of the machines and robots require complex control algorithms and intense data computation, such as traveling across zones to identify and pick up the objects and controlling the robotic arms to assemble components within a precise position alignment [3]. The limited builtin computing resources are not sufficient for the stringent latency requirements, so the tasks have to be offloaded to servers for processing [4]. Conventionally, the large volume of data generated at the local devices is uploaded to the cloud computing servers [5]. However, since the cloud computing servers are usually deployed remotely, the large roundtrip transmission latency as well as the possible network congestion makes it hard to meet the stringent endtoend delay requirements of the actuators and control units in the IIoT system. To overcome these difficulties, edge computing has emerged, where the servers are placed at the edge of the network to achieve a much lower transmission and processing latency [6].
There has been literature focusing on improving the service efficiency of the edge computing systems [7,8,9,10,11]. The authors in [7] adopt the Markov decision process (MDP) to minimize the average delay of a mobile edge computing system by deciding whether to compute locally or offload the tasks to the edge server. In [8], the authors define a delaybased Lyapunov function instead of the queue lengthbased one and minimize the average delay by optimizing resource scheduling under the Lyapunov optimization framework. Threetier multiserver mobile computing networks are investigated in [9], where a cooperative task offloading strategy is proposed based on the Alternating Direction Method of Multipliers (ADMM). In [10], an adaptive learningbased task offloading algorithm is proposed to minimize the average delay for vehicle edge computing systems. Most relatively, [11] integrates the fog computing to the cloudbased industrial Internet of Things (IIoT), where the task offloading, transmission and computing resource allocation schemes are jointly optimized to reduce the service response latency in the unreliable communication environment.
The aforementioned works have only focused on the average delay performance, neglecting the worstcase performance of the edge computing system. However, in the IIoT systems, the probability of an intense delay jitter usually matters much more than the average delay, since when the delay exceeds a certain threshold, severe accidents may incur such as the deadlock of the manufacture process, the damage to the machines and even casualties. Therefore, in such scenarios, not only the average delay performance, but also the potential hazard, i.e., the risk behind the tail distribution of the delay, should be carefully investigated. There have been some preliminary works to deal with the embedded risks in the edge computing systems [12,13,14]. In [12], the tail distribution of the task queue under a probability constraint imposed on the excess value is characterized by the extreme value theory [15], and an offloading strategy is designed to minimize the energy consumption. The authors of [13] also apply the extreme value theory to the edge computing systems, in order to investigate the extreme event of queue length violation in the computation phase. Besides, in [14], the authors focus on a vehicular edge computing network where vehicles either fetch images from cameras or acquire synthesized images from an edge computing server. A risksensitive learning [16]based task fetching and offloading strategy is proposed to minimize the risk behind the endtoend delay.
Different from the works mentioned above, we introduce the risk management theory [17], widely used in the field of finance, to the edge computing system in consideration of the uncertainty of the wireless channels. Value at risk (VaR) and Conditional Value at Risk (CVaR) are the two widely used tools to characterize risks. While VaR takes the Gaussian distribution as assumption and lacks convexity and subadditivity, which makes it inapplicable in many cases, CVaR is a coherent risk measure of any type of probability distribution and is much easier to handle in practice. Therefore, CVaR is employed in this work to model the risk of the task completion delay in the considered edge computingassisted IIoT system. We aim to minimize both the average delay and the CVaR by jointly designing the offloading and computing resource allocation strategy. The main contributions of this work are summarized as follows:

We focus on the hazard incurred by the intense delay jitter in the edge computingassisted IIoT scenario and introduce the risk management theory to the design of the offloading and computation resource allocation strategy.

A cascade queueing model is constructed to describe the endtoend delay property of the system. Due to the uncertainty of the wireless channel, the transmission time follows a general distribution, which makes the queueing model hard to analyze. By exploring the queueing theory and the risk management theory, we provide an upper bound for both the average endtoend delay and the CVaR.

A lowcomplexity risksensitive task offloading strategy is proposed, where both the average performance and the risk with respect to the endtoend delay are optimized simultaneously. The computation complexity of each procedure of the proposed algorithm is analyzed in details. Simulations under the practical wireless environment in the automated factory validate the effectiveness of the proposed strategy in controlling the risk behind the intense delay jitter.
The remainder of this paper is organized as follows. We introduce the system model and analyze both the average delay and the CVaR in Sect. 2. In Sect. 3, we formulate the offloading and computation resource allocation problem and propose a lowcomplexity heuristic algorithm. In Sect. 4, the numerical results are reported with discussions. Finally, Sect. 5 concludes the paper.
System model
Edge computing system
As shown in Fig. 1, we consider an edge computingassisted IIoT system that consists of a set of \({\mathcal {M}}=\{1,2,\ldots ,M\}\) IIoT devices and a set of \({\mathcal {N}}=\{1,2,\ldots ,N\}\) edge computing servers (ECS). Each IIoT device \(i\in {\mathcal {M}}\) randomly generates tasks of identical size of \(d_i\) bits, and we assume the task arrival process follows the Poisson distribution with average arrival rate \(\lambda _i\). We denote by \(\omega _i\) the computation intensity of the task of device i [18], i.e., the number of CPU cycles required to process per bit data. Then, the total CPU cycles needed for a task of device i, denoted by \(c_i\), can be calculated as \(c_i=\omega _i d_i\).
Owing to the insufficiency of computation capability, the IIoT devices offload their tasks to the ECSs through wireless links. Each ECS \(j\in {\mathcal {N}}\) is equipped with a CPU of \(N_j\) cores, which can work simultaneously and independently. We assume that the tasks of a device can only be offloaded to one ECS, while each core only processes the tasks from the same device, which means an ECS can receive tasks from multiple devices as long as the number of devices it serves does not exceed the number of the CPU cores [12]. Let \({\mathbf {X}}_{M \times N}=\left[ x_{ij} \right]\) as the offloading matrix, where \(x_{ij}, i\in {\mathbf {M}}, j\in {\mathbf {N}}\) is defined as follows:
Under this definition, we can redescribe the offloading scheme in the considered system mathematically as \(\sum _{j=1}^N{x_{ij}=1}\) and \(\sum _{i=1}^{M}{x_{ij} \le N_j}\) for \(\forall i\in {\mathbf {M}}, \forall j \in {\mathbf {N}}\).
Due to the massive deployment of devices in the complex industrial environment, it is impractical to obtain the instantaneous channel state information (CSI) accurately and timely. Therefore, in this work, we design a task offloading strategy based on statistics of the wireless links, i.e., the distribution of the channel gain. We assume blockingfading channels such that the channel gain remains unchanged during the execution of one task and varies independently between two executions following an identical distribution, which is known a priori. Denote by \(g_{ij}\) the channel gain from device i to ECS j, the transmission rate can be expressed as follows:
where B is the bandwidth, \(N_0\) is the noise power, \(p_i\) is the transmit power of device i and \(\Phi _{ij}\) is the path loss from device i to ECS j. Without loss of generality, we assume that the noise power at each ECS is identical, and each IIoT device has an orthogonal channel with the same bandwidth B.
Queueing model
In the considered edge computingassisted IIoT system, there are two kinds of queues: the queue at each device and the queue at each ECS, as depicted in Fig. 2. Without loss of generality, we assume that device i offloads its tasks to ECS j, and denote by \(Q^D_{ij}\) the queue formed at the device i. The arrival process of \(Q^D_{ij}\) follows the Poisson process, and the departure process is dependent on the transmission delay denoted by \(t^D_{ij}\), which is given by
Therefore, \(Q^D_i\) follows the M/G/1 model.
As for an ECS, tasks from each device connected to it form an independent queue. Denote by \(Q^S_{ij}\) the queue of tasks offloaded from device i to ECS j, then the arrival process of \(Q^S_{ij}\) is the same as the departure process of \(Q^D_{ij}\). We denote by the \(f_{ij}\) the computation frequency allocated to device i by ECS j, then the computation delay denoted by \(t^S_{ij}\), i.e., the time required to complete a task of device i, is calculated as \(t^S_{ij} = {c_i}/{f_{ij}}\). As a result, the service time of a task follows a deterministic distribution and thus \(Q^S_{ij}\) follows the G/D/1 model.
Based on the queueing analysis, when device i offloads its tasks to ECS j, the total delay denoted by \(t_{ij}\) is given by
where \(W^D_{ij}\) and \(W^S_{ij}\) are the queueing delay at the device i and the ECS j, respectively. We will analyze both the average performance and the CVaR of the total delay in the following.
Average delay
According to (4), the average delay can be calculated as follows
As for the queueing delay at device i, we denote by \(\mu _{ij}\) the service rate of \(Q^D_{ij}\), which can be calculated as the reciprocal of the average transmission time, i.e.,
where the expectation is taken over the probability distribution of the channel gain. According to [19], the average queueing delay at device i can be expressed as follows
where \(\rho _{ij}=\lambda _i/\mu _{ij}\).
To analyze the queueing delay in the G/D/1 queueing model of \(Q^S_{ij}\), we first give the following lemma [20].
Lemma 1
In the G/G/1 queueing model, let \(\lambda\), \(\mu\) and W be the arrival rate, service rate and queueing delay, respectively, then an upper bound of the average queueing delay is given by
where \(\rho =\lambda /\mu\), \(\sigma ^2_a\) is the variance of the interarrival time, and \(\sigma ^2_b\) is the variance of the service time.
According to Lemma 1, we obtain the following theorem characterizing the upper bound of the average queueing delay at a ECS.
Theorem 1
When device i offloads its tasks to ECS j, an upper bound of the average queueing delay at ECS j is given by
where \(\lambda ^S_{ij}\) is the arrival rate of \(Q^S_{ij}\), \({\rho ^S_{ij} }= \lambda ^S_{ij}/\mu _{ij}\) is the traffic intensity and \({\sigma ^S_{ij}}^2\) is the variance of the arrival interval of tasks offloaded from device i to ECS j.
Proof
G/D/1 model can be seen as a special case of G/G/1 model with the service time following a deterministic distribution, the variance of which is zero. By substituting \(\lambda =\lambda ^S_{ij}\), \(\sigma ^2_a = {\sigma ^S_{ij}}^2\), \(\sigma ^2_b = 0\) and \(\rho = \rho ^S_{ij}\) into (8), we get (9) and Theorem 1 is proved. \(\square\)
Note that, due to the cascaded structure between \(Q^D_{ij}\) and \(Q^S_{ij}\), the arrival rate \(\lambda ^S_{ij}\) of \(Q^S_{ij}\) is equal to the departure rate of \(Q^D_{ij}\), which can be evaluated from the analysis of the interdeparture time in [21]. Similarly, the variance of the interarrival time of \(Q^S_{ij}\), i.e., \({\sigma ^S_{ij}}^2\), can be derived from the variance of the interdeparture time of \(Q^D_{ij}\) as in [22].
Finally, combining (5), (6) and (9), the upper bound of the average total delay can be obtained in the following corollary.
Corollary 1
When device i offloads its tasks to ECS j, an upper bound of the average total delay \(E[t_{ij}]\) is given by
Since each IIoT device offloads its tasks to only one ECS, for each device i the task completion time denoted by \(t_i\) can be expressed as \(t_i=\sum ^N_{j=1}{x_{ij}t_{ij}}\), and correspondingly an upper bound of the average total delay of device i based on (10) is given by
where
Risk metric for delay
Risk in the considered edge computingassisted system is mainly reflected in the high latency happened with low probability. Specifically, we introduce CVaR as a measure of risk to characterize the tail distribution of the delay. Before we formally define CVaR, we first give the definition of VaR [23].
Definition 1
For a random variable X and a confidence level \(\alpha \in (0,1)\), the \(\alpha\)VaR of X is the \(\alpha\)percentile of the distribution of X, which can be expressed mathematically as follows
The CVaR measures the expected loss in the right tail given a particular threshold has been crossed and can also be considered as the average of potential loss that exceed the VaR. The definition of CVaR is given as follows [24].
Definition 2
For a random variable X and a confidence level \(\alpha \in (0,1)\), the \(\alpha\)CVaR of X is given by
To characterize the CVaR of the total delay, we first analyze the CVaR of each part of the total delay in (4). Recall that the service process of the queue at the device follows general distribution, so it is quite difficult to directly characterize the probability distribution of the waiting time. However, in the considered IIoT scenario, it is reasonable to take the heavy traffic assumption. According to [25], the cumulative distribution function (CDF) of \(W_{ij}^D\) can be approximated as
where \(V_{ij}\) is the variance of the service time of \(Q_{ij}^D\), i.e., the transmission time of a task. Based on (15), the CVaR of \(W_{ij}^D\) can be evaluated in the following theorem.
Theorem 2
For a confidence level \(\alpha \in (0,1)\), the \(\alpha\)CVaR of \(W_{ij}^D\) can be expressed as
Proof
According to the definition of VaR in Definition 1, we can obtain that
By substituting (17) to (14), the CVaR of \(W_{ij}^D\) can be calculated as follows.
\(\square\)
As for the transmission delay \(t_{ij}^D\), we define a auxiliary function
where \((x)^+= \max {(0,x)}\) and the expectation is taken over the distribution of the channel gain. According to [26], the CVaR of \(t_{ij}^D\) can finally be calculated as
Now we turn to the queue at the ECS. Similar to (15), the CDF of \(W_{ij}^S\) can be approximated as
and thus the CVaR of the queueing delay at the ECS can be evaluated in the following theorem.
Theorem 3
For a confidence level \(\alpha \in (0,1)\), the \(\alpha\)CVaR of \(W_{ij}^S\) can be expressed as
The proof of Theorem 3 is similar to Theorem 2 and is omitted here for brevity.
Since we assume constant computing capability at the ECS, the CVaR of the service time of \(Q_{ij}^S\) is at the same value as itself, i.e.,
With the CVaR of each part of the delay involved in the task offloading, we provide an upper bound of the CVaR of the total delay in the following theorem based on the convexity and subadditivity [27].
Theorem 4
For a confidence level \(\alpha \in (0,1)\), an upper bound of the \(\alpha\)CVaR of \(t_i\) is given by
where
Proof
According to the convexity, the \(\alpha\)CVaR of \(t_i\) satisfies the following Jensen inequality:
Furthermore, based on (4) and the subadditivity of the CVaR, we have the following inequality:
By combining (26) and (27), Theorem 4 is proved. \(\square\)
Problem formulation and solution
Problem formulation
In the design of the edge computingassisted IIoT system, not only the average latency but also risk behind the intense delay jitter should be carefully considered. Taking into account both the average delay performance and the risk, we set the objective of the task offloading problem as the weighted sum of the average delay and the CVaR, i.e., the meanrisk sum. We have shown that obtaining an explicit expression of both the two terms is often cumbersome, especially for the complex wireless environment in the automated factories. Therefore, the two upper bounds of the average total delay and the corresponding CVaR derived in the previous section are adopted instead. Furthermore, in the considered mission critical IIoT scenario, the performance of the whole system is usually determined by the device with the worst performance. As a result, we aim to minimize the maximum meanrisk sum among all the devices, which can be described as the following optimization problem:
where \({\mathbf {f}}=[f_{ij}],i\in {\mathcal {M}},j\in {\mathcal {N}}\) is the computation frequency allocation matrix, \(\beta \in (0,1)\) is the weight of the CVaR, also called the risksensitive parameter, and \(F_j\) is the overall computation frequency of ECS j. Constraint (28b) is used to guarantee that the tasks generated by the device can only be offloaded to one ECS. Constraint (28c) and (28d) indicate that the number of devices served by a ECS should not exceed the number of its CPU cores, and the sum of the computation frequency allocated to these devices should not exceed its overall computation frequency. Substituting (12), (18), (20), (22) and (23) to (28), we find that the optimization problem is a nonconvex mixed integer nonlinear problem (MINLP), which is NPhard [28]. To reduce the computation overhead, we propose a heuristic algorithm, which will be described in details in the following subsection.
Problem solving
Recall that the CVaR of the queueing delay at the device in (20) is in the form of an minimization problem. Since the optimization variable \(\gamma\) in (20) is independent of the optimization variables \({\mathbf {X}}\) and \({\mathbf {f}}\) in (28), we can solve (20) first and substitute its optimal solution to (28) for the subsequent problem solving.
To solve (20), we introduce an auxiliary variable \(z_ij=(t_{ij}^D\gamma )^+\), and problem (20) can be transformed to the following problem
Problem (29) is a stochastic optimization problem with the expectation taken over the channel gain \(g_{ij}\). To approximate the expectation, we sample the probability distribution of \(g_{ij}\) [26], and a transformed problem is obtained as follows:
where \(z_{ij}^k, k \in {\mathcal {K}}= \{1,2,\ldots ,K\}\) are the samples of \(z_{ij}\). Problem (30) is a linear optimization problem, and the optimal solution of which, denoted by \(U_{ij}\), can be obtained from the interior point method (IPM) [29]. There are \(K+1\) variables and 2K constraints in problem (30), so we can solve it at the time complexity of \(O(((3K+1)(K+1)^2+(3K+1)^{1.5}(K+1))\delta )\), where \(\delta\) is the number of decoded bits [30].
With all the derived average and CVaR terms, problem (28) can be reformulated as the following optimization problem:
It is obvious that the objective function of problem (31) contains the term \(x_{ij}/f_{ij}\), and thus, problem (31) is still a nonconvex MINLP. In order to reduce the computational complexity, we decompose the original problem into two subproblems.
First, we consider the following problem:
where
It is worthwhile to mention that problem (32) is a convex MINLP, which can generally be solved via an outer approximation algorithm or an extended cutting plane algorithm [31]. More specifically, problem (32) is in the form of a bottleneck generalized assignment problem [32], which can be solved through the algorithm proposed in [33] at the time complexity of \(O(MN\log {N}+\theta (NM+N^2))\), where \(\theta\) is the number of bits required to encode \(\max _{i,j}{V_{ij}}\).
After solving the optimal offloading matrix for problem (32) denoted by \(X^*= [x_{ij}^*], i\in {\mathcal {M}},j\in {\mathcal {N}}\), we substitute \(X^*\) to (28), and the second subproblem can be formulated as follows:
Problem (34) is a nonconvex optimization problem. To transform it to a convex problem, we introduce an auxiliary variable \({\mathbf {G}}=[1/f_{ij}],i\in {\mathcal {M}},j\in {\mathcal {N}}\), and an optimization problem equivalent to (34) is given by
The right side of inequality (35a) is used to maintain the stability of the queue at the ECS. Although problem (35) is a convex optimization problem, the objective function is in the form of the pointwise maximum of M meanrisk sums. To handle this, we optimize the epigraph of problem (35) and obtain the equivalent problem as follows:
Problem (36) is a convex nonlinear optimization problem, which can be solved by various algorithms such as IPM. Constraint (36a), (35a) and (35b) consists of M, \(\sum _{i=1}^M{\sum _{j=1}^N{x_{ij}^*}}\) and N individual constraints, respectively. As a result, problem (36) has \(L \triangleq M+N+\sum _{i=1}^M{\sum _{j=1}^N{x_{ij}^*}}\) constraints in all. According to [34], we can find an \(\epsilon\)optimal solution to problem (36) in \(O(\kappa \sqrt{L} \ln {\frac{L\mu _0}{\epsilon }})\) Newton iterations through the logarithmic barrier method [35], where \(\kappa\) is the selfconcordance factor, \(\mu _0\) is the initial barrier value and \(\epsilon\) is the accuracy parameter. Till now, both the offloading matrix \({\mathbf {X}}\) and the frequency allocation matrix \({\mathbf {f}}\) have been solved.
Results and discussion
In this section, we evaluate the proposed strategy through the numerical results. We consider a typical use case in the edge computingassisted IIoT system, i.e., the videooperated remote control use case, with a typical latency requirement of 10100 ms and payload size of 15150 kbytes [36]. In the simulation, we consider 8 IIoT devices offload their tasks to 2 ECSs. Without loss of generality, we set the data size to 0.5 Mbits, i.e., 62.5 kbytes, and the task computation intensity to 15 cycles/bit for each task of each device. Each ECS is equipped with a fourcore CPU. The task arrival rate of each device, i.e., the parameter of the Poisson process, is set to be uniformly distributed in (10, 30). The bandwidth of each wireless channel is 10 MHz, and the transmission power, noise power at the receiver and the path loss are all set to be identical for each device at 30 dBm, \(10^{9}\) W and 70 dB, respectively. To characterize the fading channel in the practical automated factory, we set the channel distribution as a mixture of Rayleigh and lognormal distribution, which has been confirmed by the measurements in the real industrial environment [37]. The parameter of the Rayleigh distribution is set to be uniformly distributed in (0.5, 1) for each IIoTECS pair, and correspondingly, the two parameters of the lognormal distribution are set to be uniformly distributed in (1, 2) and (0, 4), respectively. Finally, we set the confidence level to \(\alpha = 0.99\).
Our proposed strategy considers both the queueing effect and the risk behind the total delay and thus is denoted by queueingbased and risksensitive (QR) strategy in the following simulations. To evaluate the performance of the QR, we compare it with the following five strategies: (i) the queueingbased and risksensitive optimal (QROpt) strategy, i.e., the globally optimal solution to problem (28); (ii) the queueingbased and nonrisksensitive (QNR) strategy, which considers the queueing effect but only optimizes the average delay performance, i.e., the weight of the CVaR is set to \(\beta =0\); (iii) the queueingbased and nonrisksensitive optimal (QNROpt) strategy, i.e., the globally optimal solution that corresponds to the QNR case; (iv) the nonqueueingbased and risksensitive (NQR) strategy, which takes into account both the average delay and the CVaR, but does not consider the queueing effect; (v) the nonqueueingbased and nonrisksensitive (NQNR), which considers neither the queueing effect nor the risk. In the following simulations, we set the weight of the CVaR to \(\beta =2\) for risksensitive strategies.
We first investigate the complementary cumulative distribution function (CCDF) of the total delay under the six offloading strategies, since the CVaR captures the tail information of the delay distribution. As presented in Fig. 3, for the probability of ultrahigh delay, the curves of QR, QROpt and NQR are all lower than their corresponding nonrisksensitive strategies. This implies that by adding the CVaR to the optimization objective, the risk of high delay can be greatly reduced. On the other hand, we can see that for any value of the total delay, the CCDF under NQR and NQNR is greater than that under QR, QROpt, QNR and QNROpt, which means the nonqueueing strategies are more likely to arise a higher delay. This is reasonable, since the nonqueueing strategies neglect the queueing effect in the strategy design, which leads to a higher queueing delay. Furthermore, the curve of QR is close to that of QROpt and has nearly the same trend, which indicates that the proposed algorithm achieves nearoptimal performance with a great reduction in computation complexity.
In Figs. 4 and 5, we compare how the delay performance evolves with the computation frequency of the ECS under the six strategies. Specifically, Fig. 4 investigates relationship between the average delay and the computation frequency, and Fig. 5 focuses on the 99th percentile of the total delay. It can be seen that with the increasing computation frequency, the average total delay and the 99th percentile decreases for all the six strategies. The reason is that the higher the computation frequency, the more computation resources allocated to the IIoT devices and thus the lower the computation delay. Furthermore, note that the delay does not descend much when computation frequency is relatively high. This is because for high computation frequency, both the computation delay and the queueing delay at the ECS are relatively low, and thus, the total delay is mainly dependent on the queueing delay at the devices. We can also see that the queueing strategies outperform the corresponding nonqueueing strategies for both the average performance and the 99th percentile, which verifies the significance of the queueing analysis again. More importantly, the two figures verify the nearoptimality of the proposed algorithm and jointly indicate that the risksensitive strategies achieve nearly the same average total delay as the nonrisksensitive ones, but greatly reduce the 99th percentile of the total delay by incorporating the risk to the design of offloading strategy. In other words, the intense delay jitter can be effectively controlled under the risksensitive strategies only at the price of very little degradation on the average performance.
Finally, we investigate the effect of the task size on both the average delay and the 99th percentile under the QR and QNR strategies. As shown in Fig. 6, the 99th percentile is higher than the average total delay for both strategies, since the former characterize the worstcase delay. With the increase of the task size, both the average delay and the 99th percentile increase under both strategies. This is due to the fact that a larger task size leads to the higher transmission and computation delay. Furthermore, the 99th percentile of QR is always lower than that of QNR, while the two curves of the average delay almost coincide with each other. Note that the average delay under the QNR strategy is the lower bound of that under the QR. This implies that QR achieves nearly the same average performance as the QNR while simultaneously improving the worstcase performance with respect to the total delay.
Conclusions
In this work, we introduce the risk management theory to design of the edge computingassisted IIoT system. We explore the queueing theory and the properties of the CVaR to capture the tail distribution of the endtoend delay, and provide two upper bounds of the average total delay and the CVaR. A joint task offloading and computation resource allocation problem is formulated to simultaneously minimize the average total delay and the risk. Since the problem is a nonconvex MINLP, we decompose it into two subproblems and design a twostage heuristic algorithm. The computation complexity of each procedure of the proposed algorithm has been analyzed. Finally, simulations are performed under the practical channel model in the automated factories, and the results verify that the proposed strategy can effectively control the risk of intense delay jitter while guaranteeing the average delay performance.
Availability of data and materials
Not applicable.
Abbreviations
 URLLC:

Ultrareliable lowlatency communications
 5G:

Fifth generation
 6G:

Sixth generation
 MDP:

Markov decision process
 ADMM:

Alternating Direction Method of Multipliers
 IIoT:

Industrial Internet of Things
 VaR:

Value at risk
 CVaR:

Conditional value at risk
 ECS:

Edge computing server
 CDF:

Cumulative distribution function
 MINLP:

Mixed integer nonlinear problem
 IPM:

Interior point method
 QR:

Queueingbased and risksensitive
 QNR:

Queueingbased and nonrisksensitive
 QROpt:

Queueingbased and risksensitive optimal
 QNROpt:

Queueingbased and nonrisksensitive optimal
 NQR:

Nonqueueingbased and risksensitive
 NQNR:

Nonqueueingbased and nonrisksensitive
 CCDF:

Complementary cumulative distribution function
References
Z. Gu, C. She, W. Hardjawana, S. Lumb, D. McKechnie, T. Essery, B. Vucetic, Knowledgeassisted deep reinforcement learning in 5g scheduler design: From theoretical framework to implementation. arXiv preprint arXiv:2009.08346 (2020)
C. She, C. Sun, Z. Gu, Y. Li, C. Yang, H.V. Poor, B. Vucetic, A tutorial on ultrareliable and lowlatency communications in 6g: Integrating domain knowledge into deep learning. Proceedings of the IEEE. 2021. arXiv preprint arXiv:2009.06010 (to appear) (2020)
J. Wan, J. Yang, Z. Wang, Q. Hua, Artificial intelligence for cloudassisted smart factory. IEEE Access 6, 55419–55430 (2018)
M.S. Elbamby, C. Perfecto, C. Liu, J. Park, S. Samarakoon, X. Chen, M. Bennis, Wireless edge computing with latency and reliability guarantees. Proc. IEEE 107(8), 1717–1737 (2019)
T. Velte, A. Velte, R. Elsenpeter, Cloud Computing, a Practical Approach (McGrawHill Inc, USA, 2009)
W. Shi, J. Cao, Q. Zhang, Y. Li, L. Xu, Edge computing: vision and challenges. IEEE Internet Things J. 3(5), 637–646 (2016). https://doi.org/10.1109/JIOT.2016.2579198
J. Liu, Y. Mao, J. Zhang, K.B. Letaief, Delayoptimal computation task scheduling for mobileedge computing systems. In: 2016 IEEE International Symposium on Information Theory (ISIT), pp. 1451–1455 (2016)
Y. Zhang, P. Du, J. Wang, T. Ba, R. Ding, N. Xin, Resource scheduling for delay minimization in multiserver cellular edge computing systems. IEEE Access 7, 86265–86273 (2019)
Y. Wang, X. Tao, X. Zhang, P. Zhang, Y.T. Hou, Cooperative task offloading in threetier mobile computing networks: an admm framework. IEEE Trans. Veh. Technol. 68(3), 2763–2776 (2019). https://doi.org/10.1109/TVT.2019.2892176
Y. Sun, X. Guo, J. Song, S. Zhou, Z. Jiang, X. Liu, Z. Niu, Adaptive learningbased task offloading for vehicular edge computing systems. IEEE Trans. Veh. Technol. 68(4), 3061–3074 (2019)
C. Shi, Z. Ren, K. Yang, C. Chen, H. Zhang, Y. Xiao, X. Hou, Ultralow latency cloudfog computing for industrial internet of things. In: 2018 IEEE Wireless Communications and Networking Conference (WCNC), pp. 1–6 (2018)
C. Liu, M. Bennis, M. Debbah, H.V. Poor, Dynamic task offloading and resource allocation for ultrareliable lowlatency edge computing. IEEE Trans. Commun. 67(6), 4132–4150 (2019)
Y. Zhu, Y Hu., T. Yang, A. Schmeink, Reliabilityoptimal offloading in multiserver edge computing networks with transmissions carried by finite blocklength codes. In: 2019 IEEE International Conference on Communications Workshops (ICC Workshops), pp. 1–6 (2019). https://doi.org/10.1109/ICCW.2019.8757175
S. Batewela, C. Liu, M. Bennis, H.A. Suraweera, C.S. Hong, Risksensitive task fetching and offloading for vehicular edge computing. IEEE Commun. Lett. 24(3), 617–621 (2020). https://doi.org/10.1109/LCOMM.2019.2960777
L. De Haan, A. Ferreira, Extreme Value Theory: An Introduction (Springer, New York, 2007)
O. Mihatsch, R. Neuneier, Risksensitive reinforcement learning. Mach. Learn. 49(2–3), 267–290 (2002)
A.G. Hessami, (ed.): Perspectives on Risk, Assessment and Management Paradigms. Books, vol. 5615. IntechOpen, London (2019). https://doi.org/10.5772/intechopen.77127. https://ideas.repec.org/b/ito/pbooks/5615.html
C. You, K. Huang, H. Chae, B. Kim, Energyefficient resource allocation for mobileedge computation offloading. IEEE Trans. Wireless Commun. 16(3), 1397–1411 (2017). https://doi.org/10.1109/TWC.2016.2633522
D.P. Bertsekas, R.G. Gallager, P. Humblet, Data Networks, vol. 2 (PrenticeHall International, New Jersey, 1992)
L. Kleinrock, Queueing Systems, Volume 2: Computer Applications vol. 66. Wiley, New York (1976)
D.J. Bertsimas, D. Nakazato, The departure process from a gi/g/1 queue and its applications to the analysis of tandem queues (1990)
P.C. Yeh, J.F. Chang, Characterizing the departure process of a single server queue from the embedded Markov renewal process at departures. Queueing Syst. 35(1–4), 381–395 (2000)
P. Jorion, Value at risk (2000)
C. Acerbi, D. Tasche, On the coherence of expected shortfall. J. Bank. Finance 26(7), 1487–1503 (2002)
J.F. Kingman, On queues in heavy traffic. J. R. Stat. Soc.: Ser. B (Methodol.) 24(2), 383–392 (1962)
R.T. Rockafellar, S. Uryasev et al., Optimization of conditional valueatrisk. J. Risk 2, 21–42 (2000)
G.C. Pflug, Some remarks on the valueatrisk and the conditional valueatrisk. In: Probabilistic Constrained Optimization, pp. 272–281. Springer, Boston (2000)
D. Bertsimas, J.N. Tsitsiklis, Introduction to Linear Optimization, vol. 6 (Athena Scientific, Belmont, 1997).
S. Boyd, S.P. Boyd, L. Vandenberghe, Convex Optimization (Cambridge University Press, Cambridge, 2004).
P.M. Vaidya, An algorithm for linear programming which requires o (((m+ n) n 2+(m+ n) 1.5 n) l) arithmetic operations. Mathematical Programming 47(13), 175–201 (1990)
J. Kronqvist, D.E. Bernal, A. Lundell, I.E. Grossmann, A review and comparison of solvers for convex minlp. Optim. Eng. 20(2), 397–455 (2019)
J. Mazzola, A. Neebe, Bottleneck generalized assignment problems. Eng. Costs Prod. Econ. 14(1), 61–65 (1988)
S. Martello, P. Toth, The bottleneck generalized assignment problem. Eur. J. Oper. Res. 83(3), 621–638 (1995)
D. Den Hertog, Interior Point Approach to Linear, Quadratic and Convex Programming: Algorithms and Complexity, vol. 277 (Springer, Dordrecht, 2012)
D. Den Hertog, C. Roos, T. Terlaky, On the classical logarithmic barrier function method for a class of smooth convex programming problems. J. Optim. Theory Appl. 73(1), 1–25 (1992)
J. Yang, B. Ai, I. You, M. Imran, L. Wang, K. Guan, D. He, Z. Zhong, W. Keusgen, Ultrareliable communications for industrial internet of things: Design considerations and channel modeling. IEEE Netw. 33(4), 104–111 (2019). https://doi.org/10.1109/MNET.2019.1800455
T. Olofsson, A. Ahlen, M. Gidlund, Modeling of the fading statistics of wireless sensor network channels in industrial environments. IEEE Trans. Signal Process. 64(12), 3021–3034 (2016)
Acknowledgements
Not applicable.
Funding
This work was supported by the Shanghai Municipal Natural Science Foundation (No. 19ZR1404700). This work was also supported in part by the National Science Foundation of China under Grants 71731004.
Author information
Authors and Affiliations
Contributions
All the authors contributed to the system design, queueing and CVaR analysis, strategy design, simulations and the writing of this paper. All authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hao, X., Zhao, R., Yang, T. et al. A risksensitive task offloading strategy for edge computing in industrial Internet of Things. J Wireless Com Network 2021, 39 (2021). https://doi.org/10.1186/s13638021019235
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13638021019235
Keywords
 Ultrareliable and lowlatency communications
 Edge computing
 Industrial Internet of Things
 Risk management theory
 Conditional Value at Risk