Data analysis on video streaming QoE over mobile networks

Wang, Qingyong; Dai, Hong-Ning; Wu, Di; Xiao, Hong

doi:10.1186/s13638-018-1180-8

Research
Open access
Published: 09 July 2018

Data analysis on video streaming QoE over mobile networks

Qingyong Wang¹,
Hong-Ning Dai ORCID: orcid.org/0000-0001-6165-4196¹,
Di Wu^2,3 &
…
Hong Xiao⁴

EURASIP Journal on Wireless Communications and Networking volume 2018, Article number: 173 (2018) Cite this article

6235 Accesses
14 Citations
Metrics details

Abstract

One of recent proposals on standardizing quality of user experience (QoE) of video streaming over mobile network is video Mean Opinion Score (vMOS), which can model QoE of video streaming in 5 discrete grades. However, there are few studies on quantifying vMOS and investigating the relationship between vMOS and other quality of service (QoS) parameters. In this paper, we address this concern by proposing a novel data analytical framework based on video streaming QoE data. In particular, our analytical model consists of K-means clustering and logistic regression. This model integrates the benefits of both these two models. Moreover, we conduct extensive experiments on realistic dataset and verify the accuracy of our proposed model. The results show that our proposed framework outperforms other existing methods in terms of prediction accuracy. Moreover, our results also show that vMOS is essentially affected by many QoS parameters such as initial buffering latency, stalling ratio, and stalling times. Our results offer a number of insights in improving QoE of video streaming over mobile networks.

1 Introduction

Video streaming is becoming one of the most popular services over mobile networks. It is predicted in [1] that the traffic caused by video streaming will occupy more than 77% of all consumer Internet traffic by 2021, among which mobile video traffic will be more than 55% of all video traffic. The growing demands on video streaming over mobile networks inevitably lead to the challenges in optimizing network resource in order to improve the user perceptual experience. Many previous studies mainly focus on improving quality of service (QoS) of video streaming over mobile networks. Typical QoS measures include throughput, bandwidth, outage, jitter, and delay [2]. However, most of these QoS metrics fail to characterize user perceptual experience, which is also called quality of experience (QoE). It is more crucial to conduct video quality assessment from QoE than that from QoS [3, 4] because (i) enhancing QoS does not directly improve QoE [5] and (ii) only improving QoS sometimes significantly increases operating expenditure, consequently decreasing the profit of service providers [6].

Therefore, QoE improvement of video streaming over mobile networks has received extensive attention recently. In particular, the work of [7] investigates QoE-driven cross-layer optimization for video transmission in wireless networks. Ramamurthi et al. [8] propose a resource management scheme at network core in wireless networks to improve video QoE. The work of [9] presents a large-scale measurement-based study on the effects of Internet path selection in video QoE and offers several QoE enhancement schemes.

However, the prerequisite of QoE improvement of video streaming is to quantify QoE appropriately. Video QoE assessment schemes can be generally categorized into subjective tests, objective assessments, and data-driven analysis [3]. Compared with subjective tests and objective assessments, data-driven analysis is more promising due to the availability of massive datasets and the accuracy of characterizing user perception while overcoming the drawbacks of subjective tests and objective assessments (such as high cost and insufficient human visual system knowledge). In particular, the work of [6] proposes a data-driven model to quantify the metrics affecting video QoE. Jiang et al. [10] improve video QoE by exploiting data-driven QoE prediction. The work of [11] improves the video bitrate adaptation based on data-driven QoE prediction. Huang et al. [12] propose a dynamic adaptive streaming via HTTP to optimize user QoE. Ref. [13] presents a Multi-Constraint Quality-of-Experience (QoE) centric Routing (MCQR) scheme to improve user video streaming QoE over mobile networks. The work of [14] presents a queue-based model to analyze the video buffer using discrete-time analysis. Marai et al. [15] propose a client-server cooperation-based approach to achieve efficiency, fairness, and stability of video adaptive streaming.

In addition to the above efforts, there are also other solutions on standardizing QoE. One of recent video QoE measurement standards is U-vMOS (User/Unified/Ubiquitous video Mean Opinion Score)^{Footnote 1}, which was proposed by Huawei in 2016 [16]. The score of vMOS is essentially established according to Mean Opinion Score (MOS) standardized by International Telecommunication Union (ITU) [17] as shown in Table 1, where discrete grades from 1 to 5 represent bad, poor, fair, good, and excellent, respectively. It is shown in [16] that vMOS at video playback startup is mainly determined by three key factors: video quality, initial buffering delay, and video freezing duration, each of which is also affected by multiple QoS variables. Recently, Pan et al. [18] investigate machine learning-based bitrate estimation on YouTube video streaming based on Huawei’s vMOS assessment model. However, they just give a mathematical expression of vMOS based on their subjective estimations. The work of [19] presents using vMOS to investigate the impacts of several system parameters on long-term evolution (LTE) networks based on simulations. To the best of our knowledge, there is no data-driven QoE analysis on vMOS.

Table 1 Mean Opinion Score (MOS) evaluation scale

Full size table

Therefore, this paper aims to conduct data-driven QoE analysis on vMOS. In particular, we obtain a realistic dataset on video QoE based on SpeedVideo Global Operating Platform (SVGOP) established by Huawei. This dataset has the following unique characteristics: (1) heterogeneous data types, (2) positive/negative correlations, and (3) dependence of features; these characteristics result in the difficulties in analyzing video QoE data.

To address the above concerns, we propose a data-driven analysis framework to analyze the relationship between vMOS and other QoS parameters. Although our previous work [20] presented preliminary results on quantifying vMOS and other QoS parameters, this study is significantly different from our previous work in the following aspects: (1) we conduct a data preliminary analysis on vMOS data; this analysis has been ignored in our previous work; (2) we propose a novel analytical framework in this paper, which is significantly different from the previous work; and (3) experiment results have shown that our proposed model can improve the predication accuracy than our previous work.

In addition, this paper has the following research contributions in contrast to other existing studies: (i) our model consists of K-means clustering approach and logistic regression; the combination of these two approaches can greatly improve the predication accuracy; (ii) we have conducted extensive experiments on three training datasets and one testing dataset and the experiment results show that our proposed model outperforms other existing methods in terms of predication accuracy; and (iii) moreover, our results also imply that a small set of QoS parameters play an important role in determining vMOS.

The remainder of this paper is organized as follows. Section 2 describes the data used in this paper and identifies the challenges. We then present the overview of our method in Section 3. Section 4 presents the experimental results. Finally, we conclude this paper in Section 6.

2 Data description

We obtained the realistic datasets from SpeedVideo Global Operating Platform (SVGOP) established by Huawei^{Footnote 2}; SVGOP is a specific application of vMOS in mobile networks throughout the world. In particular, three datasets contain totally 89,266 samples with 11 features (i.e., QoS parameters) and 1 scoring factor vMOS. Table 2 summarizes the features. As shown in [16], vMOS is a function of the above QoS parameters. However, to the best of our knowledge, there is no data-driven analysis on the relation between vMOSand other QoS parameters.

Table 2 Description of dataset

Full size table

We first conduct a preliminary statistic analysis on the dataset. In particular, Fig. 1 displays the histogram of vMOS in different scales (ranging from 1 to 5), where the vertical axis represents the frequency of vMOS and the horizontal axis represents the evaluation scale of vMOS. We observe from Fig. 1 that most of vMOS values concentrate on the range from 3.5 to 4. When plotting the normal distribution curve (i.e., the red curve in Fig. 1), we can find that the median is about 4.0, implying that most of vMOS values are quite close to “good”.

We then investigate the correlations between vMOS and other QoS parameters. In particular, Fig. 2 shows a histogram of vMOS versus video total DL rate. It is shown in Fig. 2 that there is a positive correlation between vMOS and video total DL rate, implying that the higher the video total DL rate is, the higher the vMOS is.

In order to measure the correlation between vMOS and initial buffering latency, we plot vMOS against initial buffering latency in Fig. 3 where there is a negative correlation between vMOS and initial buffering latency. In other words, vMOS decreases with the increment of initial buffering latency, implying the worse QoE to users.

Figure 4 plots vMOS against stalling ratio. Essentially, stalling ratio is defined as $\text {Stalling ratio}=\frac {\text {Stalling duration}}{\text {Playing total duration}}$ according to the white paper of Huawei [16]. Since most of Stalling ratio values concentrate on some small values (i.e., less than 0.3), we also plot a subgraph in Fig. 4 to better present the results. We observe from Fig. 4 that there is a negative correlation between vMOS and stalling ratio. In other words, the higher the stalling ratio is, the lower the vMOS is.

Due to the space limitation, we do not show other correlation analytical results on other QoS parameters in this section. In summary, we find that the dataset has the following characteristics:

Heterogeneous data types. The preliminary results show that QoS parameters are in different types and different ranges. For example, the initial buffering latency is ranging from 500 ms (millisecond) to 30,000 ms while the average rate of play phase is ranging from 300 kbps (kilobit per second) to 16,000 kbps.
Positive/negative correlations of QoS parameters. As shown in the statistics results, we observe that there are positive or negative correlations between vMOS and other QoS parameters. For example, there is a positive correlation between vMOS and video total DL rate while there is a negative correlation between vMOS and initial buffering latency.
Dependence on QoS parameters. The preliminary statistics results also show that vMOS is essentially affected by multiple factors, such as average rate of playing phase, video total DL rate, end-to-end (E2E) round-trip time, and initial buffering latency.

The above characteristics result in the difficulties in analyzing video QoE data. To address the above challenges, we propose a novel data-driven QoE analysis framework (which will be described in details in Section 3).

3 QoE analysis framework

In order to address the above concerns, we propose a novel data-driven QoE analysis method, which consists of the following procedures: (1) data preprocessing and (2) data analysis with integration of K-means clustering and logistic regression. Figure 5 shows the flow chart of our proposed method.

3.1 Data preprocessing

We need to normalize the features since they are in different units. In particular, we make a conversion from the absolute value to the relative value. In particular, we choose the MAX-MIN scaling method to normalize the positive and negative values. More specifically, we have

Positive values:
$$ u_{ij}=\frac{x_{ij}-\min\left(x_{ij}\right)}{\max\left(x_{ij}\right)-\min\left(x_{ij}\right)}, $$
(1)
Negative values:
$$ u_{ij}=\frac{\max\left(x_{ij}\right)-x_{ij}}{\max\left(x_{ij}\right)-\min\left(x_{ij}\right)}, $$
(2)

where x_ij represents the original value, u_ij represents the value after normalization, min(·) is the minimum value, and max(·) is the maximum value.

3.2 K-means + LR method

In this paper, we use logistic regression (LR) mainly to predict whether a user’s QoE of video streaming is “good” or “bad” since it is hard to determine it based on a continuous vMOS value within [ 1,5]. However, logistic regression requires that the dependent variable is dichotomou (binary). Therefore, we first exploit K-means to categorize the sample datasets into two groups according to “good” or “bad”. We then use logistic regression to predict the QoE of video streaming.

3.2.1 K-means clustering

The main idea of K-means algorithm [21] is to find a partition such that squared error between the empirical mean of a cluster and the points in the cluster is minimized. In particular, given a dataset X={x_i}, i=1,2,...,m, we partition them into K disjoint clusters so that the sum of the intra-cluster variances is minimized. We denote the K disjoint clusters by C=C₁,C₂,...,C_k.

The goal of K-means is to minimize the within-cluster sum of variance over all clusters. In other words, we need to find

$$ \arg\!\min_{\mathbf{C}} \sum\limits^{K}_{k=1} \sum\limits_{x \in C_{i}} \lVert x_{i}-\mu_{k} \rVert^{2}, $$

(3)

where μ_k is the mean of cluster C_i.

This clustering process can be completed by alternating between assigning instances to their closest centers and recomputing the centers until a local minimum reaches.

3.2.2 Logistic regression

In this paper, we concern with a binary classification problem of categorizing QoE of a video streaming into two cases of “bad” and “good”. To solve this problem, we exploit logistic regression (LR). In particular, we denote the QoE by a binary-dependent variable. This variable only takes two values, either “0” or “1”. Specifically, we use “1” to represent “good” and “0” to represent “bad”. Without loss of generality, we classify a data sample as “bad” when its vMOS score is within [ 1,2] and “good” when its vMOS score is within [ 3,5], as shown in Table 1.

Recall that a dataset contains m samples, D=(X₁,y₁),(X₂,y₂),...,(X_m,y_m), where X_i=(x_i1,x_i2,...,x_ij) is the i−th input pattern with dimensionality j and y_i is a corresponding variable that takes a value of 0 or 1. The term y_i=0 indicates the i−th sample is bad and y_i=1 indicates the i−th sample is good. The vector X_i contains j influence features (for all n QoE features) for the i−th sample and x_ij denotes the value of feature j for the i−th sample. We denote the probability of vMOS being “good” by p and by 1−p of vMOS being “bad”. Then, the logit transform of probability p is as follow,

$$ \text{logit}(p)=\ln\left[\frac{p}{1-p}\right]=\beta_{0}+\beta_{1}x_{1}+\beta_{2}x_{2}+...+\beta_{n}x_{n}, $$

(4)

where β₀ is the offset and β_i (i=1,...,n) is the corresponding regression coefficient for each QoS parameter.

It is shown in Eq. (4) that the probability of occurrence denoted by p(x) can be expressed as a non-linear function of features,

$$ {}p(x)=\frac{1}{1+e^{-\text{logit}(p(x))}}=\frac{1}{1+e^{-(\beta_{0}+\beta_{1}x_{1}+\beta_{2}x_{2}+...+\beta_{n}x_{n})}}, $$

(5)

where p∈ [ 0,1]. Since the logistic regression model is non-linear, the maximum likelihood estimation method can be used to estimate the regression coefficient β_i (i=0,...,n).

4 Empirical study

As we summarize in Section 2, the dataset has the characteristics such as different types, positive and negative correlations, and dependence. Therefore, we need to use the proposed method to address these concerns. In particular, we describe the experiment settings in Section 4.1. We then show the intermediate results of K-means clustering and logistic regression in Section 4.2. We next compare our proposed method with other existing methods in terms of predication accuracy in Section 4.3. Finally, we conduct performance analysis of our proposed method in Section 4.4.

4.1 Experiment settings

We obtain three sample datasets from SpeedVideo Global Operating Platform of Huawei. Table 3 presents the meta data information of these datasets. In particular, dataset 1, dataset 2, and dataset 3 contain 30,000, 30,000, and 26,984 samples, respectively, each of which has 11 QoS parameters. Table 4 summarizes the 11 features where we denote each of these 11 features by variable x_i (i=1 to 11).

Table 3 Meta data information

Full size table

Table 4 Summary of features

Full size table

4.2 K-means LR method

4.2.1 K-means Analysis

As shown in Section 3, we exploit the proposed K-means logistic regression method to predict QoE. In particular, we classify each dataset into two classes: one class contains samples with QoE being “good” and another class contains samples with QoE being “bad”. The experimental results show that these two classes are fairly unbalanced in terms of class size. Thus, we choose K-means scheme to search the cut-off values. Specifically, we obtain classification standard values, 3.9, 3.9, and 3.93 for datasets 1, 2, and 3, respectively. Table 5 shows the K-means clustering results.

Table 5 K-means clustering analysis

Full size table

4.2.2 Logistics regression analysis

We next conduct logistics regression analysis. In particular, we give the regression expression on 11 features as follows,

$$\begin{array}{@{}rcl@{}} \text{logit}(p) & = &\beta_{0}+\beta_{1}x_{1}\,+\,\beta_{2}x_{2}\,+\,\beta_{3}x_{3}+\beta_{4}x_{4}\!+\beta_{5}x_{5}+\beta_{6}x_{6}\\ & & +\beta_{7}x_{7}+\beta_{8}x_{8}+\beta_{9}x_{9}+\beta_{10}x_{10}+\beta_{11}x_{11}, \end{array} $$

(6)

where x_i (i=1 to 11) corresponds to each of 11 features as given in Table 4 and β_i is the regression coefficient.

Table 6 lists regression results on the three datasets, where “Value” denotes the resulting value of each coefficient, “Wal” denotes the value of the Wald test under the significant level of “Sig” [22]. We observe from Table 6 that the lower “Sig” value (or the higher “Wal” value) of a coefficient indicates the higher impact of that coefficient on vMOS. In general, Sig >0.05, the coefficient is not statistically significant, implying that no correlation can be found between the QoS parameter and vMOS. In practice, there is no correlation between a QoS parameter and vMOS when Sig <0.05 according to the Wald test [23].

Table 6 Regression coefficients

Full size table

It is shown in Table 6 that coefficient β₆ (corresponding to initial buffering latency) has the most significant impact on vMOS. Similarly, we can find that coefficient β₃ (corresponding to video bitrate) also has a strong influence on vMOS because of high Wal value (i.e., 7.374).

4.3 Performance comparison

We next present the experimental results over the given datasets with performance comparison with conventional traditional methods including multivariate linear regression, logistic regression, support vector machines, K-nearest-neighbor and Naive Bayes [24].

We then evaluate the performance of our proposed method and other traditional methods. Table 7 presents performance comparison of our proposed K-means + LR method with other methods, where we choose the precision as the performance metrics (the higher precision implying the better performance). Among all the methods, our proposed K-means + LR method has the highest performance with precision 96.94, 97.13, and 97.54% on dataset 1, dataset 2, and dataset 3, respectively. This performance improvement may lie in the integration of K-means method and logistic regression method.

Table 7 Performance comparison

Full size table

4.4 Performance analysis of our method

4.4.1 Residual plots

We first use the residual error plots to evaluate the accuracy of the proposed model. In particular, we define Error=Prediction−Original, where Prediction is the prediction value and Original is the given value. Figure 6 shows the residual plots over the three datasets, in which blue points denote the “good” QoE values and blue points denote the “bad” QoE values. It is shown in Fig. 6 that there are more red points than blue points, implying that our model has a bias of regarding “good” QoE values as “’bad”.

4.4.2 Accuracy analysis

In order to assess the accuracy of prediction, we use Accuracy, which is defined as follows,

$$ \textsf{Accuracy}=\frac{(\textsf{TP}+\textsf{TN})}{(\textsf{TP}+\textsf{FP}+\textsf{TN}+\textsf{FN})}, $$

(7)

where TP is true positive, FP is false positive, FN is false negative, and TN is true negative.

We obtain another dataset from SVGOP. This dataset contains 2517 samples. This new dataset serves as the testing dataset. We next evaluate the accuracy of our proposed model over two datasets: training dataset (i.e., dataset 1) and testing dataset (i.e., the dataset containing 2517 samples). Figure 7 gives the histogram of the accuracy values of 11 QoS parameters over two aforementioned datasets. We observe from Fig. 7 that training dataset has the accuracy value close to that of testing dataset in most of QoS parameters while there are some gaps between several QoS parameters (e.g., playing total duration); the performance gaps may owe to the fact of the small sample size of testing dataset (2517 samples versus 30,000 samples in training dataset). Note that we have similar findings in dataset 2 and dataset 3. Without repetition, we do not show the results on dataset 2 and dataset 3 here.

We next use the deviance to evaluate the measure of the fitness of the data. In particular, the deviance denoted by D value is calculated by comparing a given model with the saturated model [24]. We define D value=|prediction−original|, where prediction and original denote the prediction value and the original value, respectively. Table 8 gives the deviance results over the three datasets and the corresponding rankings (sorting D value in ascending order). We observe from Table 8 that initial buffering latency, stalling times, and stalling ratio have the smaller D values than other QoS parameters, implying that they essentially have the dominant influence on vMOS. This result is consistent with the previous finding in [20].

Table 8 D value and ranking

Full size table

In summary, we can see from Table 8 that initial buffering latency, stalling times, and stalling ratio can significantly affect vMOS. In particular, initial buffering latency, stalling times, and stalling ratio are negatively correlated with vMOS. We have the following major findings: (1) vMOS is affected by multiple QoS parameters together. Essentially, vMOS is affected by 11 QoS parameters as given in Table 4. (2) Small set of QoS parameters dominates the performance of vMOS. Interestingly, we observe that a small set of QoS parameters has the stronger influence on vMOS than other QoS parameters.

5 Discussion and future work

The analytical results essentially offer us some useful insights in improving video QoE, which may pave the way toward improving vMOS in mobile networks. We summarize several possible future directions as follows.

Concentrating on optimizing several dominating QoS parameters in vMOS. For example, we may focus on optimizing the network resource to reduce initial buffering latency, stalling times, and stalling ratio so that we can significantly improve the video QoE while maintaining relatively low operating expenditure. However, it is not an easy task to achieve this goal because the enhancement of these QoS parameters is also involved with many other technologies, such as cross-layer optimization and distributed resource allocation [10, 25, 26].
Identifying QoS bottlenecks. Determining QoS bottlenecks can help to enhance system performance and consequently improving vMOS. However, it is also difficult to identify the QoS bottlenecks since they are often affected by many factors. For example, video stalling is essentially caused by many factors, such as network congestion, network failure, device mobility, and radio spectrum scarcity. There is a challenge in identifying the causality of stalling. In the future, we may apply data-driven approach to identify the reason behind video stalling according to different scenarios.
Distributing the videos appropriately to improve vMOS. For example, we can distribute the most popular videos at the servers close to users so that we can significantly reduce the initial buffering latency. However, to determine the popularity of video streaming is challenging since it requires the extensive efforts in analyzing the massive video data [27].

6 Conclusions

In this paper, we propose a novel data analysis model to analyze video Mean Opinion Score (vMOS), which is an important measure of user quality of experience of video streaming. In particular, our proposed model is a combination of K-mean clustering method and logistic regression method, which can essentially improve the prediction accuracy than other existing methods. We conduct experiments over several realistic datasets. Extensive experiment results show that our proposed method outperforms other existing methods in terms of prediction accuracy. For example, our proposed method has the precision of 96.94, 97.13, and 97.54% on dataset 1, dataset 2, and dataset 3. Our results also show that a small set of QoS parameters play an important role in determining vMOS; this implies that we can concentrate on enhancing these key QoS parameters. It can be achieved by integrating cross-layer optimization and distributed resource allocation schemes together and mitigating QoS bottlenecks.

Our model has a broad range of applications. For example, it can be used to enhance the QoE of video service providers (such as Netflix and YouTube), video-centric mobile applications (including Facebook LIVE, Instagram LIVE, Snapchat, etc.), video game live streaming services (such as Twitch, Hitbox and NetEase Game Lives). Moreover, it can be used to improve the usability of video surveillance systems. For example, the quality of video streaming of video surveillance systems can be helpful in detecting dangers in advance.

Notes

For simplicity, we use vMOS to represent U-vMOS throughout this paper.
http://speedvideo.huawei.com/

Abbreviations

DL:: Download
E2E:: End-to-end
LR:: Logistic regression
QoS:: Quality of service
QoE:: Quality of experience
Sig:: Significance
SVGOP:: SpeedVideo global operating platform
SVM:: Support vector machine
vMOS:: video mean opinion score
Wal:: Wald test value

References

Cisco, Visual Network Index. “forecast and methodology, 2015-2020”. White Paper, 1–41 (2016).
M Venkataraman, M Chatterjee, Inferring video QoE in real time. IEEE Netw. 25(1), 4–13 (2011).
Article Google Scholar
Y Chen, K Wu, Q Zhang, From QoS to QoE: A tutorial on video quality assessment. IEEE Commun. Surv. Tutorials. 17(2), 1126–1165 (2015).
Article Google Scholar
H Nam, KH Kim, H Schulzrinne, in IEEE INFOCOM. QoE matters more than QoS: Why people stop watching cat videos (IEEENew Jersey, 2016), pp. 1–9.
Google Scholar
V Joseph, G de Veciana, in IEEE INFOCOM. NOVA: QoE-driven optimization of DASH-based video delivery in networks (IEEENew Jersey, 2014), pp. 82–90.
Google Scholar
A Balachandran, V Sekar, A Akella, S Seshan, I Stoica, H Zhang, in Proceedings of the ACM SIGCOMM Conference. Developing a Predictive Model of Quality of Experience for Internet Video (ACMNew York, 2013).
Google Scholar
S Thakolsri, W Kellerer, E Steinbach, in Proceedings of IEEE International Conference on Communications (ICC). QoE-Based Cross-Layer Optimization of Wireless Video with Unperceivable Temporal Video Quality Fluctuation (IEEENew Jersey, 2011).
Google Scholar
V Ramamurthi, O Oyman, J Foerster, in IEEE Global Communications Conference (GLOBECOM). Video-QoE aware resource management at network core (IEEENew Jersey, 2014), pp. 1418–1423.
Google Scholar
M Venkataraman, M Chatterjee, Effects of internet path selection on video-QoE: analysis and improvements. IEEE/ACM Trans. Networking (TON). 22(3), 689–702 (2014).
Article Google Scholar
J Jiang, V Sekar, H Milner, D Shepherd, I Stoica, H Zhang, in 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI). CFA: A Practical Prediction System for Video QoE Optimization (Berkeley, 2016), pp. 137–150.
Y Sun, X Yin, J Jiang, V Sekar, F Lin, N Wang, et al, in Proceedings of ACM SIGCOMM. CS2P: Improving video bitrate selection and adaptation with data-driven throughput prediction (ACMNew York, 2016), pp. 272–285.
Google Scholar
W Huang, Y Zhou, Xie X, D Wu, M Chen, E Ngai, Buffer State is Enough: Simplifying the Design of QoE-Aware HTTP Adaptive Video Streaming. IEEE Trans. Broadcast. 64(2), 1–12 (2018).
Article Google Scholar
C Lal, V Laxmi, MS Gaur, M Conti, Enhancing QoE for video streaming in MANETs via multi-constraint routing. Wirel. Netw. 24(1), 235–256 (2018).
Article Google Scholar
V Burger, T Zinner, L Dinh-Xuan, F Wamser, P Tran-Gia, A Generic Approach to Video Buffer Modeling Using Discrete-Time Analysis. ACM Trans Multimedia Comput Commun Appl. 14(2s), 33:1–33:23 (2018).
Article Google Scholar
OE Marai, T Taleb, M Menacer, M Koudil, On Improving Video Streaming Efficiency, Fairness, Stability, and Convergence Time Through Client-Server Cooperation. IEEE Trans. Broadcast. 64(1), 11–25 (2018).
Article Google Scholar
Huawei, “Requirements of Mobile Bearer Network for Mobile Video Service”. White Paper, 1–8 (2016).
International Telecommunication Union (ITU), “Subjective video quality assessment methods for multimedia applications”. ITU Recommendation:1–37.
W Pan, G Cheng, H Wu, Y Tang, in IEEE/ACM 24th International Symposium on Quality of Service (IWQoS). Towards QoE assessment of encrypted YouTube adaptive video streaming in mobile networks (IEEENew Jersey, 2016).
Google Scholar
CM Lentisco, L Bellido, QJCCE Pastor, JLA H, QoE-Based Analysis of DASH Streaming Parameters Over Mobile Broadcast Networks. IEEE Access. 5:, 20684–20694 (2017).
Article Google Scholar
Q Wang, HN Dai, H Wang, D Wu, in The 15th IEEE International Symposium on Parallel and Distributed Processing with Applications. Data-Driven QoE Analysis on Video Streaming in Mobile Networks (IEEENew Jersey, 2017).
Google Scholar
X Wu, V Kumar, JR Quinlan, J Ghosh, Q Yang, H Motoda, et al, Top 10 algorithms in data mining. Knowl. Inf. Syst. 14(1), 1–37 (2008).
Article Google Scholar
J Friedman, T Hastie, R Tibshirani, The elements of statistical learning. vol. 1. Springer series in statistics New York (2001).
RR Wilcox, Introduction to robust estimation and hypothesis testing, Academic press (Elsevier, MA, USA, 2011).
Google Scholar
E Alpaydin, Introduction to machine learning (MIT press, MA, USA, 2014).
MATH Google Scholar
Y Wang, X Lin, User-provided networking for QoE provisioning in mobile networks. IEEE Wirel. Commun. 22(4), 26–33 (2015).
Article Google Scholar
Z Su, Q Xu, Q Qi, Big data in mobile social networks: a QoE-oriented framework. IEEE Netw. 30(1), 52–57 (2016).
Article Google Scholar
Z Su, Q Xu, F Hou, Q Yang, Q Qi, Edge Caching for Layered Video Contents in Mobile Social Networks. IEEE Trans. Multimed. 19(10), 2210–2221 (2017).
Article Google Scholar

Download references

Acknowledgements

The authors would like to express their appreciation for Gordon K.-T. Hon for his thoughtful discussions.

Funding

The work described in this paper was partially supported by Macao Science and Technology Development Fund under Grant No. 0026/2018/A1, the National Key R&D Program of China under Grant No. 2016YFB0201900, the National Natural Science Foundation of China (NSFC) under Grant No. 61572538 and Grant No. 61672170, the Fundamental Research Funds for the Central Universities under Grant No. 17LGJC23, the NSFC-Guangdong Joint Fund under Grant No. U1401251 and Guangdong Science and Technology Plan with Grant No. 2015B090923004. All the funding bodies are involved with the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Faculty of Information Technology, Macau University of Science and Technology, Wai Long Avenida, Macau SAR, China
Qingyong Wang & Hong-Ning Dai
School of Data and Computer Science, Sun Yat-sen University, Guangzhou, 510006, China
Di Wu
Guangdong Province Key Laboratory of Big Data Analysis and Processing, Guangzhou, 510006, China
Di Wu
Faculty of Computer, Guangdong University of Technology, No. 100 Waihuan Xi Road, Guangzhou, 510006, China
Hong Xiao

Authors

Qingyong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Hong-Ning Dai
View author publications
You can also search for this author in PubMed Google Scholar
Di Wu
View author publications
You can also search for this author in PubMed Google Scholar
Hong Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

QW proposed the idea, derived the results, and wrote the paper. H-ND supervised the work and revised versions. DW gave valuable suggestions on the motivation of conducting data analysis on video streaming over mobile networks. HX contributed to motivating, revising and proofreading of the article. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Hong-Ning Dai.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Wang, Q., Dai, HN., Wu, D. et al. Data analysis on video streaming QoE over mobile networks. J Wireless Com Network 2018, 173 (2018). https://doi.org/10.1186/s13638-018-1180-8

Download citation

Received: 06 November 2017
Accepted: 18 June 2018
Published: 09 July 2018
DOI: https://doi.org/10.1186/s13638-018-1180-8

Data analysis on video streaming QoE over mobile networks

Abstract

1 Introduction

2 Data description

3 QoE analysis framework

3.1 Data preprocessing

3.2 K-means + LR method

3.2.1 K-means clustering

3.2.2 Logistic regression

4 Empirical study

4.1 Experiment settings

4.2 K-means LR method

4.2.1 K-means Analysis

4.2.2 Logistics regression analysis

4.3 Performance comparison

4.4 Performance analysis of our method

4.4.1 Residual plots

4.4.2 Accuracy analysis

5 Discussion and future work

6 Conclusions

Notes

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords