A kind of effective data aggregating method based on compressive sensing for wireless sensor network

Zhang, De-gan; Zhang, Ting; Zhang, Jie; Dong, Yue; Zhang, Xiao-dan

doi:10.1186/s13638-018-1176-4

Research
Open access
Published: 19 June 2018

A kind of effective data aggregating method based on compressive sensing for wireless sensor network

De-gan Zhang^1,2,
Ting Zhang^1,2,
Jie Zhang³,
Yue Dong^1,2 &
…
Xiao-dan Zhang⁴

EURASIP Journal on Wireless Communications and Networking volume 2018, Article number: 159 (2018) Cite this article

3467 Accesses
140 Citations
Metrics details

Abstract

Wireless sensor network (WSN) in the Internet of Things consists of a large number of nodes. The proposal of compressive sensing technology provides a novel way for data aggregation in WSN. Based on the clustering structure of WSN, a kind of effective data aggregating method based on compressive sensing is proposed in this paper. The aggregating process is divided into two parts: in the cluster, the sink node sets the corresponding seed vector based on the distribution of network and then sends it to each cluster head. Cluster head can generate corresponding own random spacing sparse matrix based on its received seed vector and collect data through compressive sensing technology. Among clusters, clusters forward measurement values to the sink node along multi-hop routing tree. Performance analysis and comparison with the relative methods show that our method is effective and superior to other methods regardless of intra-cluster or inter-cluster on the total energy consumption of network.

1 Introduction

Data aggregating is an effective strategy to control energy consumption because the number of transmissions can be reduced after aggregation. Reference [1,2,3,4] strives for energy balancing to make the network lifetime maximum. The unbalanced consumption of energy is harmful to network safety and health [5,6,7,8,9]. If the sensor nodes of wireless sensor networks (WSNs) spend their energy in a relatively balanced way, the connectivity among sensor nodes and the sink nodes can be kept for a longer time, making the network segmentation to be postponed. Avalanched quantities of tiny sensor nodes establish WSNs in the Internet of Things. These nodes can monitor all kinds of object information around them in real-time. Since the energy of these sensor nodes is usually very limited, how to ensure complete data aggregating with the minimum energy consumption of nodes has been a very critical issue in WSNs [10,11,12].

In order to remove redundant portions of the collected data, and control the number of data nodes in WSNs, which can save the energy consumption of nodes, recently, many scholars proposed a compressive sensing (CS) technology, which can collect and reconstruct signal with high probability through sampling points less than the Nyquist sampling theorem [13,14,15,16,17,18]. According to the sparsity of the signal, compressive sensing technology can decrease the original signal from high dimensional to low dimensional on the nodes. It needn't aggregate the signal and recover it with high probability on the sink node. The proposal of compressive sensing has good performance on image processing and other applications [19,20,21,22,23,24,25,26,27].

Without using compressive sensing in data aggregation, nodes near the leaves forward a small amount of packets, but those which are close to the sink node need to forward a large number of packets [28,29,30]. With using compressive sensing in data aggregating, each node simply forwards M packets, so the total transmission number of the network with N nodes is MN. However, transmission quantity is still large. References [6,7,8,9,10] proposed a hybrid protocol. In this protocol, nodes near the leaves forward original data without using compressive sensing, and those which are close to the sink node use compressive sensing technology to transmit data. References [31,32,33,34,35] applied hybrid compressive sensing to the data aggregating and proposed a minimum energy aggregation tree. The previous work directly applies compressive sensing method to the route tree. Since clustering method has many advantages over the routing tree [36,37,38,39,40], compressive sensing method on clustering network is applied. Compared with routing tree data aggregating methods, clustering algorithm generally has a better communication load balance [41, 42]. In addition, previous works ignore the distribution of location information and node distribution, which can contribute that data aggregating consumes less energy in WSNs of the Internet of Things [43,44,45,46,47].

References [13, 14] proposed Toeplitz matrix and proved that it meets the restricted isometry property (RIP). Since the correlation of data collected in a single cluster is relatively large, the sparse matrix to the process of compressive sensing can be used. It can minimize the number of independent random variables, which can reduce the complexity of compressive sensing process, and improve the calculation speed in the meantime.

The literature [14,15,16,17,18] proposed Toeplitz random measurement matrix and proved it. The literature [17,18,19,20,21,22,23,24,25,26] proposed quasi-Toeplitz matrix, semi-Hadamard matrix, and chaos-Toeplitz matrix and proved that they met the condition of the RIP. Based on the former researches, some scholars [27,28,29,30,31,32,33,34,35] proposed random spacing sparse Toeplitz matrix optimized by singular value decomposition (SVD) and apply it in wireless sensor networks.

According to random space sparse matrix based on the Toeplitz matrix, the vector T₁ = [ϕ₁, ϕ₂, …, ϕ_N, ϕ_N + 1, …, ϕ_{N + M − 1}] contains all the elements of a Toeplitz matrix. They sparse T₁ with space △ = 2, the value of its element ϕ_i (i ∈ Λ, Λ is ⌈(N + M − 1)/△⌉ indexes randomly selected from 1~N + M − 1) is subject to independent and identically distributed, and the other elements are set to 0. Finally, the sparse vector is used to construct random spacing sparse Toeplitz matrix:

$$ {\phi}_{i+1,j+1}={\phi}_{i,j} $$

(1)

The Gaussian random matrix requires MN-independent random elements, the general Toeplitz matrix only needs M + N − 1, and the random space sparse Toeplitz matrix needs only ⌈(N + M − 1)/Δ⌉|_{Δ = 2, …, 16} independent random elements, so it is possible to further reduce complexity.

The innovation or contribution of this paper is as follows: based on the clustering structure of WSNs, a new data aggregating method based on sparse hybrid compressive sensing is proposed The aggregating process is divided into two parts: in the cluster, the sink node sets the corresponding seed vector based on the distribution of network and then sends it to each cluster head. Cluster head can generate corresponding own random spacing sparse matrix based on its received seed vector and collect data through compressive sensing technology. Among clusters, clusters forward measurement values to the sink node along the multi-hop routing tree which we built before. Performance analysis and comparison of the experimental results with the relative methods show that our method is effective and superior to other methods regardless of intra-cluster or inter-cluster on the total energy consumption of network and the lifetime of network.

2 Modeling based on hybrid compressive sensing for WSN

In the data aggregating process, first of all, the network is clustered. Each cluster has its cluster head, one sample is shown in Fig. 1. The measurement matrix of the entire network is generated by sink nodes according to the sparse seed vector and sends the sparse seed vector to each cluster head. So, the measurement matrix can be divided into many sub-matrices; each sub-matrix corresponds to a cluster. $ {\phi}^{H_i} $represents the i^th sub-matrix, CH_i represents its cluster head, and $ {x}^{H_i} $ represents data vector of this cluster. CH_ican calculate the measurement values $ {\phi}^{H_i}{x}^{H_i} $ of received data $ {x}^{H_i} $ based on its sub-matrix. When CH_i generates its M_i predicted values, it forwards data to the sink node along the backbone tree which connects clustered heads to the sink node.

Assume that all of the nodes are divided into four clusters (because the 5 or 6 or 7 or 8 or other clusters are the same as that of four clusters, we select four clusters as an example), which are connected through a backbone aggregation tree. Data vector x can be represented by $ {\left[{x}^{H_1}\kern0.5em {x}^{H_2}\kern0.5em {x}^{H_3}\kern0.5em {x}^{H_4}\right]}^T $. Matrix ϕ can be represented by $ \left[{\phi}^{H_1}\kern0.5em {\phi}^{H_2}\kern0.5em {\phi}^{H_3}\kern0.5em {\phi}^{H_4}\right] $. Generally, those assumptions mentioned in this paper are realistic, and their implications tell us that the truth is from the real scenarios of the applications, and the results can be tested the cases of the applications.

$$ y=\phi x=\left[{\phi}^{H_1}\kern0.5em {\phi}^{H_2}\kern0.5em {\phi}^{H_3}\kern0.5em {\phi}^{H_4}\right]\left(\begin{array}{c}{x}^{H_1}\\ {}{x}^{H_2}\\ {}{x}^{H_3}\\ {}{x}^{H_4}\end{array}\right)=\sum \limits_{i=1}^4{\phi}^{H_i}{x}^{H_i} $$

(2)

As shown in Formula (2), the predicted coefficient of measurement matrix is the sum of all the measured coefficients in the cluster. Therefore, in each round, the cluster head generates predicted coefficients; all cluster heads forward the received predicted coefficients to the sink node. When the sink node collected M rounds predicted value, it can recover the original data.

We define the compressive ratio as ρ = M/N, which means that the ratio is between the measurement value M in the process of compressive sensing and the length N of collected signal. It describes the compression efficiency of the entire network.

We define the relative reconstruction error as $ \varepsilon =\frac{{\left\Vert d-\overset{\Lambda}{d}\right\Vert}_2^2}{{\left\Vert d\right\Vert}_2^2} $, i.e., the ratio between the absolute error and the true value, where d is the true distance value of a certain node i and its cluster head node and $ \overset{\Lambda}{d} $ is the measurement distance value of a certain node i and its cluster head node.

3 Data aggregating method based on compressive sensing in WSNs

Although compressive sensing technology can effectively reduce the energy consumption of each node in the network, it is directly related to the measurement value M in compressive sensing. When the value of M is large, the energy consumption of nodes remains high. To solve this problem, a novel hybrid compressive sensing data aggregating method is proposed, which mainly consists of four parts: network clustering, building the appropriate inter-cluster routing tree, compressive sensing data aggregating in clusters, and cluster head transmitting data to the sink node. How to construct the routing tree and evolve the process of compressive sensing in clusters is shown below.

3.1 Network model

We make the following assumptions in the network (generally, those assumptions are realistic, and their implications tell us that the truth is from the real scenarios of the applications):

1)
N nodes randomly distribute in a circular perception area (the radius is L); the sink node is at the center of the sensing area (as shown in Fig. 1).
2)
The sink node has enough data space and the ability of process.
3)
The initial energy and the transmission rate of each sensor node are the same.
4)
Nodes can know its own location information using the relative locating technology.

Lemma 1: Suppose that nodes in the wireless sensor network are distributed randomly, data aggregating in the cluster uses sparse matrices. If the cluster head is at the center of this cluster, then nodes consume least energy for each measurement value aggregating process.

Proof Assume that the j^th cluster consists of m_j nodes; the sparse ratio of the measurement matrix in the process of compressive sensing is s. In each aggregating process, the average number $ {m}_j^{\hbox{'}} $ of nodes which involves in the aggregation of measurement values is:

$$ {m}_j^{\hbox{'}}=\sum \limits_{i=1}^{m_j}s\times 1={m}_js $$

(3)

Obviously, only $ {m}_j^{\hbox{'}} $ nodes need to forward their corresponding weights for each time. Therefore, cluster head node receives$ {m}_j^{\hbox{'}} $ packets. So, at every measurement, the average energy consumption in the j^thcluster is:

$$ {\displaystyle \begin{array}{l}{\overline{E}}_{\mathrm{intra}}^j=\sum \limits_{i=1}^{m_j^{\hbox{'}}}{E}_{Tx}^i\left(k,E\left({d}_i\right)\right)+{m}_j^{\hbox{'}}{E}_{Rx}(k)\\ {}\kern4em =k\sum \limits_{i=1}^{m_j^{\hbox{'}}}\left({E}_{\mathrm{ele}}+{\varepsilon}_{\mathrm{amp}}E\left({d}_i^2\right)\right)+{m}_j^{\hbox{'}}{kE}_{\mathrm{ele}}\\ {}\kern4em =2{m}_j^{\hbox{'}}{kE}_{\mathrm{ele}}+k{\varepsilon}_{\mathrm{amp}}\sum \limits_{i=1}^{m_j^{\hbox{'}}}E\left({d}_i^2\right)\end{array}} $$

(4)

where $ {E}_{Tx}^i\left(k,E\left({d}_i\right)\right) $ represents the energy consumption consumed by the i^th node when forwarding k bit data to its cluster head. E(d_i) represents the distance expectations from the i^th node to its cluster head. As shown in the formula above, the average energy consumption is decided by $ E\left({d}_i^2\right) $. Suppose that the cluster is square and its side length is b and the cluster head’s coordinate (x₀, y₀). We can use f(x, y) to represent the probability density function of the distance between child nodes to the cluster head:

$$ f\left(x,y\right)=\left\{\begin{array}{cc}\frac{1}{b^2}& x\in \left(-\frac{b}{2},\frac{b}{2}\right),y\in \left(-\frac{b}{2},\frac{b}{2}\right)\\ {}0& \mathrm{other}\end{array}\right. $$

(5)

then

$$ {\displaystyle \begin{array}{l}E\left({d}_i^2\right)=E\left({\left(x-{x}_0\right)}^2+{\left(y-{y}_0\right)}^2\right)\\ {}\kern5em ={\int}_{-\frac{b}{2}}^{\frac{b}{2}}{\int}_{-\frac{b}{2}}^{\frac{b}{2}}\frac{1}{b^2}\left({\left(x-{x}_0\right)}^2+{\left(y-{y}_0\right)}^2\right) dxdy\\ {}\kern5em =\frac{b^2}{6}+\left({x}_0^2+{y}_0^2\right)\ge \frac{b^2}{6}\end{array}} $$

(6)

is true if and only if x₀ = y₀ = 0, i.e., the cluster head node is at the center area of the cluster.

Assuming that the network is divided into N_c non-overlapping clusters, that means N_c nodes are selected as the cluster heads; the other nodes connect to the cluster head near to them.

We also assume that the node can adjust their own energy levels based on real transmission distance. Thus, the energy consumption from node n_i to node n_j is $ {P}_{ij}={d}_{ij}^{\alpha } $. The parameter α depends on the characteristics of the channel, which usually take between 2 and 4 as mentioned by References [13, 14]. Here, we choose α = 2, which is realistic for a typical WSN deployment [13,14,15,16]. Eventually, we use the normalized reconstruction error as the CS signal reconstruction error.

3.2 Establishment of inter-cluster routing tree

Hops are forwarded from current cluster head to other cluster head (NoH), i.e., the node determines the value based on its own communication radius and the distribution of cluster heads in the network.

Lemma 2: Suppose that cluster heads forward measurement values along the inter-cluster multi-hop shortest routing tree, so the energy consumption of inter-cluster will reach to the minimum value.

Proof The cluster head will get h − 1 data packets at each time collecting measurement values, and the definition of the energy consumption of inter-cluster is as follows:

$$ {\displaystyle \begin{array}{l}{E}_{\mathrm{inter}}=\sum \limits_{i=1}^h{E}_{Tx}^i\left(k,{d}_i\right)+\left(h-1\right){E}_{Rx}(k)\\ {}\kern4em =k\sum \limits_{i=1}^h\left({E}_{\mathrm{ele}}+{\varepsilon}_{\mathrm{amp}}{d}_i^2\right)+\left(h-1\right){kE}_{\mathrm{ele}}\\ {}\kern4em =\left(2h-1\right){kE}_{\mathrm{ele}}+k{\varepsilon}_{\mathrm{amp}}\sum \limits_{i=1}^h{d}_i^2\end{array}} $$

(7)

where d_i represents the transmission distance of the i^th data packet. The formula above shows that if h and k are constant, the final result is decided by $ \sum \limits_{i=1}^h{d}_i^2 $.

We propose an iterative algorithm to build distributed inter-cluster routing. Assuming that all cluster heads have the same transmission radius (R). Within the communication radius, cluster heads can communicate with each other. All cluster heads broadcast the hops from themselves to the sink node to their neighbors. The NoH of cluster head which contains the sink node in their communication radius is set as 1 at the first time of iterating. At the next iteration, these cluster heads broadcast their NoH to their neighbors and set the NoH of those cluster head nodes without NoH to be 2. After a series of iterations, it keeps choosing routing path until no cluster head is left. The algorithm can be abbreviated as the following steps:

3.3 Intra-cluster data aggregating based on compressive sensing

After building inter-cluster routing tree, we use compressive sensing technology to collect data in clusters. Since the data correlation of intra-cluster node is relatively large, we can reduce the measurement values by using random space sparse matrix. In traditional data aggregation methods based on compressive sensing, the measurement matrix required for compressive sensing process is generated by the cluster head. When data is collected, the cluster head needs to forward both data and measurement matrix to the sink node. Because the random space sparse matrix can be directly generated by the sink node by using a sparse seed vector, each cluster head can generate its corresponding sub-matrix by using the seed vector provided by the sink node. The steps of the method are as follows:

Step 1: The sink node forwards the seed vector U(u_i),{i = 1,2,…, N} with sparse space △ to every cluster head. Each cluster head determinating its position in the seed vector depends on its position on the backbone tree.
Step 2: Start from its position in the seed vector, the i^thcluster head node traverses forward N_i values depends on the number of its intra-cluster nodes N_i. Then, the cluster head gets its own new sparse seed vector and eventually generates its corresponding sub-matrix M_i × N_i.
Step 3: Non-CH (cluster head, CH) nodes forward their nodes to CH; CHs calculate the received data as M_i measurement values by using the formula y_i = φ_ix_i.
Step 4: CHs forward measurement values to the sink node along the generated forwarding path.
Step 5: The sink node generates the whole measurement matrix based on the whole seed vector U(u_i),{i = 1,2,…, N} and recovers the original data depends on received data y = [y₁,y₂,…,y_Nc] by using CS reconstruction algorithm.

4 Analysis of energy consumption in WSN

As stated in the above sections, non-CH nodes send their readings to their cluster heads. The energy consumption of intra-cluster defines as P_{intra-cluster}. In the next step, the cluster heads get their corresponding measurement values (y_i = φ_ix_i) based on intra-cluster node data and then send measurement values to the sink node. The energy consumption of intra-cluster node represents as P_toBS, and total energy consumption is expressed as follows:

$$ {P}_{\mathrm{total}}=\left({P}_{\mathrm{intra}\hbox{-} \mathrm{cluster}}+{P}_{\mathrm{toBS}}\right) $$

(8)

1)
Analysis of P_{intra-cluster}

We assume that WSN is divided into N_c clusters evenly, each cluster has the same number of nodes N/N_c, including a cluster head and N/N_c − 1 non-CH nodes. Then,

$$ {P}_{\mathrm{intra}\hbox{-} \mathrm{cluster}}={N}_c\left(\frac{N}{N_c}-1\right)E\left[{r}^{\alpha}\right] $$

(9)

where r is a random variable, which corresponds to the distance between a common node and its cluster head; α is the path loss exponent. In this paper, we set it as 2, so we can calculate the expectation E[r²]:

$$ E\left[{r}^2\right]=\iint \left({x}^2+{y}^2\right)\rho \left(x,y\right) dxdy=\iint {r}^{\hbox{'}2}\rho \left({r}^{\hbox{'}},\theta \right){r}^{\hbox{'}}{dr}^{\hbox{'}} d\theta $$

(10)

where ρ(r,θ) represents the distribution of nodes. We also assume that each cluster is a circular area of $ R=L/\sqrt{N_c} $ radius; the density of nodes in all clusters is distributed evenly. Therefore,

$$ E\left[{r}^2\right]=\frac{1}{\left(\pi {L}^2/{N}_c\right)}{\int}_{\theta =0}^{2\pi }{\int}_{r^{\hbox{'}}=0}^R{r}^{\hbox{'}3}{dr}^{\hbox{'}} d\theta =\frac{L^2}{2{N}_c} $$

(11)

Correspondingly,

$$ {P}_{\mathrm{intra}\hbox{-} \mathrm{cluster}}=\left(\frac{N}{N_c}-1\right)\frac{L^2}{2} $$

(12)

2)
Analysis of P_toBS

We define the energy consumption of inter-cluster transmission as follows:

$$ {P}_{\mathrm{toBS}}=\sum \limits_{i=1}^{N_c} NoH(i)\times {R}^2\times M(i) $$

(13)

where M(i) is the number of measurement values of the i^th cluster; R² is the energy consumption on each hop. In the case of the analysis, we assume that all cluster sizes are equal. According to the literature [18], the number of measurement values required for each cluster is linearly proportional to the number of nodes in each cluster. Therefore, Eq. (12) can be rewritten as follows:

$$ {P}_{\mathrm{toBS}}={R}^2\times \frac{M}{N_c}\sum \limits_{i=1}^{N_c} NoH(i) $$

(14)

As aforementioned, M represents the total number of measured values required in the network. N_c is the number of clusters. Formula (14) can be rewritten as follows:

$$ {P}_{\mathrm{toBS}}={NoH}_{\mathrm{ave}}\times {R}^2\times M $$

(15)

where NoH_ave is the average number of hops.

3)
Analysis of communication radius of cluster head

Communication radius of cluster head is closely related to the network energy consumption. In each routing path, the number of hops is closely related to the communication radius. If R is increased, the cluster head can forward data to more cluster heads, which means that the total number of hops will change with the communication radius R. In Figs. 2 and 3, we construct a network of 2000 nodes. The network is clustered by using a common method such as K-means or LEACH. We change different communication radius R = {10, 12, 14, 16, 18, 20} in order to change the number of clusters in the network. As shown in Figs. 2 and 3, the total hops will change after increasing or decreasing of radius R correspondingly. Generally, the units of the communication radius R used to measure the quantities are specified meter as m or the times of m, so we ignore the unit description for the following figures.

Figure 2 shows the comparison of the total hop change in the network when changing the communication range of the cluster head. Figure 3 shows the comparison of the total energy consumption of the network when changing the communication range of the cluster head. The figures above (a) uses K-means before data aggregating and (b) uses LEACH before data aggregating.

5 Description of the algorithm

Theorem 1 Assuming that wireless sensor network is clustering uniformly, the intra-cluster collect data by using compressive sensing technology, sparse matrix is selected as the measurement matrix, cluster head node is at the center of cluster, and inter-cluster forwards data along the shortest multi-hop routing tree. Then every time in the data aggregating, the total energy consumption of network is minimum.

Proof From the previous Lemma 1 and Lemma 2, the mean value of energy consumption in the wireless sensor network is as follows:

$$ {\displaystyle \begin{array}{l}{\overline{E}}_{\mathrm{total}}(h)=\sum \limits_{i=1}^h{\overline{E}}_{\mathrm{intra}}^i+{E}_{\mathrm{inter}}\\ {}=\kern0.5em {kE}_{\mathrm{ele}}\left(2+\frac{\varepsilon_{\mathrm{amp}}{b}^2}{6}\right)\sum \limits_{i=1}^h{m}_i^{\hbox{'}}+\\ {}\left(2h-1\right){kE}_{\mathrm{ele}}+k{\varepsilon}_{\mathrm{amp}}\sum \limits_{i=1}^h{d}_i^2\end{array}} $$

(16)

where $ {m}_i^{\hbox{'}} $ represents the average number of nodes within i^th cluster the first time to participate in a single measurement. In the case of uniform clustering, $ {m}_i^{\hbox{'}}={m}_j^{\hbox{'}} $ and $ {d}_i^2={d}_j^2\left(i,j=1,2,\dots, h,i\ne j\right) $ then $ \sum \limits_{i=1}^h{m}_i^{\hbox{'}} $ and $ \sum \limits_{i=1}^h{d}_i^2 $ reach the minimum. So, $ {\overline{E}}_{\mathrm{total}}(h) $ reaches the minimum.

This section presents a kind of data aggregating algorithm based on hybrid compressive sensing, which is different from the traditional hybrid compressive sensing data aggregation. The measurement matrix required for every cluster is generated by seed vector provided by the sink node; because all of the intra-cluster nodes have the same calculation process, the entire network has balanced energy consumption. The complete algorithm is described as follows:

1)
The network is clustered by using conventional clustering methods, such as LEACH and K-means.
2)
The aforementioned method is used to construct the inter-cluster multi-hop shortest routing tree between cluster heads and the sink node. Each cluster head can get its own NoH. As seen in Formula (13), if M and N_c are certain, the energy consumption of inter-cluster is only associated with NoH.
3)
The sink node generates a corresponding sparse seed vector U(u_i),{i = 1,2,…, N} according to the number of nodes in the network and send it to each cluster head.
4)
Each cluster head (assuming that i^th cluster head) using the received seed vector generates its measurement matrix M_i × N_i according to its location and the number of nodes in it.
5)
In the cluster, data is collected by using compressive sensing technology, then we can get M measurement values of the corresponding cluster head.
6)
Cluster heads forward M measurement values to the sink node along the inter-cluster multi-hop shortest routing tree. Based on Theorem 1, the total energy consumption of network during the data acquisition is minimum, so as to achieve the best performance; otherwise, we use machine learning approach to reconstruct signal and then ensure that the total energy consumption is minimum. Detailed machine learning approach can be found in our relative research works [7,8,9,10,11], because of the length limit of the paper, we ignore the detailed description.
7)
Since the measurement matrix used in each cluster is generated by the partial seed sparse vector U(u_i),{i = 1,2,…, N}, so the sink node may also generate a total block matrix as the recovery matrix. The sink node recovers the original data by using corresponding reconstruction algorithm.

Because the random space sparse matrix can be dynamically generated by a series of seed vectors, the measurement matrix required for the whole network can be determined by the sink node. On one hand, compared with the Gaussian random matrix, it reduces the number of independent variables; on the other hand, it avoids the problem that nodes cannot save the dynamic measurement matrix while routing path changes in the process of conventional hybrid compressive sensing.

6 Results and discussions

This section provides some simulations and evaluations of this proposed data aggregating method.

6.1 Performance of data aggregating based on random space sparse compressive sensing

We always assess the performance of methods by using the amount of data packet transmission collected by nodes in the network; the space here is △ = 2. We compare six schemes: (a) K-means clustering scheme based on random space sparse measurement matrix, (b) LEACH clustering scheme based on random space sparse measurement matrix, (c) K-means clustering scheme based on Gaussian measurement matrix, (d) LEACH clustering scheme based on Gaussian measurement matrix, (e) K-means clustering scheme without compressive sensing, and (f) LEACH clustering scheme without compressive sensing. The number of nodes is increased from 500 to 1500, the transmission radius nodes is 10, and the compressive ratio is ρ = M/N.

Figure 4 shows the comparison of data packet transmission of various programs when the compressive ratio is ρ = 0.2. Figure 5 shows the comparison of data packet transmission of various programs when the compressive ratio is ρ = 0.1. The two values of the compressive ratio ρ are used, which are ρ = 0.1 and ρ = 0.2; what value of ρ would be a realistic one will be based on the requirements of the realistic applications, if the high ratio is needed, then selecting high ratio, such as ρ = 0.2, which is related to the crucial importance and know the importance of the results.

Figure 6 shows the comparison of the tendency of the network lifecycle changes with the number of nodes when the compressive ratio is ρ = 0.1. Figure 7 shows the comparison of the tendency of the network lifecycle changes with the number of nodes when the compressive ratio is ρ = 0.2. It can be seen from the figures that the use of compressive sensing obviously prolongs the network’s lifecycle, while compared to the Gaussian random matrix, random space sparse matrix collects less data packets, thereby further increases the number of rounds of the network.

6.2 Simulation and analysis of energy consumption in network

We also deploy 2000 nodes, and L is 100. Firstly, the network is clustered by K-means or LEACH, then we get N_c clusters. We use our CS data aggregating method and calculate the energy consumption of the entire network. The sink node is set at the center of sensing field. Given the number of measurements M = 500, in order to meet the target error 0.1, we change the number of cluster head of the network by changing the transmission radius of nodes. We use the transmission radius R = [50, 30, 25, 22, 18, 14, 11] to represent the number of the cluster head N_c = [10, 50, 100, 200, 300, 400, 500].

First, the energy consumption of intra-cluster is simulated. We select random space sparse matrix and Gaussian matrix to do the comparison, and we also choose the different random space △. As shown in Figs. 8 and 9 ((a) represents as the program using K-means and random space sparse matrix, (b) as the program using LEACH and random space sparse matrix, (c) as the program using K-means and Gaussian random matrix, and (d) as the program using LEACH and Gaussian random matrix), they represent the total energy consumption of intra-cluster including cluster head. If the number of cluster increases, then the energy consumption of intra-cluster decreases. At this time, the transmission of data packet of inter-cluster consumes much energy. As can be seen from the figures, the random space sparse matrix consumes less than the Gaussian matrix due to a large number of zero element.

With the increase of the number of cluster heads, we represent (a) as the program using K-means and random space sparse matrix with △ = 2, (b) as the program using LEACH and random space sparse matrix with △ = 2, (c) as the program using K-means and Gaussian random matrix, and (d) as the program using LEACH and Gaussian random matrix.

In addition, with the increase of the number of clusters, we represent (a) as the program using K-means and random space sparse matrix with △ = 4, (b) as the program using LEACH and random space sparse matrix with △ = 4, (c) as the program using K-means and Gaussian random matrix, and (d) as the program using LEACH and Gaussian random matrix.

Figure 10 shows that the entire energy consumption of inter-cluster is decreased with the increasing of the number of clusters. We represent (a) as uniform clustering, (b) as LEACH, and (c) as K-means.

Figure 11 shows the trend of the total energy consumption of the network. It can be seen from the figure that the use of inter-cluster multi-hop routing significantly reduces the total energy consumption of the network when there are too many clusters. We represent (a) as the use of inter-cluster multi-hop routing and K-means, (b) as the use of inter-cluster multi-hop routing and LEACH, (c) as the only use of K-means, and (d) as the only use of LEACH.

In addition, we do the comparison experiments on the total consumption of network of WSN with the relative methods [19,20,21,22,23,24,25,26,27]. When we consider the total consumption of network, an abnormal situation occurs (in Fig. 12, the abnormal situation regarding new nodes added to the network in the actual applications because of the dynamic change of the network topology, such as some nodes go into the relative clusters or some nodes leave the relative clusters, the implication is that the WSN is self-organized based on the requirements of the relative applications), which has added new nodes in the data collection process. In order to consider the worst case, we assume that new nodes are added at the front end of the network. Fifty new nodes are joined in the network every 2 cycles. From the results of Fig. 12 (in Fig. 12, we represent (c) as the method of Reference [19], we represent (b) as the method of Reference [20], we represent (a) as the method of this paper), we can see that our method reduces the total energy consumption of network than that of other methods regardless of intra-cluster or inter-cluster.

Figure 13 shows the comparison of lifetime of network under different data collection methods. In Fig. 13, we represent (c) as the method of Reference [21], we represent (b) as the method of Reference [23], and we represent (a) as the method of this paper. From the results of Fig. 13, we can see that our method prolongs the lifetime of network than that of other methods.

The algorithm can be described as intra-cluster method based on existing methods and the inter-cluster aggregation based on minimum consumption. The common problem in clustering networks which is the energy balancing during the head selection is well considered by the machine learning process.

The WSNs will inevitably use clustering when the node number is large. It is not a fair comparison between the cluster and non-cluster structure in large-scale networks, so we adopt the overhead of normalized network transmission based on the relative weight.

In order to compare our work with other clustering methods and including the machine learning process, the cost of the algorithm (the bandwidth, energy consumption caused by the extra communication) is considered on the performance analysis as Fig. 14 ((a) as the method of this paper, (b) as the method of Reference [23], (c) as the method of Reference [21], and (d) as the method of Reference [24]).

From Fig. 14, we can see that optimized compressive sensing data collection program reduces the overhead of normalized network transmission than the un-optimized program.

7 Conclusions

A kind of effective data aggregating method based on compressive sensing in WSN is proposed. The method can effectively reduce the energy consumption of the network. The sink node forwards sparse seed to cluster heads. Within a cluster, the cluster head generates its required measurement matrix according to the received sparse seed and then produces the corresponding measurement values by using random space sparse compressive sensing. Cluster heads forward measurement values to the sink node along the inter-cluster multi-hop routing tree from one cluster to another. The sink node reconstructs the original signal by using the corresponding compressive sensing reconstruction algorithm. We analyze the energy consumption of the algorithm in the network, the relationship between the size of cluster head and the energy consumption of inter-cluster, and the relationship between the size of cluster head and the energy consumption of network. The experimental results show that this method can effectively reduce the energy consumption of the network.

References

C Luo, W F, J Sun, Efficient measurement generation and pervasive sparsity for compressive data gathering. IEEE Trans. Wirel. Commun. 9(12), 3728–3738 (2011)
Article Google Scholar
DG Zhang, G Li, K Zheng, An energy-balanced routing method based on forward-aware factor for wireless sensor network. IEEE Trans Ind Inf 10(1), 766–773 (2014)
Article Google Scholar
YY Xiao, Time-ordered collaborative filtering for news recommendation. China Commun 12(12), 53–62 (2015)
Article Google Scholar
DG Zhang, X Wang, XD Song, A novel approach to mapped correlation of ID for RFID anti-collision. IEEE Trans. Serv. Comput. 7(4), 741–748 (2014)
Article Google Scholar
XD Zhang, Design and implementation of embedded un-interruptible power supply system (EUPSS) for web-based mobile application. Enterp Inf Syst 6(4), 473–489 (2012)
Article Google Scholar
S Yi, J Heo, Y Cho, PEACH: power-efficient and adaptive clustering hierarchy protocol for wireless sensor networks. Comput. Commun. 30(14–15), 2842–2852 (2007)
Article Google Scholar
K Zheng, Novel quick start (QS) method for optimization of TCP. Wirel. Netw 22(1), 211–222 (2016)
Article Google Scholar
K Zheng, T Zhang, A novel multicast routing method with minimum transmission for WSN of cloud computing service. Soft. Comput. 19(7), 1817–1827 (2015)
Article Google Scholar
HL Niu, Novel PEECR-based clustering routing approach. Soft. Comput. 21(24), 7313–7323 (2017)
Article Google Scholar
DG Zhang, A new approach and system for attentive mobile learning based on seamless migration. Appl. Intell 36(1), 75–89 (2012)
Article Google Scholar
YN Zhu, A new constructing approach for a weighted topology of wireless sensor networks based on local-world theory for the Internet of Things (IOT). Comput Math Appl 64(5), 1044–1055 (2012)
Article MATH Google Scholar
XJ Kang, A novel image de-noising method based on spherical coordinates system. EURASIP J Adv Sig Process 2012(110), 1–10 (2012). https://doi.org/10.1186/1687-6180-2012-110
Google Scholar
J Haupt, WU Bajwa, Toeplitz compressed sensing matrices with applications to sparse channel estimation. IEEE Trans. Inf. Theory 56(11), 5862–5875 (2010)
Article MathSciNet MATH Google Scholar
C Zhang, HR Yang, Compressive sensing based on deterministic sparse Toeplitz measurement matrices with random pitch. Acta Automat. Sin. 38(8), 1362–1369 (2012)
Article MathSciNet Google Scholar
YP Liang, A kind of novel method of service-aware computing for uncertain mobile applications. Math Comput Model 57(3–4), 344–356 (2013)
Google Scholar
S Zhou, New mixed adaptive detection algorithm for moving target with big data. J Vibroengineering 18(7), 4705–4719 (2016)
Article Google Scholar
R Devore, Deterministic constructions of compressed sensing matrices. J. Complex. 23(4), 918–925 (2007)
Article MathSciNet MATH Google Scholar
R Calderbank, S Howard, Construction of a large class of deterministic sensing matrices that satisfy a statistical isometry property. IEEE J Sel Top Sig Proces 4(2), 358–374 (2009)
Article Google Scholar
B Malathi, Data collection based hybrid compressive sensing in wireless sensor networks. Int J Adv Inf Sci Technol (IJAIST) 4(2), 2319–2682 (2015)
Google Scholar
GK Nigam, Effective compressive sensing for clustering in wireless sensor networks. Ind J Sci Technol 9(38), 0974–5645 (2016)
Google Scholar
KA Shabna, Cluster method using hybrid compressive sensing for sensor network. Int J Modern Trends Eng Res (IJMTER) 2(5), 2349–9745 (2015)
Google Scholar
M Kumar, S Verma, Clustering approach to data aggregation in wireless sensor networks. 16th IEEE Int Conference Netw 1(1), 125–135 (2008)
MathSciNet Google Scholar
A Rajalakshmi, T Mohanraj, Efficient data transmission in wireless sensor networks using hybrid compressive sensing. J Recent Res Eng Technol 2(3), 2349–2252 (2015)
Google Scholar
P Sukumar, B Sowmya, Effective hybrid compressive sensors using wireless networks in clustering methods. Int J Comput Sci Trends Technol (IJCST) 4(2), 110–119 (2016)
Google Scholar
B Ameena, M Biradar, The hybrid compressive sensing data collection method in cluster structure for efficient data transmission in WSN. Int J Sci Res (IJSR) 4(6), 210–219 (2015)
Google Scholar
NV Deshmukh, AV Deorankar, Consuming less energy in hybrid compressive sensed WSN. Natl Conference Adv Computing, Commun Netw 1(1), 30–40 (2016)
Google Scholar
X Wang, XD Song, New medical image fusion approach with coding based on SCD in wireless sensor network. J Electr Eng Technol 10(6), 2384–2392 (2015)
Article Google Scholar
Z Ma, Shadow detection of moving objects based on multisource information in Internet of Things. J Exp Theor Artif Intell 29(3), 649–661 (2017)
Article Google Scholar
WB Li, Novel fusion computing method for bio-medical image of WSN based on spherical coordinate. J Vibroengineering 18(1), 522–538 (2016)
Google Scholar
XD Song, X Wang, New agent-based proactive migration method and system for big data environment (BDE). Eng. Comput. 32(8), 2443–2466 (2015)
Article Google Scholar
WB Li, Novel ID-based anti-collision approach for RFID. Enterp Inf Syst 10(7), 771–789 (2016)
Article Google Scholar
DL Donoho, Compressed sensing. IEEE Trans Inform Theory 52(4), 1289–1306 (2006)
Article MathSciNet MATH Google Scholar
S Zhou, New Dv-distance method based on path for wireless sensor network. Intell Autom Soft Comput 23(2), 219–225 (2017)
Article Google Scholar
X Wang, XD Song, New clustering routing method based on PECE for WSN. EURASIP J. Wirel. Commun. Netw. 2015(162), 1–13 (2015). https://doi.org/10.1186/%20s13638-015-0399-x
Google Scholar
S Liu, T Zhang, Novel unequal clustering routing protocol considering energy balancing based on network partition & distance for mobile education. J. Netw. Comput. Appl. 88(15), 1–9 (2017). https://doi.org/10.1016/j.jnca.2017.03.025
Google Scholar
X Zhang, X Zhang, G C, A micro-artificial bee colony based multicast routing in vehicular ad hoc networks. Ad Hoc Netw. 58(4), 213–221 (2017)
Article Google Scholar
KS He, YQ Li, CC Yin, A novel compressed sensing-based non-orthogonal multiple access scheme for massive MTC in 5G systems. EURASIP J. Wirel. Commun. Netw. 2018(81), 1–12 (2018)
Google Scholar
JF Wan, B Zeng, A scalable and quick-response software defined vehicular network assisted by mobile-edge computing. IEEE Commun. Mag. 55(7), 94–100 (2017)
Article Google Scholar
DY Jia, High-efficiency urban-traffic management in context-aware computing and 5G communication. IEEECommunications Mag 55(1), 34–40 (2017)
Google Scholar
QR Wang, P Deng, A survey on position-based routing for vehicular ad hoc networks. Telecommun. Syst. 62(1), 15–30 (2016)
Article Google Scholar
S Zhou, YM Tang, A low duty cycle efficient MAC protocol based on self-adaption and predictive strategy. Mob Netw Appl 2 (2017). https://doi.org/10.1007/s11036-017-0878-x
Z Ma, A novel compressive sensing method based on SVD sparse random measurement matrix in wireless sensor network. Eng. Comput. 33(8), 2448–2462 (2016)
Article Google Scholar
H LNiu, Novel positioning service computing method for WSN. Wirel. Pers. Commun. 92(4), 1747–1769 (2017)
Article Google Scholar
X Wang, A kind of novel VPF-based energy-balanced routing strategy for wireless mesh network. Int. J. Commun. Syst. 30(6), 1–15 (2017)
Article Google Scholar
XD Song, X Wang, Extended AODV routing method based on distributed minimum transmission (DMT) for WSN. Int J Electron Commun 69(1), 371–381 (2015)
Article Google Scholar
CP Zhao, A new medium access control protocol based on perceived data reliability and spatial correlation in wireless sensor network. Comput Electric Eng 38(3), 694–702 (2012)
Article Google Scholar
JQ Chen, GQ Mao, Capacity of cooperative vehicular networks with infrastructure support: multi-user case. IEEE Trans. Veh. Technol. 67(2), 1546–1560 (2018)
Article Google Scholar

Download references

Acknowledgements

This research work is supported by the National Natural Science Foundation of China (Grant No. 61571328), Tianjin Key Natural Science Foundation (No.13JCZDJC34600), CSC Foundation (No. 201308120010), Major projects of science and technology in Tianjin (No.15ZXDSGX 00050), training plan of Tianjin University Innovation Team (No.TD12-5016), major projects of science and technology for their services in Tianjin (No.16ZXFWGX00010, No.17YFZCGX00360), the Key Subject Foundation of Tianjin (15JCYBJC46500), and training plan of Tianjin 131 Innovation Talent Team (No.TD2015-23).

Funding

The work is partially supported by the following funding: training plan of Tianjin University Innovation Team (No.13-5025).

Availability of data and materials

The data will not be shared due to confidentiality matters.

Author information

Authors and Affiliations

Key Laboratory of Computer Vision and System, Tianjin University of Technology, Ministry of Education, Tianjin, 300384, China
De-gan Zhang, Ting Zhang & Yue Dong
Tianjin Key Lab of Intelligent Computing and Novel Software Technology, Tianjin University of Technology, Tianjin, China
De-gan Zhang, Ting Zhang & Yue Dong
Beijing No. 20 High School, Xiaoyingxilu, Haidian District, Beijing, 100085, China
Jie Zhang
Institute of Scientific and Technical Information of China, Beijing, 100038, China
Xiao-dan Zhang

Authors

De-gan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ting Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jie Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yue Dong
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-dan Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

D-gZ designed the algorithm. TZ wrote this paper. JZ did the experimental tests. YD optimized the algorithm and experiments. X-dZ checked the whole paper and figures. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Jie Zhang.

Ethics declarations

Authors’ information

De-gan Zhang, Ph.D., graduated from the Northeastern University, China. Now, he is a visiting professor of School of Electronic and Information Engineering, University of Sydney, Sydney, NSW 2006, Australia; professor of Tianjin Key Lab of Intelligent Computing and Novel software Technology, Key Lab of Computer Vision and System, Ministry of Education, Tianjin University of Technology, Tianjin, 300384, China. His research interest includes image processing, service computing, etc.

Ting Zhang, Ph.D., is a member (M) of IEEE in 2012. Now, she is a researcher at Tianjin University of Technology, Tianjin, 300384, China. Her research interest includes WSN, mobile computing, etc.

Jie Zhang (Beijing No.20 High School, Xiaoyingxilu, Haidian District, Beijing 100085, China). His research interest includes image processing, CRN, WSN, and IOT.

Yue Dong, Ph.D., is a researcher at Tianjin University of Technology, Tianjin, 300384, China. Her research interest includes WSN, etc.

Xiao-dan Zhang, Ph.D., is a member (M) of IEEE in 2012. Now, she is a researcher at Institute of Scientific and Technical Information of China, Beijing, 100038, China. Her research interest includes WSN, mobile computing, etc.

Ethics approval and consent to participate

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Zhang, Dg., Zhang, T., Zhang, J. et al. A kind of effective data aggregating method based on compressive sensing for wireless sensor network. J Wireless Com Network 2018, 159 (2018). https://doi.org/10.1186/s13638-018-1176-4

Download citation

Received: 22 February 2018
Accepted: 05 June 2018
Published: 19 June 2018
DOI: https://doi.org/10.1186/s13638-018-1176-4

A kind of effective data aggregating method based on compressive sensing for wireless sensor network

Abstract

1 Introduction

2 Modeling based on hybrid compressive sensing for WSN

3 Data aggregating method based on compressive sensing in WSNs

3.1 Network model

3.2 Establishment of inter-cluster routing tree

3.3 Intra-cluster data aggregating based on compressive sensing

4 Analysis of energy consumption in WSN

5 Description of the algorithm

6 Results and discussions

6.1 Performance of data aggregating based on random space sparse compressive sensing

6.2 Simulation and analysis of energy consumption in network

7 Conclusions

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Authors’ information

Ethics approval and consent to participate

Competing interests

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords