Hop-distance relationship analysis with quasi-UDG model for node localization in wireless sensor networks

Gao, Deyun; Chen, Ping; Foh, Chuan Heng; Niu, Yanchao

doi:10.1186/1687-1499-2011-99

Research
Open access
Published: 17 September 2011

Hop-distance relationship analysis with quasi-UDG model for node localization in wireless sensor networks

Deyun Gao¹,
Ping Chen²,
Chuan Heng Foh³ &
…
Yanchao Niu¹

EURASIP Journal on Wireless Communications and Networking volume 2011, Article number: 99 (2011) Cite this article

4221 Accesses
11 Citations
Metrics details

Abstract

In wireless sensor networks (WSNs), location information plays an important role in many fundamental services which includes geographic routing, target tracking, location-based coverage, topology control, and others. One promising approach in sensor network localization is the determination of location based on hop counts. A critical priori of this approach that directly influences the accuracy of location estimation is the hop-distance relationship. However, most of the related works on the hop-distance relationship assume the unit-disk graph (UDG) model that is unrealistic in a practical scenario. In this paper, we formulate the hop-distance relationship for quasi-UDG model in WSNs where sensor nodes are randomly and independently deployed in a circular region based on a Poisson point process. Different from the UDG model, quasi-UDG model has the non-uniformity property for connectivity. We derive an approximated recursive expression for the probability of the hop count with a given geographic distance. The border effect and dependence problem are also taken into consideration. Furthermore, we give the expressions describing the distribution of distance with known hop counts for inner nodes and those suffered from the border effect where we discover the insignificance of the border effect. The analytical results are validated by simulations showing the accuracy of the employed approximation. Besides, we demonstrate the localization application of the formulated relationship and show the accuracy improvement in the WSN localization.

1 Introduction

In recent years, wireless sensor networks (WSNs) which generally consist of a large number of small, inexpensive and energy efficient sensor nodes have become one of the most important and basic technologies for information access [1]. WSNs have been widely used in military, environment monitoring, medicine care, and transportation control. Spatial information is crucial for sensor data to be interpreted meaningfully in many domains such as environmental monitoring, smart building failure detection, and military target tracking. The location information of sensors also helps facilitate WSN operation such as routing to a geographic field of interests, measuring quality of coverage, and achieving traffic load balance. In many monitoring applications, the sensor nodes must be aware its location to explain 'what happens and where'.

While specialized localization devices exist such as GPS, given the large number of sensor nodes involved in building a single WSN, it is cost ineffective to equip every sensor node with such a sophisticated device. Therefore, seeking for an alternative localization technology in WSNs has become one major research in WSNs [2]. Over the past few years, many localization algorithms have been proposed to provide sensor localization [3]. These localization protocols can be divided into two categories: range-based and range-free. The former is defined by methods that use absolute point-to-point distance estimates (range) or angle estimates for computing locations. The latter makes no assumption about the availability or validity of such information. Recently, range-free localization methods have attracted much attention because no extra sophisticated device for distance measurement is needed for each sensor node. Despite the challenge in obtaining virtual coordinates purely based on radio connectivity information [4, 5], attempts have been made in developing a practical solution to achieve localization. A few representative protocols of this range-free scheme include DV-Hop [6], APIT [7], DRLS [8], MDS-MAP [9], and LS-SOM [10]. Most of the range-free localization schemes, such as DV-Hop, need to compute the average distance per hop to estimate a node's location. In other words, the performance of these localization schemes relies on the accuracy of the employed hop-distance relationship. Since the determination of an accurate hop-distance relationship depends on various complex factors such as node deployment, node density, and wireless communication technology that cannot be easily quantified, the deduction process is tedious and unlikely to produce an exact close form relationship using, say the geometric methods [11].

Due to lack of any predetermined infrastructure and self-organized nature, in most cases, the sensor nodes are randomly and independently deployed in a bounded area. For simplicity, the vast majority of studies based on the idealized unit-disk graph (UDG) network model, where any two sensors can directly communicate with each other if and only if their geographic distance is smaller than a predetermined radio range. Examples of these research include geo-routing protocols [12, 13], localization algorithms [8, 14], and topology control techniques [15, 16]. Similarly, most of the works related to the hop-distance relationship have been investigated assuming the UDG model [11, 17–23]. The probability that two randomly selected stations with a known distance can communicate in K or less hops with omnidirectional antennas has been analyzed by Chandler [17]. Bettestetter and Eberspacher, derived the probability of the distance of two randomly chosen nodes deployed in a rectangular region within one or two hops [18]. However, when the hop counts are larger than two, only simulation results are available. The distribution parameters are computed by the iterative formula which extends from [19] with a linear formation. Ekici et al. [20] studied the probability of the k-hop distance in two dimensional network based on the approximated Gaussian distribution. Dulman et al. [11] derived the relationship between the number of hops separating two nodes and the physical distance between them in one- and two-dimensional topologies considering the UDG model. In the study, the approximated approach based on a Markov Chain in two-dimensional case is rather complicated to compute. Zhao and Liang [21] collected the hop-distance joint distribution from Monte Carlo simulations in a circular region and proposed an attenuated Gaussian approximation for the conditional probability distribution function (pdf) of the Euclidean distance given a known hop count. Ta et al. [22] provided a recursive equation for the two randomly located sensor nodes that are k-hop neighbors given a known distance in homogeneous wireless sensor networks. Ma et al. [23] proposed a method to compute the conditional probability that a destination node has hop-count h with respect to a source node given that the distance between the source and the destination is d.

Despite the current efforts, no fixed communication range exists in actual network environment for the reasons such as multi-path fading and antenna issues. Therefore, a certain level of deviation occurs between the intended operation and actual operation in wireless sensor networks when the UDG model is assumed in a protocol design. To deal with this problem, a practical model called the quasi Unit-disk Graph (quasi-UDG) model is proposed recently [24]. The quasi-UDG model can be characterized by two parameters, the radio range R and the quasi-UDG factor α. For any two nodes in the quasi-UDG model, if their distance is longer than R, no direct communication link exists between the two. Otherwise, if their distance is between αR and R, a communication link exists with a probability of p_l , and p_l = 1 when their distance is shorter than αR. Given this newly proposed practical property of connectivity, it warrants an investigation of the hop-distance relationship with the quasi-UDG model for the range-free localization schemes to capture practical connectivity characteristics.

In this paper, we focus on exploiting the connectivity property of the quasi-UDG model and analyze the relationship between the hop counts separating two nodes and their geographic distance with a specific node density in a WSN. We seek approximation technique to provide a scalable solution for the two-dimensional case. We further demonstrate the application of the developed hop-distance relationship to a range-free localization scheme.

In our WSN setup, we consider that sensor nodes are deployed into a circular region S_b with the radius R_b , where the deployment position follows a Poisson point process with a certain density λ. We set $p_{l} = \frac{α}{1 - α} (\frac{R}{d} - 1)$ such that a longer distance between two nodes has a lower probability to form a direct communication link. With this setup, we formulate the probability that a pair of nodes with a known distance resulting a particular hop count. Additionally, we also develop the probability that a pair of nodes with a known distance gives a particular hop count. Finally, in our analysis, we present a quantitative evaluation for the border effect of geographic distance distribution with a given hop count.

The rest of this paper is organized as follows. In Section 2, we present our analytical model deriving an approximate recursive formula for the hop-distance relationship considering the quasi-UDG model. Section 3 extends our analytical model by taking the border effect and dependence problem into consideration. Section 4 formulates the probability distribution of distance with known hop counts. In Section 5, we demonstrate the use of our developed hop-distance relationship by applying the relationship to a least squares (LS) based localization algorithm. Finally, we report results in Section 6 and draw important conclusions in Section 7.

2 The probability of the hop count given a known distance

In general, the hop-distance relationship is influenced by the density of sensor nodes and their deployment strategy, as well as the radio communication characteristics. Considering the more practical quasi-UDG model, it is recognized that the formulation for the hop-distance relationship with the consideration of quasi-UDG model is tedious and unlikely to produce an exact close form. We seek approximation using a recursive approach to derive an approximated hop-distance relationship. In this section, we focus on analyzing the probability that a particular pair of sensor nodes forms a certain hop count with a known distance.

Suppose that N sensor nodes are deployed randomly in circular region S_b with a radius R_b . The number of nodes in any region is a Poisson random variable with an average node density of $λ = \frac{N}{S_{b}} = \frac{N}{(π R_{b}^{2})}$ . Assume that the communication range of a node is R, the communication model between any pair of nodes follows the quasi-UDG model with a factor of α where 0 < α < 1.

With the quasi-UDG model, the communication area between two nodes with the distance d can be further divided into three cases shown as follows.

If d ≤ αR, then the two nodes can communicate directly.
If αR < d ≤ R, then the two nodes can communicate with a probability p_l, which is set to (R/d - 1)α/(1 - α). It means that a longer distance between two nodes has a lower probability to form a direct communication link.
If d > R, then the two nodes cannot communicate directly.

The quasi-UDG model is illustrated with an example shown in Figure 1. In the figure, we assume that there are two nodes u and v, their distance is d_uv , and their communication probability is P. Let Φ _h (d) be the probability that a particular pair of nodes with d distance apart is h hops away from each other. In the following, we shall first derive Φ _h (d) for the case of h = 1 and then h ≥ 2.

2.1 The case of h= 1

For the case of h = 1, owing to the quasi-UDG model, Φ₁ (d) is obviously

Φ_{1} (d) = \{\begin{matrix} 1 \\ \frac{α}{1 - α} (\frac{R}{d} - 1) \\ 0 \end{matrix} \begin{matrix} d \leq α R \\ α R < d \leq R \\ d > R \end{matrix}

(1)

2.2 The case of h≥ 2

We first note that two nodes, named O₁ and O₂, have no direct link but may communicate through h - 1 relay nodes. This gives rise to two possibilities, where

O₂ is not the m-hop neighbor of O₁ if m < h.
Within the communication range of O₂, there is a least one (h - 1)-hop neighbor of O₁ that has a direct link with O₂.

For m < h, the probability, P_N , that O₂ is not the m-hop neighbor of O₁ can be obtained as

P_{N} = 1 - \sum_{m = 1}^{h - 1} Φ_{m} (d) .

(2)

We shall now consider the second possibility in the following. Considering two circles which one centered at O₁ having a radius of r and the other centered at O₂ having a radius of R. We denote the distance between the two centers as d and refer the common region of the two circles as S. The quantity P_r (S) is defined as the probability that in the area S, there is no (h - 1)-hop neighbor of O₁ that can communicate with O₂ directly. A differential increment of dr on r can obtain a differential incremental region of dS. Assume that the probability Φ_h(d) of any pair of nodes is independent and statistically identical, we have P_r (S + dS) = P_r (S)P_r (dS). In the following subsections, we calculate P_r (dS) based on three conditions, which are d > R, $\frac{1 + α}{2} R < d < R$ , and $α R < d \frac{1 + α}{2} R$ .

2.2.1 O₁falls outside the communication range of O₂where d> R

In Figure 2, we see that dS can be further divided into many differential regions rdrdθ. Since dr and dθ are infinitesimal, the probability that there exists more than one sensor node in the region rdrdθ can be ignored, and the probability that a single sensor node located within rdrdθ can be approximated as λrdrdθ.

We term the circular region centered in O₂ with the radius αR as $C (O_{2})$ , and the annulus region centered in O₂ with the larger radius R and the smaller one αR as $A (O_{2})$ . There are two cases needed to be taken into consideration, which are

When dS falls into $A (O_{2})$ as shown in Figure 2(a), r satisfies d - R ≤ r ≤ d - αR or d + αR ≤ r ≤ d - R. With the definition of the quasi-UDG model, every differential region rdrdθ of dS has a corresponding probability p_lto communicate with O₂. Therefore, P_r(dS) is given by (3) where
$P_{r} (d S) = 1 - 2 Φ_{h - 1} (r) λ r d r \int_{0}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ .$
(3)

As illustrated in Figure 2(a), we can get the following relationship

φ = a r c c o s \frac{r^{2} + d^{2} - R^{2}}{2 r d}

(4)

l = \sqrt{r^{2} + d^{2} - 2 r d cos θ} .

(5)

When dS covers both $C (O_{2})$ and $A (O_{2})$ , r will be bounded by d - αR ≤ r < d + αR. The part rdrdθ that falls within $C (O_{2})$ is surely a one-hop neighbor of O₂. When that part falls within $A (O_{2})$ , it has a corresponding probability p_lthat it has a direct link with O₂. Then P_r(dS) can be determined by
$P_{r} (d S) = 1 - 2 Φ_{h - 1} (r) λ r d r [φ_{1} + \int_{φ 1}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ]$
(6)

and

φ_{1} = a r c c o s \frac{r^{2} + d^{2} - {(α R)}^{2}}{2 r d} .

(7)

2.2.2 O₁falls within the communication range of O₁and d satisfies $\frac{1 + α}{2} R < d < R$

We use the foregoing strategy for this derivation. We notice that there are three cases needed to be treated individually which are given as follows.

If 0 < r < R - d, dS will be the annulus region and the entire section of dS will fall within $A (O_{2})$ , which gives
$P_{r} (d S) = 1 - 2 Φ_{h - 1} (r) λ r d r \int_{0}^{π} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ$
(8)
If R-d ≤ r < d-αR or d+αR ≤ r < R+d, dS will not be the annulus region but the entire section of dS will still fall within $A (O_{2})$ . Then we can obtain P_r(dS) by (3).
If d-αR ≤ r < d+αR, dS will cover both $C (O_{2})$ and $A (O_{2})$ . In this case, we can determine P_r(dS) by (6).

2.2.3 O₁falls within the communication range of O₂and d satisfies $α R < d \frac{1 + α}{2} R$

There are four cases needed to be considered when O₁ falls within the communication range of O₂ and d satisfying the condition $α R < d \frac{1 + α}{2} R$ , which are

If 0 < r < d-αR, dS will be the annulus region and the entire section of dS will fall within $C (O_{2})$ . Then we can determine P_r(dS) by (8).
If d-αR ≤ r < R-d, dS will still be the annulus region but it covers both $C (O_{2})$ and $A (O_{2})$ . Therefore, we have
$P_{r} (d S) = 1 - 2 Φ_{h - 1} (r) λ r d r [φ_{1} + \int_{φ_{1}}^{π} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ]$
(9)
If R-d ≤ r < d+αR, dS will not be will the annulus region and it covers both $C (O_{2})$ and $A (O_{2})$ . The probability P_r(dS) can be obtained by (6).
If d+αR ≤ r < R+d, dS will fall within the region $A (O_{2})$ , and hence we can compute P_r(dS) by (3).

2.3 Determination of Φ _h(d) for h≥ 2

Consider that P_r (dS) only depends on r with a specific d, we set P_r (dS) = 1 - g(r). From P_r (S + dS) = P_r (S)P_r (dS), the expression of P_r (S) can be obtained by the following linear differential equation where

P_{r} (S) = exp (- \int_{d - R}^{d + R} g (r) d r) .

(10)

Therefore, with (2) and (10), the probability Φ_h(d) with h ≥ 2 can be obtained as

\begin{align} Φ_{h} (d) & = P_{N} \times (1 - P_{r} (S)) & (1) \\ = (1 - \sum_{i = 1}^{h - 1} Φ_{i} (d)) (1 - exp (- 2 λ Ω (d))) & (2) \\ (3) \end{align}

(11)

where knowing d, Ω(d) can be determined by one of the following expressions, which are

For d > hR or d < αR :
$Ω (d) = 0;$
(12)
For R < d ≤ hR :
$\begin{align} Ω (d) & = \int_{d - R}^{d - α R} Φ_{h - 1} (r) r \int_{0}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ d r & (1) \\ + \int_{d - α R}^{d + α R} Φ_{h - 1} (r) r (φ_{1} + \int_{φ_{1}}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ) d r & (2) \\ + \int_{d + α R}^{d + R} Φ_{h - 1} (r) r \int_{0}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ d r & (3) \\ (4) \end{align}$
(13)
For $\frac{1 + α}{2} R < d \leq R$ :
$\begin{align} Ω (d) & = \int_{0}^{R - d} Φ_{h - 1} (r) r \int_{0}^{π} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ d r & (1) \\ + \int_{R - d}^{d - α R} Φ_{h - 1} (r) r \int_{0}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ d r & (2) \\ + \int_{d - α R}^{d + α R} Φ_{h - 1} (r) r (φ_{1} + \int_{φ_{1}}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ) d r & (3) \\ + \int_{d + α R}^{d + R} Φ_{h - 1} (r) r \int_{0}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ d r & (4) \\ (5) \end{align}$
(14)
For $α R < d \leq \frac{1 + α}{2} R$ :
$\begin{align} Ω (d) & = \int_{0}^{d - α R} Φ_{h - 1} (r) r \int_{0}^{π} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ d r & (1) \\ + \int_{d - α R}^{R - d} Φ_{h - 1} (r) r (φ_{1} + \int_{φ_{1}}^{π} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ) d r & (2) \\ + \int_{d - α R}^{d + α R} Φ_{h - 1} (r) r (φ_{1} + \int_{φ_{1}}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ) d r & (3) \\ + \int_{d + α R}^{d + R} Φ_{h - 1} (r) r \int_{0}^{φ} \frac{α}{1 - α} (\frac{R}{l} - 1) d θ d r . & (4) \\ (5) \end{align}$
(15)

3 The border effect and dependence problem

In the above analysis, we do not consider borders of a WSN. However, in a realistic scenario, the deployment area of WSNs is finite and hence borders exist. It is known that the probability Φ_h(d) derived assuming that both involved nodes are not near the border of a WSN may give a slightly different result when one or both of them fall near the border. This is known as the border effect. One common handling of the border effect is to consider the toroidal distance metric in the simulation experiment where a node closed to the border can communicate directly with some nodes at the opposite border [25]. While this special setup eliminates the border effect, it creates discrepancy between the study and practical setups which may lead to a certain level of errors.

Clearly, nodes which are closer to the border cover smaller regions than those at least d away from the border, and therefore intuitively the quantity for Ω(d) should be smaller with the consideration of the border effect. Apparently, the border effect gives a different level of impacts in the measure of Φ_h(d) with a different distance between an involved node and the border. However, it is tedious to derive all cases considering the border effect. For simplicity, we take two key cases of the border effect into consideration. Assuming the center of deployment area is O, we consider two annulus near the border in the following.

The first annulus, called $A_{1} (o)$ , is between the circles with radius of R_b-R and R_b-αR.
The second annulus, called $A_{2} (o)$ , is between the circles with radius of R_b-R and R_b-αR.

We set an average metric ζ(h) which varies from 0 to 1 for each hop to determine the decrement of Ω(d). For the circle area with the radius R_b - R, which can be called $C (o)$ , we can set ζ(h) = 1 accordingly.

Another factor we have to consider is the dependence. The hop-distance relationship derived as aforesaid relies on an implicit independence assumption, that is the probability Φ_h(d) of any pair of nodes is independent and statistically identical. However as pointed in [22], the events that those nodes with the direct link to O₂ are h - 1 hops away from O₁ are not mutually independent for cases when h > 2, and the calculation of Φ_h-1(r) should include appropriate dependence conditions. For example, as shown in Figure 3, nodes O₁ and O₂ are d distance apart and h hops away from each other where h = 3. The probability that node M₁ is a 2-hop neighbor of node O₁ is the probability that there is at least one node located in the area S₁ offering packet relay between nodes O₁ and M₁. Here, the area S₁ is the intersect area between the circles with the centers O₁ and M₁. Similarly, the probability that node M₂ is a 2-hop neighbor of node O₁ is the probability that there is at least one node located in the area S₂ which can directly communicate with nodes O₁ and M₂. Here, the area S₂ is the intersect area between the circles with the centers O₁ and M₂. It is obvious in the figure that the areas S₁ and S₂ share a common area S₁₂ indicating that the calculated probabilities are not independent.

To include the impact of the dependence, we add a new factor, namely ξ(h), into the expression of Ω(d). Both factors ζ(h) and ξ(h) are added to allow Ω(d) to reflect a practical setup, and they can be estimated by statistical results via experiments. With the inclusion of ζ(h) and ξ(h) into the expression of ω(h), (11) becomes

Φ_{h} (d) = (1 - \sum_{i = 1}^{h - 1} Φ_{i} (d)) (1 - exp (- 2 λ ω (h) Ω (d))) .

(16)

4 Distance distribution with known hop counts

In this section, assume that sensor nodes are randomly deployed in a circular region, we derive equations to determine the probability density function of distance d with a known hop count $f_{H} (d)$ .

Theorem 4.1 The probability density function for the distance d between two nodes randomly deployed in a circular region with the radius R_bis $f_{D} (d)$ , where

f_{D} (d) = \frac{d}{π R_{b}^{4}} (4 R_{b}^{2} arccos (\frac{d}{2 R_{b}}) - d \sqrt{4 R_{b}^{2} - d^{2}}) .

(17)

We provide the proof of Theorem 4.1 in Appendix A. According to Theorem 4.1, we can obtain the probability density function of distance between any two nodes in the areas $C (o)$ , $A_{1} (o)$ , and $A_{2} (o)$ . Their probability density functions of distance are $f_{D_{c}} (d)$ , $f_{D_{A_{1}}} (d)$ , and $f_{D_{A_{2}}} (d)$ , respectively. We also term them as $f_{D *} (d)$ , in general, where the symbol * is appropriately substituted by either $A_{1}$ , $A_{2}$ or $C$ . Their expressions are given in (18), (19) and (20) in the following.

f_{D_{A_{1}}} (d) = \{\begin{matrix} \frac{2 d}{R_{b}^{2}} & 0 < d \leq α R \\ \frac{2 d}{π R R_{b}^{2} (1 - α) (2 R_{b}^{2} - α R - R)} (Λ (R_{b}, R_{b} - α R, d) - π {(R_{b} - R)}^{2}) & α R < d \leq R \\ \frac{2 d}{π R R_{b}^{2} (1 - α) (2 R_{b}^{2} - α R - R)} (Λ (R_{b}, R_{b} - α R, d) - Λ (R_{b}, R_{b} - R, d)) & R < d \leq 2 R_{b} - R \\ \frac{2 d}{π R R_{b}^{2} (1 - α) (2 R_{b}^{2} - α R - R)} Λ (R_{b}, R_{b} - α R, d) & 2 R_{b} - R < d \leq 2 R_{b} - α R \end{matrix}

(18)

f_{D_{A_{2}}} (d) = \{\begin{matrix} \frac{d}{π α R R_{b}^{2} (2 R_{b} - α R)} (4 R_{b}^{2} a r c c o s (\frac{d}{2 R_{b}}) - d \sqrt{4 R_{b}^{2} - d^{2}} - 2 π {(R_{b} - α R)}^{2}) & 0 < d \leq α R \\ \frac{d}{π α R R_{b}^{2} (2 R_{b} - α R)} (4 R_{b}^{2} a r c c o s (\frac{d}{2 R_{b}}) - d \sqrt{4 R_{b}^{2} - d^{2}} - 2 Λ (R_{b}, R_{b} - α R, d)) & α R < d \leq 2 R_{b} - α R \\ \frac{d}{π α R R_{b}^{2} (2 R_{b} - α R)} (4 R_{b}^{2} a r c c o s (\frac{d}{2 R_{b}}) - d \sqrt{4 R_{b}^{2} - d^{2}}) & 2 R_{b} - α R < d \leq 2 R_{b} \end{matrix}

(19)

\begin{gathered} f_{D_{C}} (d) = \frac{4 d}{π {(R_{b} - R)}^{2}} arccos \frac{d}{2 (R_{b} - R)} - \frac{4 d^{2}}{π {(R_{b} - R)}^{4}} \sqrt{4 {(R_{b} - R)}^{2} - d^{2}} \\ s . t . 0 < d \leq 2 \cdot (R_{b} - R) \end{gathered}

(20)

where Λ(R, r, d) is given by

\begin{align} Λ (R, r, d) = & R^{2} arccos \frac{R^{2} + d^{2} - r^{2}}{2 d R} + r^{2} a r c c o s \frac{r^{2} + d^{2} - R^{2}}{2 d r} & (1) \\ - \frac{1}{2} \sqrt{({(r + R)}^{2} - d^{2}) (d^{2} - {(R - r)}^{2})} . & (2) \\ (3) \end{align}

By the Bayes' formula, given $f_{D *} (d)$ and Φ_h(d), we can obtain the expression $f_{H *} (d)$ which is the probability density function of the geographical distance d when the hop count h is known to be H*. This expression is determined by

f_{H *} (d) = \frac{Φ_{h} (d) f_{D *} (d)}{\int_{r_{0}}^{h R} Φ_{h} (x) f_{D *} (x) d x}

(21)

where r₀ = 0 when h = 1, and r₀ = αR when h > 1.

5 Localization Applications

With the development of the hop-distance relationship for the quasi-UDG model, in this section, we show the application of this new relationship to a particular localization algorithm using LS based localization algorithms [26], and we call this newly designed localization algorithm enhance weighted least squares (EWLS).

In a particular localization scenario in WSNs, we assume that there is a number of nodes whose locations are known, and they shall be called anchor nodes. Other nodes that have no knowledge of their locations are called unknown nodes. Consider that an unknown node j can obtain the location x _i, hop h_ji and average hop-distance c_i of an anchor node i. The distance between nodes j and i can be calculated as d_ji = c_ih_ji . In our test scenario, we place an anchor node o in the center and add several other anchor nodes in the map.

We design a simple mechanism to compute the range of distance d_ji . Each anchor node i collects some information to other anchor node k, computes and ranks the average hop-distance c_i(k)= d_ik/h_ik, such as c_i(1)≥ c_i(2)≥ ⋯ ≥ c_i(n). We set the range of average hop-distance as

{\underset{}{c}}_{i} = \frac{\sum_{k = 1}^{n - 1} | | x_{i} - x_{(k)} ∥}{\sum_{k = 1}^{n - 1} h_{i (k)}} \leq c_{i} \leq \frac{\sum_{k = 2}^{n} | | x_{i} - x_{(k)} ∥}{\sum_{k = 2}^{n} h_{i (k)}} = {\bar{c}}_{i} .

(22)

Following that, the range of distance d_ji can be computed as $d_{j i}^{(M)} = {\bar{c}}_{i} \times h_{j i}$ and $d_{j i}^{(m)} = {\underset{}{c}}_{i} \times h_{j i}$ . With the range of distance d_ji , the variance v_h of the pdf $f_{H} (d)$ , we compute the weights, w_i , of measured distance d_ji as

w_{i} = \frac{1}{v_{h} \int_{d_{j i}^{(m)}}^{d_{j i}^{(M)}} f_{H} (x) d x} .

(23)

Finally, we set W = diag(w₁, ⋯, w_n ) and compute the location $\hat{x}$ of an unknown node using the following results, where

\hat{x} = {(A_{n}^{T} W A_{n})}^{- 1} A_{n}^{T} W b_{n}

(24)

and

\begin{gathered} A_{n} = 2 [\begin{matrix} x_{1} - Ω (x_{i}) & y_{1} - Ω (y_{i}) \\ ⋮ & ⋮ \\ x_{n} - Ω (x_{i}) & y_{n} - Ω (y_{i}) \end{matrix}] \\ b_{n} = [\begin{matrix} x_{1}^{2} - Ω (x_{i}^{2}) + y_{1}^{2} - Ω (y_{i}^{2}) + Ω (d_{i}^{2}) - d_{1}^{2} \\ ⋮ \\ x_{n}^{2} - Ω (x_{i}^{2}) + y_{n}^{2} - Ω (y_{i}^{2}) + Ω (d_{i}^{2}) - d_{n}^{2} \end{matrix}] \\ Ω (t) = \frac{\sum_{i = 1}^{n} t w_{i}}{\sum_{i = 1}^{n} w_{i}} . \end{gathered}

6 Result discussions

In this section, we compare the analytical and statistical results through simulation experiments to illustrate the performance of our proposed hop-distance model. To illustrate the benefit of applying our model to LS-based localization algorithms, we compared our enhanced algorithm of EWLS to two classical LS-based localization algorithms namely LS [26] and PDM [27].

6.1 Impacts of boarder effects and dependence

We first illustrate the impacts of the boarder effect and dependence problem. In the experiments, we gather statistics of the hop counts with corresponding distance information using Monte Carlo simulations. All the simulation data are collected from several scenarios where N sensor nodes are randomly deployed in a circular region of radius R_b , and the transmission range is set to R with the consideration of the quasi-UDG model. The parameters are set to N = 400, R_b = 200, R = 50, α = 0.75, and the result comparisons are listed in Table 1. Let o be the deployment center. The region where nodes are deployed away from the border is denoted as $C (o)$ , and we term $A_{1} (o)$ and $A_{2} (o)$ as the annulus regions in which the distances to o are within (R_b-R, R_b-αR] and (R_b-αR, R_b ], respectively.

Table 1 Comparisons between analytical and simulation results of Φ_h(d)

Full size table

In Table 1 we use cumulative absolute difference (CAD) to measure the sum of absolute differences between the analytical results and statistical data. We set $C A D = \sum_{d} | Φ_{h} (d) - S i m_{h} |$ , where Φ_h(d) and Sim_h are the probabilities of two nodes giving a hop count of h with a known distance of d obtained from the analysis and simulation, respectively. Moreover, we denote CAD* as the CAD measurement between analytical results without the border effect consideration and statistical data. For $A_{1} (o)$ and $A_{2} (o)$ , we can see that the CAD* of each hop is larger than that of CAD because of the impact of the border effect.

6.2 The validation of distribution of distance by a known hop count

We conduct simulation experiments with N = 400, R_b = 200, R = 50, α = 0.75 and present $f_{H *} (d)$ in Figures 4, 5 and 6 with the statistical data and our analytical results. In all three cases, we note that the numerical results of $f_{H *} (d)$ given in (21) show excellent agreement with the simulation results. This excellent agreement confirms the accuracy of our model for the estimation of the distance given a known hop count between two sensor nodes.

6.3 Localization accuracy comparisons

In the following, we conduct several simulation experiments to illustrate the performance of our proposed EWLS algorithm. In the simulation, N = 100 sensor nodes are randomly deployed in the circle $S_{b}$ with the radius R_b = 200. The number of anchor nodes is 16 and the communication range of each sensor node is R = 80. The factor α of the quasi-UDG model is set to 0.76. In Figure 7(a), even within the communication range R of node 1, the nodes 30, 38, 53, and 63 cannot communicate directly with node 1 due to the considered quasi-UDG model. With the network topology illustrated in Figure 7(a), we show the localization errors of EWLS, LS, and PDM in Figure 7. Apparently, the accuracy of EWLS is higher than that of the two classical algorithms where the average localization errors of EWLS, LS, and PDM are 0.26702R, 0.29728R, and 0.28462R, respectively. This confirms that when WSNs exhibit the quasi-UDG connectivity behavior, our new hop-distance relationship that captures the behavior offers an improved accuracy in localization.

In the following, we further compare the localization accuracy among EWLS, LS and PDM under various scenarios. In these simulation experiments, we set N = 400, and sensor nodes are deployed uniformly in the circle area with the radius R_b = 200. The connectivity of nodes follows the quasi-UDG model. The localization error is calculated as $ξ = \sum_{j} ∥ x_{j} - {\hat{x}}_{j} ∥ / (N - n)$ .

Firstly, we focus on the impact of the number of anchor nodes. The factor α of quasi-UDG model is set to 0.76 and the communication range R of each sensor node is set to 50. In Figure 8, we can see that the localization error ξ of all three algorithms decreases with the increase of number of anchor nodes. Among them, our proposed EWLS always offers the best performance.

Secondly, we investigate the impact of the parameter α of quasi-UDG model. In this scenario, we set the number of anchor nodes to 40 and the parameter α varies from 0.72 to 1. The localization error comparison is given in Figure 9. We observe that when the parameter α increases, the number of neighbor nodes increases and the number of hops between an unknown node and an anchor node decreases. Thus, the localization error decreases, and our proposed EWLS algorithm remains the best among all for all considered α values.

Last we study the impact of the communication range R of each sensor node. We set the parameter α of quasi-UDG model to 0.76 and set the number of anchor nodes to 40. Similarly, we compare the localization errors in Figure 10 with a range of R values. We observe that because the number of neighbor nodes of a node increases when its communication range increases, and number of hops between an unknown node and an anchor decreases which leads to a decrease in localization errors. Comparing the results for all algorithms, our proposed EWLS outperforms its peers.

7 Conclusions

The hop-distance relationship information can effectively improve the performance of the protocols for wireless sensor networks in many aspects. However, most studies focus on the UDG model which significantly deviates from the real world. In the paper, we presented an analytical modeling to formulate the hop-distance relationship considering the quasi-UDG model. Senor nodes are randomly distributed in a circular region according to a Poisson point process. The probability of a particular hop count given a known distance Ω_h(d) was studied, and the border effect and dependence problem was considered in our analysis. Precisely, we derived the probability density function of a random variable describing the distance between two arbitrary nodes with a given hop count. Simulation results confirmed that our analytical results gave excellent accuracy. From the results, we further illustrated impact of the border effect.

Furthermore, we demonstrated the application of our developed hop-distance relationship considering the quasi-UDG model in WSN localizations. We designed a LS-based localization algorithm using our developed relationship and compared its performance with other popular LS-based localization algorithms. We again confirmed that the explicit use of our developed relationship in the computation of localization algorithms improved the localization accuracy.

A Appendix

Suppose that a node x(x, y) is randomly deployed in a circular region with the radius R_b , the joint distribution f_x(x, y) can be obtained from

f_{x} (x, y) = \{\begin{matrix} \frac{1}{π R_{b}^{2}}, & x^{2} + y^{2} \leq R_{b}^{2} \\ 0, & e l s e w h e r e \end{matrix} .

(25)

As the nodes x₁(x₁, y₁) and x₂(x₂, y₂) are selected independently, the joint pdf of x₁ and x₂ is

f_{x_{1}, x_{2}} (x_{1}, y_{1}, x_{2}, y_{2}) = \{\begin{matrix} \frac{1}{{(π R_{b}^{2})}^{2}}, & x_{i}^{2} + y_{i}^{2} \leq R_{b}^{2}, i = 1, 2 \\ 0, & e 1 s e w h e r e \end{matrix} .

(26)

We set x_d = x₁ - x₂ and x_m = (x₁ + x₂)/2. The joint distribution of x_m and x_d can be obtained as

f_{x_{d}, x_{m}} (x_{d}, y_{d}, x_{m}, y_{m}) = \{\begin{matrix} \frac{1}{{(π R_{b}^{2})}^{2}} & x_{d}, x_{m} \in L_{1} \cap L_{2} \\ 0, & e l s e w h e r e \end{matrix}

(27)

where the constraints L₁ and L₂ are

\begin{gathered} L_{1} : {(x_{m} + x_{d} ∕ 2)}^{2} + {(y_{m} + y_{d} ∕ 2)}^{2} < R_{b}^{2} \\ L_{2} : {(x_{m} - x_{d} ∕ 2)}^{2} + {(y_{m} - y_{d} ∕ 2)}^{2} < R_{b}^{2} . \end{gathered}

(28)

We set the probability of the geographical distance $D$ between x₁ and x₂ less than d to be $P (D \leq d)$ , and the constraint L₃ can be expressed by $L_{3} : D^{2} = x_{d}^{2} + y_{d}^{2} \leq d^{2}$ , then we have

P (D \leq d) \underset{L 1 \cap L_{2} \cap L_{3}}{\int \int \int \int} f_{X_{d}, X_{m}} (x_{d}, y_{d}, x_{m}, y_{m}) d x_{m} d y_{m} d x_{d} d y_{d} .

(29)

With L₁ ∩ L₂, then x_m falls into the intersectional region of two circles with centers (x_d/ 2, y_d/ 2) and (-x_d/ 2, -y_d/ 2). The intersectional area is

2 R_{b}^{2} arccos (\frac{\sqrt{x_{d}^{2} + y_{d}^{2}}}{2 R_{b}}) - \sqrt{x_{d}^{2} + y_{d}^{2}} \times \sqrt{R_{b}^{2} - (\frac{x_{d}^{2} + y_{d}^{2}}{4})} .

(30)

Since ${f_{x}}_{d_{}, x_{m}} (x_{d}, y_{d}, x_{m}, y_{m})$ is constant, (29) can be rewritten as

P (D \leq d) = \frac{1}{π R^{4}} \int_{0}^{d} [4 R^{2} arccos (\frac{l}{2 R}) - l \sqrt{4 R^{2} - l^{2}}] l d l

(31)

Therefore, we have

f_{D} (d) = \frac{d}{π R^{4}} (4 R^{2} arccos (\frac{d}{2 R}) - d \sqrt{4 R^{2} - d^{2}})

(32)

where 0 < d < 2R_b .

References

Jennifer Yick DG, Biswanath M: Wireless sensor network survey. Comp Netw 2008, 52: 2292-2330. 10.1016/j.comnet.2008.04.002
Article Google Scholar
Niculescu D: Positioning in ad hoc sensor networks. IEEE Netw 2004, 18(4):24-29. 10.1109/MNET.2004.1316758
Article MathSciNet Google Scholar
Patwari N, Ash JN, Kyperountas S, Hero AO, Moses RL, Correal NS: Locating the nodes: cooperative localization in wireless sensor networks. IEEE Signal Process Mag 2005, 22(4):54-69.
Article Google Scholar
Breu H, Kirkpatrick DG: Unit Disk Graph Recognition is NP-hard. Computational Geometry 1998, 9(1-2):3-24. 10.1016/S0925-7721(97)00014-X
Article MathSciNet MATH Google Scholar
Kuhn F, Moscibroda T, Wattenhofer R: Unit disk graph approximation. Proc. Joint Workshop on Foundations of Mobile Computing, Philadelphia, PA, USA 2004, 17-23.
Google Scholar
Niculescu D, Nath B: DV based positioning in ad hoc networks. Telecommun Syst 2003, 22(14):267-280.
Article Google Scholar
He T, Huang C, Blum BM, Stankovic JA, Abdelzaher T: Range-free localization schemes for large scale sensor networks. Proc International Conference on Mobile Computing and Networking (MobiCom), California 2003, 81-95.
Google Scholar
Sheu J-P, Chen P-C, Hsu C-S: A distributed localization scheme for wireless sensor networks with improved grid-scan and vector-based refinement. IEEE Trans Mobile Comput 2008, 7(9):1110-1123.
Article Google Scholar
Shang Y, Ruml W, Zhang Y, Fromherz M: Localization from connectivity in sensor networks. IEEE Trans Parallel Distribu Syst 2004, 15(11):961-974. 10.1109/TPDS.2004.67
Article Google Scholar
Tinh PD, Kawai M: Distributed range-free localization algorithm based on self-organizing maps. EURASIP J Wireless Commun Netw 2010., 2010:
Google Scholar
Dulman S, Rossi M, Havinga P, Zorzi M: On the hop count statistics for randomly deployed wireless sensor networks. Int J Sensor Netw 2006, 1(1/2):89-102. 10.1504/IJSNET.2006.010837
Article Google Scholar
Flury R, Pemmaraju SV, Wattenhofer R, Zurich ZE: Greedy routing with bounded stretch. Proc IEEE International Conference on Computer Communications (INFOCOM), Rio de Janeiro, Brazil 2009, 1737-1745.
Google Scholar
Ruhrup S, Kalosha H, Nayak A, Stojmenovic I: Message-efficient beaconless georouting with guaranteed delivery in wireless sensor, ad hoc, and actuator networks. IEEE/ACM Trans Netw 2010, 18(1):95-108.
Article Google Scholar
Zhou Z, Peng Z, Cui J-H, Shi Z, Bagtzoglou A: Scalable localization with mobility prediction for underwater sensor networks. IEEE Trans Mobile Comput 2011, 10(3):335-348.
Article Google Scholar
Kadivar M, Shiri ME, Dehghan M: Distributed topology control algorithm based on one- and two-hop neighbors' information for ad hoc networks. Computer Communications 2009, 32(2):368-375. 10.1016/j.comcom.2008.11.014
Article Google Scholar
Khadar F, Simplot-Ryl D: Incremental power topology control protocol for wireless sensor networks. Proc IEEE International Symposium on Personal, Indoor and Mobile Radio Communicatoins (PIMRC), Tokyo, Japan 2009, 77-81.
Google Scholar
Chandler SAG: Calculation of number of relay hops required in randomly located radio network. Electronics Letters 1989, 25(24):1669-1671. 10.1049/el:19891119
Article Google Scholar
Bettstetter C, Eberspacher J: Hop distances in homogeneous ad hoc networks. Proc IEEE Vehicular Technology Conference (VTC 2003-Spring), Jeju Island, Korea 2003, 2286-2290.
Google Scholar
Li Z, Trappe W, Zhang Y, Nath B: Robust statistical methods for securing wireless localization in sensor networks. Proc International Symposium on Information Processing in Sensor Networks (IPSN), California 2005, 91-98.
Google Scholar
Ekici E, Mcnair J, Al-Abri D: A probabilistic approach to location verification in wireless sensor networks. Proc IEEE International Conference on Communications (ICC) 2006, 8: 3485-3490.
Google Scholar
Zhao L, Liang Q: Hop-distance estimation in wireless sensor networks with applications to resources allocation. EURASIP J Wireless Commun Netw 2007., 2007:
Google Scholar
Ta X, Mao G: BDO Anderson, Evaluation of the probability of k-hop connection in homogeneous wireless sensor networks. Proc Global Telecommuniations Conference (GLOBECOM), Washington 2007, 1279-1284.
Google Scholar
Ma D, Er MJ, Wang B, Lim HB: K-hop statistics in wireless sensor networks. Proc International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP), Melbourne, Austrilia 2009, 469-474.
Chapter Google Scholar
Chen J, Jiang A, Kanj IA, Xia G, Zhang F: Separability and topology control of quasi unit disk graphs. Proc IEEE International Conference on Computer Communications (INFOCOM), Alaska 2007, 2225-2233.
Google Scholar
Bettstetter C: On the minimum node degree and connectivity of a wireless multihop network. Proc International Symposium on Mobile Ad Hoc Networking and Computing (MobiHoc), Lausanne, Switzerland 2002, 80-91.
Google Scholar
Savvides A, Han C-C, Strivastava MB: Dynamic fine-grained localization in ad-hoc networks of sensors. Proc International Conference on Mobile Computing and Networking (MobiCom), Rome, Italy 2001, 166-179.
Google Scholar
Lim H, Hou JC: Localization for anisotropic sensor networks. Proc IEEE International Conference on Computer Communications (INFOCOM), Miami 2005, 138-149.
Google Scholar

Download references

Acknowledgements

The authors gratefully acknowledge the support of the Program of Introducing Talents of Discipline to Universities ("111 Project") under grant No. B08002, and the support of the National Natural Science Foundation of China (NSFC) under Grants No. 60802016, 60833002 and 60972010, the support by "the Fundamental Research Funds for the Central Universities" under grant No. 2009JBM007.

Author information

Authors and Affiliations

School of Electronics and Information Engineering, Beijing Jiaotong University, Beijing, 100044, PR China
Deyun Gao & Yanchao Niu
TEDA College, Nankai University, Tianjin, 300457, PR China
Ping Chen
School of Computer Engineering, Nanyang Technological University, 639798, Singapore
Chuan Heng Foh

Authors

Deyun Gao
View author publications
You can also search for this author in PubMed Google Scholar
Ping Chen
View author publications
You can also search for this author in PubMed Google Scholar
Chuan Heng Foh
View author publications
You can also search for this author in PubMed Google Scholar
Yanchao Niu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chuan Heng Foh.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Gao, D., Chen, P., Foh, C.H. et al. Hop-distance relationship analysis with quasi-UDG model for node localization in wireless sensor networks. J Wireless Com Network 2011, 99 (2011). https://doi.org/10.1186/1687-1499-2011-99

Download citation

Received: 31 December 2010
Accepted: 17 September 2011
Published: 17 September 2011
DOI: https://doi.org/10.1186/1687-1499-2011-99

Hop-distance relationship analysis with quasi-UDG model for node localization in wireless sensor networks

Abstract

1 Introduction

2 The probability of the hop count given a known distance

2.1 The case of h= 1

2.2 The case of h≥ 2

2.2.1 O1falls outside the communication range of O2where d> R

2.2.2 O1falls within the communication range of O1and d satisfies 1 + α 2 R<d<R

2.2.3 O1falls within the communication range of O2and d satisfies αR<d 1 + α 2 R

2.3 Determination of Φ h (d) for h≥ 2

3 The border effect and dependence problem

4 Distance distribution with known hop counts

5 Localization Applications

6 Result discussions

6.1 Impacts of boarder effects and dependence

6.2 The validation of distribution of distance by a known hop count

6.3 Localization accuracy comparisons

7 Conclusions

A Appendix

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

2.2.1 O₁falls outside the communication range of O₂where d> R

2.2.2 O₁falls within the communication range of O₁and d satisfies $\frac{1 + α}{2} R < d < R$

2.2.3 O₁falls within the communication range of O₂and d satisfies $α R < d \frac{1 + α}{2} R$

2.3 Determination of Φ _h(d) for h≥ 2