A practical approach for outdoors distributed target localization in wireless sensor networks

Béjar, Benjamín; Zazo, Santiago

doi:10.1186/1687-6180-2012-95

Research
Open access
Published: 01 May 2012

A practical approach for outdoors distributed target localization in wireless sensor networks

Benjamín Béjar¹ &
Santiago Zazo¹

EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 95 (2012) Cite this article

3285 Accesses
8 Citations
Metrics details

Abstract

Wireless sensor networks are posed as the new communication paradigm where the use of small, low-complexity, and low-power devices is preferred over costly centralized systems. The spectra of potential applications of sensor networks is very wide, ranging from monitoring, surveillance, and localization, among others. Localization is a key application in sensor networks and the use of simple, efficient, and distributed algorithms is of paramount practical importance. Combining convex optimization tools with consensus algorithms we propose a distributed localization algorithm for scenarios where received signal strength indicator readings are used. We approach the localization problem by formulating an alternative problem that uses distance estimates locally computed at each node. The formulated problem is solved by a relaxed version using semidefinite relaxation technique. Conditions under which the relaxed problem yields to the same solution as the original problem are given and a distributed consensus-based implementation of the algorithm is proposed based on an augmented Lagrangian approach and primal-dual decomposition methods. Although suboptimal, the proposed approach is very suitable for its implementation in real sensor networks, i.e., it is scalable, robust against node failures and requires only local communication among neighboring nodes. Simulation results show that running an additional local search around the found solution can yield performance close to the maximum likelihood estimate.

1 Introduction

The deployment of a large number of scattered sensors in a certain area constitutes a very powerful tool for sensing and retrieving information from the environment (e.g., temperature, humidity, motion). The main features of wireless sensor networks (WSN) are that of a large number of low-cost nodes with limited computational and power resources. WSNs must also be scalable and robust against changes in topology (i.e., node failure or addition of new nodes), as well as energy efficient. These are the major design issues in WSNs that make the development of simple and efficient algorithms a challenging problem. These limitations also make centralized approaches not very suitable for being used in WSNs. Localization is a key task (often mandatory) in many applications [1] and therefore, distributed localization algorithms are of high practical importance.

There exist different measurement sources that can be fused in order to get an estimate of the target's position [1, 2], like time-of-arrival (TOA), time-difference of arrival (TDOA), angle of arrival (AOA) or received signal strength indicator (RSSI). In this article, we focus on single antenna nodes without tight synchronization abilities, which leads us to the use of RSSI measurements for the localization task. One of the main challenges when using RSSI measurements is that the mapping between the measurement and target's position is nonlinear and hence, finding a suitable solution becomes more challenging. Some approaches to deal with non-linearities are based on particle filtering principles [3]. In the context of WSN particle filtering approaches have also been proposed for localization and tracking using RSSI measurements [4–8]. In general, particle filtering approaches have shown very good performance when dealing with RSSI measurements but they are centralized and suffer from a high computational cost and hence, their applicability in a real scenario is questionable.

A recent approach based on convex optimization concepts has been proposed in [9, 10] for the node localization problem. In [9] a semidefinite relaxation approach is used to cast the localization problem into a semidefinite program (SDP) that can be solved efficiently via interior point methods, see [11] and references therein. The obtained position estimate through SDP is then further refined via an iterative algorithm. Although the proposed method provide near optimal results (i.e., close to the Cramer-Rao bound) they are centralized so their application to WSN may be limited. The problem of source localization using energy measurements has also been treated in [12], where a distributed algorithm based on projections onto convex sets is presented. The algorithm is shown to asymptotically approach the maximum likelihood (ML) estimate as the number of nodes increases when the target lies in the convex hull defined by the node's coordinates. In [13], an alternative approach is presented that can handle variations in the path-loss exponent. However, in both approaches no restrictions are imposed in the communication among nodes. In real applications this will cause rapid battery depletion if far away nodes are to communicate. Further, in both approaches the estimation is performed only by a subset of nodes that are selected according to their received signal to noise ratio. The main drawback is that such subset must be known to every node in the network. In a real scenario, the required signaling and routing overhead necessary for node coordination may limit their application.

In this contribution, we propose a distributed algorithm for localization in WSN's by fusing RSSI measurements. We approach the ML estimation problem by solving a simplified and more tractable problem which allows the use of convex optimization tools for its distributed solution. More precisely, we use an augmented Lagrangian approach with a primal-dual decomposition [14, 15]. The developed approach offers an advantage over centralized approaches as it is scalable, robust against changes in network's topology and requires only local communication among neighboring nodes. These are key properties very desirable in the context of WSN's.

The article is organized as follows: Section 2 introduces the localization problem and the underlying propagation model. In Section 3, we present the localization approach based on RSSI readings and its distributed implementation is presented in Section 4. Simulations are provided in Section 6, while some comments and concluding remarks are given in Section 7.

Notation

Bold lower- and upper-case letters denote vectors and matrices, respectively. For vector quantities the operator || · || denotes Euclidean norm while for matrices it refers to the Frobenius norm. The symbol 0 denotes a matrix of appropriate dimensions whose entries are all zeros. The symbol I is used to denote the identity matrix of appropriate dimensions. The optimal value of a variable x in an optimization problem is denoted by x*. The symbol ℝ ⁿ is used to denote the set of real n-dimensional vectors while $S_{+}^{n}$ is used to denote the set of symmetric, n × n positive semi-definite matrices.

2 Problem formulation and definitions

Consider a wireless sensor network, as the one depicted in Figure 1, consisting of M nodes randomly deployed on a certain area (in the same x-y plane). Nodes are static and able to communicate with adjacent nodes that lie within a given range for communications. Nodes are aware of their own location but not aware of the location of any other element in the network. Assume the presence of a target node that emits beacon frames that can be heard by all nodes in the network. The goal is to determine the location of the target node in the x-y plane.

For getting estimates of the target position, nodes employ RSSI measurements. The use of RSSI readings is of practical convenience when working with real hardware as they do not need tight synchronization requirements. We assume that the RSSI follows a linear relationship with the received power (we assume they are equal). Let denote r_m as the received power at node m. A common assumption, see [2] and references therein, is that the received power follows a lognormal distribution with a distance-dependent mean as

r_{m} = p_{m} - 10 α_{m} {log}_{10} (\frac{d_{m}}{d_{0}}) + n_{m},

(1)

where p_m is the received power (in dB) at reference distance d₀, α_m is the path-loss exponent, d_m is the true distance between the target and the m th node and $n_{m} ~ N (0, σ_{m}^{2})$ is a Gaussian random variable of zero mean and variance $σ_{m}^{2}$ . The received power r_m will be used to get an estimate of the true target position.

3 Localization strategies

In this section, we present different localization strategies using the received power (in dB) at the nodes. We first consider the (centralized) ML estimate and then we propose to use a suboptimal strategy based on local distance estimate at each node. We show that the proposed localization strategy can be implemented in a fully distributed way by only local communication among neighboring nodes.

3.1 ML estimation

Consider the presence of a centralized unit that gathers all measurements coming from the nodes. Let r = [r₁, . . . , r_M ]^T be a vector whose components are the different measurements taken by each sensor and denote x = [x_t y_t ]^T ∈ ℝ² as the target's position. The true distance between the target and the m th sensor can be then expressed as

d_{m} = | | x - c_{m} | |,

(2)

where c_m = [x_m y_m ]^⊤ ∈ ℝ² are the coordinates of node m with m = 1, . . . , M. The vector of measurements r can now be written as

r = [\begin{matrix} p_{1} - 10 α_{1} lo g_{10} (\frac{| | x - c_{1} | |}{d_{0}}) \\ ⋮ \\ p_{M} - 10 α_{M} lo g_{10} (\frac{| | x - c_{M} | |}{d_{0}}) \end{matrix}] + [\begin{matrix} n_{1} \\ ⋮ \\ n_{M} \end{matrix}] = μ (x) + n

(3)

where the vector $n ~ N (0, \sum)$ is jointly Gaussian with zero mean and covariance ∑. It is easy to see that r will follow a Gaussian distribution with mean μ(x) and covariance ∑, that is

p (r; x) = \frac{1}{{(2 π)}^{M / 2} \sqrt{det \sum}} exp [- 1 / 2 {(r - μ (x))}^{T} \sum^{- 1} (r - μ (x))]

(4)

where p(r ; x) is the probability density function of r with parameter x. The ML estimate of the target position is then

{\hat{x}}_{ML} = \underset{x}{arg max} p (r; x),

(5)

which is equivalent to maximizing the log of p(r ; x). Neglecting all terms that do not depend on x it is easy to see that

{\hat{x}}_{ML} = \underset{x}{arg min} μ {(x)}^{T} \sum^{- 1} μ (x) - 2 r^{T} \sum^{- 1} μ (x) .

(6)

The objective to be minimized in (6) is not convex and therefore, finding the global optimum is not an easy task. In Figure 2 (left), we have an illustration of how the objective in (6) looks like for a network of 20 nodes over a normalized square area. It is clear that the function is not convex and that several local minima and saddle point may exist. Instead of dealing directly with the ML estimate we propose to use a suboptimal approach that offers a reasonable good performance and that allows for its distributed computation.

3.2 Practical approach

In this section, we propose to estimate the target's position based on local distance estimates computed at each node. The use of local distance estimates allows the derivation of simple and distributed estimators of the target's position. We have that, for the propagation model (1), the ML estimate of the distance between the m th node and the target is given by

{\hat{d}}_{m} = d_{0} 10 (\frac{p_{m} - r_{m}}{10 α_{m}}) .

(7)

Taking the square at both sides of (2) and further developing, it is easy to see that the following equation must be satisfied

\begin{matrix} d_{1}^{2} = x^{T} x - 2 c_{1}^{T} x + | | c_{1} | |^{2} \\ ⋮ \\ d_{M}^{2} = x^{T} x - 2 c_{M}^{T} x + | | c_{M} | |^{2} \end{matrix}

(8)

Rearranging terms we can express (8) in a more compact form as

[\begin{matrix} d_{1}^{2} - | | c_{1} | |^{2} \\ ⋮ \\ d_{M}^{2} - | | c_{M} | |^{2} \end{matrix}] = (x^{T} x) \cdot 1 - 2 \underset{C}{\underset{⏟}{[\begin{matrix} c_{1}^{T} \\ ⋮ \\ c_{M}^{T} \end{matrix}]}} x,

(9)

where 1 is a M × 1 vector of all ones. However, we do not have the actual distances to the target but a noisy version of them as per (7). Define the vector $b = {[{| | c_{1} | |}^{2} - {\hat{d}}_{1}^{2}, \dots, {| | c_{M} | |}^{2} - {\hat{d}}_{M}^{2}]}^{T}$ and the vector-valued cost function f(x): ℝ² ↦ ℝ ^M as

f (x) = (x^{T} x) \cdot 1 - 2 C x + b .

(10)

We can then get an estimate of the target's position by minimizing the norm of (10). In order to incorporate robustness and make the localization task more applicable to realistic scenarios we propose to use a weighted version of the cost function (10). In a WSN it may happen that some of the nodes exhibit a bias in their measurements due to the presence of obstacles. Additionally, nodes do not have precise information about their own locations instead, some errors may be present. The incorporation of weights will mitigate the effects of biased nodes and uncertainties in nodes' positions. So we compute an estimate $\hat{x}$ of the true target's position x as the solution to the following non-linear (weighted) least-squares problem

\hat{x} = \underset{x}{minimize} ∥D f (x)∥,

(11)

where D = diag (γ₁, . . . , γ_M ) is a diagonal weighting matrix with γ_m≥ 0 for all m = 1, . . . , M. A proper choice for the weights would be inversely proportional to the variance of the measurements. As we are assuming the log-normal model for the measurements it is well known that the variance of the ML estimate (7) is proportional to the square of the true distance to be estimated [2, 16]. With this consideration in mind we may choose to weight our measurements inversely proportional to the measured distance, that is $γ_{m} = 1 / {\hat{d}}_{m}$ .

This problem has been studied in [17, 18] where a distributed version of the Gauss-Newton method can be used for its solution. In this study, we present a more flexible approach that uses concepts from convex optimization theory. The proposed approach has better convergence properties and also makes it straightforward to include additional constraints to the problem that may prevent it from instabilities. In order to proceed let's write problem (11) as the following equivalent problem

\hat{x} = \underset{x}{minimize} \sum_{m = 1}^{M} γ_{m} {(x^{T} x - 2 c_{m}^{T} x + b_{m})}^{2}

(12)

Note that, although (11) and (12) are equivalent problems (i.e., with the same solution), they are different because in the latter case we are minimizing the squared norm of Df(x). The minimization of the squared norm is motivated by the fact that it allows a simple distributed implementation as it can be guessed from the structure of (12). The use of the objective in (12) is well motivated by the fact that we get a smoother surface at the cost of introducing some bias with respect to the ML solution (see Figure 2 (right)). However, if the bias is small, we may still get to the ML estimate by performing a local search around the solution of (12). However, in order to use convex optimization methods we need problem (12) to be convex. Unfortunately, the objective function is not convex because we are adding the squares of quadratic convex but not necessarily positive functions [11]. It would be interesting to exploit some hidden convexity of the problem so that convex optimization methods can be applied.

A possible approach to make the problem convex is to use semidefinite relaxation technique. Let X = xx^Tand note that Tr (X) = ||x||², where Tr(·) is the trace operator. We can rewrite the problem as

\begin{aligned} \underset{X, x}{minimize} \sum_{m = 1}^{M} γ_{m} {(Tr (X) - 2 c_{m}^{T} x + b_{m})}^{2} \\ subject to X = x x^{T} \end{aligned}

(13)

We now have that the objective is convex as it is the composition of an affine function of X and x with a convex function [11]. However, the above problem is still non-convex due to the non-linear constraint X = xx^T. We can then relax the equality constraint by replacing it with a semidefinite constraint. As a result we end up with the following (convex) SDP

\begin{aligned} \underset{X, x}{minimize} \sum_{m = 1}^{M} γ_{m} {(Tr (X) - 2 c_{m}^{T} x + b_{m})}^{2} \\ subject to X - x x^{T} ≽ 0 \\ X \in S_{+}^{2} \end{aligned}

(14)

where $S_{+}^{2}$ is the set of 2 × 2 symmetric positive semi-definite matrices. As we are allowing for a larger feasible set, the optimal value of problem (14) would provide a lower bound on the optimal value of the original problem (12). However, if the optimal solution X* of (14) is of rank-one, we have that the semidefinite relaxation approach is not a relaxation at all and the found solution x* of (14) is also optimal for (12).

It would be interesting to give conditions under which (14) provides a rank-one solution so that the obtained solution is optimal for the original problem, too. To that end let define the matrix A as

A = [\begin{matrix} 2 \sum_{m} γ_{m} x_{m} & 2 \sum_{m} γ_{m} y_{m} & - \sum_{m} γ_{m} \\ 2 \sum_{m} γ_{m} x_{m}^{2} & 2 \sum_{m} γ_{m} y_{m} x_{m} & - \sum_{m} γ_{m} x_{m} \\ 2 \sum_{m} γ_{m} y_{m} x_{m} & 2 \sum_{m} γ_{m} y_{m}^{2} & - \sum_{m} γ_{m} y_{m} \end{matrix}]

(15)

and let the vector δ be given by

δ = [\begin{matrix} \sum_{m} γ_{m} b_{m} \\ \sum_{m} γ_{m} b_{m} x_{m} \\ \sum_{m} γ_{m} b_{m} y_{m} \end{matrix}]

(16)

We then define the following feasibility problem

\begin{matrix} find & {z, t} \\ subject to & {∥z∥}^{2} \leq t \\ A [\begin{matrix} z \\ t \end{matrix}] = δ \end{matrix}

(17)

with variables z ∈ ℝ² and t ∈ ℝ₊. The above problem is convex since it belongs to the class of second-order cone program (SOCP) [11]. Based on the feasibility problem (17) we can state the following result:

Proposition 1. Assume problem (12) has at least one strictly feasible point. If problem (17) is not feasible, then the optimal solution x* of the semidefinite relaxed problem (14) is also optimal for the original problem (12).

Proof. See the Appendix. □

Corollary 2. If matrix A is singular then, the solution X* of (14) is of rank one with X* = x*x*^Tand x* is also optimal for (12).

Proof. It follows directly from Proposition 1. If A is singular then, problem (17) is infeasible (because matrix A is not invertible) so that the relaxed problem is not a relaxation at all. □

It is worth to mention that the feasibility problem (17) can be easily checked without the requirement of an optimization solver. If matrix A is singular then, the problem is infeasible and we are done. If however, matrix A is full-rank, we compute A^{- 1}δ (which is unique) and check whether it satisfies the second order cone (SOC) constraint ||z|| ²≤ t. If the constraint is not met then we conclude that the problem is infeasible.

4 Distributed algorithm

One of the main advantages that offers the considered approach is that it allows for a distributed implementation. We assume that nodes communicate with their one-hop neighbors as dictated by the communication graph G(V, E), where V is the set of vertices and E is the set of edges of the graph. By only local communication exchange, nodes can agree to compute some desired (global) quantity using consensus algorithms [19].

Let's proceed by reformulating problem (14) into a SDP with a single matrix variable $Z \in S_{+}^{3}$ . For that purpose, let us formulate the following problem

\begin{matrix} \underset{Z}{minimize} \sum_{m = 1}^{M} γ_{m} {(Tr (M_{m} Z) + b_{m} - 1)}^{2} \\ subject to Z (3, 3) = 1 \\ Z \in S_{+}^{3} \end{matrix}

(18)

where $M_{m} = I - 2 [\begin{matrix} 0 & 0 \\ c_{m}^{T} & 0 \end{matrix}]$ and I is the identity matrix. With the above problem definition we have the following equivalence:

Lemma 1. The two problems (14) and (18) are equivalent. Further, if we denote Z* as the optimal solution to (18), then the optimal solution x* of (14) is given by x* = [Z*(3, 1), Z*(3, 2)]^T.

Proof. See the Appendix. □

Now that we have established the equivalence between problems (14) and (18) through Lemma 1 we show how to solve it in a distributed way. For that purpose we use the optimization framework for consensus-networked systems as proposed in [15] that uses an augmented Lagrangian approach [14]. The framework in [15] generalizes the previous work of [20] as it can also handle convex but not necessarily strictly convex objective functions. The augmented Lagrangian method adds a quadratic penalty term to the objective function that is zero at the optimal solution. The resulting problem is then equivalent to the original problem as both of them end up with the same solution. Augmented Lagrangian methods are also attractive because they offer better convergence properties than standard primal-dual decomposition methods. A detailed treatment about augmented Lagrangian methods and their properties can be found in [14].

In order to derive a distributed solution consider first the introduction of M new variables and a global consensus constraint into the problem as

\begin{matrix} \underset{Z, {Z_{m}}}{minimize} \sum_{m = 1}^{M} γ_{m} {(Tr (M_{m} Z_{m}) + b_{m} - 1)}^{2} \\ subject to Z (3, 3) = 1 \\ Z_{m} = Z \\ Z \in S_{+}^{3} \end{matrix}

(19)

The problem is now separable in the objective function (as it is the sum of M terms, each one dependent of one node) but we still have the coupling "consensus" constraint Z_m = Z. However, we do not need to impose that all nodes agree on the same quantity instead, we can only force nodes to agree with their one-hop neighbors. Let $N_{m}$ be the set of neighbors of node m, we can then reformulate the problem as

\begin{matrix} \underset{{Z_{m}}}{minimize} \sum_{m = 1}^{M} γ_{m} {(Tr (M_{m} Z_{m}) + b_{m} - 1)}^{2} \\ subject to Z_{m} (3, 3) = 1 \\ Z_{m} = Z_{j}, j \in N_{m} \\ Z_{m} \in S_{+}^{3} \end{matrix}

(20)

The two problems (19) and (20) are equivalent provided that the underlying graph is strongly connected [20]. We can now use the developed framework in [15] to derive an augmented Lagrangian method for the distributed solution of (20). Consider then, the introduction of the additional variables ${W_{m}}_{, j} \in S_{+}^{3}$ and formulate the equivalent problem

\begin{matrix} \underset{{Z_{m}}, {W_{m j}}}{minimize} \sum_{m = 1}^{M} γ_{m} {(Tr (M_{m} Z_{m}) + b_{m} - 1)}^{2} \\ subject to Z_{m} (3, 3) = 1 \\ Z_{m} = W_{m, j}, j \in N_{m} \\ Z_{j} = W_{m, j}, j \in N_{m} \\ Z_{m} \in S_{+}^{3} \end{matrix}

(21)

The penalized problem [14] can be written as

\begin{matrix} \underset{{Z_{m}}, {W_{m j}}}{minimize} \sum_{m = 1}^{M} γ_{m} {(Tr (M_{m} Z_{m}) + b_{m} - 1)}^{2} \\ + \frac{c}{2} \sum_{m = 1}^{M} \sum_{j \in N_{m}} (| | Z_{m} - W_{m, j} | |^{2} + | | Z_{j} - W_{m, j} | |^{2}) \\ subject to Z_{m} (3, 3) = 1 \\ Z_{m} = W_{m, j}, j \in N_{m} \\ Z_{m} = W_{m, j}, j \in N_{m} \\ Z_{m} \in S_{+}^{3} \end{matrix}

(22)

where c > 0 is a constant that controls the penalization of the disagreement among neighbors. In general, c could be a sequence as long as it is non-decreasing. The choice of c has a direct impact on the rate of convergence of the distributed algorithm [14]. There is no general rule to choose c and its value will vary depending on the problem at hand. It becomes clear from the formulation of problem (22) that the penalty term is zero at the optimum and, therefore the optimal solution to (22) is also optimal for (20). We can now find a solution of (22) by solving its dual problem. By relaxing the consensus constraint we form the partial augmented Lagrangian L_c as

\begin{aligned} L_{c} ({Γ_{m, j}}, {Φ_{m, j}}, {Z_{m}}, {W_{m, j}}) & = \sum_{m = 1}^{M} γ_{m} {(Tr (M_{m} Z_{m}) + b_{m} - 1)}^{2} \\ + \sum_{m = 1}^{M} \sum_{j \in N_{m}} Tr (Γ_{m, j} (Z_{m} - W_{m, j})) \\ + \sum_{m = 1}^{M} \sum_{j \in N_{m}} Tr (Φ_{m, j} (Z_{j} - W_{m, j})) \\ + \frac{c}{2} \sum_{m = 1}^{M} \sum_{j \in N_{m}} (| | Z_{m} - W_{m, j} | |^{2} + | | Z_{j} - W_{m, j} | |^{2}) \end{aligned}

(23)

where Γ_{m, j}and Φ_{m, j}are the Lagrange multipliers. We then have that the dual problem is given by

\begin{matrix} \underset{{Γ_{m, j}}, {Φ_{m, j}}}{maximize} inf_{{Z_{m}}, {W_{m, j}}} L_{c} ({Γ_{m, j}}, {Φ_{m, j}}, {Z_{m}}, {W_{m, j}}) \\ subject to Z_{m} (3, 3) = 1 \\ Z_{m} \in S_{+}^{3} \end{matrix}

(24)

The problem is now separable and strictly convex which allows its solution by alternating the minimization over Z_m and W_{m, j}as

\begin{gathered} \begin{matrix} Z_{m}^{(k + 1)} = \underset{Z m}{minimize} L_{c} ({Γ_{m, j}^{(k)}}, {Φ_{m, j}^{(k)}}, {Z_{m}}, {W_{m, j}^{(k)}}) \\ subject to Z_{m} (3,3) = 1 \\ Z_{m} \in S_{+}^{3} \end{matrix} \end{gathered}

(25)

W_{m, j}^{(k + 1)} = \underset{W_{m, j}}{minimize} L_{c} ({Γ_{m, j}^{(k)}}, {Φ_{m, j}^{(k)}}, {Z_{m}^{(k + 1)}}, {W_{m, j}})

(26)

and then performing an update of the Lagrange multipliers using a subgradient step

Γ_{m, j}^{(k + 1)} = Γ_{m, j}^{(k)} + c (Z_{m} - W_{m, j})

(27)

Φ_{m, j}^{(k + 1)} = Φ_{m, j}^{(k)} + c (Z_{j} - W_{m, j})

(28)

where the superscript ^(k)denotes the k th iteration. Following the same steps as in [15], it can be shown that

Γ_{m, j}^{(k + 1)} = - Φ_{m, j}^{(k + 1)}

(29)

W_{m, j}^{(k + 1)} = \frac{1}{2} (Z_{m}^{(k + 1)} + Z_{j}^{(k + 1)})

(30)

Let Δ_{m, j} = Γ_{m, j} = -Φ_{m, j} and define Ψ_m = Δ_{m, j} - Δ _{j, m}. Based on the above results it can be easily shown that the solution to problem (24) is obtained in a distributed way by alternating between the following two updates

\begin{matrix} Z_{m}^{(k + 1)} = \underset{Z_{m}}{minimize} γ_{m} {(Tr (M_{m} Z_{m}) + b_{m} - 1)}^{2} \\ + Tr (Ψ_{m} Z_{m}) + c \sum_{j \in N_{m}} | | Z_{m} - \frac{1}{2} (Z_{m}^{(k)} - Z_{j}^{(k)}) | |^{2} \\ subject to Z_{m} (3, 3) = 1 \\ Z_{m} \in S_{+}^{3} \end{matrix}

(31)

and

Ψ_{m}^{(k + 1)} = Ψ_{m}^{(k)} + c \sum_{j \in N_{m}} (Z_{m}^{(k + 1)} - Z_{j}^{(k + 1)})

(32)

with $Ψ_{m}^{(0)} = 0$ and m = 1, . . . , M.

The network will then operate as follows: At the beginning of the k th iteration, each node locally solves (31). Then, nodes broadcast the computed estimates $Z_{m}^{(k + 1)}$ to their neighbors. With the local estimates of the corresponding neighbors at hand, each node updates its multipliers as in (32). The process is repeated until all nodes converge to the same solution which, in turn would be the same as in the centralized case.

5 Approaching the ML estimate

So far, we have shown how to solve (11) by formulating the relaxed problem (14). We have also provided conditions under which the solutions to (11) and (14) coincide. Further, the solution can be computed in a distributed fashion using convex optimization tools. However, the performance of the followed approach in (11) is below that of the ML estimate (6). In order to come closer to the ML solution we could perform an additional local search that improves the obtained estimate through the solution of (14). The idea is to run a distributed optimization routine, taking the solution of (14) as the starting point, to solve for (6). If the previously computed estimate by solving (14) is close to the ML estimate we may converge to it by optimizing in the neighborhood of the solution of (14), otherwise we will converge to a local optima but still improving performance.

Observe that the ML estimate (6) can be cast into a non-linear least-squares problem of the form of (11). Assuming that ∑ is positive definite, we can write the ML estimation problem (6) as the following unconstrained optimization problem

{\hat{x}}_{ML} = \underset{x}{minimize} | | f_{ML} (x) | |^{2},

(33)

where f_ML(x) = S (r - μ (x)) and S is the Cholesky factorization of the inverse covariance matrix, i.e., S^TS = ∑^-1. A local minimum of the above non-linear least-squares problem (33) can be found using an iterative descent algorithm like the Gauss-Newton method [11, 21]. The standard (centralized) Gauss-Newton procedure is given in Algorithm 1, where h_gn represents the descent direction (i.e., direction that reduces the value of the cost function) and J^(k)= J(x^(k)) with J(x) ∈ ℝ^M×2representing the Jacobian matrix of $f_{ML} (x) = {[f_{1}^{ML} (x), \dots, f_{M}^{ML} (x)]}^{T}$ whose entries are given by

{[J (x)]}_{i j} = \frac{\partial f_{i}^{ML}}{\partial x_{j}} (x), i = 1, \dots, M, j = 1, 2 .

(34)

Algorithm 1 Gauss-Newton method

1: x⁽⁰⁾← x₀k = 0 {Initialization}

2: while ! found & k < k_maxdo

3: h_gn← - (J^(k)TJ^(k))^{- 1}J^(k)Tf_ML (x^(k)) {Descent direction}

4: if ∄ h_gnthen

5: found = true

6: end if

7: x^(k+1)← x^(k)+ h_gn {Update}

8: k ← k + 1

9: end while

If the covariance matrix ∑ has no special structure, then the problem (33) requires a central entity that gathers all the information coming from the nodes in order to solve it. However, it is reasonable to assume independence of the noise processes among the nodes so that ∑ has a diagonal structure, say $\sum = diag (σ_{1}^{2}, . . ., σ_{M}^{2})$ . In that case, matrix S = diag(1/σ₁, . . . , 1/σ_M ) is also diagonal and we can exploit the problem structure in order to find a distributed implementation of the Gauss-Newton procedure given in Algorithm 1. Note that for finding a distributed implementation of Algorithm 1, it suffices to find a way to compute the descent search direction h_gn is a distributed fashion. For such purpose, let first note that J(x) has a block-wise structure given by

J (x) = [\begin{matrix} \frac{- 10 α_{1}}{σ_{1} | | x - c_{1} | |^{2}} {(x - c_{1})}^{T} \\ ⋮ \\ \frac{- 10 α_{M}}{σ_{M} | | x - c_{M} | |^{2}} {(x - c_{M})}^{T} \end{matrix}] = [\begin{matrix} J_{1} (x) \\ ⋮ \\ J_{M} (x) \end{matrix}] .

(35)

Based on the block-wise structure of matrix J(x) it is easy to note that

J {(x)}^{T} J (x) = \sum_{m} J_{m} {(x)}^{T} J_{m} (x)

(36)

J {(x)}^{T} f_{ML} (x) = \sum_{m} J {(x)}^{T} f_{m}^{ML} (x)

(37)

and therefore, the above quantities can be computed in a distributed fashion by means of average consensus [19]. Once we have computed the products (36) and (37), it is straightforward to compute the descent search direction h_gn.

Based on these observations we propose here a fully distributed algorithm, shown as Algorithm 2, which asymptotically approaches the same result as in the centralized case using only local information and the exchange of low-volume intermediate results within each node's 1-hop neighborhood. We immediately note that the Steps 3-6 and 11-12 can all be performed locally by each node. The only communication occurs in the Steps 8 and 9 via standard average consensus Algorithms [19]. Seeing how $Δ_{n}^{(k)} \in R^{2 \times 2}$ and is symmetric, and $γ_{n}^{(k)} \in ℝ^{2}$ , we conclude that each consensus round requires a broadcast of only five real values.

Algorithm 2 Distributed Gauss-Newton localization

1: ${\hat{x}}^{(0)}$ ← same initial value ∀m ∈ M

2: for k = 0 to K - 1 do

3: $J_{m}^{(k)} \leftarrow \frac{- 10 α_{m}}{σ_{m} {∥{\hat{x}}^{(k)} - c_{n}∥}^{2}} {({\hat{x}}^{(k)} - c_{n})}^{T}$

4: $f_{m}^{ML} ({\hat{x}}^{(k)}) \leftarrow σ_{m}^{- 1} (r_{m} - p_{m} + 5 α_{m} {log}_{10} {∥{\hat{x}}^{(k)} - c_{n}∥}^{2} + 10 {log}_{10} d_{0})$

5: $Δ_{m}^{(k)} \leftarrow J_{m}^{(k) T} J_{m}^{(k)}$

6: $γ_{m}^{(k)} \leftarrow J_{m}^{(k) T} f_{m}^{ML} ({\hat{x}}^{(k)})$

7: begin consensus

8: $Δ_{*}^{(k)} \leftarrow \frac{1}{M} \sum_{m = 1}^{M} Δ_{m}^{(k)} = \frac{1}{M} J^{(k) T} J^{(k)}$

9: $γ_{*}^{(k)} \leftarrow \frac{1}{M} \sum_{m = 1}^{M} γ_{m}^{(k)} = \frac{1}{M} J^{(k) T} f_{ML} ({\hat{x}}^{(k)})$

10: end consensus

11: $h^{(k)} \leftarrow Δ_{*}^{{(k)}^{- 1}} γ_{*}^{(k)} = {(J^{(k) T} J^{(k)})}^{- 1} J^{(k) T} f_{ML} ({\hat{x}}^{(k)})$

12: ${\hat{x}}^{(k + 1)} \leftarrow {\hat{x}}^{(k)} + h^{(k)}$

13: end for

6 Numerical simulations

In this section, we provide several numerical examples in order to evaluate the performance of the proposed approach. For the simulations we consider a network of randomly deployed nodes over an area of 100 × 100 squared meters. We have used the same propagation model for all the nodes with reference power p_m = - 40 dB at reference distance d₀ = 1 m and path-loss exponent α_m = 2 for m = 1, . . . , M. We further assume that the noise processes are independent and identically distributed with $n_{m} ~ N (0, σ_{dB}^{2})$ for all m.

In Figure 3, we have simulated the distributed localization task using a network of 50 nodes (see Figure 1) with a randomly located target. We have performed distributed estimation of the target's position by the alternating between (31) and (32) with penalty parameter c = 0.05. We have plotted in Figure 3 the error between each node's local estimate and the centralized solution of the problem. As it can be appreciated, the distributed algorithm converges to the optimal centralized solution as the number of iterations increases.

We have also simulated the localization task over the same network of Figure 1. For the propagation model the measurement noise variance has been set to $σ_{dB}^{2} = 9$ . We have evaluated the performance of the proposed approach with and without weights (labeled SDP and wSDP, respectively) over 1000 random target locations (test points). We compare our approach with Multilateration localization approach [2]. As shown in [2], if one node is chosen as a reference, then the localization problem can be linearized. Therefore, it allows its distributed implementation by means of consensus since it can be cast into a least-squares problem where each node locally contributes to the global cost function. Note, however that nodes must first agree on a common reference. For the simulations, we have chosen the node with the closest distance estimate to be the reference. We also provide the performance of the ML estimate for comparison purposes. The empirical cumulative distribution function (CDF) of the localization error is represented in Figure 4. As it can be observed in Figure 4, the performance of the proposed scheme outperforms that of Multilateration. Further, we observe that the use of weights improves the localization accuracy of the algorithm so that we come closer to the ML estimate.

The performance of the algorithm has also been tested for different values of the measurement noise variance. The results are displayed in Figure 5 where the average error over 1000 locations is depicted as a function of the measurement noise standard deviation. Again, we can appreciate the performance improvement of the proposed approach compared to multilateration as can be seen in Figure 5. We have also displayed the results when combining the proposed distributed localization approach with a local search (wSDP+local). The local search is performed in a distributed fashion following the steps in Algorithm 2. As it can be observed the results of such combination provide close to ML performance. This implies that our method is capable of providing good estimates that could be used to run a local solver in order to come close to the ML estimate. Although not guaranteed to converge to the ML solution, the local search can only provide better estimates (in the ML sense).

7 Conclusions

We have presented a distributed localization approach over sensor networks using consensus and convex optimization. An alternative problem to the ML position estimation problem has been proposed based on local ML distance estimates at each node. In order to circumvent the non-convexity of the problem, semidefinite relaxation technique has been employed and conditions that guarantee zero gap between the relaxed and the original problem have been given. A distributed algorithm based on an augmented Lagrangian approach using primal-dual decompositions have been proposed and it has been shown to converge to the centralized solution. The approach is suitable for its real implementation in WSN as it is scalable, robust against changes in topology and energy efficient by the use of only local broadcast-type communication among nodes. Another interesting property of the proposed algorithm is that it allows the introduction of additional convex constraints to the localization problem in a straightforward manner.

The proposed algorithm is intended to be usable in real networks and its suitability in terms of accuracy would be determined by the application at hand. However, if higher accuracy is required, we could run an additional optimization step around the found solution. We have verified by means of simulations that the combination of our suboptimal method with a local search provides a localization error close to the ML estimate.

It is worth to mention that the proposed approach has a direct application to distributed tracking in WSN's as well. The tracking procedure would be based on the jointly estimated target's position. As all nodes share the same estimate, they could use that estimate to locally run a tracking filter in order to follow the movement of the target.

Appendix 1: Proof of Proposition 1

To show the validity of Proposition 1, consider first the Lagrangian of (14) which is given by

L (X, x) = \sum_{m = 1}^{M} γ_{m} {(Tr (X) - 2 c_{m}^{T} x + b_{m})}^{2} - Tr (Ψ (X - x x^{T}))

(38)

with Ψ ≽ 0 being the Lagrange multipliers. Since problem (14) is convex and there exists, by assumption, at least a strictly feasible point, Slater's constraint qualifications are satisfied and therefore, strong duality holds. Moreover, from duality theory we have that, at the optimum, the derivative of the Lagrangian with respect to X and x must be zero, that is

\nabla_{X} L (X, x) = 2 \sum_{m = 1}^{M} γ_{m} (Tr (X) - 2 c_{m}^{T} x + b_{m}) I - Ψ = 0

(39)

and

\nabla_{x} L (X, x) = - 4 \sum_{m = 1}^{M} γ_{m} (Tr (X) - 2 c_{m}^{T} x + b_{m}) c_{m} - 2 Ψ x = 0

(40)

From Equation (39) it becomes clear that Ψ must be a diagonal matrix. This fact, together with the complementary slackness condition

Tr (Ψ (X - x x^{T})) = 0

(41)

implies that the off-diagonal elements of X must equal those of xx^T. However, this does not necessarily mean that X is of rank one. From the complementary slackness condition (41) we have that X will equal xx^T whenever the constraint is active (i.e., Ψ ≠ 0). So by finding under which conditions Ψ ≠ 0 we will find the conditions that guarantee that the solution of (14) coincides with the solution of the original problem (12). For that purpose, if we set Ψ = 0 we have from (39) and (40) that

\sum_{m = 1}^{M} γ_{m} (Tr (X) - 2 c_{m}^{T} x + b_{m}) = 0

(42)

\sum_{m = 1}^{M} γ_{m} (Tr (X) - 2 c_{m}^{T} x + b_{m}) c_{m} = 0

(43)

Let t = Tr(X) and z = [z₁, z₂]^T = x, keeping in mind that c_m = [x_m , y_m ], we can rewrite the above equations as

2 (\sum_{m} γ_{m} x_{m}) z_{1} + 2 (\sum_{m} γ_{m} y_{m}) z_{2} - (\sum_{m} γ_{m}) t = \sum_{m} γ_{m} b_{m}

(44)

2 (\sum_{m} γ_{m} x_{m}^{2}) z_{1} + 2 (\sum_{m} γ_{m} y_{m} x_{m}) z_{2} - (\sum_{m} γ_{m} x_{m}) t = \sum_{m} γ_{m} b_{m} x_{m}

(45)

2 (\sum_{m} γ_{m} x_{m} y_{m}) z_{1} + 2 (\sum_{m} γ_{m} y_{m}^{2}) z_{2} - (\sum_{m} γ_{m} y_{m}) t = \sum_{m} γ_{m} b_{m} y_{m}

(46)

which can be expressed in a compact way as

A [\begin{matrix} z \\ t \end{matrix}] = δ

(47)

Therefore, Ψ will be equal to 0 only if (47) has a solution where x = z and Tr(X) = t. Additionally, we have that for the solution to be a feasible point it must be satisfied that Tr(X - xx^T) ≥ 0 which implies that Tr(X) ≥ ||x||² or, equivalently ||z||²≤ t. This implies that if problem (17) is infeasible, then Ψ ≠ 0 and hence, X = xx^T so that the solution of the relaxed problem (14) coincides with that of the original problem (12), which proves Proposition 1.

■

Appendix 2: Proof of Lemma 1

By the Schur's complement we have that

X - x x^{T} ≽ 0 \Leftrightarrow [\begin{matrix} X & x \\ x^{T} & 1 \end{matrix}] ≽ 0

(48)

Let introduce a new variable $Z = [\begin{matrix} X & x \\ x^{T} & 1 \end{matrix}]$ . We then have that Tr(X) = Tr(Z) - 1 and that

c_{m}^{T} x = [0^{T} 1] Z [\begin{matrix} c_{m} \\ 0 \end{matrix}] = Tr (Z [\begin{matrix} 0 & 0 \\ c_{m}^{T} & 0 \end{matrix}])

(49)

By rearranging terms and the previous conditions on Z we end up with problem (18) and the equivalence is established. The equivalence between the two solutions follows directly from the definition of Z.

■

References

Sayed AH, Tarighat A, Khajehnouri N: Network-based wireless location. IEEE Signal Process Mag 2005, 22(4):24-40.
Article Google Scholar
Patwari N, Ash JN, Kyperountas S, Hero AO, Moses RL, Correal NS: Locating the nodes. IEEE Signal Process Mag 2005, 22(4):54-68.
Article Google Scholar
Arulampalam MS, Maskell S, Gordon N, Clapp T: A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking. IEEE Trans Signal Process 2002, 50(2):174-188. 29 10.1109/78.978374
Article Google Scholar
Zàruba GV, Huber M, Kamangar FA, Chlamtac I: Indoor location tracking using RSSI readings from a single Wi-Fi access point. Wirel Netw 2007, 13(2):221-235. 10.1007/s11276-006-5064-1
Article Google Scholar
Tai Y, Bo Y: Collaborative target tracking in wireless sensor network. Proc 9th Int Conf Electronic Measurement & Instruments ICEMI'09, 16-19 Aug 2009; Beijing 2009, 2-1005-2-1010.
Google Scholar
Wu H, Tian G, Huang B: Multi-robot collaborative localization methods based on Wireless Sensor Network. Proc IEEE Int Conf Automation and Logistics ICAL,1–3Sep 2008; Qindao 2008, 2053-2058.
Chapter Google Scholar
Ren H, Meng MQH: Power adaptive localization algorithm for wireless sensor networks using particle filter. IEEE Trans Veh Technol 2009, 58(5):2498-2508.
Article Google Scholar
Aounallah F, Amara R, Alouane MTH: Particle filtering based on sign of innovation for tracking a jump Markovian motion in a binary WSN. Proc Third Int Conf Sensor Technologies and Applications SENSORCOMM'09, 18-23 Jun 2009; Athens 2009, 252-255.
Chapter Google Scholar
Lui KWK, Ma WK, So HC, Chan FKW: Semi-definite programming algorithms for sensor network node localization with uncertainties in anchor positions and/or propagation speed. IEEE Trans Signal Process 2009, 57(2):752-763.
Article MathSciNet Google Scholar
Luo Z, So AMC, Ye Y, Zhang S: Semidefinite relaxation of quadratic optimization problems. IEEE Signal Process Mag 2010, 27(3):20-34.
Article Google Scholar
Boyd S, Vandenberghe L: Convex Optimization. Cambridge University Press, New York; 2004.
Book MATH Google Scholar
Blatt D, Hero AO: Energy-based sensor network source localization via projection onto convex sets. IEEE Trans Signal Process 2006, 54(9):3614-3619.
Article Google Scholar
Shi Q, He C: A new incremental optimization algorithm for ML-based source localization in sensor networks. IEEE Signal Process Mag 2008, 15: 45-48.
Article Google Scholar
Bertsekas DP: Constrained optimization and Lagrange Multiplier Methods. Academic Press, New York; 1982.
MATH Google Scholar
Li J, Elhamifar E, Wang IJ, Vidal R: Consensus with robustness to outliers via distributed optimization. Proc 49th IEEE Conf Decision and Control (CDC), 15-17 Dec 2010; Atlanta 2010, 2111-2117.
Google Scholar
Patwari N, Hero IAO, Perkins M, Correal NS, O'Dea RJ: Relative location estimation in wireless sensor networks. IEEE Trans Signal Process 2003, 51(8):2137-2148. 10.1109/TSP.2003.814469
Article Google Scholar
Bejar B, Belanovic P, Zazo S: Distributed Gauss-Newton method for localization in Ad-Hoc networks. 43rd Asilomar Conference on Signals, Systems, and Computers, 7-10 Nov 2010; Pacific Grove 2010, 1452-1454.
Google Scholar
Bejar B, Belanovic P, Zazo S: Distributed consensus-based tracking in wireless sensor networks: a practical approach. Proceedings of the European Signal Processing Con- ference, EUSIPCO, 29 Aug - 1 Sep 2011; Barcelona 2011, 2019-2023.
Google Scholar
Olfati-Saber R, Murray R: Consensus problems in networks of agents with switching topology and time-delays. IEEE Trans Autom Control 2004, 49(9):1520-1533. 10.1109/TAC.2004.834113
Article MathSciNet Google Scholar
Rabbat MG, Nowak RD, Bucklew JA: Generalized consensus computation in networked systems with erasure links. Proc IEEE 6th Workshop Signal Process Adv Wirel Commun, 5 - 8 Jun 2005; New York 1088-1092.
Google Scholar
Madsen K, Nielsen HB, Tingleff O:Methods for Non-Linear Least Squares Problems. Richard Petersens Plads, Kgs. Lyngby; 2004. [http://www2.imm.dtu.dk/pubdb/p.php?660]
Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their useful comments and suggestions that lead to a significant improvement in the clarity of exposition and motivation of the present study. This study was supported in part by the Spanish Ministry of Science and Innovation under the grant TEC2009-14219-C03-01; El Consejo Social de la UPM; the Spanish Ministry of Science and Innovation in the program CONSOLIDER-INGENIO 2010 under the grant CSD2008-00010 COMONSENS; the European Commission under the grant FP7-ICT-2009-4-248894-WHERE-2; the European Commission under the grant FP7-ICT-223994-N4C and the Spanish Ministry of Science and Innovation under the complementary action grant TEC 2008-04644-E; Spanish Ministry of Science and Innovation under the grant TEC2010-21217-C02-02-CR4HFDVL.

Author information

Authors and Affiliations

Universidad Politécnica de Madrid, Dpto. Señales, Sistemas y Radiocomunicaciones, E.T.S.I. de Telecomunicación C-303, 28040, Madrid, Spain
Benjamín Béjar & Santiago Zazo

Authors

Benjamín Béjar
View author publications
You can also search for this author in PubMed Google Scholar
Santiago Zazo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Benjamín Béjar.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

BB proposed and developed the solution to the problem, derived the distributed algorithm for its computation and run the numerical simulations. SZ conceived the study and helped in the draft of the manuscript. All authors read and approved the manuscript.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Béjar, B., Zazo, S. A practical approach for outdoors distributed target localization in wireless sensor networks. EURASIP J. Adv. Signal Process. 2012, 95 (2012). https://doi.org/10.1186/1687-6180-2012-95

Download citation

Received: 15 May 2011
Accepted: 01 May 2012
Published: 01 May 2012
DOI: https://doi.org/10.1186/1687-6180-2012-95

A practical approach for outdoors distributed target localization in wireless sensor networks

Abstract

1 Introduction

Notation

2 Problem formulation and definitions

3 Localization strategies

3.1 ML estimation

3.2 Practical approach

4 Distributed algorithm

5 Approaching the ML estimate

6 Numerical simulations

7 Conclusions

Appendix 1: Proof of Proposition 1

Appendix 2: Proof of Lemma 1

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors' contributions

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords