Collaborative emitter tracking using Rao-Blackwellized random exchange diffusion particle filtering

Bruno, Marcelo G S; Dias, Stiven S

doi:10.1186/1687-6180-2014-19

Research
Open access
Published: 13 February 2014

Collaborative emitter tracking using Rao-Blackwellized random exchange diffusion particle filtering

Marcelo G S Bruno¹ &
Stiven S Dias^1,2

EURASIP Journal on Advances in Signal Processing volume 2014, Article number: 19 (2014) Cite this article

1953 Accesses
20 Citations
Metrics details

Abstract

We introduce in this paper the fully distributed, random exchange diffusion particle filter (ReDif-PF) to track a moving emitter using multiple received signal strength (RSS) sensors. We consider scenarios with both known and unknown sensor model parameters. In the unknown parameter case, a Rao-Blackwellized (RB) version of the random exchange diffusion particle filter, referred to as the RB ReDif-PF, is introduced. In a simulated scenario with a partially connected network, the proposed ReDif-PF outperformed a PF tracker that assimilates local neighboring measurements only and also outperformed a linearized random exchange distributed extended Kalman filter (ReDif-EKF). Furthermore, the novel ReDif-PF matched the tracking error performance of alternative suboptimal distributed PFs based respectively on iterative Markov chain move steps and selective average gossiping with an inter-node communication cost that is roughly two orders of magnitude lower than the corresponding cost for the Markov chain and selective gossip filters. Compared to a broadcast-based filter which exactly mimics the optimal centralized tracker or its equivalent (exact) consensus-based implementations, ReDif-PF showed a degradation in steady-state error performance. However, compared to the optimal consensus-based trackers, ReDif-PF is better suited for real-time applications since it does not require iterative inter-node communication between measurement arrivals.

1 Introduction

In several engineering applications, e.g., target tracking or fault detection, multiple agents[1] that are physically dispersed over remote nodes on a network cooperate to execute a global task, e.g., estimating a hidden signal or parameter, without relying on a global data fusion center. Each network node is normally equipped with one or more sensors that generate local measurements and can process those measurements independently of the rest of the network. At the same time, however, the network nodes are also able to communicate with each other in order to build in a collaborative fashion a joint estimate of the hidden signals or parameters of interest that depends both on local and remote measurements. Ideally, that joint estimate should be equal to or, at least approximate the optimal global estimate that would be generated by a centralized processor with access to all network measurements.

Most of the previous literature in distributed signal processing on networks is based on linear estimation methods. Specifically, distributed versions of the Kalman filter were proposed e.g., in[2–4] to track unknown state vectors in linear, Gaussian state-space models. In situations, however, where the state dynamic model or the sensor observation models are nonlinear, the posterior distribution of the states conditioned on the network measurements becomes non-Gaussian (even with Gaussian sensor noise) and, therefore, the linear minimum mean square error (LMMSE) estimate of the states provided, e.g., by an extended Kalman filter (EKF) may differ from the true minimum mean square error (MMSE) estimate given by the expected value of the state vector conditioned on the measurements. In this paper in particular, we focus specifically on an application where multiple passive received-signal-strength (RSS) sensors jointly track a moving emitter assuming, at each network node, nonlinear observation models with possibly unknown static parameters.

1.1 Distributed particle filtering

In nonlinear scenarios, an alternative to approximate the true MMSE estimate is to use a sequential Monte Carlo method like particle filters[5, 6]. Several distributed particle filters have been proposed recently, see a comprehensive review in[7], to handle nonlinear distributed estimation tasks. An important constraint in the design of a distributed estimation algorithm is, however, that most networks of practical interest are only partially connected, i.e., each node can only directly access neighboring nodes in its immediate vicinity according to the network topology. In particular, assuming conditional independence of the different sensor measurements given the state vector, a distributed particle filter (PF) normally requires the computation of a product of likelihood functions that depend on local data only[8]. To compute that product over the network in a fully distributed fashion and with local neighborhood inter-node communication only, previous references suggest using iterative average consensus[8], iterative Markov chain Monte Carlo move steps[9], or selective gossip algorithms[10]. Alternatively, we proposed in[11] to compute the likelihood product exactly in a finite number of iterations using either iterative minimum consensus[12] or flooding techniques[13]. However, both consensus or flooding-based solutions are very costly in terms of bandwidth requirements as they require multiple iterative inter-node communication between two consecutive sensor measurements. Previous works, e.g.[8, 14, 15], propose approximations aimed at reducing the communication cost, but, in all aforementioned schemes, processing and sensing at different time scales are still required.

1.2 Diffusion particle filtering

An alternative to circumvent the high communication cost of consensus algorithms is to use diffusion algorithms[16] which, contrary to the former, do not require multiple iterative inter-node communication between consecutive measurements. Diffusion algorithms are, however, suboptimal in the sense that they do not simulate at each time step the behavior of the optimal global estimator, but rather, at best, approximate the optimal global solution asymptotically over time.

In the distributed linear estimation literature, most diffusion schemes are based on convex combinations of Kalman filters, see e.g.,[3]. Kar et al. proposed in[2] a different approach based on random information dissemination. In a previous conference paper[17], we introduced the random exchange diffusion particle filter (ReDif-PF), which generalizes and extends the methodology in[2] to a PF framework by basically using random information dissemination to build at each network node different Monte Carlo representations of the posterior distribution of the states conditioned on random sets of measurements coming from the entire network. Reference[17] assumed, however, that the parameters of the sensor observation model were perfectly known. In this paper, we extend the algorithm to a scenario with unknown parameters and derive in detail a Rao-Blackwellized[18] version of the ReDif-PF. In the specific application under consideration in the paper, the unknown parameters are the sensor variances, but most of the methodology in the derivation of the RB ReDif-PF is general and could be easily adapted to other signal models and applications provided that, in a fully Bayesian framework, the dynamic posterior probability distribution of the unknown parameters conditioned on the observations and on the simulated particles is a conjugate prior[19] for the likelihood function of the measurements.

An abbreviated description of the RB ReDif-PF may be found also in the short paper[20]. This paper consolidates and extends both[17] and[20] including detailed derivations and additional simulation results and comparisons. We also detail approximate versions of the RB ReDif-PF where we use Gaussian mixture models (GMM)[21] and moment-matching techniques inspired by[22] to reduce communication requirements.

1.3 Paper outline

The paper is divided into six sections and three appendices. Section 1 is the introduction. Section 2 describes the state and sensor models. Section 3 describes the centralized PF and also briefly reviews the equivalent broadcast, consensus, and flooding implementations introduced in[11]. Section 4 derives the ReDif-PF algorithm considering alternate scenarios with both known and unknown parameters. In the unknown parameter case, we derive in detail the Rao-Blackwellized version of the ReDif-PF and introduce approximate versions thereof that enable significant reductions in communication cost. The performance of the proposed algorithms is evaluated with simulated data in a realistic scenario with 25 sensors in Section 5. We compare the ReDif-PF algorithm in the unknown parameter scenario to the optimal centralized PF and its equivalent consensus implementations. In the known parameter case, we also compare the proposed ReDif-PF tracker to the Markov chain Monte Carlo distributed particle filter (MCDPF) in[9], to a linearized random exchange distributed EKF, which is a variation of the algorithm proposed in[2], and to a distributed bootstrap particle filter based on selective gossip as proposed in[23]. Finally, we present our conclusions in Section 6.

Appendices 1 and 2 show the proof of some key results in the paper, and Appendix 3 describes the ReDif-EKF algorithm used for comparison purposes in Section 5.

2 Problem setup

For simplicity of notation, we use lowercase letters in this paper to denote both random variables/vectors and real-valued samples of random variables/vectors with the proper interpretation implicit in context.

Without loss of generality, we assume that the emitter trajectory is described by the white noise acceleration model[24]

x_{n + 1} = F x_{n} + u_{n}

(1)

where $x_{n} ≜ {[x_{n} {\dot{x}}_{n} y_{n} {\dot{y}}_{n}]}^{T}$ is the hidden state vector at time step n consisting of the positions and velocities of the target’s centroid respectively in dimensions x and y; F is the state transition matrix; and {u_n} is a sequence of independent, identically distributed (i.i.d.) zero-mean Gaussian vectors with covariance matrix Q. Matrices F and Q, parameterized by the sampling period T and the acceleration noise $σ_{accel}^{2}$ , are detailed in[11, 24].

2.1 Observation model

Let $N (m, σ^{2})$ denote the Gaussian probability distribution with mean m and variance σ² and denote by $I G (a, b)$ the inverse-gamma probability distribution with parameters a and b. The measurements z_r,0:n = {z_r,0,…,z_r,n} in decibels relative to one milliwatt (dBm) at the r th node of a network of R RSS sensors are modeled as

z_{r, n} = g_{r} (x_{n}) + \sqrt{σ_{r}^{2}} v_{r, n},

(2)

where $v_{r, n} \sim N (0, 1), σ_{r}^{2} \sim I G (α, β), \forall r \in R ≜ {1, \dots, R}$ , and $\{x_{0}, \{u_{n}\}, \{v_{r, n}\}, \{σ_{r}^{2}\}\}$ are mutually independent for all n ≥ 0 and for all $r \in R$ . The nonlinear function g_r(•) in (2) is in turn given by[25]

g_{r} (x) = P_{0} - 10 ζ_{r} log (\frac{∥Hx - x_{r}∥}{d_{0}}),

(3)

where x_r represents the r th sensor position, ||.|| is the Euclidean norm, (P₀,d₀, ζ_r) are known model parameters (see[25] for details), and H is a 2 × 4 projection matrix such that H(1,1) = H(2,3) = 1 and H(i,j) = 0 otherwise. We also denote by N_r the set of nodes in the neighborhood of node r. The real-valued constants {α,β} are the model’s hyper-parameters.

Note that in (2), we take a fully Bayesian approach and model the unknown sensor noise variances $\{σ_{r}^{2}\}, r \in R$ , as random variables that are mutually independent for s ≠ r and identically distributed a priori with an inverse-gamma distribution.

2.2 Problem statement and goals

Let z_1:R,0:n denote the set {z_r,t} for all network nodes r = 1,…,R and all time instants t = 0,…,n. Given z_1:R,0:n, we want to compute the MMSE estimate

{\hat{x}}_{n | n} = E \{x_{n} | z_{1 : R, 0 : n}\}

(4)

at each instant n ≥ 0, where E{x_n|z_1:R,0:n} denotes the conditional expectation of x_n given z_1:R,0:n.

In the sequel, we first describe in Section 3 a recursive, centralized PF algorithm that approximates the desired global MMSE in (4) at each time step n in a scenario with unknown sensor variance scales $\{σ_{r}^{2}\}$ . Next, we review in Section 3.1 two fully distributed algorithms that operate on a partially connected network and allow exact in-network computation of the state estimate in (4) without a global data fusion center and with inter-node communication limited to a node’s immediate neighborhood according to the network topology. The network connectivity is described by a graph $G = (R, E)$ where $R = \{1, \dots, R\}$ is the set of nodes and the graph has an edge $(u, v) \in E, (u, v) \in R \times R$ if and only if nodes u and v can communicate directly with each other. The particular network graph used in the simulation scenarios in this paper is described in detail in Section 5.

Finally, we introduce in Section 4 a novel diffusion-based algorithm, which is also fully distributed and relies on local inter-node communication only specified as before by the network graph but, rather than yielding an identical estimate (4) at each node, obtains at each node r a suboptimal estimate

{\hat{x}}_{r, n | n} = E \{x_{n} | Z_{r, 0 : n}\},

(5)

where $Z_{r, 0 : n}$ is a random subset of z_1:R,0:n, which is different at each node r and includes measurements coming from random locations in the entire network, as opposed to measurements coming only from node r and its neighborhood. Compared to the exact distributed implementations of the optimal global estimate in Section 3.1, the diffusion solution in Section 4, although suboptimal, is designed to have a much lower inter-node communication cost and, therefore, is better suited for real-time applications.

3 Centralized particle filter

In a centralized architecture, all nodes in the network transmit their local measurements to a data fusion center which then runs a particle filter that approximates the MMSE estimate of the unknown state vector at each time instant n as

E \{x_{n} | z_{1 : R, 0 : n}\} \approx \sum_{q = 1}^{Q} w_{n}^{(q)} x_{n}^{(q)},

(6)

where $\{x_{n}^{(q)}\}, q \in Q ≜ {1, \dots, Q}$ , with the corresponding importance weights $\{w_{n}^{(q)}\}$ is a properly weighted Monte Carlo set[5, 6] that represents the posterior probability density function (PDF) p(x_n|z_1:R,0:n) in the sense that the sum on the right-hand side of (6) converges, according to some statistical criterion, to the expectation on the left-hand side when Q → ∞. The random samples $\{x_{n}^{(q)}\}$ , also called particles, are sequentially generated according to a proposal probability distribution specified by a so-called importance PDF $π (x_{n} | x_{0 : n - 1}^{(q)}, z_{1 : R, 0 : n})$ . If the blind importance function[5]

π (x_{n} | x_{0 : n - 1}^{(q)}, z_{1 : R, 0 : n}) = p (x_{n} | x_{n - 1}^{(q)})

is used, then it turns out that the proper importance weights must be updated according to the recursion[6]

w_{n}^{(q)} \propto w_{n - 1}^{(q)} p (z_{1 : R, n} | x_{0 : n}^{(q)}, z_{1 : R, 0 : n - 1})

(7)

where ∝ denotes ‘proportional to,’ z_1:R,n is an alternative notation for the set {z_r,n}, $r \in R$ , and the proportionality constant on the right-hand side of (7) is chosen such that $\sum_{q = 1}^{Q} w_{n}^{(q)} = 1$ . From the mutual independence assumptions in the model in Section 2, it follows that

p (z_{1 : R, 0 : n} | x_{0 : n}, σ_{1 : R}^{2}) = \prod_{r = 1}^{R} p (z_{r, 0 : n} | x_{0 : n}, σ_{r}^{2})

(8)

and

\begin{align} p (σ_{1 : R}^{2} | x_{0 : n}) & = \prod_{r = 1}^{R} p (σ_{r}^{2} | x_{0 : n}) \\ = \prod_{r = 1}^{R} p (σ_{r}^{2}) . \end{align}

(9)

From (8) and (9), it can be shown then that (see the proof in[11])

p (z_{1 : R, n} | x_{0 : n}^{(q)}, z_{1 : R, 0 : n - 1}) = \prod_{r = 1}^{R} \underset{λ_{r, n}^{(q)} (x_{n}^{(q)})}{\underset{⏟}{p (z_{r, n} | x_{0 : n}^{(q)}, z_{r, 0 : n - 1})}} .

(10)

Substituting now (10) into (7), the centralized weight update rule reduces to

w_{n}^{(q)} \propto w_{n - 1}^{(q)} \prod_{r = 1}^{R} \underset{λ_{r, n}^{(q)} (x_{n}^{(q)})}{\underset{⏟}{p (z_{r, n} | x_{0 : n}^{(q)}, z_{r, 0 : n - 1})}} .

(11)

3.1 Equivalent distributed implementation of the centralized particle filter

Note that each factor $λ_{r, n}^{(q)} (x_{n}^{(q)})$ in the product on the right-hand side of (11) depends only on local observations. In a fully connected network, assuming that all nodes $r \in R$ start out at instant n - 1 with the same particles $\{x_{n - 1}^{(q)}\}$ , they can all synchronously draw[26] new particles $\{x_{n}^{(q)}\}$ according to $p (x_{n} | x_{n - 1}^{(q)})$ , locally compute their own local likelihood functions $λ_{r, n}^{(q)} (x_{n}^{(q)})$ , and then broadcast them to the entire network until all nodes have all the remote likelihood functions and can compute the product on the right-hand side of (11). Synchronous multinomial resampling according to the global weights followed by regularization may follow (see[11]) to mitigate particle degeneracy and impoverishment[5, 6]. The algorithm described in this paragraph is referred to as the decentralized particle filter (DcPF) in[11] and[27].

As mentioned, however, in Section 1, real-world networks are only partially connected and fully distributed computations of the product in (11) are needed. One possibility is to approximate the product using iterative average consensus[28] as proposed, e.g., in[8] and[29]. Alternatively, we introduced in[11] a fully distributed computation of the global weights in (11) using either iterative minimum consensus[12] or flooding[13]. Both algorithms assume only local communication between nodes in immediate neighborhoods and, to achieve an exact computation of the global weights, require only a finite number of iterative message exchanges between nodes in the time interval between two consecutive sensor measurements.

Let D denote the diameter of the network graph, i.e., the maximum number of hops between any two nodes and, as before, denote by R the number of nodes in the network. By running R × D consecutive minimum consensus iterations[12] for each particle q, it is possible (see details in[11]) to build an identical ordered list of likelihood functions $\{λ_{r, n}^{(q)} (x_{n}^{(q)})\}, r \in R$ , at all nodes. Each node can then locally compute the product of the likelihoods as in (10) and obtain identical, optimal global importance weights $\{w_{n}^{(q)}\}$ . We refer to that (communication-intensive) minimum-consensus-based distributed tracking algorithm as CbPFa.

A more efficient way, however, to compute the exact optimal global weights at each node is to flood[13] the local node likelihoods over the network. Flooding protocols allow one to (iteratively) broadcast values over a network relying on local neighborhood inter-node communication only. Given a partially connected sensor network, one can simultaneously flood the R distinct likelihoods over the network as follows. First of all, each node r maintains an ordered list of distinct likelihoods. A likelihood in turn is flagged to indicate that it has not been sent to node r neighbors yet. Initially, the node r stores its local flagged likelihood in its list. At a given iteration, node r sends its lowest flagged likelihood to all neighbors and then unflags it. Conversely, it receives remote likelihoods from nodes s ∈ N_r. If a received remote likelihood is not included in node r’s list yet, it is inserted with a flag in its list. This procedure is guaranteed to converge in a finite number of iterations as soon as each node has R distinct values in its ordered list of likelihoods. We refer to the flooding-based iterative tracker in this paper as the CbPFb algorithm.

Figure1 illustrates how the proposed flooding protocol iteratively creates at each node r an ordered list comprising all likelihoods across the network in a toy example with three nodes where node 1 is connected to node 2, node 2 is connected to nodes 1 and 3, and node 3 is connected to node 2 only. A star symbol is employed to indicate which likelihoods are flagged in the ordered list maintained by each node r at a given iteration j.

Although optimal in the sense of reproducing the centralized solution, the minimum consensus and flooding algorithms in[11] are still communication-intensive due to the requirement of iterative inter-node communication between sensor measurement arrivals. In the next sections, we describe an alternative fully distributed diffusion-based solution that drops this requirement and is the main topic of this paper.

4 Random exchange diffusion particle filter

In this section, we derive an alternative distributed PF based on random information dissemination that extends the methodology in[2] to a Monte Carlo framework. We also present a Rao-Blackwellized version of the proposed distributed PF in a scenario with unknown sensor parameters.

Let $Z_{s, 0 : n - 1}$ denote the set of all network measurements assimilated by node s up to instant n - 1. Next, let $\{x_{s, 0 : n - 1}^{(q)}\}$ with associated weights $\{w_{s, n - 1}^{(q)}\}, q \in Q$ , be a properly weighted set that represents the posterior PDF $p (x_{0 : n - 1} | Z_{s, 0 : n - 1})$ at node s. Assume now that at instant n - 1, node s sends its particles and weights to a neighboring node r that can assimilate at instant n the measurements $Z_{r, n} = \{z_{i, n}\}$ , i ∈ {r} ∪ N_r. At instant n, the new particle set at node r, $x_{r, 0 : n}^{(q)} = (x_{s, 0 : n - 1}^{(q)}, x_{r, n}^{(q)})$ with updated weights $w_{r, n}^{(q)}$ such that

\begin{align} x_{r, n}^{(q)} \sim p (x_{n} | x_{s, n - 1}^{(q)}) \end{align}

(12)

\begin{align} w_{r, n}^{(q)} \propto w_{s, n - 1}^{(q)} p (Z_{r, n} | x_{r, 0 : n}^{(q)}, Z_{s, 0 : n - 1}) \end{align}

(13)

is now a properly weighted set to represent the updated posterior $p (x_{0 : n} | Z_{r, n}, Z_{s, 0 : n - 1})$ , where ${Z_{r, n}, Z_{s, 0 : n - 1}}$ is redefined as $Z_{r, 0 : n}$ . Resampling from the particle weights followed by regularization may be added to combat particle degeneracy and restore particle diversity, i.e., for $q \in Q$ (see also[11]):

Draw l^(q) from {1,2,…,Q} with $Pr ({l^{(q)} = l}) = w_{r, n}^{(l)}$ , where P r(A) denotes the probability of an event A.
Make ${\bar{x}}_{r, n}^{(q)} = x_{r, n}^{(l^{(q)})} + h D_{n} x^{*}$ , where $x^{*} \sim N (0, I), D_{n} D_{n}^{T}$ is equal to the empirical covariance of the weighted particles $\{(x_{r, n}^{(q)}, w_{r, n}^{(q)})\}$ , and h > 0 is an empirically adjusted parameter.
Reset the particle weights $w_{r, n}^{(q)}$ to $\frac{1}{Q}$ and make $x_{r, n}^{(q)} = {\bar{x}}_{r, n}^{(q)}$ .

Random exchange protocol In order to build, at each instant n and at each node r, different Monte Carlo representations of the posterior distribution conditioned on different sets of observations $Z_{r, 0 : n}$ coming from random locations in the entire network, it suffices to implement a protocol where each node r, starting from instant zero, exchanges its particles and weights with a randomly chosen neighboring node s, propagates the received particles using the blind importance function as in (12), and then updates their weights as in (13).

Figure2 illustrates the evolution of the marginal posterior at each node - in a linear network containing three nodes running the random exchange protocol - over four time instants. Initially, each node r ∈ {1,2,3} has a posterior at instant zero conditioned on the measurements $Z_{r, 0} = {z_{i, 0}}, i \in {r} \cup N_{r}$ , in its vicinity only. At each time instant n ∈ {1,2,3}, network nodes perform the sequence of random exchanges as indicated in the rightmost column of Figure2 and, then, update the received posterior by assimilating measurements in their respective neighborhoods.

Note that in the linear network topology shown in Figure2, node 2 always performs two random exchanges at each time instant n. Generally speaking, however, at a given instant n, a node r exchanges its parameters at least one time with a randomly chosen neighbor s and, in the worst case, performs d(r) random exchanges between two measurement arrivals with nodes in its vicinity, where d(r) is the degree of node r, i.e., the number of neighbor nodes.

Unlike randomized gossip algorithms[30], this procedure diffuses information by randomly propagating posterior statistics across the network. More specifically, as the initial posterior statistics provided by a given node r₀ at time 0 follows a path $P ≜ {r_{0}, r_{1}, \dots, r_{n}}$ along the network, it assimilates the available measurements $Z_{r, n}$ in the neighborhood of each visited node $r \in P$ . Since, as illustrated in Figure2, the initial posteriors at each node follow different paths, the posterior available at node r_n at time n will be different from those in the remaining nodes. Thus, network nodes will provide different estimates conditioned on distinct sets of measurements.

4.1 ReDif-PF with known sensor variances

If the parameters of the sensor observation model at each node r are deterministic and perfectly known, then

p (Z_{r, n} | x_{r, 0 : n}^{(q)}, Z_{s, 0 : n - 1}) = \prod_{i \in {r} \cup N_{r}} p (z_{i, n} | x_{r, n}^{(q)}) .

(14)

At instant n, then, upon receiving $(w_{s, n - 1}^{(q)}, x_{s, n - 1}^{(q)}), q \in Q$ , from node s, the particle filter at node r samples as before

x_{r, n}^{(q)} \sim p (x_{n} | x_{s, n - 1}^{(q)}) q = 1, \dots, Q

and updates its weights as

w_{r, n}^{(q)} \propto w_{s, n - 1}^{(q)} \prod_{i \in {r} \cup N_{r}} p (z_{i, n} | x_{r, n}^{(q)}) q = 1, \dots, Q

(15)

where $z_{i} | x_{r, n}^{(q)} \sim N (g_{i} (x_{r, n}^{(q)}), σ_{i}^{2})$ .

Inter-node transmission requirements From the previous discussion, it follows that in the scenario with known variances at each instant n, it suffices for each node s to transmit to the chosen neighbor r the set of particles ${x_{s, n - 1}^{(q)}}$ (4Q real numbers for a four-dimensional state space) and the respective set of importance weights ${w_{s, n - 1}^{(q)}}$ (Q real numbers). In addition, node s also sends its scalar observation z_s,n and the known observation model parameters $(ζ_{s}, x_{s}, σ_{s}^{2})$ (see (3)) to all nodes i in the neighborhood of s.

4.2 Rao-Blackwellized ReDif-PF with unknown sensor variances

Let I G(σ²|α,β) denote the PDF of a continuous random variable σ² with an inverse-gamma distribution specified by the parameters α and β, i.e.[19],

IG (σ^{2} | α, β) = \frac{β^{α}}{Γ (α)} σ^{- 2 (α + 1)} exp (- \frac{β}{σ^{2}}) σ^{2} > 0

(16)

and zero otherwise. In (16), Γ(.) denotes the gamma function

Γ (α) = \int_{0}^{\infty} t^{α - 1} exp (- t) dt α > 0 .

Similarly, let also N(x|m,Σ) denote the PDF of a Gaussian random vector taking values in ℜ^L and with mean m and positive definite covariance matrix Σ, i.e.,

N (x | m, Σ) = \frac{1}{{(2 π)}^{\frac{L}{2}} {| Σ |}^{\frac{1}{2}}} exp [- \frac{{(x - m)}^{T} Σ^{- 1} (x - m)}{2}],

where |Σ| denotes the determinant of the matrix Σ and the superscript T denotes the transpose of a vector.

In the scenario with unknown sensor variances, it can be shown (see Appendix 2) that if at instant n - 1,

p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) = \prod_{i = 1}^{R} IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}),

(17)

then

p (Z_{r, n} | x_{r, 0 : n}^{(q)}, Z_{s, 0 : n - 1}) = \prod_{i} \underset{{\bar{λ}}_{i, n}^{(q)} (x_{r, n}^{(q)})}{\underset{⏟}{p (z_{i, n} | x_{r, 0 : n}^{(q)}, Z_{s, 0 : n - 1})}},

(18)

where i ∈ {r} ∪ N_r, and each factor ${\bar{λ}}_{i, n}^{(q)} (x_{r, n}^{(q)})$ in the product on the right-hand side of (18) is computed by solving the integral

\begin{align} \int_{0}^{\infty} p (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) p (σ_{i}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) d σ_{i}^{2} \\ = \int_{0}^{\infty} N (z_{i, n} | g_{i} (x_{r, n}^{(q)}), σ_{i}^{2}) IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}) d σ_{i}^{2} \\ \propto \frac{{[β_{s, i, n - 1}^{(q)}]}^{α_{s, i, n - 1}}}{Γ (α_{s, i, n - 1})} \frac{Γ (α_{r, i, n})}{{[β_{r, i, n}^{(q)}]}^{α_{r, i, n}}}, \end{align}

(19)

where Γ(•), as before, denotes the gamma function

\begin{align} α_{r, i, n} = α_{s, i, n - 1} + \frac{1}{2} \end{align}

(20)

\begin{align} β_{r, i, n}^{(q)} = β_{s, i, n - 1}^{(q)} + \frac{1}{2} {[z_{i, n} - g_{i} (x_{r, n}^{(q)})]}^{2}, \end{align}

(21)

with g_i(•) calculated as in (3). Furthermore, at node r and instant n, the updated parameter posterior PDF

p (σ_{1 : R}^{2} | x_{r, 0 : n}^{(q)}, Z_{r, 0 : n}) = \prod_{i = 1}^{R} IG (σ_{i}^{2} | α_{r, i, n}, β_{r, i, n}^{(q)}),

(22)

where α_r,i,n and $β_{r, i, n}^{(q)}$ are updated as in (20) and (21) if i ∈ {r} ∪ N_r or, otherwise, are kept equal respectively to α_s,i,n-1 and $β_{s, i, n - 1}^{(q)}$ . If regularization is used to combat particle degeneracy, the posterior parameters ${β_{s, i, n - 1}^{(q)}}$ must be also resampled according to new weights $w_{r, n}^{(q)}$ updated as in (13) and a new set $\{β_{r, i, n}^{(q)}\}$ must be recalculated for i ∈ {r} ∪ N_r using (21) with the resampled ${β_{s, i, n - 1}^{(q)}}$ and the new moved particles ${x_{r, n}^{(q)}}$ . We follow, however, a different suboptimal strategy described in Section 4.3, which also allows a significant reduction in inter-node communication cost.

Inter-node transmission requirements In the unknown variance scenario, based on the previous discussion, at each instant n, a node s has to transmit to its (randomly chosen) neighboring node r its particle set $\{x_{s, n - 1}^{(q)}\}$ (4Q real numbers) plus the respective importance weights $\{w_{s, n - 1}^{(q)}\}$ (Q real numbers) and the set of hyper-parameters $\{(α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)})\}, i \in R, q \in Q$ (another R × (Q + 1) real numbers), which specify the posterior PDF $p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})$ . In addition, as before, node s also sends its scalar observation z_s,n and the observation model parameters (ζ_s,x_s) to all nodes i in the neighborhood of s.

4.3 Approximate RB ReDif-PF

Although the exact ReDif-PF algorithms in Sections 4.1 and 4.2 converge asymptotically to the state estimate in (5) as the number of particles Q goes to infinity, their inter-node communication cost is still relatively high. To reduce the communication burden, we propose two suboptimal approximations which are described in detail in the sequel.

GMM approximation of the marginal posterior of the states To circumvent the inconvenience of having to transmit, either in the known or unknown sensor parameter scenarios, Q particles and respective weights per node at each time step, we follow the lead in[21] and build a GMM representation of the marginal posterior $p (x_{n - 1} | Z_{s, 0 : n - 1})$ of the form

p (x_{n - 1} | Z_{s, 0 : n - 1}) \approx \sum_{k \in K} η_{s, n - 1}^{(k)} N (x_{n - 1} | μ_{s, n - 1}^{(k)}, Σ_{s, n - 1}^{(k)})

(23)

where $K = \{1, \dots, K\}$ and the parameters $η_{s, n - 1}^{(k)}, μ_{s, n - 1}^{(k)}$ , and $Σ_{s, n - 1}^{(k)}$ are obtained from the weighted particle set $\{x_{s, n - 1}^{(q)}, w_{s, n - 1}^{(q)}\}, q \in Q$ , at node s using the Expectation-Maximization (EM)[31] algorithm. Node s now transmits to node r only the parameters that specify the GMM model, i.e., 15K real numbers for a four-dimensional state vector, as opposed to 5Q real numbers, where typically Q >> K (in the simulations in Section 5 for example, K is either 1 or 2, whereas Q is 500). Node r then locally resamples Q new particles $x_{s, n - 1}^{(q)}$ according to the received GMM PDF and resets its importance weights $w_{s, n - 1}^{(q)}$ to 1/Q. Since resampling from the GMM approximation is used, we omit the regularization step mentioned in Section 4.

Approximation of the posterior distribution of the sensor variances In the particular situation where the sensor variances are unknown, in theory we should also locally resample the previous particle trajectories $x_{s, 0 : n - 2}^{(q)}$ jointly with $x_{s, n - 1}^{(q)}$ from some parametric approximation to $p (x_{0 : n - 1} | Z_{s, 0 : n - 1})$ and then recompute retroactively the posterior PDF’s $p (σ_{i}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})$ , i = 1,…,R for the resampled particle paths. To eliminate that curse of dimensionality, it is desirable to introduce a parametric approximation to $p (σ_{i}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})$ that eliminates the dependence of that function on the particle label q and the simulated sequence $x_{s, 0 : n - 1}^{(q)}$ .

Specifically, we follow the lead in[11, 22, 32], and, for each $i \in R$ , approximate the marginal posteriors $p (σ_{i}^{2} | x_{0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})$ for all particle labels q and all possible sequences $x_{0 : n - 1}^{(q)}$ by a new inverse gamma PDF with parameters ${\tilde{α}}_{s, i, n - 1}$ and ${\tilde{β}}_{s, i, n - 1}$ , independent of q and chosen such that the approximated PDF $IG (σ_{i}^{2} | {\tilde{α}}_{s, i, n - 1}, {\tilde{β}}_{s, i, n - 1})$ matches the first and second moments of

\begin{align} p (σ_{i}^{2} | Z_{s, 0 : n - 1}) & = \int [p (σ_{i}^{2} | x_{0 : n - 1}, Z_{s, 0 : n - 1}) \\ \times p (x_{0 : n - 1} | Z_{s, 0 : n - 1})] d x_{0 : n - 1} \end{align}

(24)

where the term on the left-hand side of (24) is the average (or expected value) of $p (σ_{i}^{2} | x_{0 : n - 1}, Z_{s, 0 : n - 1})$ over all possible realizations of x_0:n-1 conditioned on the observations $Z_{s, 0 : n - 1}$ . Assuming now that ${(w_{s, n - 1}^{(q)}, x_{s, 0 : n - 1}^{(q)})}, q \in Q$ , is a properly weight set available at node s at instant n - 1 to represent $p (x_{0 : n - 1} | Z_{s, 0 : n - 1})$ , we make the Monte Carlo approximation

p (σ_{i}^{2} | Z_{s, 0 : n - 1}) \approx \sum_{q = 1}^{Q} w_{s, n - 1}^{(q)} p (σ_{i}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) .

(25)

On the other hand, from the assumption that $p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})$ is a separable function factored as in (17), it follows that

p (σ_{i}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) = IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)})

and, therefore,

p (σ_{i}^{2} | Z_{s, 0 : n - 1}) \approx \sum_{q = 1}^{Q} w_{s, n - 1}^{(q)} IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}) .

(26)

In the sequel, recall that if $σ^{2} \sim I G (α, β)$ , then the respective mean and variance of σ² are given by[19]

\begin{align} E \{σ^{2}\} = \frac{β}{α - 1}, α > 1 \end{align}

(27)

\begin{align} Var \{σ^{2}\} & = \frac{β^{2}}{{(α - 1)}^{2} (α - 2)}, α > 2 . \end{align}

(28)

Therefore, the parameters ${\tilde{α}}_{s, i, n - 1}$ and ${\tilde{β}}_{s, i, n - 1}$ such that $IG (σ_{i}^{2} | {\tilde{α}}_{s, i, n - 1}, {\tilde{β}}_{s, i, n - 1})$ matches the mean and variance associated with the PDF on the right-hand side of (26) are found, following the procedure in[11, 22, 32] by making

\begin{align} {\tilde{α}}_{s, i, n - 1} = 2 + {\hat{E}}_{s, n - 1}^{2} [σ_{i}^{2}] / {\hat{VAR}}_{s, n - 1} [σ_{i}^{2}] \end{align}

(29)

\begin{align} {\tilde{β}}_{s, i, n - 1} = ({\tilde{α}}_{s, i, n - 1} - 1) {\hat{E}}_{s, n - 1} [σ_{i}^{2}], \end{align}

(30)

where

\begin{align} {\hat{E}}_{s, n - 1} [σ_{i}^{2}] & = \frac{\sum_{q = 1}^{Q} w_{s, n - 1}^{(q)} β_{s, i, n - 1}^{(q)}}{α_{s, i, n - 1} - 1} \\ {\hat{VAR}}_{s, n - 1} [σ_{i}^{2}] & = \frac{\sum_{q = 1}^{Q} w_{s, n - 1}^{(q)} {(β_{s, i, n - 1}^{(q)})}^{2}}{(α_{s, i, n - 1} - 1) (α_{s, i, n - 1} - 2)} \\ - {\hat{E}}_{s, n - 1}^{2} [σ_{i}^{2}] . \end{align}

(31)

Replacing now $p (σ_{i}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})$ in (19) with

\tilde{p} (σ_{i}^{2} | Z_{s, 0 : n - 1}) = IG (σ_{i}^{2} | {\tilde{α}}_{s, i, n - 1}, {\tilde{β}}_{s, i, n - 1})

for all q ∈ {1,…,Q} and all possible sequences $x_{s, 0 : n - 1}^{(q)}$ , we get, at node r at instant n, new factors ${\tilde{λ}}_{i, n} (.)$ such that

\begin{array}{l} {\tilde{λ}}_{i, n} (x_{r, n}^{(q)}) & = \int_{0}^{\infty} p (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) \tilde{p} (σ_{i}^{2} | Z_{s, 0 : n - 1}) d σ_{i}^{2} \\ = \int_{0}^{\infty} N (z_{i, n} | g_{i} (x_{r, n}^{(q)}), σ_{i}^{2}) IG (σ_{i}^{2} | {\tilde{α}}_{s, i, n - 1}, {\tilde{β}}_{s, i, n - 1}) d σ_{i}^{2} \\ \propto \frac{{[{\tilde{β}}_{s, i, n - 1}]}^{{\tilde{α}}_{s, i, n - 1}}}{Γ ({\tilde{α}}_{s, i, n - 1})} \frac{Γ (α_{r, i, n})}{{[β_{r, i, n}^{(q)}]}^{α_{r, i, n}}}, \end{array}

(32)

where

\begin{align} α_{r, i, n} = {\tilde{α}}_{s, i, n - 1} + \frac{1}{2} \end{align}

(33)

\begin{align} β_{r, i, n}^{(q)} = {\tilde{β}}_{s, i, n - 1} + \frac{1}{2} {[z_{i, n} - g_{i} (x_{r, n}^{(q)})]}^{2}, \end{align}

(34)

for all $q \in Q$ and all i ∈ {r} ∪ N_r. Otherwise, if i ∉ {r} ∪ N_r

\begin{align} α_{r, i, n} = {\tilde{α}}_{s, i, n - 1} \end{align}

(35)

\begin{align} β_{r, i, n}^{(q)} = {\tilde{β}}_{s, i, n - 1}, \end{align}

(36)

again for all $q \in Q$ . The modified importance weight update rule at node r at instant n now becomes

w_{r, n}^{(q)} \propto w_{s, n - 1}^{(q)} \prod_{i \in {r} \cup N_{r}} {\tilde{λ}}_{i, n} (x_{r, n}^{(q)}) q \in \{1, \dots, Q\} .

(37)

Inter-node communication cost By combining the GMM approximation and the moment-matching approximation described before, node s now transmits to its (randomly chosen) neighbor r only the GMM model parameters (15K real numbers as previously explained) plus 2R hyper-parameters $({\tilde{α}}_{s, i, n - 1}, {\tilde{β}}_{s, i, n - 1}), i \in R$ , as opposed to R × (Q + 1) hyper-parameters as before in the exact RB ReDif-PF algorithm.

Summary of the approximate RB-ReDif-PF Algorithm 1 summarizes the approximate RB ReDif-PF tracker at node r at instant n. In Algorithm 1, the symbol Θ_r,n denotes the set $\{(η_{r, n}^{(k)}, μ_{r, n}^{(k)}, Σ_{r, n}^{(k)}), ({\tilde{α}}_{r, i, n} {\tilde{β}}_{r, i, n})\}$ for $i \in R$ and $k \in K$ .

Algorithm 1 Approximate Rao-Blackwellized random exchange diffusion particle filter

4.4 Differences between ReDif-PF and the Markov chain distributed particle filter

An alternative and different approach to distributed particle filtering is the MCDPF algorithm introduced in[9]. MCDPF, like other previous work in the distributed PF literature, assumes conditional independence of the sensor observations given the target state and, therefore, should be compared to the proposed ReDif-PF algorithm in this paper in the known sensor parameters scenario of Section 4.1 as opposed to the more general Rao-Blackwellized version of ReDif-PF proposed for unknown sensor parameters in Section 4.2.

The main idea in MCDPF is to move each particle and its associated weight multiple times between nodes in the time interval between instants n and n + 1, according to a Markov chain with transition probabilities defined by the normalized adjacency matrix of the graph that defines the network topology. Each time a given particle $x_{n}^{⋆}$ visits a network node r, its weight is multiplied by the pseudo-likelihood $p {(z_{r, n} | x_{n}^{⋆})}^{1 / J ϕ (r)}$ where ϕ(r) is the long-term stationary probability of the state of the Markov chain specified by being equal to r, r = 1,…,R, and J is total number of Markov chain move steps between consecutive sensor measurements, which is set by the user. Since the number of visits to the node r divided by J converges to ϕ(r)[9] as J → ∞, it follows that if J is large enough so that particle $x_{n}^{⋆}$ not only visits all network nodes but also visits each node multiple times, then the aggregate update factor for its corresponding weight at the end of the random walk will approach

\prod_{r = 1}^{R} p (z_{r, n} | x_{n}^{⋆}),

(38)

which, under the assumption of conditional independence of the sensor measurements given the target state, is the exact update factor for the optimal global weight associated with particle $x_{n}^{⋆}$ . For a finite and especially low number of move steps, MCDPF is no longer optimal, meaning that the choice of the parameter J involves a tradeoff between inter-node communication cost and state estimation error.

Contrary to MCDPF, the proposed ReDif-PF does not attempt to compute the exact optimal global posterior PDF p(x_0:n|z_1:R,0:n) at all nodes r = 1,…,R at each instant n. Instead, as explained in previous sections, ReDif-PF builds at each node r and at each instant n a Monte Carlo representation of the posterior $p (x_{0 : n} | Z_{r, 0 : n})$ , where $Z_{r, 0 : n}$ is a random subset of z_1:R,0:n that changes from node to node. Such Monte Carlo representation is built in a way that between instants n and n + 1, each node makes only one request to exchange particles/weights (or equivalent parametric approximations of posterior distributions) with a randomly chosen neighbor, thus eliminating the need for multiple iterative inter-node communication between consecutive sensor measurements and resulting in a communication cost that is much lower than that of the MCDPF algorithm for a similar mean square state estimation error (see the numerical simulation results in Section 5.2).

Finally, we also note that compared to the non-iterative ReDif-PF, MCDPF is also computationally more intensive since each node r has to compute the local likelihoods $p (z_{r, n} | x_{n}^{(q)})$ for all its particles $x_{n}^{(q)}$ multiple (namely J) times between instants n and n + 1. We also illustrate that point in the numerical simulations of Section 5.2.

5 Simulation results

We assessed the performance of the proposed algorithms using 100 Monte Carlo runs with simulated data in three distinct scenarios assuming both unknown and known sensor variances. In all scenarios, we used R = 25 RSS sensors with parameters P₀ = 1 dBm, d₀ = 1 m, ζ_r = 3, $\forall r \in R$ , and $σ_{r}^{2}$ independently sampled at each node according to an distribution with mean 16. The nodes were deployed on a jittered grid within a square of size 100 m × 100 m. In the fully distributed algorithms, each node communicates with other nodes within a range of 40 m. All particle filters used Q = 500 particles.

Figure3 shows the sensor positions and two distinct realizations of the emitter trajectory generated for T = 1 s and $x_{0} = {[\begin{matrix} 25 m & 0.5 m/s & 35 m & 0.5 m/s \end{matrix}]}^{T}$ considering respectively σ_accel = 0.05 m/s² and σ_accel = 0.2 m/s². It also depicts the available network connections. The diameter of the sensor network is D = 5 hops and the minimum number of neighbors for any possible node is 3.

5.1 Scenario I: ReDif-PF vs. CbPF

In the first scenario, we assumed unknown sensor variances and evaluated the performance of the Rao-Blackwellized ReDif-PF and two consensus-based PF trackers using respectively iterative minimum consensus (CbPFa) and flooding (CbPFb) (see also[11]). The aforementioned algorithms were compared to the equivalent broadcast implementation of the optimal centralized PF tracker, referred to as DcPF in[11] and[27] and in Section 3.1 of this paper. We also assumed Gaussian priors with mean ${[\begin{matrix} x_{0} & y_{0} \end{matrix}]}^{T}$ and covariance matrix diag(20²,20²) for the emitter’s position in Cartesian coordinates and mean ${[\sqrt{{\dot{x}}_{0}^{2} + {\dot{y}}_{0}^{2}} arctan ({\dot{y}}_{0} / {\dot{x}}_{0})]}^{T}$ and covariance matrix diag(0.3²,(5π/180)²) for the emitter’s velocity in polar coordinates, where x₀ = 25 m, y₀ = 35 m, and ${\dot{x}}_{0} = {\dot{y}}_{0} = 0.5 m/s$ . In the initialization step, the realizations of the initial emitter velocity are sampled from the aforementioned Gaussian prior and, then, converted from polar to Cartesian coordinates.

Figure4 shows the evolution of the root mean square (RMS) error norm - averaged over all network nodes and Monte Carlo runs - of the emitter position estimates for the RB ReDif-PF and the CbPFa and CbPFb algorithms superimposed to the benchmark RMS error curve for the optimal DcPF algorithm. Furthermore, we also show in Figure4 the average RMS error norm for the non-cooperative (isolated node) trackers and for a local cooperation scheme. In the former, each node runs a regularized PF tracker (see[11]) which assimilates local measurements only, while in the latter, a node r incorporates all measurements $Z_{r, n}$ in its vicinity in the same way as in the ReDif-PF tracker, but it does not exchange its updated posterior with its neighbors. The bars shown in Figure4 represent the standard deviation of the error norm across all nodes in the network. There are no bars for the DcPF and CbPF algorithms since they provide the same state estimate at all nodes. The RMS error norm at time step 0 for all algorithms was calculated after the measurements z_1:R,0 were assimilated. We implemented the RB ReDif-PF in this scenario with the parametric approximations in Section 4.3 using only one Gaussian mode to represent $p (x_{n - 1} | Z_{s, 0 : n - 1})$ .

As expected, CbPFa and CbPFb match the performance of the DcPF tracker since both algorithms reproduce the optimal centralized PF tracker exactly, albeit with different communication and computational costs. On the other hand, as shown in Figure4, the RB ReDif-PF tracker has a performance degradation compared to DcPF. This result is again theoretically expected since, in the RB ReDif-PF algorithm, the posterior at each node assimilates just a subset of the available measurements z_1:R,n in the whole network at each time step n. However, ReDif-PF offers an improvement in error performance compared to the local cooperation scheme by better diffusing the information across the network. We also note from Figure4 that the standard deviation of the state estimate across the different network nodes is much lower in the ReDif-PF algorithm than in the local cooperation scheme. Note also that, as shown in Figure4, isolated nodes were not able to properly track the emitter in the evaluated scenario.

Finally, Figure5 shows the performance comparison between the ReDif-PF and consensus-based algorithms for σ_accel ∈ {0.05,0.1,0.2}.

As expected, as σ_accel increases, there is a deterioration in the RMS error performance. However, the ratio between the RMS error performance of the suboptimal ReDif-PF tracker and the benchmark optimal DcPF/CbPFb algorithms remains approximately constant (close to a factor of two) along the simulation period for all three different values of σ_accel employed.

Communication and computation cost Considering a four-byte and a one-byte network representation respectively for real and Boolean values, the total amount of bytes transmitted and received by all nodes over the network was recorded while running each tracker in Figure4. Table1 summarizes the communication cost for each algorithm in the first scenario (unknown sensor variances) in terms of average transmission (TX) and average reception (RX) rates per node and also quantifies the processing cost for each algorithm in terms of average duty cycle per node, measured in a Intel Core i5 machine with 4GB RAM. The duty cycle of a given node is defined as the ratio between the total node processing time and the simulation period 100 s. Finally, values in Table1 are averaged over all Monte Carlo simulations.

Table 1 Average communication and processing cost per node in the first scenario

Full size table

As shown in Table1, the RB ReDif-PF tracker with the parametric approximations in Section 4.3 using only one Gaussian mode has a communication cost based on TX rate that is approximately one order of magnitude lower than the flooding-based CbPFb’s communication requirements. Compared to the iterative minimum consensus solution (CbPFa), the average communication cost is reduced by two orders of magnitude.

5.2 Scenario II: ReDif-PF vs. ReDif-EKF

In the second scenario, the sensor variances are perfectly known and the ReDif-PF tracker is compared both to the optimal centralized PF and to a linearized random exchange extended Kalman filter (ReDif-EKF), which is summarized in Appendix 3. In the simulations, we assumed a non-informative prior for the sensor’s initial position that is uniform in the entire surveillance space. The actual initial position of the emitter was, however, sampled from a Gaussian distribution centered at (5 m,5 m) with standard deviation of 3 m in both dimensions. Figure6 shows a normalized contour map for the posterior PDF p(x₀,y₀|z_1:R,0) at instant 0 as a function of x₀ and y₀ assuming the aforementioned non-informative prior. As seen from Figure6, the initial posterior distribution of the target’s position is non-Gaussian.

Figure7 shows the evolution of the RMS error norm assuming known sensor variances respectively for the ReDif-PF algorithm in Section 4.1 with a two-Gaussian GMM parametric approximation and the ReDif-EKF algorithm in Appendix 3. We also show the RMS curve for the optimal centralized PF tracker as a benchmark. The plots in Figure7 show that, especially in the initial time steps, when the posterior distribution of the states is strongly non-Gaussian as suggested by Figure6, the fully distributed ReDif-PF outperforms its linearized counterpart, the ReDif-EKF. As the emitter moves away from the near field of the initial dominant sensor, the performance of the ReDif-EKF slowly improves and approaches that of the ReDif-PF, albeit still with a slight degradation towards the end of the simulation.

Communication and Computation Cost Table2 summarizes the communication and processing cost per node for each algorithm in the second scenario.

Table 2 Average communication and processing cost per node in the second scenario

Full size table

As expected, the DcPF algorithm assuming known sensor variances has the same communication requirements as in the scenario with unknown variances since DcPF locally computes the likelihood functions and then broadcasts them to the entire network. However, as shown in Table2, DcPF has a slightly lower processing cost when the sensor variances are known. The ReDif-PF tracker on the other hand outperformed the ReDif-EKF tracker in terms of the position RMS error at the expense of a greater communication and computational cost. However, as indicated in Table2, the communication requirements of the ReDif-PF and ReDif-EKF trackers still have the same order of magnitude.

5.3 Scenario III: ReDif-PF vs. MCDPF/selective gossip

In the third scenario, the ReDif-PF tracker is compared to two iterative algorithms from the literature - the MCDPF and the selective gossip from[9] and[23], respectively - assuming perfectly known sensor variances as in the second scenario and the same Gaussian priors for the emitter’s initial position and velocity used in the first scenario.

Figure8 shows the evolution of the RMS error norm assuming known sensor variances for the ReDif-PF algorithm in Section 4.1 with a single-mode GMM parametric approximation and the MCDPF algorithm in[9] for J ∈ {10,30,50,100} iterations.

Figure9 shows the evolution of the RMS error norm for the ReDif-PF algorithm in Section 4.1 with a single-mode GMM parametric approximation and the selective gossip algorithm in[23] using respectively J ∈ {1,000;2,000;4,000} iterations. More specifically, we first run J average gossip iterations considering only the particles in the top 10% bracket in terms of log-likelihood for each randomly selected pair of nodes at each iteration and, subsequently, we run J standard max gossip iterations for the averaged log-likelihood of the selected particle as proposed in[23] to ensure that all nodes have exactly the same weight update factors. Note that, since only one pair of nodes is active at each average gossip iteration and only 10% of the particles are being transmitted between the active nodes, the Selective Gossip algorithm has a lower inter-node communication cost than MCDPF even when a much larger number of iterations is used between consecutive sensor measurements.

Communication and computation cost Table3 summarizes the communication and processing cost per node for each algorithm in the third scenario.

Table 3 Average communication and processing cost per node in the third scenario

Full size table

The MCDPF and the selective gossip algorithms have a RMS error performance similar to the ReDif-PF algorithm for J = 30 and J = 4,000 iterations, respectively, at the expense of a communication cost approximately two orders of magnitude larger than that of the ReDif-PF tracker. Moreover, for a comparable RMS error, the measured ReDif-PF duty cycle is also approximately five and seven times lower than the duty cycle of the MCDPF and the selective gossip algorithms respectively. Note, however, that the selective gossip tracker converges to the same estimate at all nodes and the estimates at each node provided by the MCDPF tracker have a lower standard deviation than those provided by the ReDif-PF algorithm.

We also note from Table3 that with J = 100 Markov chain move steps between sensor measurements, the MCDPF RMS error approaches the error curve of the optimal flooding-based CbPFb tracker with a inter-node communication cost that is, however, roughly four times greater than that of the CbPFb algorithm.

6 Conclusions

We introduced in this paper a Rao-Blackwellized version of the random exchange diffusion particle filter which enables fully distributed tracking of hidden state vectors in cooperative sensor networks with unknown sensor parameters. Although the general structure of the algorithm can be generalized to arbitrary signal models, we specified the algorithm in this particular paper in an application where we track a moving emitter using multiple RSS sensors with unknown noise variances. The ReDif-PF tracker, introduced originally in a simpler version in[17], is based on random information dissemination and is well suited for real-time applications since, unlike consensus-based approaches, it does not require iterative inter-node communication between measurement arrivals.

The new Rao-Blackwellized version of the ReDif-PF was compared to an exact broadcast implementation of the optimal centralized PF solution, referred to as the DcPF algorithm, and to two equivalent, fully distributed PFs using respectively iterative minimum consensus (CbPFa) and flooding (CbPFb). As expected, due to its suboptimality, the ReDif-PF tracker showed a degradation in RMS error performance compared to both DcPF and the equivalent consensus implementations in our simulations, but required much lower communication bandwidth with savings of one order of magnitude compared to DcPF and CbPFb in terms of transmission rate, and two orders of magnitude compared to CbPFa. The communication cost savings in the RB ReDif-PF algorithm were possible due to suitable parametric approximations introduced in Section 4.3.

The RB ReDif-PF algorithm RMS error performance was also compared in the unknown variance scenario to a local cooperation scheme in which each node assimilates all available measurements in its neighborhood but does not exchange its posterior statistics with other nodes. By diffusing information over the network, the RB ReDif-PF tracker showed better error performance than the local cooperation scheme that uses local information only. Additionally, the standard deviation of the error norm considering all nodes in the network was much lower for RB ReDif-PF than in the local cooperation scheme, suggesting possible weak consensus.

Next, in a second scenario with perfectly known variances, we also compared a non-RB ReDif-PF tracker to its distributed linear filtering counterpart, the ReDif-EKF described in Appendix 3. Due to the non-Gaussianity of the posterior distribution of the states, the distributed PF solution outperformed the distributed EKF solution, albeit, as expected, at a greater computational and communication cost.

Finally, in a third scenario also with perfectly known variances, we compared the non-RB ReDif-PF tracker to two alternative distributed particle filters based respectively on iterative Markov chain move steps between sensor measurements as proposed in[9] and on iterative selective average gossiping as proposed in[23]. In our simulations, the novel ReDif-PF matched the RMS error performance with both the Markov chain and the selective gossip filters with an inter-node communication cost approximately two orders of magnitude lower and a required duty cycle that is reduced by a factor of 5 when compared to MCDPF and a factor of 7 when compared to the selective gossip scheme.

As future work, we plan to extend the ReDif-PF algorithm to perform joint detection and tracking-considering scenarios with probability of detection less than 1 and probability of false alarm greater than 0 as in[33]. We also plan to analyze the diffusion properties of ReDif-PF by investigating the long-term statistical properties of the sequence of visited nodes {r_n},n > 0, defined by the random exchange protocol starting from a random node r₀.

Appendix 1

In this appendix, we use an importance sampling methodology (see[5, 6]) to show that the augmented particle set $x_{r, 0 : n}^{(q)} = {(x_{s, 0 : n - 1}^{(q)}, x_{r, n}^{(q)})}$ , q = 1,…,Q with weights ${w_{r, n}^{(q)}}$ obtained according to (12) and (13) in Section 4 is a properly weighted set to represent the posterior PDF $p (x_{0 : n} | Z_{r, n}, Z_{s, 0 : n - 1})$ in the sense that for any measurable function h(•),

E \{h (x_{0 : n}) | Z_{r, n}, Z_{s, 0 : n - 1}\} \approx \sum_{q = 1}^{Q} w_{r, n}^{(q)} h (x_{r, 0 : n}^{(q)}) .

Specifically, let $\{x_{s, 0 : n - 1}^{(q)}\}$ with associated weights $\{w_{s, n - 1}^{(q)}\}, q \in Q$ , be a properly weighted set that represents the posterior PDF $p (x_{0 : n - 1} | Z_{s, 0 : n - 1})$ at node s. Assuming that the particle set $\{x_{s, 0 : n - 1}^{(q)}\}$ was sampled according to some proposal importance function $π (x_{0 : n - 1} | Z_{s, 0 : n - 1})$ , the proper weights $\{w_{s, n - 1}^{(q)}\}$ may be written as[5, 6]

w_{s, n - 1}^{(q)} = \frac{w (x_{s, 0 : n - 1}^{(q)})}{\sum_{j = 1}^{Q} w (x_{s, 0 : n - 1}^{(j)})} q \in Q,

(39)

where

w (x_{s, 0 : n - 1}^{(q)}) = \frac{p (x_{s, 0 : n - 1}^{(q)} | Z_{s, 0 : n - 1})}{π (x_{s, 0 : n - 1}^{(q)} | Z_{s, 0 : n - 1})} .

Assume next that node s sends its particle set and weights to a neighboring node r that can access at instant n the measurements $Z_{r, n} = \{z_{r, n}\} \cup {\{z_{i, n}\}}_{i \in N_{r}}$ . For any measurable function h(•), we note that

\begin{array}{l} E \{h (x_{0 : n}) | Z_{r, n}, Z_{s, 0 : n - 1}\} \\ = \frac{\int h (x_{0 : n}) \frac{p (x_{0 : n} | Z_{r, n}, Z_{s, 0 : n - 1})}{p (x_{n} | x_{n - 1}) π (x_{0 : n - 1} | Z_{s, 0 : n - 1})} p (x_{n} | x_{n - 1}) π (x_{0 : n - 1} | Z_{s, 0 : n - 1}) d x_{0 : n}}{\int \frac{p (x_{0 : n} | Z_{r, n}, Z_{s, 0 : n - 1})}{p (x_{n} | x_{n - 1}) π (x_{0 : n - 1} | Z_{s, 0 : n - 1})} p (x_{n} | x_{n - 1}) π (x_{0 : n - 1} | Z_{s, 0 : n - 1}) d x_{0 : n}} . \end{array}

(40)

Sampling now at node r new particles $x_{r, n}^{(q)} \sim p (x_{n} | x_{s, n - 1}^{(q)})$ and building the augmented particle trajectories $x_{r, 0 : n}^{(q)} = (x_{s, 0 : n - 1}^{(q)}, x_{r, n}^{(q)}) \sim p (x_{n} | x_{n - 1}) π (x_{0 : n - 1} | Z_{s, 0 : n - 1})$ the integral on the right-hand side of (40) can be approximated as

\begin{align} E \{h (x_{0 : n}) | Z_{r, n}, Z_{s, 0 : n - 1}\} & \approx \frac{\frac{1}{Q} \sum_{q = 1}^{Q} h (x_{r, 0 : n}^{(q)}) w (x_{r, 0 : n}^{(q)})}{\frac{1}{Q} \sum_{j = 1}^{Q} w (x_{r, 0 : n}^{(j)})} \\ = \sum_{q = 1}^{Q} w_{r, n}^{(q)} h (x_{r, 0 : n}^{(q)}) \end{align}

(41)

where

w_{r, n}^{(q)} = \frac{w (x_{r, 0 : n}^{(q)})}{\sum_{j = 1}^{Q} w (x_{r, 0 : n}^{(j)})}

(42)

and

\begin{array}{l} w (x_{0 : n}) & = \frac{p (x_{0 : n} | Z_{r, n}, Z_{s, 0 : n - 1})}{p (x_{n} | x_{n - 1}) π (x_{0 : n - 1} | Z_{s, 0 : n - 1})} \\ = \frac{p (Z_{r, n} | x_{0 : n}, Z_{s, 0 : n - 1}) p (x_{n} | x_{0 : n - 1}, Z_{s, 0 : n - 1})}{p (x_{n} | x_{n - 1}) p (Z_{r, n} | Z_{s, 0 : n - 1})} \\ \times w (x_{0 : n - 1}) . \end{array}

(43)

Substituting (43) into (42) and recalling from the model assumptions that $p (x_{n} | x_{0 : n - 1}, Z_{s, 0 : n - 1}) = p (x_{n} | x_{n - 1})$ we get the recursion

\begin{align} w_{r, n}^{(q)} & = \frac{w (x_{s, 0 : n - 1}^{(q)}) p (Z_{r, n} | x_{r, 0 : n}^{(q)}, Z_{s, 0 : n - 1})}{\sum_{j = 1}^{Q} w (x_{s, 0 : n - 1}^{(j)}) p (Z_{r, n} | x_{r, 0 : n}^{(j)}, Z_{s, 0 : n - 1})} \\ \propto \frac{w_{s, n - 1}^{(q)} p (Z_{r, n} | x_{r, 0 : n}^{(q)}, Z_{s, 0 : n - 1})}{\sum_{j = 1}^{Q} w_{s, n - 1}^{(j)} p (Z_{r, n} | x_{r, 0 : n}^{(j)}, Z_{s, 0 : n - 1})} \end{align}

(44)

where, as before,

w_{s, n - 1}^{(q)} = \frac{w (x_{s, 0 : n - 1}^{(q)})}{\sum_{j = 1}^{Q} w (x_{s, 0 : n - 1}^{(j)})} .

Appendix 2

Let, as before,

\begin{align} IG (σ^{2} | α, β) = \frac{β^{α}}{Γ (α)} σ^{- 2 (α + 1)} exp (- \frac{β}{σ^{2}}) \end{align}

(45)

\begin{align} N (z | m, σ^{2}) = \frac{1}{\sqrt{2 π} σ} exp [- \frac{{(z - m)}^{2}}{2 σ^{2}}] \end{align}

(46)

where σ² > 0 and m ∈ ℜ. After some algebraic calculations, it can be shown (see[19] and also[6, 11]) that

\frac{N (z | m, σ^{2}) IG (σ^{2} | α, β)}{\int_{0}^{\infty} N (z | m, σ^{2}) IG (σ^{2} | α, β) d σ^{2}} = IG (σ^{2} | \bar{α}, \bar{β})

(47)

where

\begin{align} \bar{α} = α + \frac{1}{2} \end{align}

(48)

\begin{align} \bar{β} = β + \frac{1}{2} {(z - m)}^{2} . \end{align}

(49)

Similarly, using the same algebraic procedure, it follows that (see[6, 19])

\int_{0}^{\infty} N (z | m, σ^{2}) IG (σ^{2} | α, β) d σ^{2} \propto \frac{β^{α}}{Γ (α)} \frac{Γ (\bar{α})}{{\bar{β}}^{\bar{α}}}

(50)

where $\bar{α}$ and $\bar{β}$ are given respectively by (48) and (49).

Assume now that at node s at instant n - 1, the joint posterior PDF $p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})$ is factored as

p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) = \prod_{i = 1}^{R} IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}) .

(51)

In the sequel, assume that node s transmits to a neighboring node r its weighted particle set $\{(w_{s, n - 1}^{(q)}, x_{s, n - 1}^{(q)})\}$ and the corresponding parameters $\{α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}\}$ , q = 1,…,Q, i = 1,…,R. At instant n, as explained in Section 4, node r samples a new set of particles

x_{r, n}^{(q)} \sim p (x_{n} | x_{s, n - 1}^{(q)})

(52)

and updates its weights as

\begin{array}{l} w_{r, n}^{(q)} & = w_{s, n - 1}^{(q)} p (Z_{r, n} | x_{r, n}^{(q)}, x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) \\ = w_{s, n - 1}^{(q)} \int_{0}^{\infty} \dots \int_{0}^{\infty} [p (Z_{r, n} | x_{r, n}^{(q)}, σ_{1 : R}^{2}) \\ \times p (σ_{1 : R}^{2} | x_{r, n}^{(q)}, x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})] d σ_{1 : R}^{2} \\ = \{\prod_{i \in {\tilde{N}}_{r}} \int_{0}^{\infty} [p (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) \\ \times IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)})] d σ_{i}^{2}\} \\ \times \prod_{i \notin {\tilde{N}}_{r}} \underset{= 1}{\underset{⏟}{\int_{0}^{\infty} IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}) d σ_{i}^{2}}}, \end{array}

(53)

where ${\tilde{N}}_{r}$ denotes {r} ∪ N_r and, as in Section 4, $Z_{r, n}$ is a notation for the set {z_i,n} for all $i \in {\tilde{N}}_{r}$ . In (53), we used the facts that

\begin{array}{l} p (Z_{r, n} | x_{r, n}^{(q)}, x_{s, 0 : n - 1}^{(q)}, σ_{1 : R}^{2}, Z_{s, 0 : n - 1}) & = p (Z_{r, n} | x_{r, n}^{(q)}, σ_{1 : R}^{2}) \\ = \prod_{i \in {r} \cup N_{r}} p (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) \end{array}

(54)

and

\begin{array}{l} p (σ_{1 : R}^{2} | x_{r, n}^{(q)}, x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) \\ = \frac{p (x_{r, n}^{(q)} ∣ x_{s, 0 : n - 1}^{(q)}, σ_{1 : R}^{2}, Z_{s, 0 : n - 1}) p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})}{p (x_{r, n}^{(q)} ∣ x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})} \\ = \frac{p (x_{r, n}^{(q)} ∣ x_{s, n - 1}^{(q)}) p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1})}{p (x_{r, n}^{(q)} ∣ x_{s, n - 1}^{(q)})} \\ = p (σ_{1 : R}^{2} | x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}), \end{array}

which, in turn, is assumed to be factored as in (51). On the other hand, using (50), it follows that for each i ∈ {r} ∪ N_r,

\begin{array}{l} \int p (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}) d σ_{i}^{2} \\ = \int N (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}) d σ_{i}^{2} \\ \propto \frac{{[β_{s, i, n - 1}^{(q)}]}^{α_{s, i, n - 1}}}{Γ (α_{s, i, n - 1})} \frac{Γ (α_{r, i, n})}{{[β_{r, i, n}^{(q)}]}^{α_{r, i, n}}} \end{array}

where, from (48) and (49), α_r,i,n and $β_{r, i, n}^{(q)}$ are given by (20) and (21) in Section 4.2.

Similarly, node r at instant n updates the posterior PDF of the unknown variances as

\begin{array}{l} p (σ_{1 : R}^{2} | x_{r, n}^{(q)}, x_{s, 0 : n - 1}^{(q)}, Z_{r, n}, Z_{s, 0 : n - 1}) \\ = C_{n} p (Z_{r, n} | x_{r, n}^{(q)}, x_{s, 0 : n - 1}^{(q)}, σ_{1 : R}^{2}, Z_{s, 0 : n - 1}) \\ \times p (σ_{1 : R}^{2} | x_{r, n}^{(q)}, x_{s, 0 : n - 1}^{(q)}, Z_{s, 0 : n - 1}) \\ = C_{n} [\prod_{i \in {\tilde{N}}_{r}} N (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)})] \\ \times [\prod_{i \notin {\tilde{N}}_{r}} IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)})] \\ = \prod_{i = 1}^{R} IG (σ_{i}^{2} | α_{r, i, n}, β_{r, i, n}^{(q)}) \end{array}

where

\begin{array}{l} C_{n} = {\{\prod_{i \in {\tilde{N}}_{r}} [\int_{0}^{\infty} N (z_{i, n} | x_{r, n}^{(q)}, σ_{i}^{2}) IG (σ_{i}^{2} | α_{s, i, n - 1}, β_{s, i, n - 1}^{(q)}) d σ_{i}^{2}]\}}^{- 1} \end{array}

is a normalization constant that does not depend on $σ_{1 : R}^{2}$ and, using (47), (48), and (49), for all i ∈ {r} ∪ N_r,

\begin{align} α_{r, i, n} = α_{s, i, n - 1} + \frac{1}{2} \end{align}

(55)

\begin{align} β_{r, i, n}^{(q)} = β_{s, i, n - 1}^{(q)} + \frac{1}{2} {[z_{i, n} - g_{i} (x_{i, n}^{(q)})]}^{2}, \end{align}

(56)

as in (20) and (21) in Section 4.2. Otherwise, if i ∉ {r} ∪ N_r, then

\begin{align} α_{r, i, n} = α_{s, i, n - 1} \end{align}

(57)

\begin{align} β_{r, i, n}^{(q)} = β_{s, i, n - 1}^{(q)} . \end{align}

(58)

Appendix 3

In a scenario with perfectly known sensor model parameters, assume that at instant n - 1, node s has a linear estimate ${\hat{x}}_{s, n - 1 | n - 1}$ of the hidden state x_n-1 based on the observations $Z_{s, 0 : n - 1}$ , which were assimilated by node s from instant zero up to instant n - 1.

In the sequel, as proposed in[2], assume that node s and a randomly chosen node r in the neighborhood of s exchange their respective estimates ${\hat{x}}_{s, n - 1 | n - 1}$ and ${\hat{x}}_{r, n - 1 | n - 1}$ , and the respective associated conditional covariance matrices, P_s,n-1|n-1 and P_r,n-1|n-1.

At instant n then, we may get a new linear estimate ${\hat{x}}_{r, n | n}$ at node r, with associated conditional covariance matrix P_r,n|n, propagating ${\hat{x}}_{s, n - 1 | n - 1}$ and P_s,n-1|n-1 using the usual extended Kalman filter recursions, but assimilating now only the local measurements {z_i,n}, i ∈ {r} ∪ N_r, also denoted $Z_{r, n}$ . Under that approach, ${\hat{x}}_{r, n | n}$ is now an approximate linear minimum mean square error estimate (see[6]) of the hidden state x_n at instant n given the new set of observations $Z_{r, 0 : n} = {Z_{r, n}, Z_{s, 0 : n - 1}}$ .

Specifically, for a more general state-space model of the form

\begin{align} x_{n + 1} = F_{n} x_{n} + G_{n} u_{n} n \geq 0 \end{align}

(59)

\begin{align} z_{r, n} = h_{r} (x_{n}) + v_{r, n} n \geq 0, r = 1, \dots, R \end{align}

(60)

with $E \{u_{n} u_{n}^{T}\} = Q_{n}$ and $E \{v_{r, n} v_{r, n}^{T}\} = R_{r, n}$ , the prediction step of the extended Kalman filter at node r at instant n, after parameter exchange, is given by

\begin{align} {\hat{x}}_{r, n | n - 1} = F_{n - 1} {\hat{x}}_{s, n - 1 | n - 1} . \\ P_{r, n | n - 1} = F_{n - 1} P_{s, n | n - 1} F_{n - 1}^{T} + G_{n - 1} Q_{n - 1} G_{n - 1}^{T} . \end{align}

On the other hand, making

H_{i, n} = \frac{\partial h_{i} (x)}{\partial x} ∣_{x = {\hat{x}}_{r, n ∣ n - 1}} i \in {r} \cup N_{r},

the updated step equations of the distributed EKF become

\begin{align} {(P_{r, n ∣ n})}^{- 1} & = {(P_{r, n ∣ n - 1})}^{- 1} + \sum_{i \in {r} \cup N_{r}} H_{i, n}^{T} R_{i, n}^{- 1} H_{i, n} . \\ {\hat{x}}_{r, n ∣ n} & = {\hat{x}}_{r, n ∣ n - 1} + P_{r, n ∣ n} \sum_{i \in {r} \cup N_{r}} [H_{i, n}^{T} R_{i, n}^{- 1} \\ \times (z_{i, n} - H_{i, n} {\hat{x}}_{r, n | n - 1})] . \end{align}

Note that in the updated step of the random exchange distributed EKF, node r must have access to the measurements {z_i,n} from its immediate neighbors and must also know their respective sensor covariance matrices {R_i,n} and the analytic expressions of the neighboring gradients $\{\frac{\partial h_{i} (.)}{\partial x}\}$ , which are then all evaluated locally at node r at the predicted estimate ${\hat{x}}_{r, n ∣ n - 1}$ . Alternatively, node r may transmit ${\hat{x}}_{r, n ∣ n - 1}$ to its neighbors, which then evaluate their respective gradients and transmit back the matrices {H_i,n} and {R_i,n} to node r.

References

Djurić PM, Beaudeau J, Bugallo MF: Non-centralized target tracking with mobile agents. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Prague: IEEE; 2011:5928-5931.
Chapter Google Scholar
Kar S, Moura JMF: Gossip and distributed Kalman filtering: weak consensus under weak detectability. IEEE Trans. Signal Process 2011, 59(4):1766-1784.
Article MathSciNet Google Scholar
Cattivelli FS, Sayed AH: Diffusion strategies for distributed Kalman filtering and smoothing. IEEE Trans. Automatic Control 2010, 55(9):2069-2084.
Article MathSciNet Google Scholar
Ribeiro A, Giannakis GB, Roumeliotis SI: SOI-KF: Distributed Kalman filtering with low-cost communications using the sign of innovations. IEEE Trans. on Signal Process 2006, 54(12):4782-4795.
Article Google Scholar
Doucet A, Godsill S, Andrieu C: On sequential Monte Carlo sampling methods for Bayesian filtering. Stat. Comput 2000, 10(3):197-208. 10.1023/A:1008935410038
Article Google Scholar
Bruno MGS: Sequential Monte Carlo methods for nonlinear discrete-time filtering. Synth. Lect. Signal Process 2013, 6(1):1-99. 10.2200/S00471ED1V01Y201303SPR011
Article Google Scholar
Hlinka O, Hlawatsch F, Djurić PM: Distributed particle filtering in agent networks: a survey, classification, and comparison. IEEE Signal Process. Mag 2013, 30(1):61-81.
Article Google Scholar
Hlinka O, Sluciak O, Hlawatsch F, Djurić PM, Rupp M: Likelihood consensus and its application to distributed particle filtering. IEEE Trans. Signal Process 2012, 60(8):4334-4349.
Article MathSciNet Google Scholar
Lee SH, West M: Markov chain distributed particle filters (MCDPF). In Proceedings of the 48th IEEE International Conference on Decision and Control. Shanghai: IEEE; 2009:5496-5501.
Google Scholar
Ustebay D, Coates M, Rabbat M: Distributed auxiliary particle filters using selective gossip. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Prague: IEEE; 2011:3296-3299.
Chapter Google Scholar
Dias SS, Bruno MGS: Cooperative target tracking using decentralized particle filtering and RSS sensors. IEEE Trans. Signal Process 2013, 61(14):32-3646.
Article MathSciNet Google Scholar
Yadav V, Salapaka MV: Distributed protocol for determining when averaging consensus is reached. In 45th Annual Allerton Conference. Allerton House - UIUC; 2007:715-720.
Google Scholar
Tsoumakos D, Roussopoulos N: A comparison of peer-to-peer search methods. In Proceedings of the WebDB. San Diego: Citeseer; 2013:61-66.
Google Scholar
Farahmand S, Roumeliotis SI: GB Giannakis Particle filter adaptation for distributed sensors via set membership. In Proceedings of the 2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). Dallas: IEEE; 2010:3374-3377.
Chapter Google Scholar
Mohammadi A, Asif A: Consensus-based distributed unscented particle filter. In Proceedings of the 2011 IEEE Statistical Signal Processing Workshop (SSP). Nice: IEEE; 2011:237-240.
Chapter Google Scholar
Sayed AH, Tu S-Y, Chen J, Zhao X, Towfic ZJ: Diffusion strategies for adaptation and learning over networks: An examination of distributed strategies and network behavior. IEEE Signal Process. Mag 2013, 30(3):155-171.
Article Google Scholar
Dias SS, Bruno MGS: Distributed emitter tracking using random exchange diffusion particle filters. In Proceedings of the 16th International Conference on Information Fusion. Istanbul: IEEE; 2013.
Google Scholar
Casella G, Robert CP: Rao-blackwellisation of sampling schemes. Biometrika 1996, 83(1):81-94. 10.1093/biomet/83.1.81
Article MATH MathSciNet Google Scholar
Gelman A, Carlin JB, Stern HS, Rubin DB: Texts in Statistical Science - Bayesian Data Analysis. Florida: Chapman & Hall/CRC; 2003.
Google Scholar
Dias SS, Bruno MGS: A Rao-Blackwellized random exchange diffusion particle filter for distributed emitter tracking. In IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP). St. Martin: IEEE; 2013.
Google Scholar
Sheng X, Hu Y-H, Ramanathan P: Distributed particle filter with GMM approximation for multiple targets localization and tracking in wireless sensor network. In Proceedings of the 4th International Symposium on Information Processing in Sensor Networks, IPSN ’05. Los Angeles: IEEE Press; 2005:181-188.
Google Scholar
Bordin CJ, Bruno MGS: A particle filtering algorithm for cooperative blind equalization using VB parametric approximations. In Proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP). Dallas: IEEE; 2010:3834-3837.
Google Scholar
Üstebay D, Castro R, Rabbat M: Selective gossip. In Proceedings of the 3rd IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP). Aruba: IEEE; 2009:61-64.
Chapter Google Scholar
Bar-Shalom Y, Li X-R: Multitarget-Multisensor Tracking: Principles and Techniques. Storrs, CT: University of Connecticut; 1995.
Google Scholar
Patwari N, Hero III AO, Perkins M, Correal NS, O’dea RJ: Relative location estimation in wireless sensor networks. Proc. IEEE Trans. Signal Process 2003, 51(8):2137-2148. 10.1109/TSP.2003.814469
Article Google Scholar
Coates M: Distributed particle filters for sensor networks. In Proceedings of the 3rd International Symposium on Information Processing in Sensor Networks. New York: ACM; 2004:99-107.
Google Scholar
Bordin CJ, Bruno MGS: Cooperative blind equalization of frequency-selective channels in sensor networks using decentralized particle filtering. In Proceedings of the 42nd Asilomar Conference on Signals, Systems and Computers. Pacific Grove: IEEE; 2008:1198-1201.
Google Scholar
Xiao L, Boyd S: Fast linear iterations for distributed averaging. Syst. & Control Lett 2004, 53(1):65-78. 10.1016/j.sysconle.2004.02.022
Article MATH MathSciNet Google Scholar
Bordin CJ, Bruno MGS: Consensus-based distributed particle filtering algorithms for cooperative blind equalization in receiver networks. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Prague: IEEE; 2011:3968-3971.
Chapter Google Scholar
Boyd S, Ghosh A, Prabhakar B, Shah D: Randomized gossip algorithms. IEEE Trans. Inf. Theory 2006, 52(6):2508-2530.
Article MATH MathSciNet Google Scholar
Dempster AP, Laird NM, Rubin DB: Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc 1977, 39(1):1-38.
MATH MathSciNet Google Scholar
Dias SS, Bruno MGS: Cooperative particle filtering for emitter tracking with unknown noise variance. In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Kyoto: IEEE; 2012:2629-2632.
Chapter Google Scholar
Bruno MGS, Araújo RV, Pavlov AG: Sequential monte carlo methods for joint detection and tracking of multiaspect targets in infrared radar images. EURASIP J. Adv. Signal Process 2008., 2008: doi:10.1155/2008/217373
Google Scholar

Download references

Acknowledgements

The authors would like to thank Professor José M. F. Moura for fruitful discussions at ICASSP 2012 that motivated this work. The authors would also like to acknowledge Dr. Claudio Bordin Jr. for helpful discussions on the topic of inter-node communication cost in network particle filtering.

Author information

Authors and Affiliations

Instituto Tecnológico de Aeronáutica, Praça Marechal Eduardo Gomes 50, São José dos Campos, Sao Paulo, 12228-900, Brazil
Marcelo G S Bruno & Stiven S Dias
Embraer Defense & Security, Av. Brigadeiro Faria Lima 2.170, São José dos Campos, Sao Paulo, 12227-901, Brazil
Stiven S Dias

Authors

Marcelo G S Bruno
View author publications
You can also search for this author in PubMed Google Scholar
Stiven S Dias
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marcelo G S Bruno.

Additional information

Competing interests

The authors declare that they have no competing interests.

Marcelo G S Bruno and Stiven S Dias contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Bruno, M.G.S., Dias, S.S. Collaborative emitter tracking using Rao-Blackwellized random exchange diffusion particle filtering. EURASIP J. Adv. Signal Process. 2014, 19 (2014). https://doi.org/10.1186/1687-6180-2014-19

Download citation

Received: 01 October 2013
Accepted: 24 January 2014
Published: 13 February 2014
DOI: https://doi.org/10.1186/1687-6180-2014-19

Collaborative emitter tracking using Rao-Blackwellized random exchange diffusion particle filtering

Abstract

1 Introduction

1.1 Distributed particle filtering

1.2 Diffusion particle filtering

1.3 Paper outline

2 Problem setup

2.1 Observation model

2.2 Problem statement and goals

3 Centralized particle filter

3.1 Equivalent distributed implementation of the centralized particle filter

4 Random exchange diffusion particle filter

4.1 ReDif-PF with known sensor variances

4.2 Rao-Blackwellized ReDif-PF with unknown sensor variances

4.3 Approximate RB ReDif-PF

4.4 Differences between ReDif-PF and the Markov chain distributed particle filter

5 Simulation results

5.1 Scenario I: ReDif-PF vs. CbPF

5.2 Scenario II: ReDif-PF vs. ReDif-EKF

5.3 Scenario III: ReDif-PF vs. MCDPF/selective gossip

6 Conclusions

Appendix 1

Appendix 2

Appendix 3

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords