- Research
- Open Access

# Source localization and tracking in a dispersive medium using wireless sensor network

- Kamrul Hakim
^{1}Email author and - Sudharman K Jayaweera
^{1}

**2013**:147

https://doi.org/10.1186/1687-6180-2013-147

© Hakim and Jayaweera; licensee Springer. 2013

**Received:**12 July 2012**Accepted:**18 May 2013**Published:**18 September 2013

## Abstract

In this paper, we address the issue of collaborative information processing for diffusive source localization and tracking using wireless sensor networks capable of sensing in dispersive medium/environment. We first determine the space-time concentration distribution of the dispersion from physical modeling and mathematical formulations of an underwater oil spill scenario, considering the effect of laminar water velocity as an external force. For static diffusive source localization, we propose two parametric estimation techniques based on maximum-likelihood (ML) and best linear unbiased estimator for the special case of our physical dispersion model. We prove the consistency and asymptotic normality of the obtained ML solution when the number of sensor nodes and samples approach infinity, and derive the Cramér-Rao lower bound on its performance. We also propose a particle filter-based target tracking scheme for moving diffusive source and derive the posterior Cramér-Rao lower bound for the moving source state estimates as a theoretical performance bound. The performance of the proposed schemes are shown through numerical simulations and compared with the derived theoretical bounds.

## Keywords

- Wireless sensor network
- Diffusion
- Source localization
- Tracking
- Parameter estimation
- Maximum-likelihood
- Best linear unbiased estimator
- Particle filter

## 1 Introduction

The release of liquid petroleum hydrocarbon into the ocean or coastal water due to human activity has attracted tremendous attention because of its environmental, biological, and economical impact. Recent *BP oil disaster* in the Gulf of Mexico is a perfect example of how spill stemmed from a sea-floor oil gusher can severely damage the marine and wildlife habitats as well as the Gulf’s fishing and tourism industries. Research in modeling and predicting the extent of such oil spill can assist in planning and emergency decision-making, thereby reducing the threats and hazardous effects on the environment as well as the economic cost. Considering the fact that this is a diffusive source estimation and tracking problem, such research can in general be applicable in many other similar contexts such as homeland security, environmental and industrial monitoring, pollution control, servers, and data center temperature monitoring as well [1–8]. For example, the spread of chemical and biological agents as homeland security problems are discussed in [5, 9–11].

Recent advances in sensor technology, such as smart/intelligent nodes with cognitive abilities, on-board sensors, and wireless networking capabilities have triggered the use of wireless sensor networks (WSNs) in monitoring various physical phenomena [12–14]. Though sensor nodes are capable of a limited amount of local processing and wireless communication, when a large number of sensors communicate and share information among themselves, they can measure a desired phenomenon-of-interest in great detail. Also with the developments of unmanned autonomous vehicles, WSNs are gaining popularity due to their potential to be useful for a wide range of applications including environmental monitoring, intrusion detection, and various military and civilian applications [12, 15, 16]. Due to advanced microelectromechanical systems, many of the state-of-the-art sensors are now more accurate, robust against noise, and energy efficient [17, 18]. These new cutting-edge sensors can withstand severe unfavorable conditions in hazardous areas where human deployment is impossible. All these useful and exciting features in recently developed sensors make them suitable candidates for the set of applications involving monitoring of diffusion phenomena that we are interested in.

Source or target localization using distributed sensor arrays is an area of active research interest for a considerable period of time [19, 20]. In the past, detection and localization problems of diffusive sources in WSN have been a topic of interest, specially in the case of chemical/biological threat detection. Interesting research in this context can be found in [3, 4, 9, 10], where biochemical concentration distribution in space and time for different types of diffusive sources, diffusion models, and/or sensor networks is estimated. For instance, remotely localizing a gas or odor source using mobile robot was proposed in [3] by fitting the gas distribution model to the gas sensor response at the sensor locations. However, the mobile sensor dynamics model therein was obtained empirically, which does not allow for dynamic environment and moving diffusive source. In [4], a maximum-likelihood (ML) estimator was developed for localizing vapor-emitting sources, and its asymptotic normality of the obtained ML estimator was proved when the signal-to-noise-ratio (SNR) approaches infinity. Many other estimation techniques have also been used in diffusive source parameters estimation literature [9, 10, 21–23]. In particular, Bayesian estimation has been applied in [9, 21] in a sequential manner, which is not suitable in many practical scenarios where faster estimation and immediate actions based on the estimation are top priorities. A real-time maximum-likelihood estimation method was proposed in [23] for estimating diffusive source parameters, where consistency and asymptotic efficiency of the obtained estimator were proved when the density of sensors becomes infinite. In [24], the problem of impulsive diffusive source localization was solved assuming the spatial sensor measurements at any sensor location as a scaled and shifted version of a common prototype function, leading to solving a set of linear equations. However, the physical diffusion models used in [23, 24] are oversimplified with the diffusive sources assumed to be impulsive or instantaneous in nature.

Although research has been done in tracking and/or estimating time-varying parameter estimation in general [25–28], to the best of our knowledge, very few attempts have been made in time-varying diffusive source parameter estimation. Some of these methods cannot be applied directly into our time-varying parameter estimation model since, e.g., for a moving source, the concentration at the current time is affected by all past values of source position. Therefore, time-cumulation effects on the concentrations (i.e., observations) must be taken into account to estimate time-varying parameters. Among previous works, a parametric moving path model for a diffusive moving source was discussed in [10], where the moving source path was approximated using finite number basis functions. Tracking performance in this case depends on the smoothness of the source trajectory, prior information about the moving source trajectory, and choosing a suitable finite set of basis functions. In [29], a novel recursive algorithm was proposed to track the intensity of a diffusive point source, but the source location was considered as an unknown static value.

The aforementioned limitations may be overcome by developing or exploiting state-of-the-art Bayesian-based location tracking methods suitable for handling highly nonlinear diffusion processes. In the Bayesian approach, the key is to construct the posterior probability density function (PDF) of the underlying state vector based on all available information. For linear and Gaussian state dynamics and observation models, the optimal minimum mean squared error (MMSE) solution is tractable and is given by the well-known Kalman filter [30]. However, for most of the real-world scenarios, dynamic state estimation problems are nonlinear and non-Gaussian, and obtaining optimal closed-form solution is not tractable under the Bayesian approach. In these cases, suboptimal approaches such as extended Kalman filter and Gaussian-sum filter [31] are used with certain approximations. These suboptimal algorithms become inefficient for highly nonlinear and non-Gaussian systems. In these cases, numerical techniques based on sequential Monte Carlo methods are used to achieve better performance for highly nonlinear systems. To that end, the idea of particle filtering was introduced in [32] as an effective method of representing PDF in terms of a set of random sampling.

In this paper, our main objectives are to efficiently estimate and track diffusive source location using a wireless network of chemical sensor capable of sensing in diffusive environment. To cater to the objectives, we formulate and derive a physical model for the space-time substance dispersion mechanism of an underwater diffusive source. The modeling and the proposed solution methods can also be extended to other important diffusion phenomena involving biochemical contaminant materials as well. We propose and implement ML and best linear unbiased estimator (BLUE)-based parameter estimation techniques for a static diffusive source which is continuously emitting substance [33]. In the previous literature, such as in [4], the asymptotic normality of the obtained ML estimator was proved when the SNR approaches infinity. We prove both the consistency and asymptotical normality of our obtained ML-based solution when the number of sensor nodes and time samples go to infinity, thus allowing for the option of tweaking these two parameters. We derive the Cramér-Rao lower bound (CRLB) as a theoretical performance bound for a special case of our obtained physical dispersion model. We also propose a particle filter (PF)-based target tracking method for moving diffusive source. To the best of our knowledge, moving diffusive source tracking using particle filtering approach has not been attempted before. The posterior Cramér-Rao lower bound (PCRLB) for the moving source state estimates is also derived as a theoretical performance bound [34].

The remainder of this paper is organized as follows: Sections 2 and 3 discuss, respectively, modeling of an underwater oil spill scenario and measurement model for static diffusive source localization using sensor network. The proposed statistical methods for static diffusive source localization and corresponding theoretical performance bound are discussed in Section 4. Section 5 presents the proposed particle filter-based method for moving diffusive source tracking with theoretical performance bound analysis in detail. Section 6 shows the validity and effectiveness of our proposed methods for diffusive source localization and tracking through numerical simulations. Finally, Section 7 concludes the paper by summarizing our results.

## 2 Physical model for dispersion

**r**= [

*x*,

*y*,

*z*]

^{ T }and at time

*t*as

*c*(

**r**,

*t*). Ignoring the effects of external forces for a source-free volume and for space-invariant diffusivity constant

*κ*, the concentration of a dispersed substance follows the following diffusion equation [35]:

*c*

_{G}(

**r**,

*t*), the concentration of a continuous point source with mass release rate

*μ*(

*t*) and initial release time

*t*

_{I}can then be given by the following integral:

For parametric estimation case, it is to be noted that from the concentration measurements taken by the sensors, we can first estimate the source parameters of interest and then predict its cloud evolution in space and time by inserting the estimated parameters into (1).

**r**

_{0}= [

*x*

_{0},

*y*

_{0},

*z*

_{0}]

^{ T }. The depth of water level is 0 ≤

*z*≤

*L*with diffusivity

*κ*

_{ w }and concentration

*c*

_{ w }. The same quantities for air (

*z*>

*L*) are denoted as

*κ*

_{ a }and

*c*

_{ a }respectively. Along the

*z*-axis, we need to solve the following differential equations:

*z*=

*z*

_{0}, where 0 ≤

*z*

_{0}≤

*L*and impermeable boundary at

*z*= 0, we have the following initial condition:

*z*=

*L*. Condition (3) represents the fact that there is no accumulation of diffusing substance at

*z*=

*L*. Finally, the third boundary condition in (4) reflects the assumption that the medium at

*z*= 0 is impermeable. By applying the concept of Laplace transform on the above system of partial differential equations, we can obtain the solution to the spatio-temporal concentration distribution (omitting the details in [10, 35, 36]):

*κ*

_{ w }≫

*κ*

_{ a }, then

*ρ*→ 1. Therefore we have,

*X*-

*Y*-plane as an external force, we have

**v**= [

*v*

_{ x },

*v*

_{ y },0]

^{ T }. The diffusion equations along the

*x*and

*y*axes will include additional

*advection*term [35]:

*X*-

*Y*-plane, there is no boundary condition and the initial condition is given as

*x*and

*y*axes [37]:

*c*

_{G}(

**r**,

*t*) for the space-time concentration distribution can be obtained as the product of the solutions in (5), (6), and (7):

**r**

_{0}= [

*x*

_{0},

*y*

_{0},

*z*

_{0}]

^{ T }and

**r**

^{′}= [

*x*

_{0},

*y*

_{0},-

*z*

_{0}]

^{ T }. Considering the source mass release rate to be constant

*μ*(

*t*) =

*μ*, the final solution for concentration of oil diffusion in water for stationary continuous source with mass rate of

*μ*(

*t*) can be obtained from (1):

Derivation to (9) is given in Appendix 1. For the sake of simplicity from here on, we denote the diffusivity constant *κ*
_{
w
} = *κ*.

### 2.1 Moving diffusive source

*μ*(

*t*), the space-time concentration distribution in a semi-infinite medium can be obtained for a given Green’s function

*c*

_{G}(

**r**,

*t*) using the following integral:

where **r**
_{0}(*t*) = [*x*
_{0}(*t*),*y*
_{0}(*t*),*z*
_{0}(*t*)]^{
T
} represents the source moving path. The advantage of solving the physical diffusion model corresponding to a moving diffusive source using (10) is that the initial, boundary, and other necessary conditions can be taken into account to solve for the stationary case in the first step before extending it to the moving source case.

## 3 Measurement and system models for static diffusive source localization

*N*spatially distributed biochemical static sensor nodes capable of sensing in dispersive environment. For practical consideration, we assume that the

*N*distributed sensors are located in a rectangular volume in space such that

**r**

_{ j }= [

*x*

_{ j },

*y*

_{ j },

*z*

_{ j }]

^{ T }∈ Λ,∀

*j*∈ {1,2,…,

*N*}, where $\mathrm{\Lambda}=[{a}_{1},{a}_{2}]\times [{b}_{1},{b}_{2}]\times [{c}_{1},{c}_{2}]\phantom{\rule{0.3em}{0ex}}\subseteq {\mathbb{R}}^{3}$. It is also assumed that the source-to-sensor distances are much higher than the source and sensor dimensions. Each sensor node takes measurements at times

*t*

_{ k };∀

*k*∈ {1,2,…,

*T*}, where

*T*is the total number of time samples. Assuming that the physical model discussed before is the underlying dispersion mechanism, we may obtain a measurement model for a sensor at a position

**r**

_{ j }and at time

*t*

_{ k }as

*y*(

**r**

_{ j },

*t*

_{ k }) =

*c*(

**r**

_{ j },

*t*

_{ k }) +

*e*(

**r**

_{ j },

*t*

_{ k }) +

*b*, where

*c*(

**r**

_{ j },

*t*

_{ k }) is the concentration of interest,

*b*is a bias term, and $e({\mathbf{r}}_{j},{t}_{k})\sim \mathcal{N}(0,{\sigma}^{2})$ is the sensor noise assumed to be independent in both time and space. For the sake of brevity, it can be rewritten in the simplified form as

where *y*
_{
j,k
} = *y*(**r**
_{
j
},*t*
_{
k
}), *e*
_{
j,k
} = *e*(**r**
_{
j
},*t*
_{
k
}), *c*
_{
j,k
}(θ) = *c*(**r**
_{
j
},*t*
_{
k
}), $\mathit{\theta}\in {\mathbb{R}}^{n\times 1}$ is the unknown source and medium parameter vector that we are interested to estimate, and *b* is the bias or clutter term representing the sensor’s response to foreign substances that may be present in a diffusive field of interest. The bias term is assumed to be space and time-invariant such that the foreign substances interfering with the actual measurements are in steady state. If we want to localize a static diffusive source, then only [*x*
_{0},*y*
_{0},*z*
_{0}] are the parameters of interest. It is to be noted that some of the parameters, such as the diffusivity constant *κ*, bias term *b*, and noise variance *σ*
^{2}, can be measured at the calibration stage, thereby reducing the cost of computation during the detection/estimation phase.

We assume that the sensor nodes are in sleep mode until they are activated by some central control (i.e., FC) due to a possible release of a substance from a diffusive source. The activated sensor nodes take measurements of the substance’s concentration at time instants *t*
_{
k
}s and then return to sleep mode. For *N* number of nodes in a WSN and with each node taking *T* number of time samples of the substance concentrations at their respective locations, let $\mathit{y}\in {\mathbb{R}}^{\mathit{\text{NT}}\times 1}$ be the measurement vector received at the FC.

## 4 Static diffusive source localization

*L*→

*∞*) underwater medium. In this case, the concentration at any position

**r**

_{ j }at time

*t*

_{ k }is reduced to the following expression [4, 35]:

where erfc(.) is the complementary error function.

### 4.1 Maximum-likelihood-based source localization

*j*th node at time

*t*

_{ k }is $p({y}_{j,k}|\mathit{\theta})\sim \mathcal{N}({c}_{j,k}(\mathit{\theta})+b,{\sigma}^{2})$. Hence, the log-likelihood function formed at the FC can be written as

*u*= 1,2,3, where

*θ*

_{ u }is the

*u*th element of θ, and

Since the system of equations in (14) is nonlinear, there is no closed-form solution to it. We can obtain an ML estimation of the source location using any suitable nonlinear optimization technique. In this case, (14) is solved using simplex search algorithm [38].

*u*-

*v*th element of the FIM can be found as

where (16) was obtained using the independence assumption of observations in space and in time.

A sequence of estimators ${\widehat{\mathit{\theta}}}_{n}$ to an unknown parameter vector θ is said to be consistent if the sequence converges *in probability* to θ, i.e., ${lim}_{n\to \infty}{\widehat{\mathit{\theta}}}_{n}=\mathit{\theta}$, where *n* is the sample size [30]. It is desirable to have a consistent MLE as consistency ensures that for large data sets, the MLE will converge to the true parameter. The obtained MLE to our source localization problem is consistent when the number of sensor nodes in any non-negligible open subset of $\mathrm{\Lambda}=[{a}_{1},{a}_{2}]\times [{b}_{1},{b}_{2}]\times [{c}_{1},{c}_{2}]\subseteq {\mathbb{R}}^{3}$ and time samples go to infinity.

#### Theorem 1.

*If the number of sensors N increases in such a way that for any open subset*
$\mathrm{\Lambda}=[{a}_{1},{a}_{2}]\times [{b}_{1},{b}_{2}]\times [{c}_{1},{c}_{2}]\subseteq {\mathbb{R}}^{3}$
*having positive area, the number of sensors N and/or the number of time samples T tend to infinity, the obtained ML estimator is consistent.*

*Proof.* See Appendix 2.

Once consistency for the obtained MLE is established, the next important thing is to check the asymptotic normality. An asymptotically normal estimator is a consistent estimator whose distribution around the true parameter θ approaches a normal distribution with standard deviation shrinking in proportion to $1/\sqrt{n}$ as the sample size *n* grows, i.e., $\sqrt{n{I}_{\mathit{\theta}}}\left({\widehat{\mathit{\theta}}}_{n}-\mathit{\theta}\right)\to \mathcal{N}\left(\mathbf{0},{\mathbf{I}}^{-1}\right)$, where **I**
_{
θ
} and **I** are the Fisher information and identity matrices, respectively [30]. It ensures that the estimator not only converges to the unknown parameter, but it converges fast enough at a rate $1/\sqrt{n}$. We address this issue with the following theorem on asymptotic normality. □

#### Theorem 2.

*If the number of sensors N and time samples T increase as in Theorem 1, then for a true parameter vector*${\mathit{\theta}}_{0}\in \stackrel{\circ}{\Lambda}$,

*where*$\stackrel{\circ}{\Lambda}\subset \mathrm{\Lambda}$

*is an open subset of*Λ,

*the following is true:*

*in distribution where the*(

*u*,

*v*)

*th element of the matrix*${\stackrel{\u0304}{\mathbf{I}}}_{\mathit{\theta}}$

*is given by*

*Proof.* See Appendix 3. □

### 4.2 Best linear unbiased estimator-based source localization

The advantages of using the BLUE for static diffusive source localization are that there are no constraints on the PDF and also knowing only the mean and covariance of the measurements is enough. However, observations have to be linear for the BLUE algorithm. In this subsection, we assume that the distributed sensing nodes are capable of estimating their respective distances from the source using BLUE.

*j*th node at the

*k*th time instant can be linearized in terms of the inverse of the source-to-node distances from (11) and (12):

*j*th node can be written as

*h*s and e

_{ j }= [

*e*

_{ i 1},

*e*

_{ i 2},…,

*e*

_{ iT }]

^{ T }. Since ${e}_{j,k}\sim \mathcal{N}\left(0,{\sigma}^{2}\right)$ for ∀

*j*,

*k*and measurement noise is assumed to be independent and identically distributed across space and time, the covariance matrix of ${\stackrel{~}{\mathit{y}}}_{j}$ is ${\stackrel{~}{\mathrm{\Sigma}}}_{j}=\text{diag}\left({\sigma}^{2},{\sigma}^{2},\dots ,{\sigma}^{2}\right)\in {\mathbb{R}}^{T\times T}$. The optimal BLUE estimator formed at

*j*th node is given by

with estimator variance ${V}_{j}={\left({\mathit{h}}^{T}{\stackrel{~}{\mathrm{\Sigma}}}_{j}^{-1}\mathit{h}\right)}^{-1}$.

*j*th node can be expressed as ${f}_{j}={\widehat{d}}_{j}+{w}_{j}$, where

*w*

_{ j }is normally distributed with mean 0 and variance ${\sigma}_{m}^{2}$. For

*N*number of nodes, the data vector available at the FC can be written as

*w*

_{1},

*w*

_{2},…,

*w*

_{ N }]

^{ T }. The data vector F formed at the FC can be used to estimate the diffusive source location using the nonlinear least-square approach:

To solve for the source location from (20), simplex search algorithm [38] has been used.

## 5 Moving diffusive source tracking

### 5.1 State dynamics model

*X*-

*Y*-plane. The assumption can be easily extended to the 3D case without any loss of generality. Let us denote by ${\mathbf{s}}_{k}={\left[{x}_{s,k}\phantom{\rule{2.22144pt}{0ex}}{y}_{s,k}\phantom{\rule{2.22144pt}{0ex}}{\stackrel{\u0307}{x}}_{s,k}\phantom{\rule{2.22144pt}{0ex}}{\stackrel{\u0307}{y}}_{s,k}\right]}^{T}$, the state vector associated with the moving source at time

*t*

_{ k }, where the first two elements represent the source position in 2D and the next two elements represent the speed of the moving source, respectively. We assume linear dynamic model for the source state vector:

*k*= 1,2,…, with the initial known distribution

*p*(

**s**

_{0}) for

**s**

_{ k }, where

**F**is a 4 × 4 matrix that models the state kinematics [39]:

*T*

_{s}is the time difference between two consecutive measurements. The noise vector

**u**

_{ k }is assumed to be 0 mean Gaussian with covariance matrix

**Q**[39]:

which models the acceleration terms in the spatial directions, and ${\sigma}_{u}^{2}$ is the variance of the process noise.

### 5.2 Observation model

**r**

_{ j,k }and at time

*t*

_{ k }as

*z*

_{ j,k }is the

*j*th node’s observation at time

*t*

_{ k }; ${c}_{j,k}\triangleq c({\mathbf{r}}_{j,k},{t}_{k})={\int}_{{t}_{\mathrm{I}}}^{{t}_{k}}\mu (\tau ){c}_{\mathrm{G}}\left({\mathbf{r}}_{j,k}-{\mathbf{r}}_{s}(\tau ),t-\tau \right)$ is the substance concentration at

*j*th node location at time

*t*

_{ k }; moving diffusive source location at time

*t*

_{ k }is ${\mathbf{r}}_{s,k}={\stackrel{~}{\mathbf{s}}}_{k}={\left[{x}_{s,k},{y}_{s,k}\right]}^{T}$; location of

*j*th node at time

*t*

_{ k }is

**r**

_{ j,k }= [

*x*

_{ j,k },

*y*

_{ j,k }]

^{ T }; and ${\nu}_{j,k}\sim \mathcal{N}\left(0,{\sigma}_{\nu}^{2}\right)$ is the sensor measurement noise assumed to be independent in both time and space. Note that for static sensor node locations, we use

**r**

_{ j,k }=

**r**

_{ j }= [

*x*

_{ j },

*y*

_{ j }]

^{ T }by dropping the time index, since node locations do not change over time. By assuming the additive white Gaussian noise channel for the sake of simplicity, the received signal at the FC from the

*j*th node at time

*t*

_{ k }can be written as

where *ε*
_{
j,k
} is the received noise which is assumed to be Gaussian with mean 0, variance ${\sigma}_{\epsilon}^{2}$ and *e*
_{
j,k
} = *ε*
_{
j,k
} + *ν*
_{
j,k
} and ${\sigma}^{2}={\sigma}_{\nu}^{2}+{\sigma}_{\epsilon}^{2}$. We denote y
_{
j,1:k
} as the measurement vector from *j* th node up to the time *t*
_{
k
}, and ${\mathbf{y}}_{c,1:k}\triangleq {\{{\mathit{y}}_{1,1:k},{\mathit{y}}_{2,1:k},\dots ,{\mathit{y}}_{N,1:k}\}}^{T}$ as the collection of all measurements at the FC from *N*-distributed sensor nodes.

*T*

_{s}, we can assume that the moving diffusive source moves in a linear fashion between two observations with an average velocity determined by the source locations

**r**

_{ s,k }and

**r**

_{ s,k+1}. For 2D moving diffusive source tracking with no external force in action, the Green’s function can be obtained from (6) and (7) as

*μ*(

*t*) =

*μ*, observations taken by the

*j*th node at

*k*th time instant can be written as

### 5.3 Target tracking using particle filters

In Bayesian belief update, to estimate state vector **s**
_{
k
} at time instant *k*, we need to construct posterior distribution *p*(**s**
_{
k
}|**y**
_{
c,1:k
}) with initial PDF *p*(**s**
_{0}). The Bayesian belief update is done in two stages: *prediction* and *update*.

*Prediction.*Considering that

*p*(

**s**

_{ k-1}|

**y**

_{ c,1:k-1}) is available at time

*k*, the PDF

*p*(

**s**

_{ k }|

**y**

_{ c,1:k-1}) can be obtained as [40]

*Update.*If observations

**y**

_{ c,1:k }are available at time instant

*k*, the posterior distribution to estimate the state vector

**s**

_{ k }is given by [40]

Since the observation model is highly nonlinear, the analytical solution for the optimal estimator is not tractable in our case. Hence, we use sequential Monte Carlo method to approximate the posterior PDF (27) with particle filters [32].

Let us denote ${\mathcal{X}}_{k}={\left\{{\mathbf{s}}_{k}^{i},{w}_{k}^{i}\right\}}_{i=1}^{P}$ to be the random measure that characterizes the posterior PDF *p*(**s**
_{
k
}|**y**
_{
c,1:k
}), where *P* is the number of particles. Then $p\left({\mathbf{s}}_{k}|{\mathbf{y}}_{c,1:k}\right)\approx {\sum}_{i=1}^{P}{w}_{k}^{i}\delta \left({\mathbf{s}}_{k}-{\mathbf{s}}_{k}^{i}\right)$, where *δ*(.) is the Dirac delta function. The state vector estimate at time *t*
_{
k
} can be obtained as ${\widehat{\mathbf{s}}}_{k|k}\approx {\sum}_{i=1}^{P}{w}_{k}^{i}{\mathbf{s}}_{k}^{i}$, and the covariance matrix *U*
_{
k|k
} of the estimate is ${U}_{k|k}\approx {\sum}_{i=1}^{P}{w}_{k}^{i}\left({\mathbf{s}}_{k}^{i}-{\widehat{\mathbf{s}}}_{k|k}\right){\left({\mathbf{s}}_{k}^{i}-{\widehat{\mathbf{s}}}_{k|k}\right)}^{T}$. The predicted state ${\widehat{\mathbf{s}}}_{k+1|k}$ and the corresponding covariance matrix *U*
_{
k+1|k
} can be obtained from the state dynamics in (21) as ${\widehat{\mathbf{s}}}_{k+1|k}=\mathbf{F}{\widehat{\mathbf{s}}}_{k|k}$ and *U*
_{
k+1|k
} = **F** *U*
_{
k|k
}
**F**
^{
T
} + **Q**.

### 5.4 Posterior Cramer-Rao lower bound analysis

**S**

_{ k }and

**y**

_{ c,1:k }for an arbitrary

*k*is

*p*(

**S**

_{ k },

**y**

_{ c,1:k }) =

*p*

_{ k }, where

**y**

_{ c,1:k }is the observation vector formed at the FC at

*k*th time instant and

**S**

_{ k }= (

**s**

_{0},

**s**

_{1},…,

**s**

_{ k }). Following (26), the concentration at any time

*k*+1 for any node

*j*can be written as

*k*+ 1)th time instant formed at the FC is given by

Let *I*(**S**
_{
k
}) ∈ **R**
^{4k × 4k
} be the information matrix derived from the joint distribution *p*
_{
k
}. We wish to solve for the information submatrix for estimating **s**
_{
k
}, denoted by *I*
_{
k
}. The following theorem gives a two-step recipe for computing *I*
_{
k
}.

#### Theorem 3.

*The sequence*{

*I*

_{ k+1}}

*of the posterior information submatrices for estimating state vectors*

**s**

_{ k+1}

*can be computed as follows:*

*where*${M}_{k+1}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}-\mathbb{E}\left\{{\mathrm{\Delta}}_{{\mathbf{s}}_{k+1}}^{{\mathbf{s}}_{k}}{\mathcal{L}}_{k+1}\right\}$, ${D}_{k+1}\phantom{\rule{0.3em}{0ex}}=\phantom{\rule{0.3em}{0ex}}-\mathbb{E}\left\{{\mathrm{\Delta}}_{{\mathbf{s}}_{k+1}}^{{\mathbf{s}}_{k+1}}log\phantom{\rule{.3em}{0ex}}{p}_{k+1}\right\}$, ${L}_{k+1}=\left[-\mathbb{E}\left\{{\mathrm{\Delta}}_{{\mathbf{s}}_{k+1}}^{{\mathbf{s}}_{0}}{\mathcal{L}}_{k+1}\right\}-\mathbb{E}\left\{{\mathrm{\Delta}}_{{\mathbf{s}}_{k+1}}^{{\mathbf{s}}_{1}}{\mathcal{L}}_{k+1}\right\}\dots -\mathbb{E}\left\{{\mathrm{\Delta}}_{{\mathbf{s}}_{k+1}}^{{\mathbf{s}}_{k-1}}{\mathcal{L}}_{k+1}\right\}\right]$,

*and*
${\mathrm{\Delta}}_{\mathrm{\Phi}}^{\mathrm{\Theta}}={\nabla}_{\mathrm{\Phi}}{\nabla}_{\mathrm{\Theta}}^{T}$
*with* ∇ *being the Laplacian operator.*

*Proof.* See Appendix 4.

Note that the information submatrix computation in (28) involves computation of the inverse of a matrix of size 4*k* × 4*k*. This is because of the output *y*
_{
j,k+1} at the *j* th node at (*k* + 1)th time instant being a function of all the previous states **S**
_{
k+1}. □

## 6 Simulation results

In the following, we show the performances of our proposed models and schemes through numerical simulations.

### 6.1 Simulations for the physical model in Section 2

**r**

_{0}based on the physical diffusion model formulated in Section 2. The parameters used for this simulation are oil release rate

*μ*= 10

^{3}kg/s, diffusivity constant of oil in saline water

*κ*= 25 m

^{2}/s, initial release time

*t*

_{I}= 0 s and laminar water velocity

**v**= [50,50,0] m/s. The oil spill source is assumed to be located at

**r**

_{0}= [0,0,0]

^{ T }and the depth of water is taken to be

*L*= 100 m from the sea bed. Figure 2 shows the spatial concentration distribution for two different time instants

*t*= 1 and

*t*= 100 s. It can be seen from Figure 2 that as the oil source is located at the origin, the concentration is high near the origin at

*t*= 1 s. By the time it is 100 s, oil has diffused over larger distance from the source. It is interesting to see that since laminar water flow is assumed to be only active in the positive x and y directions, concentration increases more along the positive

*X*-

*Y*-plane with the increase in time.

### 6.2 Static diffusive source localization

Here, we show the simulation results in estimating the location of a static diffusive source using the proposed MLE and BLUE-based methods from the concentration observations taken by the sensing nodes. For the sake of simplicity, we consider a 2D diffusive field volume of Λ = [-50,50] × [-50,50]m^{2}. We assume that the sensors are placed in a uniform 2D grid such that the distance between adjacent sensors along the same ordinate is approximately 14.3 m. Parameters used for simulations are number of nodes *N* = 64, **r**
_{0} = [0,0]^{
T
}, *μ* = 1,000 kg/s, *b* = 10^{-4} kg/m^{2}, *t*
_{I} = 0 s, and *κ* = 25 m^{2}/s. The observation noise is assumed to have Gaussian distribution with mean 0 and variance *σ*
^{2} = 1 × 10^{-4} kg/m^{2}. The total number of random realizations used for simulations is 100. The measurements are taken at every 0.5 s time-step starting from 0.5 s ending at 30 s. In the case of BLUE estimator, the received noise variance at the fusion center is assumed to be ${\sigma}_{m}^{2}=0.01,10$ m^{2}.

*N*,

*T*→

*∞*, the complementary error function in (12) tends to be equal to 1, causing the linearization to have almost no effect on the approximation.

### 6.3 Moving diffusive source tracking

In this subsection, we analyze the performance of our proposed moving diffusive source tracking scheme. We use the same sensor network setup as described in Section 6.2. The initial source state vector is assumed to be Gaussian with mean **μ** = [0,0,0,0]^{
T
} and covariance matrix Σ_{0} = diag([ 0.01,0.01,0.01,0.01]^{
T
}). The intensity of the state process noise is ${\sigma}_{u}^{2}=0.1$. The sampling time is assumed to be *T*
_{s} = 0.5 s, and the total number of random realizations used for simulations is 50. The tracking is performed for 30 s and the number of particles in the PF is *N*
_{p} = 1,000. The rest of the parameters is same as in Section 6.2. The performance measure is taken as the root mean squared error (RMSE) of the moving source position estimate given by ${\text{RMSE}}_{k}=\sqrt{{\left({x}_{s,k}-{\widehat{x}}_{s,k}\right)}^{2}+{\left({y}_{s,k}-{\u0177}_{s,k}\right)}^{2}}$. The RMSE is compared with the square root of the PCRLB components of the position error, ${\text{PCRLB}}_{k}\approx \sqrt{{\left[{I}_{k}^{-1}\right]}_{11}+{\left[{I}_{k}^{-1}\right]}_{22}}$.

*T*

_{s}is varying. Figure 7 shows the effect of sampling time

*T*

_{s}on the tracking performances of the proposed moving diffusive source tracking scheme using grid-based node deployment strategy, keeping all the other parameters same as mentioned before. As one would expect, the tracking performance decreases with the increase of sampling time

*T*

_{s}. This is because for higher values of

*T*

_{s}, the process noise will increase according to (23). Since we are also assuming that the movement of the diffusive source is almost linear between two successive time instants, the lower

*T*

_{s}will result in better accuracy of the proposed tracking scheme.

## 7 Conclusion

In this paper, we obtained spatio-temporal distribution of the substance concentration by solving physical diffusion model for an underwater oil spill scenario which considers laminar water velocity as an external force. The obtained mathematical model was found to be capable of modeling satisfactorily the underlying physical diffusion phenomenon. We proposed two parametric estimation methods based on MLE and BLUE for determining static diffusive source location using wireless sensor network. We also obtained the CRLB as theoretical performance bound for source localization. It was observed that though the MLE performs better than the BLUE-based diffusive source localization method, the latter shows satisfactory performance trend for large number of sensing nodes and time samples. We also proposed a particle filter-based target tracking method for moving diffusive source-emitting substance continuously into the dispersive medium. PCRLB corresponding to the moving diffusive source tracking was obtained as a theoretical performance measure and was compared with the simulation results. The effect of sampling time on the moving source tracking was also investigated. The performance of the proposed estimation and tracking methods are shown to be excellent using numerical simulations. In future research, we plan to combine our obtained analytical results with non-model-based numerical techniques to make them applicable for more realistic and complex scenarios.

## Appendices

### Appendix 1

#### Derivation of spatio-temporal concentration in (9)

*c*

_{G}(

**r**,

*t*) in (8) can be written as ${c}_{\mathrm{G}}\left(\mathbf{r},t\right)={c}_{1}^{\prime}(\mathbf{r},t)+{c}_{2}^{\prime}(\mathbf{r},t)+{c}_{3}^{\prime}(\mathbf{r},t)+{c}_{4}^{\prime}(\mathbf{r},t)$, where

*c*

_{1}(

**r**,

*t*) in (9) as

*γ*=

*t*-

*t*

_{I}and assuming

*F*(

**r**,

*t*) =

*c*

_{1}(

**r**,

*t*) in (9), we have

*c*

_{1}(

**r**,

*t*) in (9) is valid. The expression for

*c*

_{3}(

**r**,

*t*) can be obtained as follows:

Similarly, we can also verify the expressions for *c*
_{2}(**r**,*t*) and *c*
_{4}(**r**,*t*). Therefore, the spatio-temporal concentration distribution *c*(**r**,*t*) given in (9) is valid. ■

### Appendix 2

#### Proof of Theorem 1

*x*coordinate θ

_{0}(1) =

*x*

_{0}and it can be easily followed to prove the consistency for the

*y*and

*z*coordinates without any loss of generality. Based on the technique in [30], we have to prove that

^{′}. Since both

*c*

_{ j,k }(θ) and $\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {x}_{0}}$ are continuous functions of

*x*

_{0}, by using Cauchy-Schwarz inequality, we can obtain

*t*

_{ k }<

*t*

_{I}, therefore ${\sum}_{k=1}^{T}\frac{1}{\sqrt{{t}_{k}-{t}_{\mathrm{I}}}}\le \mathit{\text{TS}}$ with

*S*being some positive real value. For practical consideration, assuming $0\le \frac{|{x}_{j}-{x}_{0}^{\prime}|}{|{\mathbf{r}}_{j}-{\mathbf{r}}_{0}^{\prime}{|}^{2}}\le P$, $0<\frac{1}{|{\mathbf{r}}_{j}-{\mathbf{r}}_{0}|}\le {M}_{1}$, and $0<\frac{1}{|{\mathbf{r}}_{j}-{\mathbf{r}}_{0}^{\prime}|}\le {M}_{2}$, ${K}_{N,T}\left({x}_{0};{x}_{0}^{\prime}\right)$ can be written as

If *d*
_{
N,T
} = *N*
^{3}
*T*
^{3} > 0 for *N* ≥ 1,*T* ≥ 1, then we can claim that ${lim}_{N,T\to \infty}\frac{1}{{d}_{N,T}}{\sum}_{j=1}^{N}{\sum}_{k=1}^{T}{K}_{N,T}({x}_{0};{x}_{0}^{\prime})$ exists.

*Q*

_{1}and

*Q*

_{2}being some positive real numbers, we obtain the following from (15):

Choosing *d*
_{
N,T
} = *N* *T* > 0 for *N* ≥ 1,*T* ≥ 1, we have ${lim}_{N,T\to \infty}\frac{1}{{d}_{N,T}^{2}}{\sum}_{j=1}^{N}{\sum}_{k=1}^{T}{\left[\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {x}_{0}}\right]}^{2}=0$. Similarly, for *y*
_{0} and *z*
_{0}, we can also claim that the MLE to the diffusive source localization problem is consistent when the number of sensor nodes and time samples go to infinity. ■

### Appendix 3

#### Proof of Theorem 2

To prove the asymptotic normality of the MLE, we define Φ_{
j,k
}(*y*
_{
j,k
}|θ) = log*p*(*y*
_{
j,k
}|θ), ${\stackrel{\u0307}{\mathbf{\Phi}}}_{j,k,u}\left({y}_{j,k}|\mathit{\theta}\right)=\frac{\partial}{\partial {\theta}_{u}}\left\{{\mathrm{\Phi}}_{j,k}\left({y}_{j,k}|\mathit{\theta}\right)\right\}$, and ${\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k,u,v}\left({y}_{j,k}|\mathit{\theta}\right)=\frac{{\partial}^{2}}{\partial {\theta}_{u}\partial {\theta}_{v}}\left\{{\mathrm{\Phi}}_{j,k}\left({y}_{j,k}|\mathit{\theta}\right)\right\}$. Below, we verify the necessary conditions mentioned in [42] for our obtained MLE to be asymptotically normal.

From practical point of view, there is no loss in generality in assuming that ${\mathit{\theta}}_{0}\in \stackrel{\circ}{\Lambda}$, where $\stackrel{\circ}{\Lambda}\subset \mathrm{\Lambda}$ is an open subset of Λ. Also because the obtained MLE to source localization is consistent, it is also consistent even when ${\mathit{\theta}}_{0}\in \stackrel{\circ}{\Lambda}\subset \mathrm{\Lambda}$. Thus conditions *N* 1 and *N* 2 are satisfied.

From the notations defined above, since $\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {\theta}_{u}}$ and $\frac{{\partial}^{2}{c}_{j,k}(\mathit{\theta})}{\partial {\theta}_{u}\partial {\theta}_{v}}$ exist for *u*,*v* = 1,2,3, it can be easily verified that both ${\stackrel{\u0307}{\mathbf{\Phi}}}_{j,k,u}\left({y}_{j,k}|\mathit{\theta}\right)$ and ${\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k,u,v}\left({y}_{j,k}|\mathit{\theta}\right)$ exist almost surely. Therefore *N* 3 is satisfied.

Since θ ∈ Λ and ${\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k}\left({y}_{j,k}|\mathit{\theta}\right)$ is a continuous mapping of θ, we can claim that ${\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k}\left({y}_{j,k}|\mathit{\theta}\right)$ is indeed uniformly continuous on θ in *j* and *k*[41]. Also, because ${\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k}\left({y}_{j,k}|\mathit{\theta}\right):{y}_{j,k}\to \mathbb{R}$ is a continuous function of *y*
_{
j,k
} with *y*
_{
j,k
} being Lebesgue measurable function, ${\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k}\left({y}_{j,k}|\mathit{\theta}\right)$ is also a measurable function of *y*
_{
j,k
} and condition *N* 4 is satisfied. To satisfy *N* 5, it is easy to verify that $\mathbb{E}\left[{\stackrel{\u0307}{\mathbf{\Phi}}}_{j,k,u}\left({y}_{j,k}|\mathit{\theta}\right)\right]=0$ for all *j*,*k* and *u*. Since $p({y}_{j,k}|\mathit{\theta})\sim \mathcal{N}\left({c}_{j,k}(\mathit{\theta})+b,{\sigma}^{2}\right)$ and *p*(*y*
_{
j,k
}|θ) is continuous and Lebesgue measurable in *y*
_{
j,k
}, $\frac{{\partial}^{2}}{\partial {\theta}_{u}\partial {\theta}_{v}}\int p({y}_{j,k}|\mathit{\theta})d{y}_{j,k}=\int \frac{{\partial}^{2}}{\partial {\theta}_{u}\partial {\theta}_{v}}p({y}_{j,k}|\mathit{\theta})d{y}_{j,k}$ is valid for all *j*,*k*,*u*, and *v*, and thus *N* 6 is satisfied. From Appendix 2, it can be claimed that ${\sum}_{j=1}^{N}{\sum}_{k=1}^{T}{\left\{\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {\theta}_{u}}\right\}}^{2}$ and ${\sum}_{j=1}^{N}{\sum}_{k=1}^{T}\left\{\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {\theta}_{u}}\right\}\phantom{\rule{.3em}{0ex}}\left\{\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {\theta}_{v}}\right\}$ exist and are bounded for all *u*,*v*. Hence, using the Cauchy-Schwarz inequality, all the leading principle minors of ${\stackrel{\u0304}{\mathbf{I}}}_{\mathit{\theta}}$ (in *Theorem 2*) can be shown to be positive. Thus, we can claim that ${\u012a}_{\mathit{\theta}}$ is also positive-definite and therefore *N* 7 is satisfied. Because $\mathbb{E}\left\{\left|{e}_{j,k}\right|\right\}=\sigma \sqrt{\frac{2}{\pi}}$, we have $\mathbb{E}\left[{\left|{\stackrel{\u0307}{\mathbf{\Phi}}}_{j,k,u}\left({y}_{j,k}|\mathit{\theta}\right)\right|}^{3}\right]\le \frac{\mathbb{E}\left[|{y}_{j,k}-{c}_{j,k}(\mathit{\theta})-b{|}^{3}\right]}{{\sigma}^{6}}{\left|\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {\theta}_{u}}\right|}^{3}\le \frac{2}{{\sigma}^{2}}\sqrt{\frac{2}{\pi}}{\left|\frac{\partial {c}_{j,k}(\mathit{\theta})}{\partial {\theta}_{u}}\right|}^{3}={K}_{1}$, ∀*j*,*k*,*u*, where *K*
_{1} is some real positive finite number and *N* 8 is satisfied.

To prove condition *N* 9 since ${\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k,u,v}\left({y}_{j,k}|\mathit{\theta}\right)$ is a uniformly continuous function of θ (shown in condition *N* 4), for any *ε* > 0, there exists one *δ* > 0 such that $\left|{\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k,u,v}\left({y}_{j,k}|\mathit{\theta}\right)-{\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k,u,v}\left({y}_{j,k}|{\mathit{\theta}}_{0}\right)\right|<\delta ,\forall \left|\left|\mathit{\theta}-{\mathit{\theta}}_{0}\right|\right|<\epsilon $. Therefore, for all ||θ - θ
_{0}|| ≤ *ε* since $\frac{\partial {c}_{j,k}({\mathit{\theta}}_{0})}{\partial {\theta}_{u}}$ and $\frac{{\partial}^{2}{c}_{j,k}({\mathit{\theta}}_{0})}{\partial {\theta}_{u}\partial {\theta}_{v}}$ are continuous functions of ${\mathit{\theta}}_{0}\in \stackrel{\circ}{\Lambda}\subset \mathrm{\Lambda},\forall j,k,u$ and *v*, we have $\text{sup}\left\{\left|{\stackrel{\u0308}{\mathbf{\Phi}}}_{j,k,u,v}\left({y}_{j,k}|\mathit{\theta}\right)\right|\right\}\le \delta +\left[\left|{e}_{j,k}\right|{K}_{j,k,u,v}^{\text{sup}}+{Q}_{j,k,u,v}^{\text{sup}}\right]={B}_{j,k,u,v}({e}_{j,k})$, where ${K}_{j,k,u,v}^{\text{sup}}$ and ${Q}_{j,k,u,v}^{\text{sup}}$ are some finite real numbers and *B*
_{
j,k,u,v
}(*e*
_{
j,k
}) is a random variable. Since $\mathbb{E}\left\{\left|{e}_{j,k}\right|\right\}=\sigma \sqrt{\frac{2}{\pi}}$ and $\mathbb{E}\left\{{\left|{e}_{j,k}\right|}^{2}\right\}={\sigma}^{2}$, hence $\mathbb{E}\left[{\left|{B}_{j,k,u,v}({e}_{j,k})\right|}^{2}\right]\le {K}_{2}$, where *K*
_{2} is a finite real number.

Therefore, the obtained MLE of the diffusive source location is asymptotically normal when the number of sensor nodes and time samples go to infinity. ■

### Appendix 4

#### Proof of Theorem 3

**S**

_{1}as ${\mathbf{S}}_{1}={[{\mathbf{s}}_{0}^{T},{\mathbf{s}}_{1}^{T}]}^{T}$,

*I*(

**S**

_{1}) can be written as

**s**

_{1}is given by

**S**

_{2}as ${\mathbf{S}}_{2}={[{\mathbf{s}}_{0}^{T},{\mathbf{s}}_{1}^{T},{\mathbf{s}}_{2}^{T}]}^{T}$, the FIM

*I*(

**S**

_{2}) can be written as follows: