Direction finding with a single spatially stretched vector sensor in the presence of mutual coupling

Shu, Ting; Wang, Kun; He, Jin; Liu, Zhong

doi:10.1186/s13634-018-0537-9

Research
Open access
Published: 27 February 2018

Direction finding with a single spatially stretched vector sensor in the presence of mutual coupling

Ting Shu ORCID: orcid.org/0000-0003-0566-2890¹,
Kun Wang¹,
Jin He¹ &
…
Zhong Liu²

EURASIP Journal on Advances in Signal Processing volume 2018, Article number: 14 (2018) Cite this article

1623 Accesses
4 Citations
Metrics details

Abstract

This paper is concerned with DOA estimation using a single-electromagnetic vector sensor in the presence of mutual coupling. Firstly, we apply the temporally smoothing technique to improve the identifiability limit of a single-vector sensor. In particular, we establish sufficient conditions for constructing temporally smoothed matrices to resolve K > 2 incompletely polarized (IP) monochromatic signals with a single-vector sensor. Then, we propose an efficient ESPRIT-based method, which does not require any calibration signals or iterative operations, to jointly estimate the azimuth-elevation angles and the mutual coupling coefficients. Finally, we derive the Cramér-Rao bound (CRB) for the problem under consideration.

1 Introduction

Direction finding using a single-electromagnetic vector sensor (EMVS) has played an important role in applications such as radar, wireless communications and seismic exploration. An EMVS consists of six components, three identical but orthogonally oriented electrically short dipoles, and another three identical but orthogonally oriented magnetically small loops. An EMVS can therefore measure all the six electromagnetic field components induced by any electromagnetic incidence. After Li [1], and Nehorai and Paldi [2] first introduced the EMVS measurement model to the signal processing community, a variety of studies regarding signal processing with a single EMVS [2–8] have been extensively carried out. These methods ignore the mutual coupling across the six antenna component, which ultimately destroys the underlying model assumptions needed for their efficient implementations. Consequently, ignoring this mutual coupling effect can seriously degrade the performance the above mentioned algorithms. Therefore, it is of great significance to develop algorithms for simultaneous mutual coupling calibration and parameter estimation.

In the last few years, many advanced array calibration methods have been reported. These algorithms include the maximum likelihood algorithm [9], the iterative auto-calibration method [10], the auxiliary sensor-based methods [11–14], the cumulant-based method [15], the Rank-reduction (RARE)-based calibration methods [16, 17], the sparse representation-based methods [18–20], and the matrix reconstruction method [21]. However, some of these methods require a set of calibration signals/auxiliary sensors [9, 11–14] or iterative/high order statistics/non-linear optimization computations [10, 15–20]. Moreover, all such methods are designed for scalar sensor arrays and are not applicable to the vector sensor arrays. Calibration of mutual coupling for vector sensors has been studied recently in [22] and [23]. These two methods can offer closed-form solutions for coupling matrix and parameter estimation. However, they require a coupling-free auxiliary vector sensor and design of a reference signal.

The aforementioned scalar sensor array calibration methods have been a strong motivation for us to develop new joint calibration and estimation methods for vector sensor arrays, and the contribution of the work lies in that direction. The proposed method is outlined as follows: the temporal smoothing technique is firstly applied to improve the identifiability limit of a single vector. In particular, sufficient conditions for constructing temporally smoothed matrices to resolve K incompletely polarized (IP) monochromatic signals with a single-vector sensor are established. An ESPRIT-based method is then developed for jointly estimating the azimuth-elevation angles and the mutual coupling coefficients. This method does not require any calibration signals or iterative operations. The Cramér-Rao bound (CRB) for the problem under consideration is also derived.

Throughout the paper, scalar quantities are denoted by lowercase letters. Lowercase bold type faces are used for vectors and uppercase letters for matrices. Superscripts T, H, ∗, and † represent the transpose, conjugate transpose, complex conjugate and pseudo inverse, respectively, while ⊗ and ⊙, respectively, symbolize the Kronecker-product operator and the Khatri-Rao (column-wise Kronecker) matrix product. I_m and 0_m,n, respectively, stand for the m×m identity matrix and m×n zero matrix.

2 Mathematical data model and assumptions

Assume that there are K uncorrelated monochromatic transverse electromagnetic signals, parameterized by {θ₁,ϕ₁},⋯,{θ_K,ϕ_K}, after having traveled through a homogeneous isotropic medium, impinge upon a single spatially stretched EMVS. The EMVS consists of three orthogonal electric dipoles and another three orthogonal magnetic loops, with the dipole triad being located at (0,0) and the loop triad being located at (d_x,d_y), as shown in Fig. 1. The parameter 0≤θ_k<π denotes the elevation angle of the kth signal, and 0≤ϕ_k<2π represents the azimuth angle. The 6 × 1 data vector measured by the EMVS at time instant t can be expressed as [2]

$$\begin{array}{@{}rcl@{}} \mathbf{z}(t) = \sum_{k = 1}^{K} \mathbf{A}_{k} \mathbf{s}_{k}(t) + \mathbf{n}(t) = \mathbf{A}\mathbf{s}(t) + \mathbf{n}(t) \end{array} $$

(1)

where

$$\begin{array}{@{}rcl@{}} \mathbf{A}_{k} = \mathbf{A}(\theta_{k}, \phi_{k}) = [\!\mathbf{a}_{1,k}, \mathbf{a}_{2,k}] = \left[ \begin{array}{cc} \mathbf{v}_{1,k} & \mathbf{v}_{2,k} \\ q_{k}\mathbf{v}_{2,k} & -q_{k}\mathbf{v}_{1,k} \\ \end{array}\right] \end{array} $$

(2)

$$\begin{array}{@{}rcl@{}} \mathbf{v}_{1,k} =\left [\cos \theta_{k} \cos \phi_{k}, \cos \theta_{k} \sin \phi_{k}, -\sin \theta_{k}\right]^{T} \end{array} $$

(3)

$$\begin{array}{@{}rcl@{}} \mathbf{v}_{2,k} = [-\sin \phi_{k}, \cos \phi_{k}, 0]^{T} \end{array} $$

(4)

$$\begin{array}{@{}rcl@{}} {q}_{k} = e^{j2\pi/\lambda(d_{x} \sin \theta_{k} \cos \phi_{k} + d_{y} \sin \theta_{k} \sin \phi_{k})} \end{array} $$

(5)

$$\begin{array}{@{}rcl@{}} \mathbf{A} = [\mathbf{A}_{1}, \cdots, \mathbf{A}_{K}] \end{array} $$

(6)

$$\begin{array}{@{}rcl@{}} \mathbf{s}_{k}(t) &=& [\!s_{k,1}(t), s_{k,2}(t)]^{T}\\ &=&\left[\beta_{k,1} e^{j(\omega_{k,1} t + \psi_{k,1})}, \beta_{k,2} e^{j(\omega_{k,2} t + \psi_{k,2})}\right]^{T} \end{array} $$

(7)

$$\begin{array}{@{}rcl@{}} \mathbf{s}(t) = \left[\mathbf{s}_{1}^{T}(t), \cdots, \mathbf{s}_{K}^{T}(t)\right]^{T} \end{array} $$

(8)

$$\begin{array}{@{}rcl@{}} \mathbf{n}(t) = [\!n_{1}(t), \cdots, n_{6}(t)]^{T} \end{array} $$

(9)

In the above equations,

A_k is the 6 × 2 EMVS response of the kth electromagnetic signal.
A is the 6×2K EMVS steering matrix.
s_k(t) is a 2 × 1 vector, representing the two entries of the kth transmitted signal.
In (7), β_k,i, ω_k,i and ψ_k,i, i,=1,2, respectively, represent the energy, frequency, and the uniformly distributed random phase of the ith entry of the kth signal.
n(t) is the 6 × 1 noise vector.

Note that the rank of the covariance matrix of the kth signal $\mathbf {R}_{\mathbf {s}_{k}}~=~E\left \{\mathbf {s}_{k}(t)\mathbf {s}_{k}^{H}(t)\right \}$ is related to the polarization state of the kth signal [24, 25]. For the case of an incompletely polarized (IP) signal, the covariance matrix $\mathbf {R}_{s_{k}}$ is of rank-2, whereas for the case of completely polarized (CP) signal, $\mathbf {R}_{s_{k}}$ becomes rank deficient. In other words, the IP signal possesses two spatial degrees of freedom, whereas the CP signal has only one. Referring back to (7), if ω_k,1≠ω_k,2, then the signal is IP, and its polarization varies with time. On the other hand, if ω_k,1=ω_k,2=ω_k, then the signal is CP, and it has constant polarization. For the CP signal, s_k(t) can be expressed as [25]

$$\begin{array}{@{}rcl@{}} \mathbf{s}_{k}(t) = \mathbf{g}_{k} s_{k}(t) = \left[ \begin{array}{c} \sin \gamma_{k} e^{j \eta_{k}} \\ \cos \gamma_{k} \\ \end{array} \right] s_{k}(t) \end{array} $$

(10)

where γ_k and η_k are polarization parameters referred to as the auxiliary polarization angle and polarization phase difference, respectively, and $\phantom {\dot {i}\!}s_{k}(t) = \beta _{k} e^{j(\omega _{k} t + \psi _{k})}$ is the kth transmitted signal. Thus, for the case of K CP signals, the data vector in (1) reduces to the one used in [3].

The data vector model in (1) is only valid for ideal vector sensors. For practical vector sensors, the mutual coupling across the six vector sensor components is not negligible. In such a realistic situation, the signal received by a vector sensor component is no longer related to the wavefield incident on that component only, but can be expressed as a linear combination of the wavefields incident onto all the six vector sensor components. To take the coupling effect into account, the data model in (1) needs to be modified by including a 6 × 6 matrix term M, referred to as the coupling matrix, which expresses the aforementioned linear combination. Therefore, the data vector in the presence of mutual coupling can be written as

$$\begin{array}{@{}rcl@{}} \mathbf{z}(t) = \mathbf{M}\mathbf{A}\mathbf{s}(t) + \mathbf{n}(t) \end{array} $$

(11)

The mutual coupling matrix M may contain 36 distinct parameters. Thus, a full coupling matrix model usually requires too many parameters to be estimated. Therefore, a simplified version of M is required. If a priori knowledge on the mutual coupling matrix is available, then its exploitation can be useful in reducing the number of parameters to be estimated, thus, making the estimation procedure simpler. In light of this consideration, we assume that the displacement between the dipole triad and the loop triad is large enough so that the mutual coupling coefficients between the dipole triad and the loop triad are zero. This hypothesis can be justified by the fact that the coupling decreases quite rapidly with distance [11]. Furthermore, since the components in the dipole triad and loop triad are orthogonal to one another, all the components of a vector sensor would experience the same coupling effect. Therefore, the mutual coupling matrix M can be formulated as

$$\begin{array}{@{}rcl@{}} \mathbf{M}~=~\mathbf{I}_{2} \otimes \mathbf{C}~=~\mathbf{I}_{2}~\otimes~\left[ \begin{array}{ccc} c_{1} & c_{2} & c_{2} \\ c_{2} & c_{1} & c_{2} \\ c_{2} & c_{2} & c_{1} \\ \end{array} \right] \end{array} $$

(12)

where c₁ and c₂ represent the self and mutual coupling coefficients, respectively.

With a total of N snapshots taken at the distinct instants {nΔ_T:n = 1,⋯,N}, the problem is to determine the DOA’s {(θ_k,ϕ_k),k = 1,⋯,K} of the K signals and the coupling matrix C from these snapshots. The following assumptions are made:

1.
The parameters (θ₁,ϕ₁),⋯,(θ_K,ϕ_K) are pairwise distinct.
2.
The value of K is known or correctly estimated.
3.
The coupling coefficients c₁ ≠ c₂ so that the coupling matrix C is non-singular.
4.
The impinging signals are IP and are uncorrelated with one another. This implies that the frequencies ω_1,1≠ω_1,2≠⋯≠ω_K,1≠ω_K,2.
5.
The noise is zero-mean, complex Gaussian, and is statistically independent of all the signals.

3 Joint angle and mutual coupling matrix estimation

3.1 Temporal smoothing

The authors in [4] have found that the maximum number of arbitrary electromagnetic sources uniquely identifiable by one vector sensor is two. That is, the data matrix in (11) is rank deficient if the number of incoming signals is greater than two. In this subsection, we will apply the temporal smoothing technique [26] to deal with this rank deficiency problem. We will also show that under certain conditions, the temporal smoothing technique can restore the rank of the data matrix.

Define a 6×N data matrix Z=[ z(Δ_T),⋯,z(NΔ_T)], where z(Δ_T), z(2Δ_T), ⋯,z(NΔ_T) are the N snapshots sampled at time instants Δ_T,2Δ_T,⋯,NΔ_T, respectively. For simplicity of analysis, we will neglect the noise terms. Then, we define P temporally shifted data subsets of Z, where each contains N−P−1 data samples. The first and the pth temporally shifted data subsets can be expressed as

$$\begin{array}{@{}rcl@{}} \mathbf{Z}_{1}\!\! & \,=\, & \![\!\mathbf{z}(\Delta_{T}), \cdots, \mathbf{z}(\!(N\,-\,P\,+\,1)\Delta_{T})\!] \\ & \,=\, & \!\mathbf{M}\mathbf{A}\! \underbrace{\left[\! \begin{array}{cccc} \beta_{1,1}e^{j \omega_{1,1} \Delta_{T} + \psi_{1,1}} &\! \!\beta_{1,1}e^{j \omega_{1,1} 2 \Delta_{T} + \psi_{1,1}} & \cdots\! \!& \!\!\beta_{1,1}e^{j \omega_{1,1} (N-P+1) \Delta_{T} + \psi_{1,1}} \\ \beta_{1,2}e^{j \omega_{1,2} \Delta_{T} + \psi_{1,2}} &\! \!\beta_{1,2}e^{j \omega_{1,2} 2 \Delta_{T} + \psi_{1,2}} & \cdots\! \!&\! \!\beta_{1,2}e^{j \omega_{1,2} (N-P+1) \Delta_{T} + \psi_{1,2}} \\ \vdots & \vdots & \cdots & \vdots \\ \beta_{K,1}e^{j \omega_{K,1} \Delta_{T} + \psi_{K,1}}\! \!&\!\! \beta_{K,1}e^{j \omega_{K,1} 2 \Delta_{T} + \psi_{K,1}} & \cdots \!\!&\! \!\beta_{K,1}e^{j \omega_{K,1} (N-P+1) \Delta_{T} + \psi_{K,1}} \\ \beta_{K,2}e^{j \omega_{K,2} \Delta_{T} + \psi_{K,2}}\! \!&\! \!\beta_{K,2}e^{j \omega_{K,2} 2 \Delta_{T} + \psi_{K,2}} & \cdots &\! \!\beta_{K,2}e^{j \omega_{K,2} (N-P+1) \Delta_{T} + \psi_{K,2}} \\ \end{array} \!\right]}_{\mathbf{S}^{T}} \end{array} $$

(13)

$$\begin{array}{@{}rcl@{}} \mathbf{Z}_{p}\! & \!\!= \!& \!\![\!\mathbf{z}(p\Delta_{T}), \!\cdots\!,\! \mathbf{z}((N\!\,-\,P\,+\,p)\Delta_{T}\!)] \\\! \!&\! \,=\, & \!\mathbf{\!M}\mathbf{A}\!\! \!\left[\!\! \begin{array}{cccc} \beta_{1,1}e^{j \omega_{1,1} p\Delta_{T} + \psi_{1,1}}\!\! &\!\! \beta_{1,1}e^{j \omega_{1,1} (p+1) \Delta_{T} + \psi_{1,1}}\!\! &\! \!\cdots\! \!&\!\! \beta_{1,1}e^{j \omega_{1,1} (N-P+p) \Delta_{T} + \psi_{1,1}} \\ \beta_{1,2}e^{j \omega_{1,2} p\Delta_{T} + \psi_{1,2}} & \beta_{1,2}e^{j \omega_{1,2} (p+1) \Delta_{T} + \psi_{1,2}} \!\!&\!\! \cdots \!\!& \!\!\beta_{1,2}e^{j \omega_{1,2} (N-P+p) \Delta_{T} + \psi_{1,2}} \\ \vdots & \vdots \!\!&\!\! \cdots \!\!&\!\! \vdots \\ \beta_{K,1}e^{j \omega_{K,1} p\Delta_{T} + \psi_{K,1}} \!\!&\! \!\beta_{K,1}e^{j \omega_{K,1} (p+1) \Delta_{T} + \psi_{K,1}}\! \!&\! \!\cdots\! \!&\! \!\beta_{K,1}e^{j \omega_{K,1} (N-P+p) \Delta_{T} + \psi_{K,1}} \\ \beta_{K,2}e^{j \omega_{K,2} p\Delta_{T} + \psi_{K,2}}\! \!& \!\!\beta_{K,2}e^{j \omega_{K,2} (p+1) \Delta_{T} + \psi_{K,2}} \!\!&\!\! \cdots \!\!&\! \!\beta_{K,2}e^{j \omega_{K,2} (N-P+p) \Delta_{T} + \psi_{K,2}} \\ \end{array}\!\! \right] \\\! \!&\! \,=\, \!&\! \mathbf{M}\mathbf{A} \!\!\underbrace{\left[ \begin{array}{ccccc} e^{j \omega_{1,1} (p - 1)\Delta_{T}} & & & & \\ & e^{j \omega_{1,2} (p - 1)\Delta_{T}} & & & \\ & & \ddots & & \\ & & & e^{j \omega_{K,1} (p - 1)\Delta_{T}} & \\ & & & & e^{j \omega_{K,2} (p - 1)\Delta_{T}} \\ \end{array} \right]}_{\boldsymbol{\Phi}_{p}} \mathbf{S}^{T} \end{array} $$

(14)

where

$$ \begin{aligned} {}\boldsymbol{\Phi}_{p} = \text{diag}&\left[e^{j\omega_{1,1} (p-1)\Delta_{T}}, e^{j\omega_{1,2} (p-1)\Delta_{T}}, \cdots,\right.\\ &\quad\left. e^{j\omega_{K,1} (p-1)\Delta_{T}}, e^{j\omega_{K,2} (p-1)\Delta_{T}} \right] \end{aligned} $$

(15)

is a diagonal matrix that is only dependent on the time delay and the frequencies of the signals, and

$$\begin{array}{@{}rcl@{}} \mathbf{S} = [\!\mathbf{s}(\Delta_{T}), \cdots, \mathbf{s}((N - P + 1)\Delta_{T})]^{T} \end{array} $$

(16)

is a (N−P+1)×2K signal matrix. Then, for p = 1,⋯,P, we will have P different data sets {Z₁,⋯,Z_P}. Note that these P data sets differ from one another in view of the fact the the matrices Φ_p differ from one set to another. Next, the 6P×(N−P+1) temporally smoothed data matrix is defined by stacking Z_p for p = 1,⋯,P as

$$\begin{array}{@{}rcl@{}} \mathbf{Z}_{\text{TS}} = \left[\mathbf{Z}_{1}^{T}, \cdots, \mathbf{Z}_{P}^{T}\right]^{T} \end{array} $$

(17)

Theorem 1

If P≥2K and N≥4K−1, then the temporally smoothed data matrix Z_TS is of full rank 2K.

Proof

The matrix Z_TS can be expressed in a column-wise Kronecker matrix product form as

$$\begin{array}{@{}rcl@{}} \mathbf{Z}_{\text{TS}}~=~(\boldsymbol{\Psi} \odot \mathbf{MA}) \mathbf{S}^{T} \end{array} $$

(18)

where

$$ {{\boldsymbol{\Psi} = \left[ \begin{array}{ccccc} 1 & 1 & \cdots & 1 & 1 \\ e^{j\omega_{1,1} \Delta_{T}} & e^{j\omega_{1,2} \Delta_{T}} & \cdots & e^{j\omega_{K,1} \Delta_{T}} & e^{j\omega_{K,2} \Delta_{T}} \\ \vdots & \vdots & \cdots & \vdots & \vdots \\ e^{j\omega_{1,1} (P-1) \Delta_{T}} & e^{j\omega_{1,2} (P-1) \Delta_{T}} & \cdots & e^{j\omega_{K,1} (P-1) \Delta_{T}} & e^{j\omega_{K,2} (P-1) \Delta_{T}} \\ \end{array} \right]}} $$

(19)

Since all the signals are assumed to be IP and have distinct frequencies, the Vandermonde matrix S is of full column rank 2K if and only if (N − P + 1) ≥ 2K. Next, by results in [27], we have

$$\begin{array}{@{}rcl@{}} \text{rank}(\boldsymbol{\Psi} \odot \mathbf{MA})~\leq~\min\{2K, \text{rank}(\boldsymbol{\Psi})\cdot\text{rank}(\mathbf{MA})\} \end{array} $$

and a sufficient condition for equality is to have Ψ and/or MA tall and full rank. Then, if P ≥ 2K, the Vandermonde matrix Ψ is tall and is of rank 2K. In this case,

$$\begin{array}{@{}rcl@{}} \text{rank}(\boldsymbol{\Psi} \odot \mathbf{MA})~=~\min\{2K, 2K\cdot\mbox{rank}(\mathbf{MA})\}~=~2K \end{array} $$

Finally, combining P ≥ 2K with (N − P + 1) ≥2K, we have Z_TS to be of full rank 2K, if P ≥ 2K and N≥4K − 1, since (Ψ⊙MA) is of full column rank and S^T is of full row rank. This concludes the proof. □

Theorem 1 establishes sufficient but not necessary conditions for constructing temporally smoothed matrices to resolve K IP monochromatic signals with a single-vector sensor. Specially, on the basis of Theorem 1, an infinite number of uncorrelated signals with distinct frequencies may potentially be resolved as N approaches infinity.

3.2 Angle and mutual coupling matrix estimation

In this subsection, we propose an ESPRIT-based algorithm to estimate the angles and the coupling matrix from the data matrix Z_TS. For analytical purposes, we consider the ideal noiseless case. Let E_s be the 6P × 2K signal-subspace eigenvector matrix, whose columns are the 6P × 1 signal-subspace eigenvectors associated with the 2K largest eigenvalues of $\mathbf {Z}_{\text {TS}}\mathbf {Z}_{\text {TS}}^{H}$. Using the basic idea of ESPRIT [28], we have

$$\begin{array}{@{}rcl@{}} \mathbf{E}_{s}~=~(\boldsymbol{\Psi} \odot \mathbf{M}\mathbf{A}) \mathbf{T} = \mathbf{B} \mathbf{T} \end{array} $$

(20)

where B = Ψ⊙MA, and T is a unique 2K × 2K non-singular matrix. Next, define the following two selection matrices

$$\begin{array}{@{}rcl@{}}{} \mathbf{J}_{1}=\ [\!\mathbf{I}_{6P~-~6}, \mathbf{0}_{(6P~-~6) \times 6}], \mathbf{J}_{2} = [\!\mathbf{0}_{(6P~-~6)~\times~6}, \mathbf{I}_{6P~-~6}] \end{array} $$

(21)

and let B₁ = J₁B and B₂ = J₂B. The shift invariance structure in B indicates that

$$\begin{array}{@{}rcl@{}} \mathbf{B}_{2}~=~\mathbf{B}_{1} \boldsymbol{\Phi} \end{array} $$

(22)

where

$$\begin{array}{@{}rcl@{}} \boldsymbol{\Phi}~=~\text{diag}\left[\!e^{j\omega_{1,1}\Delta_{T}}, e^{j\omega_{1,2} \Delta_{T}}, \cdots, e^{j\omega_{K,1}\Delta_{T}}, e^{j\omega_{K,2} \Delta_{T}}\right] \end{array} $$

(23)

From (20) and (22), we obtain

$$\begin{array}{@{}rcl@{}} \mathbf{T}^{-1} \boldsymbol{\Phi} \mathbf{T}~=~\mathbf{E}_{1}^{\dag} \mathbf{E}_{2} \end{array} $$

(24)

where E₁ = J₁E_s and E₂ = J₂E_s. Consequentially, the ESPRIT’s eigenvalues, i.e., the eigenvalues of $\mathbf {E}_{1}^{\dag } \mathbf {E}_{2}$ equal the diagonal elements of Φ, and the ESPRIT’s right eigenvectors constitute the columns of T. Thus, the matrix B₁ may be estimated as

$$\begin{array}{@{}rcl@{}} \hat{\mathbf{B}}_{1} = \frac{1}{2}\left\{\mathbf{E}_{1} \mathbf{T}^{-1} + \mathbf{E}_{2} \mathbf{T}^{-1} \boldsymbol{\Phi}^{-1}\right\} \end{array} $$

(25)

Note that the matrix B₁ has the form

$$\begin{array}{@{}rcl@{}} \mathbf{B}_{1} & = & \left[\mathbf{Q}_{1}^{T}, \mathbf{Q}_{2}^{T}, \cdots, \mathbf{Q}_{P-1}^{T} \right]^{T} \\ & = & \left[\mathbf{Q}^{T}, (\mathbf{Q}\boldsymbol{\Phi})^{T}, \cdots, \left(\mathbf{Q}\boldsymbol{\Phi}^{(P-2)}\right)^{T} \right]^{T} \end{array} $$

(26)

where Q = MA. Therefore, the matrix Q can be estimated from $\hat {\mathbf {B}}_{1}$ as

$$\begin{array}{@{}rcl@{}} \hat{\mathbf{Q}}~=~\frac{1}{P~-~1} \sum_{p~=~1}^{P~-~1} \mathbf{Q}_{p} \mathbf{\Phi}^{-~(p~-~1)} \end{array} $$

(27)

It should be pointed out that the estimated $\hat {\mathbf {Q}}$ would suffer the unknown scaling ambiguities of the columns. That is, the columns of the estimated $\hat {\mathbf {Q}}$ in fact satisfy

$$\begin{array}{@{}rcl@{}} \mathbf{q}_{2k~-~1}~=~\alpha_{1,k} \mathbf{C} \mathbf{v}_{1,k}, \ \ \mathbf{q}_{2k}~=~\alpha_{2,k} \mathbf{C} \mathbf{v}_{2,k} \end{array} $$

(28)

$$\begin{array}{@{}rcl@{}} \bar{\mathbf{q}}_{2k~-~1}~=~\bar \alpha_{1,k} \mathbf{C} \mathbf{v}_{2,k}, \ \ \bar{\mathbf{q}}_{2k}~=~\bar \alpha_{2,k} \mathbf{C} \mathbf{v}_{1,k} \end{array} $$

(29)

where q_{2k − 1}, q_2k and $\bar {\mathbf {q}}_{2k~-~1}$, $\bar {\mathbf {q}}_{2k}$, k = 1,⋯,K, respectively, denote the top three and bottom three rows of the (2k − 1)th and (2k)th columns of $\hat {\mathbf {Q}}$, α_i,j and $\bar \alpha _{i,j}$, i = 1,2,j = 1,⋯,K represent the unknown scalars. Note that since q_k ≠ 1, α_i,j is in general unequal to $\bar \alpha _{i,j}$.

The scaling ambiguities can be easily eliminated in the proposed method. Using q_2k = α_2,kCv_2,k, we can form the following three equations:

$$\begin{array}{*{20}l} \alpha_{2,k} (-~c_{1}\sin \phi_{k}~+~c_{2} \cos \phi_{k}) &~=~& q_{2k,1} \end{array} $$

(30)

$$\begin{array}{*{20}l} \alpha_{2,k} (-~c_{2}\sin \phi_{k}~+~c_{1} \cos \phi_{k}) &~=~& q_{2k,2} \end{array} $$

(31)

$$\begin{array}{*{20}l} \alpha_{2,k} (-~c_{2}\sin \phi_{k}~+~c_{2} \cos \phi_{k}) &~=~& q_{2k,3} \end{array} $$

(32)

where q_2k,1, q_2k,2, and q_2k,3 are, respectively, the first, second, and third entries of q_2k. Solving these three equations yields the azimuth angle and coupling coefficient estimates:

$$\begin{array}{@{}rcl@{}} \hat \phi_{k}~=~\arctan\left(\frac{q_{2k,1}~-~q_{2k,3}}{q_{2k,3}~-~q_{2k,2}}\right) \end{array} $$

(33)

$$ {{\begin{aligned} {}\hat c~=~c_{2}/c_{1}~&=~\frac{1}{2}\left(\frac{q_{2k,1} \cos \hat \phi_{k}~+~q_{2k,2} \sin \hat \phi_{k}}{q_{2k,1} \sin \hat \phi_{k}~+~q_{2k,2} \cos \hat \phi_{k}}\right.\\ &\left.\quad+\frac{q_{2k,3} \cos \hat \phi_{k}} {q_{2k,2} \cos \hat \phi_{k}~+~q_{2k,3} \sin \hat \phi_{k}~-~q_{2k,2} \sin \hat \phi_{k}}\right) \end{aligned}}} $$

(34)

With the estimation of the coupling coefficient $\hat c$, we can construct an estimate of the mutual coupling matrix $\hat {\mathbf {C}}$ as

$$\begin{array}{@{}rcl@{}} \hat{\mathbf{C}}~=~\left[ \begin{array}{ccc} 1 & \hat c & \hat c \\ \hat c & 1 & \hat c \\ \hat c & \hat c & 1 \\ \end{array} \right] \end{array} $$

(35)

It is easy to see that the matrix product $\hat {\mathbf {C}} \mathbf {C}$ becomes a scaled identity matrix. This means that the mutual coupling coefficients, which constitute the non-diagonal elements of C, are completely eliminated. With the estimation of $\hat \phi _{k}$ and $\hat c$, using $\bar {\mathbf {q}}_{2k}~=~\bar \alpha _{2,k} \mathbf {C} \mathbf {v}_{1,k}$, we can form the following three equations:

$$ \begin{aligned} {}\bar \alpha_{2,k} \left(\cos \hat \phi_{k} \cos \theta~+~\hat c \sin \hat \phi_{k} \cos \theta~-~\hat c \sin \theta\right) = \bar q_{2k,1} \\ \end{aligned} $$

(36)

$$ \begin{aligned} {}\bar \alpha_{2,k} \left(\hat c \cos \hat \phi_{k} \cos \theta~+~\sin \hat \phi_{k} \cos \theta~-~\hat c \sin \theta\right)=\bar q_{2k,2} \\ \end{aligned} $$

(37)

$$ \begin{aligned} {}\bar \alpha_{2,k} \left(\hat c \cos \hat \phi_{k} \cos \theta~+~\hat c \sin \hat \phi_{k} \cos \theta~-~\sin \theta\right)=\bar q_{2k,3} \end{aligned} $$

(38)

Solving these three equations leads to the elevation angle estimates

$$ \begin{aligned} {}\hat \theta_{k}&=\arctan\left(\hat c \left(\cos \hat \phi_{k}~+~\sin \hat \phi_{k}\right) - \frac{\bar q_{2k,3}}{\bar q_{2k,1} - \bar q_{2k,2}} \right.\\ &\quad\times\left.(1~-~\hat c) \left(\cos \hat \phi_{k}~-~\sin \hat \phi_{k}\right)\right) \end{aligned} $$

(39)

Note that the estimation of $\hat \theta _{k}$ and $\hat \phi _{k}$ are automatically paired without any additional processing.

In practice, apart from the scaling ambiguities, the estimated $\hat {\mathbf {Q}}$ may also suffer from some permutation ambiguities. In this case, q_2k may not be the estimate of α_2,kCv_2,k. Thus, the estimation of $\hat \phi _{k}$ and $\hat c$ obtained by using (33) and (34) from q_2k may be erroneous. These may further result in the erroneous estimation of $\hat \theta _{k}$. Unlike the scaling ambiguities, the permutation ambiguities are not resolvable. Here, we provide a solution to deal with this permutation ambiguity problem as follows: first, for all k = 1,⋯,2K, obtain a set of 2K different azimuth angle estimates from q_k. Each of these 2K azimuth angle estimates is then used to produce its own coupling coefficient and elevation angle estimates. Thus, the kth azimuth angle, elevation angle, and coupling coefficient estimates are automatically matched. We know that only a set of K estimates are true estimates. Theoretically, the K true coupling coefficient estimates are identical, while the K erroneous coupling coefficient estimates are, in general, distinct from one another and from the K true estimates. Therefore, we can take homogeneity in coupling coefficient estimates as a criterion for determining the true estimates of the angles and coupling coefficients, i.e., we take a set of K angle estimates associated with K identical coupling coefficient estimates as the true estimates. Without loss of generality, let us assume that the first K estimates are true and the last K estimates are erroneous; then, we have $\hat c_{1} = \cdots = \hat c_{K}~=~\hat c~\neq ~\hat c_{K~+~1} \neq \cdots ~\neq ~\hat c_{2K}$. Finally, we obtain the estimates $(\hat \theta _{k}, \hat \phi _{k}), k~=~1, \cdots, K$ as the angle estimates of the K signals.

3.3 Remarks

In the presence of noise, the estimation procedures in Section 3.2 becomes approximate. Specially, with noise, the set of K coupling coefficient estimates are in general different. Nevertheless, we can search for a set of K coupling coefficient estimates with “most similar values” as the “identical” estimates.

Also note that the vector cross product estimator has been widely used for direction finding with a single-vector sensor [2, 3, 7]. However, this estimator cannot be exploited directly in the presence of mutual coupling among the vector sensor components. Obviously, with the estimation of $\hat c$, the vector sensor can be calibrated by using the calibration matrix defined as $\hat {\mathbf {M}}~=~\mathbf {I}_{2} \otimes \hat {\mathbf {C}}$. Therefore, the vector cross product estimator can be applied to the calibrated data matrix $\hat {\mathbf {M}} \mathbf {Z}$ to extract the angle estimates of the incoming signals. Although the proposed method is designed for vector sensors with mutual coupling, it can also be applied to ideal vector sensors, where the measurement of each component is independent of the others.

The proposed method shares all the advantages indicated in [3]. For example, it offers automatically paired azimuth and elevation angle estimates, does not restrict Δ_T to be constricted by the Nyquist sampling rate, does not need the signal frequencies to be known a priori, and suffers no frequency-DOA ambiguity. It should be noted that the method in [3] assumes CP signals, whereas the proposed method assumes IP ones.

Lastly, it should be pointed out that the application of ESPRIT technique for vector sensor mutual coupling calibration has been studied in works [22] and [23]. However, the differences between these two works and the present work are that (1) the former requires a coupling-free auxiliary vector sensor and design of a reference signal, while the latter does not, (2) the former does not apply the temporally smoothing technique to improve the identifiability limit of a vector sensor, and (3) the former assumes the incoming signals are completely polarized, while the latter considers the incompletely polarized signals.

4 Cramér-Rao bound

This section derives the CRB for the problem considered above, under the assumptions made in Section 2. Further, the following assumptions are added: (i) we assume c₁ = 1 and c₂ = c ≠ 1 due to the fact that the vector sensor can be calibrated with the estimation of c = c₂/c₁. (ii) Since the spatial phase factor q_k between the dipole triad and loop triad is considered as a scalar constant in the proposed method, we assume q_k = 1,k = 1,⋯,K for convenience. (iii) The energies, frequencies and initial phases of the signals are presumed as known constants. (iv) The 6 × 6 noise covariance matrix R_n is presumed as unknown, deterministic, and diagonal with all diagonal elements equal to σ². Hence, the observation data satisfies the following model:

$$\begin{array}{@{}rcl@{}} \mathbf{z}(n\Delta_{T}) \sim {\mathcal{C}N}\left(\boldsymbol{\mu}(n), \sigma^{2}\mathbf{I}_{6}\right) \end{array} $$

(40)

where $\boldsymbol {\mu }(n)~=~\sum _{k = 1}^{K} \mathbf {M}\mathbf {A}_{k} \mathbf {s}_{k}(n\Delta _{T})$, n = 1,⋯,N.

Define a (2K + 2) × 1 unknown parameter vector $\boldsymbol {\theta } = \left [\!\mathbf {c}^{T}, \boldsymbol {\theta }_{1}^{T}, \cdots, \boldsymbol {\theta }_{K}^{T}\right ]^{T}$, where c = [Re(c),Im(c)]^T and θ_k=[ θ_k,ϕ_k]^T. Then, the (2K+2)×(2K+2)Fisher information matrix (FIM) is given by

$$\begin{array}{@{}rcl@{}} \mathbf{J}(\boldsymbol{\theta})~=~2 \sum_{n~=~1}^{N} \left\{\text{Re} \left[ \left(\frac{\partial \boldsymbol{\mu}(n)}{\partial \boldsymbol{\theta}} \right)^{H} \mathbf{R}_{n}^{~-~1} \left(\frac{\partial \boldsymbol{\mu}(n)}{\partial \boldsymbol{\theta}} \right)\right]\right\} \end{array} $$

(41)

where the elements of ∂μ(n)/∂θ are

$$\begin{array}{@{}rcl@{}} \frac{\partial \boldsymbol{\mu}(n)}{\partial \text{Re}(c)} = \sum_{k = 1}^{K} (\mathbf{I}_{2} \otimes {\mathbf{C}^{\prime}}) \mathbf{A}_{k} \mathbf{s}_{k}(n\Delta_{T}) \end{array} $$

(42)

$$\begin{array}{@{}rcl@{}} \frac{\partial \boldsymbol{\mu}(n)}{\partial \text{Im}(c)} = j\sum_{k = 1}^{K} (\mathbf{I}_{2} \otimes {\mathbf{C}^{\prime}}) \mathbf{A}_{k} \mathbf{s}_{k}(n\Delta_{T}) \end{array} $$

(43)

$$\begin{array}{@{}rcl@{}} \frac{\partial \boldsymbol{\mu}(n)}{\partial \theta_{k}} = (\mathbf{I}_{2} \otimes \mathbf{C}) \mathbf{A}_{\theta_{k}}^{\prime} \mathbf{s}_{k}(n\Delta_{T}) \end{array} $$

(44)

$$\begin{array}{@{}rcl@{}} \frac{\partial \boldsymbol{\mu}(n)}{\partial \phi_{k}} = (\mathbf{I}_{2} \otimes \mathbf{C}) \mathbf{A}_{\phi_{k}}^{\prime} \mathbf{s}_{k}(n\Delta_{T}) \end{array} $$

(45)

where

$$\begin{array}{@{}rcl@{}} \mathbf{C}^{\prime} = \left[ \begin{array}{ccc} 0 & 1 & 1 \\ 1 & 0 & 1 \\ 1 & 1 & 0 \\ \end{array} \right] \end{array} $$

(46)

$$\begin{array}{@{}rcl@{}} \mathbf{A}_{\theta_{k}}^{\prime} = \left[ \begin{array}{cc} \mathbf{v}_{1,k}^{\theta} & \mathbf{0} \\ \mathbf{0} & {-\bf v}_{1,k}^{\theta} \\ \end{array} \right] \end{array} $$

(47)

$$\begin{array}{@{}rcl@{}} \mathbf{A}_{\phi_{k}}^{\prime} = \left[ \begin{array}{cc} \mathbf{v}_{1,k}^{\phi} & \mathbf{v}_{2,k}^{\phi} \\ \mathbf{v}_{2,k}^{\phi} & -\mathbf{v}_{1,k}^{\theta} \\ \end{array} \right] \end{array} $$

(48)

with $\mathbf {v}_{1,k}^{\theta } = [-\cos \phi _{k} \sin \theta _{k}, -\sin \phi _{k} \sin \theta _{k}, -\cos \theta _{k}]^{T},\mathbf {v}_{1,k}^{\phi }\!=\ [-\sin \phi _{k} \cos \theta _{k}, \cos \phi _{k} \cos \theta _{k}, 0]^{T}$, and $\mathbf {v}_{2,k}^{\phi } = [-\cos \phi _{k}, -\sin \phi _{k}, 0]^{T}$. Note that J(θ) can be expressed in a block matrix form as

$$\begin{array}{@{}rcl@{}} \mathbf{J}(\boldsymbol{\theta}) = \left[ \begin{array}{cccc} \mathbf{J}_{\mathbf{c},\mathbf{c}} & \mathbf{J}_{\mathbf{c},\boldsymbol{\theta}_{1}} & \cdots & \mathbf{J}_{\mathbf{c},\boldsymbol{\theta}_{K}} \\ \mathbf{J}_{\boldsymbol{\theta}_{1},\mathbf{c}} & \mathbf{J}_{\boldsymbol{\theta}_{1},\boldsymbol{\theta}_{1}} & \cdots & \mathbf{J}_{\boldsymbol{\theta}_{1},\boldsymbol{\theta}_{K}} \\ \vdots & \vdots & \cdots & \vdots\\ \mathbf{J}_{\boldsymbol{\theta}_{K},\mathbf{c}} & \mathbf{J}_{\boldsymbol{\theta}_{K},\boldsymbol{\theta}_{1}}& \cdots & \mathbf{J}_{\boldsymbol{\theta}_{K},\boldsymbol{\theta}_{K}} \\ \end{array} \right] \end{array} $$

(49)

where all the blocks are of size 2×2. Using the fact that for large N, $\frac {1}{N}\sum _{n = 1}^{N} s_{i}(n\Delta _{T}) s_{j}^{\ast }(n\Delta _{T}) = \delta _{ij}$, where δ_ij denotes the Dirac delta, J(θ) would asymptotically have the form

$$\begin{array}{@{}rcl@{}} \mathbf{J}(\boldsymbol{\theta}) = \left[ \begin{array}{cc} \mathbf{J}_{\mathbf{c},\mathbf{c}} & \mathbf{J}_{\mathbf{c}, \boldsymbol{\Theta}} \\ \mathbf{J}_{\boldsymbol{\Theta}, \mathbf{c}} & \mathbf{J}_{\boldsymbol{\Theta},\boldsymbol{\Theta}}\\ \end{array} \right] \end{array} $$

(50)

where

$$\begin{array}{@{}rcl@{}} \mathbf{J}_{\boldsymbol{\Theta}, \mathbf{c}} = \mathbf{J}_{\mathbf{c}, \boldsymbol{\Theta}}^{T} = [\!\mathbf{J}_{\mathbf{c},\boldsymbol{\theta}_{1}},\cdots,\mathbf{J}_{\mathbf{c},\boldsymbol{\theta}_{K}}] \end{array} $$

(51)

$$\begin{array}{@{}rcl@{}} \mathbf{J}_{\boldsymbol{\Theta}, \boldsymbol{\Theta}} =\left[ \begin{array}{ccc} \mathbf{J}_{\boldsymbol{\theta}_{1}, \boldsymbol{\theta}_{1}} & & \\ & \ddots & \\ & & \mathbf{J}_{\boldsymbol{\theta}_{K}, \boldsymbol{\theta}_{K}} \\ \end{array} \right] \end{array} $$

(52)

Finally, computing the inverse of the FIM, we obtain the following closed-form expression for the CRB:

$$\begin{array}{@{}rcl@{}} \text{CRB}(\mathbf{c}) = \frac{\sigma^{2}}{2N}\left(\mathbf{J}_{\mathbf{c},\mathbf{c}} - \sum_{k = 1}^{K} \left(\mathbf{J}_{\mathbf{c},{\boldsymbol{\theta}_{k}}} \mathbf{J}_{\boldsymbol{\theta}_{k}, \boldsymbol{\theta}_{k}}^{-1} \mathbf{J}_{{\boldsymbol{\theta}_{k}}, \mathbf{c}} \right)\right)^{-1} \end{array} $$

(53)

$$\begin{array}{@{}rcl@{}} \text{CRB}(\boldsymbol{\Theta}) = \frac{\sigma^{2}}{2N} \left(\mathbf{J}_{\boldsymbol{\Theta},\boldsymbol{\Theta}} - \mathbf{J}_{\boldsymbol{\Theta},\mathbf{c}} \mathbf{J}_{\mathbf{c}, \mathbf{c}}^{-1} \mathbf{J}_{\mathbf{c}, \boldsymbol{\Theta}} \right)^{-1} \end{array} $$

(54)

where

$$\begin{array}{@{}rcl@{}} \mathbf{J}_{\mathbf{c},\mathbf{c}} = J_{\text{Im}(c), \text{Im}(c)} \mathbf{I}_{2} = J_{\text{Re}(c), \text{Re}(c)} \mathbf{I}_{2} \end{array} $$

(55)

$$\begin{array}{@{}rcl@{}} \mathbf{J}_{{\boldsymbol{\theta}_{k}}, \mathbf{c}} = \mathbf{J}_{\mathbf{c},{\boldsymbol{\theta}_{k}}} = \left[ \begin{array}{cc} J_{\text{Re}(c), \theta_{k}} & J_{\text{Re}(c), \phi_{k}} \\ J_{\text{Im}(c), \theta_{k}} & J_{\text{Im}(c), \phi_{k}}\\ \end{array} \right] \end{array} $$

(56)

$$\begin{array}{@{}rcl@{}} \mathbf{J}_{{\boldsymbol{\theta}_{k}}, \boldsymbol{\theta}_{k}} = \left[ \begin{array}{cc} J_{\theta_{k}, \theta_{k}} & J_{\theta_{k}, \phi_{k}} \\ J_{\phi_{k}, \theta_{k}} & J_{\phi_{k}, \phi_{k}}\\ \end{array} \right] \end{array} $$

(57)

with

$$ J_{\text{Re}(c), \text{Re}(c)} = 2 \sum_{k = 1}^{K} \left ((\mathbf{C}^{\prime} \mathbf{v}_{1,k})^{H} (\mathbf{C}^{\prime} \mathbf{v}_{1,k}) + (\mathbf{C}^{\prime} \mathbf{v}_{2,k})^{H} (\mathbf{C}^{\prime} \mathbf{v}_{2,k})\right) $$

(58)

$$ J_{\text{Re}(c), \theta_{k}} = 2 \left ((\mathbf{C}^{\prime} \mathbf{v}_{1,k})^{H} \left(\text{Re}(\mathbf{C}) \mathbf{v}_{1,k}^{\theta}\right)\right) $$

(59)

$$ {{\begin{aligned} J_{\text{Re}(c), \phi_{k}} = 2 \left(\!(\mathbf{C}^{\prime} \mathbf{v}_{1,k})^{H} \left(\text{Re}(\mathbf{C}) \mathbf{v}_{1,k}^{\phi}\right) \,+\, (\mathbf{C}^{\prime} \mathbf{v}_{2,k})^{H} \left(\!\text{Re}(\mathbf{C}) \mathbf{v}_{2,k}^{\phi}\right)\!\right) \end{aligned}}} $$

(60)

$$ {}J_{\text{Im}(c), \theta_{k}} = 2 \left ((\mathbf{C}^{\prime} \mathbf{v}_{1,k})^{H} \left(\text{Im}(\mathbf{C}) \mathbf{v}_{1,k}^{\theta}\right)\right) $$

(61)

$$ {}J_{\text{Im}(c), \phi_{k}} = \!2 \left (\!(\mathbf{C}^{\prime} \mathbf{v}_{1,k})^{H} \left(\!\text{Im}(\mathbf{C}) \mathbf{v}_{1,k}^{\phi}\right) \!\,+\, (\mathbf{C}^{\prime} \mathbf{v}_{2,k})^{H} \!\left(\!\text{Im}(\mathbf{C}) \mathbf{v}_{2,k}^{\phi}\!\right)\!\right) $$

(62)

$$\begin{array}{@{}rcl@{}} J_{\theta_{k}, \theta_{k}} = 2 \left (\left(\mathbf{C} \mathbf{v}_{1,k}^{\theta}\right)^{H} \left(\mathbf{C}\mathbf{v}_{1,k}^{\theta}\right)\right) \end{array} $$

(63)

$$\begin{array}{@{}rcl@{}} J_{\theta_{k}, \phi_{k}} = J_{\phi_{k}, \theta_{k}} = 2 \left (\left(\mathbf{C} \mathbf{v}_{1,k}^{\theta}\right)^{H} \left(\mathbf{C}\mathbf{v}_{1,k}^{\phi}\right)\right) \end{array} $$

(64)

$$\begin{array}{@{}rcl@{}} J_{\phi_{k}, \phi_{k}} = 2 \left (\left(\mathbf{C} \mathbf{v}_{1,k}^{\phi}\right)^{H} \left(\mathbf{C}\mathbf{v}_{1,k}^{\phi}\right) + \left(\mathbf{C} \mathbf{v}_{2,k}^{\phi}\right)^{H} \left(\mathbf{C}\mathbf{v}_{2,k}^{\phi}\right)\right) \end{array} $$

(65)

5 Simulation results and discussion

5.1 Simulation results

In this section, we provide simulation results to illustrate the performance of the proposed ESPRIT-based method. In all the simulations, the vector sensor is assumed to be spatially collocated with the mutual coupling model defined in (12). The mutual coupling coefficient used is c=0.1e^−jπ/4. The additive noise is assumed to be spatial white complex Gaussian, and the SNR is defined relative to each signal. The result in each of the examples below is obtained from 500 independent Monte-Carlo trials. For comparison purposes, three different methods are considered. The first method is to apply the vector cross product estimator to the measured data directly. This method is hereafter referred to as “VCP Estimator without calibration.” The second method is based on the condition that the mutual coupling coefficient is known a priori, and the vector cross product estimator is applied to the perfectly calibrated data. This method is referred to as “VCP Estimator perfect calibration.” The third method is the auxiliary sensor calibration method presented in [22]. The performance metric used is the root mean squared error (RMSE) of the two signals.

In the first example, we present the scatter diagram of the azimuth-elevation angle estimates of four source signals in Fig. 2 to show the effectiveness of the temporal smoothing technique. The angles of signals are set as (θ₁,⋯,θ₄)= (24.35°, 12.92°, 21.13°, 39.82°) and (ϕ₁,⋯,ϕ₄)= (75.96°, 26.57°, 33.69°, 51.34°). The SNR is assumed to be 25 dB and N = 15 data samples are used. P = 8 is used for performing the temporal smoothing processing. From the figure, we can see that the proposed method successfully resolves the four signals, as stated in Theorem 1: N≥4K−1.

In the second example, we study the performance of the proposed method with different temporal smoothing dimension P. Two equal-power narrowband uncorrelated monochromatic IP signals with randomly chosen digital frequencies impinge upon the vector sensor with angles: θ₁= 21.13°, ϕ₁= 33.69° and θ₂= 39.82°, ϕ₂= 51.34°. The number of snapshots and SNR used are, respectively, N=200 and SNR=25 dB. The RMSEs of angle estimates as a function of the value P, varying from 8 to 40, are shown in Fig. 3. We see from the figure that the estimation errors remain almost unchanged with the increasing of value P.

In the third example, we compare the performance of the proposed method with the VCP Estimator without calibration, VCP Estimator perfect calibration, and auxiliary sensor calibration method. The simulation conditions are similar to those in the second example, except that the SNR is varying from 0 to 40 dB in steps of 5 dB. The RMSEs of angle estimates are shown in Fig. 4, where the CRBs are also plotted for comparison. We see from the figure that the proposed method has a performance significantly superior to that of the VCP Estimator without calibration. For azimuth angle estimation, the proposed method outperforms the VCP Estimator perfect calibration and auxiliary sensor calibration method at all SNRs. For elevation angle estimation, the RMSEs of the proposed method are slightly greater than those of the VCP Estimator perfect calibration and auxiliary sensor calibration method. This phenomenon can be explained as follows. Referring back to (33), (34), and (39), the proposed method obtain the estimates of azimuth angles, coupling coefficients and elevation angles in a successive way, and the estimation of the elevation angles is based on the previous estimations of the azimuth angles and the mutual coefficient. Consequently, the errors in the azimuth angle and coupling coefficient estimates will produce additional errors in the elevation angle estimates. Note that, with the estimation of coupling coefficient, the estimation performance for elevation angle can be further improved by statistically optimal algorithms such as maximum likelihood algorithm and subspace fitting algorithm. Incidentally, since the estimate errors are a bit bigger than CRB, the proposed method might be biased.

In the fourth example, we assess the performance of the proposed method versus the number of snapshots. The simulation conditions are similar to those in the second example, except that the SNR is set at 20 dB and the number of snapshots is varied from N=20 to N=2000. The RMSEs of the angle estimates are plotted in Fig. 5, and compared with those of the VCP Estimator without calibration, VCP Estimator perfect calibration, auxiliary sensor calibration method, and the CRBs. We see from Fig. 5 that the results are similar to those of the first example. The RMSEs of the proposed method decrease monotonically with the number of snapshots. Moveover, for azimuth angle estimation, the RMSEs of the proposed method are lower than those of the VCP Estimator perfect calibration and auxiliary sensor calibration method.

In the last example, we investigate the RMSEs of angle estimates for the proposed method against the coupling level, which is represented by coupling coefficient c=ρe^jπ/4, varying low (ρ=0.1) to high (ρ=0.9). One IP signal with θ= 21.13°, ϕ= 33.69°, and randomly chosen digital frequencies impinges upon the sensor. The number of snapshots and SNR used are, respectively, N=200 and SNR=25 dB. The RMSEs of angle estimates as a function of the value ρ are shown in Fig. 6. We see from the figure that the proposed method can handle both low and high coupling levels, since the estimation errors remain almost constant with the increasing of coupling coefficient ρ.

5.2 Discussion

The proposed method is valid only for the case of K≥2. If there is only one incoming signal, this method cannot be used. In this case, the temporal smoothing process is not necessary, and a method for determining the true estimates is formulated as follows: let E_n be the 6×4 noise-subspace eigenvector matrix whose four columns are the 6×1 noise-subspace eigenvectors associated with four smallest eigenvalues of ZZ^H. For the two sets of estimates $\left \{\left (\hat c_{1}, \hat \theta _{1}, \hat \phi _{1}\right), \left (\hat c_{2}, \hat \theta _{2}, \hat \phi _{2}\right)\right \}$, we may determine as to which one is the true one by first constructing the matrices $\mathbf {I}_{2} \otimes \hat {\mathbf {C}}_{1} \hat {\mathbf {A}}_{1}$ and $\mathbf {I}_{2} \otimes \hat {\mathbf {C}}_{2} \hat {\mathbf {A}}_{2}$, and then taking $\left (\hat c_{1}, \hat \theta _{1}, \hat \phi _{1}\right)$ to be the true estimates if

$$\begin{array}{@{}rcl@{}} \left\|\left[\!\mathbf{I}_{2} \otimes \hat{\mathbf{C}}_{1} \hat{\mathbf{A}}_{1}\right]^{H} \mathbf{E}_{n}\right\| < \left\|\left[\!\mathbf{I}_{2} \otimes \hat{\mathbf{C}}_{2} \hat{\mathbf{A}}_{2}\right]^{H} \mathbf{E}_{n}\right\| \end{array} $$

or $\left (\hat c_{2}, \hat \theta _{2}, \hat \phi _{2}\right)$ otherwise, where ||·|| denotes the Frobenius norm.

6 Conclusions

The present paper has considered, for the first time, the direction finding using a single-vector sensor in the presence of mutual coupling. The temporal smoothing technique has been applied to improve the identifiability limit of a single vector. In particular, sufficient conditions for constructing temporally smoothed matrices to resolve K>2 incompletely polarized (IP) monochromatic signals with a single vector sensor have been established. An efficient ESPRIT-based method, which does not require any calibration sources or iterative operations, has been developed to jointly estimate the azimuth-elevation angles and the mutual coupling coefficients. The CRB for the considered problem has also been derived. Simulation results have been presented showing the superiorities of the proposed method.

References

J Li, Direction and polarization estimation using arrays with small loops and short dipoles. IEEE Trans. Antennas Propag. 41:, 379–387 (1993).
Article Google Scholar
A Nehorai, E Paldi, Vector-sensor array processing for electromagnetic source localization. IEEE Trans. Signal Process. 42:, 376–398 (1994).
Article Google Scholar
KT Wong, MD Zoltowski, Uni-vector-sensor ESPRIT for multisource azimuth, elevation, and polarization estimation. IEEE Trans. Antennas Propag. 45:, 1467–1474 (1997).
Article Google Scholar
KC Tan, KC Ho, A Nehorai, Linear independence of steering vectors of an electromagnetic vector sensor. IEEE Trans. Signal Process. 44:, 3099–3107 (1996).
Article Google Scholar
J Zhang, CC Ko, A Nehorai, Separation and tracking of multiple broadband sources with one electromagnetic vector sensor. IEEE Trans. Aerosp. Electron. Syst. 38:, 1109–1116 (2002).
Article Google Scholar
A Nehorai, P Tichavsky, Cross-product algorithms for source tracking using an EM vector sensor. IEEE Trans. Signal Process. 47:, 2863–2867 (1999).
Article Google Scholar
KT Wong, X Yuan, Vector cross-product direction-finding with an electromagnetic vector-sensor of six orthogonally oriented but spatially non-collocating dipoles/loops. IEEE Trans. Signal Process. 59:, 160–171 (2011).
Article MathSciNet Google Scholar
X Yuan, Estimating the DOA and the polarization of a polynomial-phase signal using a single polarized vector-sensor. IEEE Trans. Signal Process. 60:, 1270–1282 (2012).
Article MathSciNet Google Scholar
BC Ng, CMS See, Sensor array calibration using a maximum likelihood approach. IEEE Trans. Antennas Propag. 44:, 827–835 (1996).
Article Google Scholar
F Sellone, A Serra, A novel online mutual coupling compensation algorithm for uniform and linear arrays. IEEE Trans. Signal Process. 55:, 560–573 (2007).
Article MathSciNet Google Scholar
Z Ye, J Dai, X Xu, X Wu, DOA estimation for uniform linear array with mutual coupling. IEEE Trans. Aerosp. Electron. Syst. 45:, 280–288 (2009).
Article Google Scholar
J Dai, Z Ye, Spatial smoothing for DOA estimation of coherent signals in the presence of unknown mutual coupling. IET Signal Process. 5:, 418–425 (2011).
Article Google Scholar
J Dai, W Xu, D Zhao, Real-valued DOA estimation for uniform linear array with unknown mutual coupling. Signal Process. 92:, 2056–2065 (2012).
Article Google Scholar
W Wang, S Ren, Y Ding, H Wang, An efficient algorithm for direction finding against unknown mutual coupling. Sensors. 14:, 4–20077 (2006).
Google Scholar
B Liao, S Chan, A cumulant-based approach for direction finding in the presence of mutual coupling. Signal Process. 104:, 197–202 (2014).
Article Google Scholar
J Dai, X Bao, N Hu, C Chang, W Xu, A Recursive RARE Algorithm for DOA Estimation With Unknown Mutual Coupling. IEEE Antennas Wirel. Propag. Lett. 13:, 1593–1596 (2014).
Article Google Scholar
M Lin, L Yang, Blind calibration and DOA estimation with uniform circular arrays in the presence of mutual coupling. IEEE Antennas Wirel. Propag. Lett. 5:, 315–318 (2006).
Article Google Scholar
J Dai, D Zhao, X Ji, A sparse representation method for DOA estimation with unknown mutual coupling. IEEE Antennas Wirel. Propag. Lett. 11:, 1210–1213 (2012).
Article Google Scholar
N Hu, Z Ye, X Xu, M Bao, DOA estimation for sparse array via sparse signal reconstruction. IEEE Trans. Aerosp. Electron. Syst. 49:, 760–773 (2013).
Article Google Scholar
J Dai, N Hu, W Xu, C Chang, Sparse Bayesian learning for DOA estimation with mutual coupling. Sensors. 15:, 7–26280 (2626).
Google Scholar
Y Wang, M Trinkle, BW-H Ng, DOA estimation under unknown mutual voupling and multipath with improved effective array aperture. Sensors. 15:, 30856–30869 (2015).
Article Google Scholar
G Wang, H Tao, J Su, X Guo, C Zeng, L Wang, in Antennas, Propagation & EM Theory (ISAPE) 2012 10th International Symposium on. Mutual coupling calibration for electromagnetic vector sensor array (IEEEXian, 2012), pp. 261–264.
Google Scholar
L Wang, G Wang, C Zeng, Mutual coupling calibration for electromagnetic vector sensor. Prog. Electromagn. Res. B. 52:, 347–362 (2013).
Article Google Scholar
KC Ho, KC Tan, A Nehorai, Estimating directions of arrival of completely and incompletely polarized signals with electromagnetic vector sensors. IEEE Trans. Signal Process. 47:, 2845–2852 (1999).
Article Google Scholar
K Wang, J He, T Shu, Z Liu, Localization of mixed completely and partially polarized signals with crossed-dipole sensor arrays. Sensors. 15:, 31859–31868 (2015).
Article Google Scholar
AN Lemma, AJ van der Veen, EF Deprettere, Analysis of joint angle-frequency estimation Using ESPRIT. IEEE Trans. Signal Process. 51:, 1264–1283 (2003).
Article MathSciNet MATH Google Scholar
MC Vanderveen, AJ van der Veen, A Paulraj, Estimation of multipath parameters in wireless communications. IEEE Trans. Signal Process. 46:, 682–690 (1998).
Article Google Scholar
R Roy, T Kailath, ESPRIT-estimation of signal parameters via rotational invariance techniques. IEEE Trans. Acoust. Speech Signal Process. 37:, 984–995 (1989).
Article MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank the anonymous reviewers for their valuable comments and suggestions that helped improve the quality of this manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (no. 61771302) and Shanghai Association of Science and Technology (no. 16511103004)

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available but are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations

Shanghai Key Laboratory of Intelligent Sensing and Recognition, Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai, 200240, People’s Republic of China
Ting Shu, Kun Wang & Jin He
Department of Electronic Engineering, Nanjing University of Science and Technology, Nanjing, 210094, People’s Republic of China
Zhong Liu

Authors

Ting Shu
View author publications
You can also search for this author in PubMed Google Scholar
Kun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jin He
View author publications
You can also search for this author in PubMed Google Scholar
Zhong Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed extensively to the study presented in this manuscript. TS presented the main idea, carried out the simulation, interpreted the results, and wrote the paper. KW and JH conceived of the experiments. They also provided many valuable suggestions to this study. ZL supervised the main idea and edited the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Ting Shu.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional information

Authors’ information

Ting Shu received the B.Sc. and M.Sc. degrees from Nanjing University of Science and Technology, Nanjing, China, in 2004 and 2006, respectively, both in electrical engineering. He received the Ph.d degrees in electrical engineering from Shanghai Jiao Tong University, Shanghai, China, in 2010. From 2010 to 2011, he worked as the system engineer in the Wireless Department of Huawei Technologies, Co., Ltd., responsible for the HSDPA baseband system design for the UMTS evolution. He joins SJTU in July, 2011. He is currently an assistant professor in electrical engineering at Shanghai Jiao Tong University. His current research interests include the hardware-in-loop simulation of radar and EW systems, R&D of radar realtime signal processing, and adaptive array for phased array radar system. (Email: tingshu@sjtu.edu.cn)

Kun Wang received the B.S. and Ph.D. degrees from Nanjing University of Science and Technology (NJUST), Nanjing, Jiangsu, China, in 2008 and 2017, respectively, both in electrical engineering. He is now a postdoctoral fellow with the Department of Electrical Engineering, Shanghai Jiaotong University, Shanghai, China. His research interests include sensor-array signal processing, statistical signal processing and their applications to sensors/radar networking.

Jin He received his B.S. degree in information engineering from Tongji University, Shanghai, China in 2001 and his M.S. and Ph.D. degrees in electrical engineering from Nanjing University of Science and Technology (NJUST), Nanjing, Jiangsu, China in 2004 and 2007, respectively. From 2007 to 2009 he was a postdoctoral fellow with the Department of Electrical Engineering, NJUST, where he was the program director and was responsible for the China Postdoctoral Science Projects. From 2009 to 2011 he was a postdoctoral fellow with the Department of Electrical and Computer Engineering, Concordia University, Montreal, Quebec, Canada. From 2012 to 2015 he was a research engineer with the Shanghai Aerospace Electronic Technology Institute, Shanghai, China. He is now with the Shanghai Key Laboratory of Intelligent Sensing and Recognition, Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai, China. Also he has served as a peer-reviewer for various IEEE/IET research journals since 2007. His research interests include array signal processing, statistical signal processing, non-Gaussian signal processing, and their applications.

Zhong Liu received the B.S. degree from Anhui University, Hefei, Anhui, China, in 1983 and the M.S. and Ph.D. degrees from University of Electronic Science and Technology of China, Chengdu, Sichuan, China, in 1985 and 1988, all in electrical engineering. He was a Postdoctoral Fellow at Kyoto University, Kyoto, Japan, from1991 to 1993 and a visiting scholar at the Chinese University of HongKong, Hong Kong, from 1997 to 1998. Currently, he is a Professor in the Department of Electronic Engineering, Nanjing University of Science and Technology. His research interests include chaos and information dynamics, signal processing, radar and communication techniques.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Shu, T., Wang, K., He, J. et al. Direction finding with a single spatially stretched vector sensor in the presence of mutual coupling. EURASIP J. Adv. Signal Process. 2018, 14 (2018). https://doi.org/10.1186/s13634-018-0537-9

Download citation

Received: 28 July 2017
Accepted: 12 February 2018
Published: 27 February 2018
DOI: https://doi.org/10.1186/s13634-018-0537-9

Direction finding with a single spatially stretched vector sensor in the presence of mutual coupling

Abstract

1 Introduction

2 Mathematical data model and assumptions

3 Joint angle and mutual coupling matrix estimation

3.1 Temporal smoothing

Theorem 1

Proof

3.2 Angle and mutual coupling matrix estimation

3.3 Remarks

4 Cramér-Rao bound

5 Simulation results and discussion

5.1 Simulation results

5.2 Discussion

6 Conclusions

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Publisher’s Note

Additional information

Authors’ information

Rights and permissions

About this article

Cite this article

Share this article

Keywords