Skip to main content

Target scattering estimation in clutter with polarization optimization

Abstract

In this paper, we propose an adaptive waveform polarization method for the estimation of target scattering matrix in the presence of clutter. The proposed sequential algorithm, based on the concept of sequential minimum mean square error (MSE) estimation, to determine the coefficients of the scattering matrix, guarantees the convergence and the resulting computational complexity is linear with the number of iterations. The effectiveness of the proposed method is validated through numerical results, underlining the performance improvement given by joint transmission and reception (Tx/Rx) polarization optimization for the scalar system. Also, the results show that the vector system with transmission polarization optimization provides a comparative performance as the scalar measurement system employing joint Tx/Rx polarization optimization. Less computation burden highlights the advantage of the former mode.

1 Introduction

Polarization, together with the amplitude, time, frequency, phase, and bearing descriptor of radar signals, completes the information description of target returns from radars. The exploitation of information on the echo polarization can provide a significant improvement on radar performance [1]. To obtain the target polarimetric scattering information, conventional polarimetric radar systems alternately switch (or simultaneously transmit) the horizontal (H) and vertical (V) polarizations at the transmission side, and simultaneously receive both polarizations at the reception side, consequently resulting in four polarization combinations: HH, HV, VH, and VV polarizations.

Motivated in part by recent advances in the hardware and sensor information processing, such as solid state transmission and digital arbitrary waveform generators, in modern radar systems, any polarization on either transmission or reception can be synthesized by using the linear combination of the H and V components. Thus, besides the four types of transmitter/receiver combinations above, polarimetric radar can achieve any pair of transmitter/receiver polarizations. Such flexibility greatly enhances the polarimetric sensing capability of the radar system. For the mentioned reason, several papers concerning polarization optimization have appeared in the open literature during the last two decades aiming at the performance of polarimetric radars on target estimation [2, 3], detection [48], tracking [9], and identification [4, 10].

This paper will tackle with the problem of adaptively selecting waveform polarization to optimally estimate the target scattering matrix. As to this topic, the technique of optimizing transmission polarization for “vector measurement systems" [10] (with the term “vector measurement system”, the authors refer to the reception side of the radar that receives separately the horizontal and vertical polarization components to form a vector) has been addressed in [2]. The numerical results there show that, by optimally selecting waveform polarization, the performance of the target estimation can be significantly improved compared to the fixed polarization design. In most cases, polarimetric radar systems combine two received signals linearly and coherently at the receiver to give a scalar measurement [1]. Following [6], we call the radar employing such measurement mode as the “scalar measurement system”. As to this kind of system, in [3], the authors devise an optimization algorithm to jointly design Tx/Rx polarizations that minimize the MSE of estimating the target scattering vector. The original problem therein is formulated into a convex form which is solvable employing semi-definite programming (SDP). As a subfield of convex optimization, SDP concerns with the optimization of a linear objective function to be maximized or minimized over the intersection of the cone of positive semi-definite matrices with an affine space, i.e., a spectrahedron [11]. Such optimization problems can be solved via the well-known CVX toolbox for Matlab software [12]. But unfortunately, we find the problem formulation in [3] cannot be effectively solved by exploiting the devised algorithm (see Appendix 1 for details). In fact, the SDP formulation there is a relaxation version of the original problem to be handled.

Hence, in this paper, we reconsider the problem of adaptive polarization design of polarimetric radars to optimally estimate the target scattering vector in clutter. Starting from scalar measurement systems, we design an optimization procedure for the Tx/Rx polarizations to sequentially minimize the MSE of estimating the target scattering vector. The proposed algorithm converges to a certain value, and the computational complexity is linear with the number of iterations. Furthermore, we generalize the algorithm to the vector measurement system to optimally design the transmission polarization. At the analysis stage, we validate the effectiveness of the proposed method through numerical results, highlighting the performance improvement of the joint Tx/Rx polarization optimization for the scalar system. Our results show that the vector system with transmission polarization optimization offers a comparative performance as the scalar system employing joint Tx/Rx polarization optimization.

The remainder of the paper is organized as follows. The measurement model of the scalar measurement system with jointly adaptive Tx/Rx polarization design is introduced in Section 2. In Section 3, the problem formulation is described. The sequential method to optimally select waveform parameter with affordable computation burden is proposed in Section 4. Then, in Section 5, the devised algorithm is exploited to the vector measurement system. Numerical results are presented in Section 6, demonstrating the improvement in performance achieved by using the joint Tx/Rx polarization in comparison with the two other scalar measurement systems, as well as highlighting the advantage of vector measurement systems with respect to scalar ones. Finally, the conclusions are provided in Section 7.

2 Measurement model

In this section, we formulate the parametric measurement model of target scattering vector in additive Gaussian distributed clutter for the scalar measurement system with adaptive Tx/Rx polarization designs.

Let \( \boldsymbol{\xi} ={\left[{\xi}_{\mathrm{h}},{\xi}_v\right]}^{\mathrm{T}} \) be the transmitted polarized electric field impinging on the target and \( \boldsymbol{\eta} ={\left[{\eta}_{\mathrm{h}},{\eta}_{\mathrm{v}}\right]}^{\mathrm{T}} \) be the reception antenna polarization, where the subscripts “h” and “v” denote the horizontal and vertical polarization components of a fully polarized signal, respectively, and the superscript “T” denotes the transpose operation. We consider that the transmission pulse has unitary power, i.e., ξ = 1. As to the reception polarization, we set η = 1. After ignoring the target Doppler shift, the complex envelope of the received signal is given by [3, 5]

$$ y(t)=\frac{g}{r^2}{\boldsymbol{\eta}}^{\mathrm{T}}\left({\boldsymbol{S}}_{\mathrm{t}}+{\boldsymbol{S}}_{\mathrm{c}}\right)\boldsymbol{\xi} s\left(t-\tau \right)+w(t), $$
(1)

where w(t) is the white noise, r is the distance from the target to radar, s(t) is the transmission waveform, τ is the delay resulted from the waveform forward and backward propagation, and g is a constant depending on the radar system characteristics including operating frequency, permittivity, permeability of free space, antenna gain at the target illumination angle, radar reception power, etc. S t and S c are the target and clutter scattering matrices, respectively, with the following matrix representation:

$$ {\boldsymbol{S}}_{\mathrm{t}}=\left[\begin{array}{c}\hfill {s}_{\mathrm{h}\mathrm{h}}^{\mathrm{t}}\ {s}_{\mathrm{h}\mathrm{v}}^{\mathrm{t}}\hfill \\ {}\hfill {s}_{\mathrm{v}\mathrm{h}}^{\mathrm{t}}\ {s}_{\mathrm{v}\mathrm{v}}^{\mathrm{t}}\hfill \end{array}\right],\kern1em {\boldsymbol{S}}_{\mathrm{c}}=\left[\begin{array}{c}\hfill {s}_{\mathrm{h}\mathrm{h}}^{\mathrm{c}}\ {s}_{\mathrm{h}\mathrm{v}}^{\mathrm{c}}\hfill \\ {}\hfill {s}_{\mathrm{v}\mathrm{h}}^{\mathrm{c}}\ {s}_{\mathrm{v}\mathrm{v}}^{\mathrm{c}}\hfill \end{array}\right]. $$
(2)

Here, we remark that the linear assumption has been employed to linearize the electromagnetic scattering problem. From a more general point of view, the electromagnetic scattering problem is actually highly non-linear. But the linearity assumption is usually employed to simplify the problem at hand. In this paper, we make the linearity assumption following [3, 5, 9], to address the similar signal model.

After performing the sampling process and matched filtering [13], (1) could be expressed in the following observation model:

$$ y={\boldsymbol{\eta}}^{\mathrm{T}}\left({\boldsymbol{S}}_{\mathrm{t}}+{\boldsymbol{S}}_{\mathrm{c}}\right)\boldsymbol{\xi} +\nu, $$
(3)

where \( \nu \) is the noise with variance \( {\sigma}_{\nu}^2 \). By vectorizing S t and S c, we could convert (3) into the observation model

$$ y=\boldsymbol{a}{\left(\boldsymbol{\xi}, \boldsymbol{\eta} \right)}^T{\boldsymbol{X}}_{\mathrm{t}}+\boldsymbol{a}{\left(\boldsymbol{\xi}, \boldsymbol{\eta} \right)}^T{\boldsymbol{X}}_{\mathrm{c}}+\nu, $$
(4)

where

$$ \begin{array}{l}{\boldsymbol{X}}_{\mathrm{t}}={\left[{s}_{\mathrm{h}\mathrm{h}}^{\mathrm{t}}\ {s}_{\mathrm{h}\mathrm{v}}^{\mathrm{t}}\ {s}_{\mathrm{v}\mathrm{h}}^{\mathrm{t}}\ {s}_{\mathrm{v}\mathrm{v}}^{\mathrm{t}}\ \right]}^{\mathrm{T}},\\ {}{\boldsymbol{X}}_{\mathrm{c}}={\left[{s}_{\mathrm{h}\mathrm{h}}^{\mathrm{c}}\ {s}_{\mathrm{h}\mathrm{v}}^{\mathrm{c}}\ {s}_{\mathrm{v}\mathrm{h}}^{\mathrm{c}}\ {s}_{\mathrm{v}\mathrm{v}}^{\mathrm{c}}\ \right]}^{\mathrm{T}},\end{array} $$
(5)

and

$$ \boldsymbol{a}\left(\boldsymbol{\xi}, \boldsymbol{\eta} \right)={\left[{\xi}_{\mathrm{h}}{\eta}_{\mathrm{h}}\ {\xi}_{\mathrm{h}}{\eta}_{\mathrm{v}}\ {\xi}_{\mathrm{v}}{\eta}_{\mathrm{h}}\ {\xi}_{\mathrm{v}}{\eta}_{\mathrm{v}}\right]}^{\mathrm{T}}. $$
(6)

Assuming that there are N pulses with different polarizations used to estimate the fully polarimetric target information X t, the observation from these N pulses can be written as

$$ y(i)=\boldsymbol{a}{\left(\boldsymbol{\xi} (i),\boldsymbol{\eta} (i)\right)}^{\mathrm{T}}{\boldsymbol{X}}_{\mathrm{t}}+\boldsymbol{a}{\left(\boldsymbol{\xi} (i),\boldsymbol{\eta} (i)\right)}^{\mathrm{T}}{\boldsymbol{X}}_{\mathrm{c}}+\nu (i),\kern0.75em i=1,\dots, N, $$
(7)

where ξ(i) and η(i) denote, respectively, the transmission and reception polarizations of the ith pulse, i.e., \( \boldsymbol{a}\left(\boldsymbol{\xi} (i),\boldsymbol{\eta} (i)\right)={\left[{\xi}_{\mathrm{h}}^{(i)}{\eta}_{\mathrm{h}}^{(i)}\ {\xi}_{\mathrm{h}}^{(i)}{\eta}_{\mathrm{v}}^{(i)}\ {\xi}_{\mathrm{v}}^{(i)}{\eta}_{\mathrm{h}}^{(i)}\ {\xi}_{\mathrm{v}}^{(i)}{\eta}_{\mathrm{v}}^{(i)}\right]}^{\mathrm{T}} \). For notational simplicity, we define a(i) a(ξ(i), η(i)). Introducing further the following vector notations

$$ \begin{array}{ll}\mathbf{y}=\hfill & {\left[y(1),\dots, y(N)\right]}^{\mathrm{T}},\hfill \\ {}\mathbf{A}=\hfill & {\left[\boldsymbol{a}(1),\dots, \boldsymbol{a}(N)\right]}^{\mathrm{T}},\hfill \\ {}\mathbf{v}=\hfill & {\left[\nu (1),\dots, \nu (N)\right]}^{\mathrm{T}},\hfill \end{array} $$
(8)

we obtain the matrix representation of the linear measurement model as

$$ \mathbf{y}=\mathbf{A}{\boldsymbol{X}}_{\mathrm{t}}+\mathbf{A}{\boldsymbol{X}}_{\mathrm{c}}+\mathbf{v}. $$
(9)

Notice that the system response A here is an N × 4 complex matrix; so we assume that N > 4 and ensure rank(A) = 4 to estimate four-dimensional complex vector X t [3, 5].

As to the target and clutter statistical characterizations, we do not make restrictive model assumption on the multivariate statistical characterization of X t and X c. However, we assume that we have access to the first two moments of their probability density function (PDF). This assumption is reasonable, especially in a knowledge-aided (possibly cognitive) radar scenario. In such case, clutter statistical parameters can be obtained jointly using geographical information, meteorological measurements, and statistical (possibly empirical) models for the clutter [14]. The statistical parameters of the illuminated target can be obtained or roughly estimated by pre-scan procedures and employing cognitive methods [1416]. Precisely, we assume X t is a four-dimensional random vector of parameters whose realization is to be estimated and has mean E(X t) and covariance matrix C t, and X c is a four-dimensional random vector with zero mean and covariance matrix C c and is uncorrelated with X t, where E() denotes the expectation operation. As to the statistical characterization of the noise vector, we assume that it is Gaussian white with covariance matrix \( {\sigma}_{\nu}^2{\mathbf{I}}_4 \), where I 4 is the identity matrix with a 4 × 4 size. Finally, we assume that the target, clutter, and noise are uncorrelated, so that the joint PDF \( p\left({\boldsymbol{X}}_{\mathrm{t}},{\boldsymbol{X}}_{\mathrm{c}},\mathbf{v}\right) \) is arbitrary.

3 Problem formulation

Our assignment is to estimate the target scattering vector X t from the observations. By inspection of the measurement model in the previous section, it matches the Bayesian linear model form. By exploiting the Bayesian Gauss-Markov Theorem [[17], Theorem 12.1], we obtain the minimum MSE estimator of X t as

$$ {\hat{\boldsymbol{X}}}_{\mathrm{t}}=E\left({\boldsymbol{X}}_{\mathrm{t}}\right)+{\boldsymbol{C}}_{\mathrm{t}}{\mathbf{A}}^{\mathrm{H}}{\left(\mathbf{A}\left({\boldsymbol{C}}_{\mathrm{t}}+{\boldsymbol{C}}_{\mathrm{c}}\right){\mathbf{A}}^{\mathrm{H}}+{\sigma}_{\nu}^2{\mathbf{I}}_N\right)}^{-1}\left(\mathbf{y}-\mathbf{A}E\left({\boldsymbol{X}}_{\mathrm{t}}\right)\right), $$
(10)

where I N is the identity matrix with a N × N size and (·)H denotes the conjugate transpose. The performance of the estimator is measured by the error \( \boldsymbol{e}={\boldsymbol{X}}_{\mathrm{t}}-{\widehat{\boldsymbol{X}}}_{\mathrm{t}} \) whose mean value is zero and covariance matrix is

$$ \boldsymbol{D}={\left({\boldsymbol{C}}_{\mathrm{t}}^{-1}+{\mathbf{A}}^{\mathrm{H}}{\left(\mathbf{A}{\boldsymbol{C}}_{\mathrm{c}}{\mathbf{A}}^{\mathrm{H}}+{\sigma}_{\nu}^2{\mathbf{I}}_N\right)}^{-1}\mathbf{A}\right)}^{-1}, $$
(11)

with the ith diagonal element of D as the minimum Bayesian MSE of X t’s ith element [Kay [17], Theorem 12.1]. Hence, the trace of D, i.e., Tr(D), represents the sum of the minimum Bayesian MSEs of all X t’s four elements. As a consequence, we define the MSE of X t as Tr(D). As done in [3], we consider Tr(D) as the relevant figure of merit and minimize it to optimize transmission and reception polarizations.

Thus, the problem of jointly optimizing transmission and reception polarizations, to minimize the MSE of estimating X t with N diversely polarized pulses, can be formulated as

$$ \mathcal{P}\left\{\begin{array}{c}\underset{\boldsymbol{D},{\left\{\boldsymbol{\xi} (i)\right\}}_{i=1}^N,{\left\{\boldsymbol{\eta} (i)\right\}}_{i=1}^N}{min}\mathrm{T}\mathrm{r}\left(\boldsymbol{D}\right)\\ {}\mathrm{s}.\mathrm{t}.\kern0.5em \boldsymbol{D}={\left({\boldsymbol{C}}_{\mathrm{t}}^{-1}+{\mathbf{A}}^{\mathrm{H}}{\left(\mathbf{A}{\boldsymbol{C}}_{\mathrm{c}}{\mathbf{A}}^{\mathrm{H}}+{\sigma}_{\nu}^2{\mathbf{I}}_N\right)}^{-1}\mathbf{A}\right)}^{-1}\kern0.5em \mathrm{with}\\ {}\kern1.25em \mathbf{A}={\left[\boldsymbol{a}(1),\dots, \boldsymbol{a}(N)\right]}^{\mathrm{T}}\kern0.5em \mathrm{with}\\ {}\kern1.75em \boldsymbol{a}(i)={\left[{\xi}_{\mathrm{h}}^{(i)}{\eta}_{\mathrm{h}}^{(i)},{\xi}_{\mathrm{h}}^{(i)}{\eta}_v^{(i)}, {\xi}_v^{(i)}{\eta}_{\mathrm{h}}^{(i)},{\xi}_v^{(i)}{\eta}_v^{(i)}\right]}^{\mathrm{T}}\ \mathrm{with}\\ {}\kern1.75em \boldsymbol{\xi} (i)={\left[{\xi}_{\mathrm{h}}^{(i)},{\xi}_v^{(i)}\right]}^{\mathrm{T}}\kern0.5em \mathrm{with}\ \left\Vert \boldsymbol{\xi} (i)\right\Vert =1,\kern0.5em \\ {}\kern2em \boldsymbol{\eta} (i)={\left[{\eta}_{\mathrm{h}}^{(i)},{\eta}_v^{(i)}\right]}^{\mathrm{T}}\ \mathrm{with}\ \left\Vert \boldsymbol{\eta} (i)\right\Vert =1,\ i=1,\dots, N.\end{array}\right. $$
(12)

Remarks Actually, there exist several potential optimality criteria for the problem at hand, e.g., minimizing the determinant of D, minimizing the maximum eigenvalue of D and the aforementioned Tr(D), etc. [19]. In this paper, we employ Tr(D) because of the following: firstly, it is reasonable, as stated before and secondly, we exploit the same problem formulation as in [3] and then highlight our design.

4 Optimal waveform selection

In the sequel, we will deal with the non-convex (the objective is non-convex and ξ(i) = η(i) = 1 defines a non-convex set) optimization problem \( \mathcal{P} \). By reformulating the problem using trigonometric parameters and the inspection of its computational complexity, we exploit the sequential minimum MSE estimation scheme to compute the optimal Tx/Rx polarizations instead of directly utilizing the minimum MSE estimation.

4.1 Problem reformulation

A first step toward the goal to tackle the optimization problem is represented by the following notations of transmission and reception polarizations. As to the transmission polarization, it can be parameterized into the following trigonometric function [2]:

$$ \boldsymbol{\xi} =\left\Vert \boldsymbol{\xi} \right\Vert {\mathrm{e}}^{j\varphi }Q\boldsymbol{w} $$
(13)

where \( Q=\left[\begin{array}{l} \cos \alpha \kern1.25em \sin \alpha \\ {}- \sin \alpha \kern0.5em \cos \alpha \end{array}\right] \), \( \boldsymbol{w}=\left[\begin{array}{l} \cos \beta \\ {}j \sin \beta \end{array}\right] \), ξe is the complex envelope of the source signal, α denotes the rotation angle between the system coordinates and the electric ellipse axes, and β determines the ellipse’s eccentricity, respectively, with the definition spaces of these trigonometric parameters being φ (−π, π], α [−π/2, π/2], and β [−π/4, π/4]. Since the transmitted signal is power limited, i.e., ξ = 1, by replacing Q and w with their explicit forms, we can rewrite (13) as

$$ \boldsymbol{\xi} \triangleq {\mathrm{e}}^{j\varphi}\left[\begin{array}{l}{\zeta}_{\mathrm{h}}\\ {}{\zeta}_v\end{array}\right]={\mathrm{e}}^{j\varphi}\left[\begin{array}{l} \cos \alpha \cos \beta +j \sin \alpha \sin \beta \\ - \sin \alpha \cos \beta +j \cos \alpha \sin \beta \end{array}\right] $$
(14)

Similarly, the trigonometric form of received polarization can be written as

$$ \boldsymbol{\eta} \triangleq {\mathrm{e}}^{j\phi}\left[\begin{array}{l}{\iota}_{\mathrm{h}}\\ {}{\iota}_v\end{array}\right]={\mathrm{e}}^{j\phi}\left[\begin{array}{l} \cos \theta \cos \vartheta +j \sin \theta \sin \vartheta \\ {}- \sin \theta \cos \vartheta +j \cos \theta \sin \vartheta \end{array}\right] $$
(15)

where ϕ (−π, π], θ [−π/2, π/2], and ϑ [−π/4, π/4].

We can observe that, once ξ and η are written into (14) and (15), respectively, Tr(D) can be uniquely determined by trigonometric parameters {φ, α, β, ϕ, θ, ϑ}. Before proceeding further, herein we introduce an interesting observation of Tr(D) with respect to such parameters, as summarized in following property.

Property 1 The initial signal phases φ and ϕ do not affect the value of D.

Proof See Appendix 2.

It can be seen that with Property 1, Tr(D) can be uniquely determined by four trigonometric parameters {α, β, θ, ϑ} instead of six trigonometric parameters {φ, α, β, ϕ, θ, ϑ}. Thus, the problem \( \mathcal{P} \) can be equivalently recast as

$$ {\mathcal{P}}_1\left\{\begin{array}{c}\underset{\boldsymbol{D},{\left\{{\alpha}_i\right\}}_{i=1}^N,{\left\{{\beta}_i\right\}}_{i=1}^N,{\left\{{\theta}_i\right\}}_{i=1}^N,{\left\{{\vartheta}_i\right\}}_{i=1}^N}{min}\mathrm{T}\mathrm{r}\left(\boldsymbol{D}\right)\\ {}\mathrm{s}.\mathrm{t}.\kern0.5em \boldsymbol{D}={\left({\boldsymbol{C}}_{\mathrm{t}}^{-1}+{\mathbf{A}}^{\mathrm{H}}{\left(\mathbf{A}{\boldsymbol{C}}_{\mathrm{c}}{\mathbf{A}}^{\mathrm{H}}+{\sigma}_{\nu}^2{\mathbf{I}}_N\right)}^{-1}\mathbf{A}\right)}^{-1}\kern0.5em \mathrm{with}\\ {}\kern1.25em \mathbf{A}={\left[\boldsymbol{a}(1),\dots, \boldsymbol{a}(N)\right]}^{\mathrm{T}}\kern0.5em \mathrm{with}\\ {}\kern1.75em \boldsymbol{a}(i)={\left[{\xi}_{\mathrm{h}}^{(i)}{\eta}_{\mathrm{h}}^{(i)},{\xi}_{\mathrm{h}}^{(i)}{\eta}_v^{(i)}, {\xi}_v^{(i)}{\eta}_{\mathrm{h}}^{(i)},{\xi}_v^{(i)}{\eta}_v^{(i)}\right]}^{\mathrm{T}}\ \mathrm{with}\\ {}\kern1.75em \left[\begin{array}{l}{\xi}_{\mathrm{h}}^{(i)}\\ {}{\xi}_v^{(i)}\end{array}\right]=\left[\begin{array}{l} \cos {\alpha}_i \cos {\beta}_i+j \sin {\alpha}_i \sin {\beta}_i\\ {}- \sin {\alpha}_i \cos {\beta}_i+j \cos {\alpha}_i \sin {\beta}_i\end{array}\right]\kern0.5em \mathrm{and}\ \\ {}\kern1.75em \left[\begin{array}{l}{\eta}_{\mathrm{h}}^{(i)}\\ {}{\eta}_v^{(i)}\end{array}\right]=\left[\begin{array}{l} \cos {\theta}_i \cos {\vartheta}_i+j \sin {\theta}_i \sin {\vartheta}_i\\ {}- \sin {\theta}_i \cos {\vartheta}_i+j \cos {\theta}_i \sin {\vartheta}_i\end{array}\right]\kern0.5em \mathrm{with}\\ {}\kern1.75em {\alpha}_i\in \left[-\pi /2,\pi /2\right],\ {\beta}_i\in \left[-\pi /4,\pi /4\right],\ \\ {}\kern1.75em {\theta}_i\in \left[-\pi /2,\pi /2\right],\ {\vartheta}_i\in \left[-\pi /4,\pi /4\right],\ i=1,\dots, N.\ \end{array}\right. $$
(16)

We can see that \( {\mathcal{P}}_1 \) is still a non-convex optimization problem because the objective function is the same non-convex function as in \( \mathcal{P} \). Nevertheless, as opposed to \( \mathcal{P} \), the equivalent formulation provided by \( {\mathcal{P}}_1 \) shows that lattice search along the trigonometric parameters {α, β, θ, ϑ} can be employed to calculate the optimal polarization, as done in [2] and [5]. But since lattice search does not explore any optimization property, it requires a high computational burden. Precisely, the computational complexity is firstly exponential to the observation number N. Also, it highly depends on the maximum point search algorithm used for {α, β, θ, ϑ}. If we use lattice search with l α , l β , l θ and l ϑ points in each dimension of the domain space {α, β, θ, ϑ}, respectively, the relevant complexity burden is O((l α l β l θ l ϑ )N).

4.2 Sequential minimum MSE estimation

Since to solve \( {\mathcal{P}}_1 \) straightforward requires a high computational burden, we exploit the sequential minimum MSE estimation to tackle it. By insight of \( {\mathcal{P}}_1 \), we can observe that it works at the case of block signal polarization design, namely designing the polarizations of N consecutive temporal signal samples at a time. However, in real radar signal processing applications, the returns are ongoing as time progresses. So it is reasonable to process the data sequentially in time. As to the problem at hand, the resulting solution belongs to the sequential minimum MSE estimation. Thus, by following the sequential minimum MSE estimation procedure in [Kay [17], Eq. (12.47)–(12.49)], we develop the sequential minimum MSE estimation to handle \( {\mathcal{P}}_1 \) as follows.

Let \( {\widehat{\boldsymbol{X}}}_{\mathrm{t},n} \) denote the minimum MSE estimator based on the observations [y(1), …, y(n)]T and D n be the corresponding minimum MSE matrix (just the sequential version of (10) and (11)). Then, when the new sample y(n + 1) is available, the estimator is updated as

$$ {\widehat{\boldsymbol{X}}}_{\mathrm{t},n+1}={\widehat{\boldsymbol{X}}}_{\mathrm{t},n}+{\boldsymbol{K}}_{n+1}\left(y\left(n+1\right)-{\boldsymbol{a}}^{\mathrm{T}}\left(n+1\right){\widehat{\boldsymbol{X}}}_{\mathrm{t},n}\right), $$
(17)

where

$$ {\boldsymbol{K}}_{n+1}=\frac{{\boldsymbol{D}}_n\boldsymbol{a}\left(n+1\right)}{{\left[{\mathbf{A}}_{n+1}{\boldsymbol{C}}_{\mathrm{c}}{\mathbf{A}}_{n+1}^{\mathrm{H}}+{\sigma}_{\nu}^2{\mathbf{I}}_{n+1}\right]}_{n+1,n+1}+{\boldsymbol{a}}^{\mathrm{H}}\left(n+1\right){\boldsymbol{D}}_n\boldsymbol{a}\left(n+1\right)}, $$
(18)

is the gain factor weighting confidence in the new data with A n + 1 = [a(1), …, a(n + 1)]T and a(i) having the same definition as before and [·] n + 1,n + 1 denoting the n + 1th diagonal element of matrix [·]. In the meantime, the minimum MSE matrix is updated as

$$ {\boldsymbol{D}}_{n+1}=\left({\mathbf{I}}_{n+1}-{\boldsymbol{K}}_{n+1}{\boldsymbol{a}}^{\mathrm{T}}\left(n+1\right)\right){\boldsymbol{D}}_n. $$
(19)

As to D n + 1, based on [Kay [17], p. 393], with n increasing, [D n + 1] ii , i = 1, 2, 3, 4, decreases and converges to a certain value. Hence, tr(D n + 1) is also a monotonic decreasing sequence and converges to a certain value.

4.3 Optimal waveform parameter selection

By employing the sequential minimum MSE estimation procedure described earlier, our goal is to search for the optimal Tx/Rx polarizations for every iteration, under the criterion of minimizing the Bayesian MSE-based cost function tr(D n ). As to each iteration, we still employ lattice search to compute the optimal Tx/Rx polarizations. Precisely, the sequential procedure to update the optimal polarizations is summarized as Algorithm 1. Also, Fig. 1 exhibits a pictorial representation of the sequential estimation processing.

Fig. 1
figure 1

Block diagram of the sequential estimation processing

Now let us examine the computational complexity of Algorithm 1. Let us still consider that there are N observation samples. Different from solving \( {\mathcal{P}}_1 \), Algorithm 1 is linear to the observation number N, with requiring the lattice search for the optimal polarizations in the domain space {α, β, θ, ϑ} in each iteration. Thus, the overall complexity is O(l α l β l θ l ϑ N). In comparison with the direct lattice search to solve \( {\mathcal{P}}_1 \), the computation burden is greatly reduced. It is not surprising since the proposed algorithm is sequential, namely it does not optimize for all observations and thus, it is clearly less expensive than the one proposed in (16).

5 Vector measurement system

Vector sensors, which employ two-dimensional sensors to measure both horizontal and vertical components of the electric field at each of the receivers, provide a significant improvement in performance over scalar sensors for a variety of applications [10, 18]. For such mentioned reason, in this section, we develop the approach to optimize transmission polarization of vector measurement systems by following the similar way of designing the optimal Tx/Rx polarizations for the aforementioned scalar system. In general, the derivation of such system model and the corresponding adaptive design process follow the similar way as the model in Section 2, so it is simplified.

5.1 Vector measurement model

As to the vector measurement system with adaptive transmission polarization, the transmission vector \( {\left[{\xi}_{\mathrm{h}},{\xi}_{\mathrm{v}}\right]}^{\mathrm{T}} \) is allowed to be chosen freely while the reception side has both horizontal and vertical channels to separately receive horizontal and vertical polarizations of the return, respectively, to form a two-dimensionalvector. The scattered field is related to the incident by

$$ \boldsymbol{y}(t)=\frac{g}{r^2}\left({\boldsymbol{S}}_{\mathrm{t}}+{\boldsymbol{S}}_{\mathrm{c}}\right)\boldsymbol{\xi} s\left(t-\tau \right)+\boldsymbol{w}(t), $$
(20)

where \( \boldsymbol{y}(t)={\left[{y}_{\mathrm{h}}(t)\kern0.5em {y}_{\mathrm{v}}(t)\right]}^{\mathrm{T}} \) is the complex envelope vector of the received electric field, \( \boldsymbol{w}(t)={\left[{w}_{\mathrm{h}}(t)\kern0.5em {w}_v(t)\right]}^{\mathrm{T}} \) is the complex envelope vector of noise for both polarizations, and the other parameters have the same definitions as in Eq. (1). This signal is sampled and passed through the matched filter whose output is appropriately normalized to move the effect of \( \frac{g}{r^2} \) into the noise term. Finally, the normalized output of the matched filter at the receiver is given as

$$ \boldsymbol{y}=\left({\boldsymbol{S}}_{\mathrm{t}}+{\boldsymbol{S}}_{\mathrm{c}}\right)\boldsymbol{\xi} +\boldsymbol{v}, $$
(21)

where \( \boldsymbol{y}={\left[{y}_{\mathrm{h}}{y}_\mathrm{v}\right]}^{\mathrm{T}} \) and \( \boldsymbol{v}={\left[{\nu}_{\mathrm{h}}{\nu}_\mathrm{v}\right]}^{\mathrm{T}} \). Stacking all the observations and the noise components into column vectors, in a similar fashion to the approach used in Section 2, we obtain 2N-dimensional vectors y′ and \( {\mathbf{v}}^{\mathbf{\prime}} \), i.e., \( {\mathbf{y}}^{\mathbf{\prime}}={\left[{y}_{\mathrm{h}}(1),{y}_{\mathrm{v}}(1),\dots, {y}_{\mathrm{h}}(N),{y}_{\mathrm{v}}(N)\right]}^{\mathrm{T}} \) and \( {\mathbf{v}}^{\mathbf{\prime}}={\left[{\nu}_{\mathrm{h}}(1),{\nu}_{\mathrm{v}}(1),\dots, {\nu}_{\mathrm{h}}(N),{\nu}_{\mathrm{v}}(N)\right]}^{\mathrm{T}} \), respectively. Vectors X t and X c remain the same as defined earlier.

We also define the transformation matrix

$$ \mathbf{T} = \left[\begin{array}{l}{\xi}_{\mathrm{h}}^{(1)}\kern0.75em {\xi}_v^{(1)}\kern1.25em 0\kern1.5em 0\\ {}\kern0.5em 0\kern1.25em 0\kern1.5em {\xi}_{\mathrm{h}}^{(1)}\kern0.75em {\xi}_v^{(1)}\\ {}\kern3.5em \vdots \\ {}{\xi}_{\mathrm{h}}^{(N)}\kern0.5em {\xi}_v^{(N)}\kern0.75em 0\kern1.75em 0\\ {}\ 0\kern1.5em 0\kern1.25em {\xi}_{\mathrm{h}}^{(N)}\kern0.5em {\xi}_v^{(N)}\end{array}\right], $$
(22)

So the observation vector is expressed as

$$ {\mathbf{y}}^{\mathbf{\prime}}=\mathbf{T}{\boldsymbol{X}}_{\mathrm{t}}+\mathbf{T}{\boldsymbol{X}}_{\mathrm{c}}+{\mathbf{v}}^{\mathbf{\prime}}. $$
(23)

Thus, we obtain a similar linear model for the vector system with the scalar one. In order to make a fair comparison, assume that \( {\nu}_{\mathrm{h}}(i) \), \( {\nu}_{\mathrm{v}}(i) \), and i = 1, …, N have the same power with \( \frac{\sigma^2}{2} \) and are uncorrelated with each other. Then, the covariance matrix D′ of X t’s minimum MSE estimation from y′ is

$$ {\boldsymbol{D}}^{\boldsymbol{\prime}}={\left({\boldsymbol{C}}_{\mathrm{t}}^{-1}+{\mathbf{T}}^{\mathrm{H}}{\left(\mathbf{T}{\boldsymbol{C}}_{\mathrm{c}}{\mathbf{T}}^{\mathrm{H}}+\frac{\sigma_{\nu}^2}{2}{\mathbf{I}}_{2N}\right)}^{-1}\mathbf{T}\right)}^{-1}. $$
(24)

Substituting the trigonometric representation of (14) into (24) and following the same way in Section 4.1, we can prove that the initial signal phase φ does not affect the value of D′. Since the relevant proof can be obtained similarly as that given for Property 1, it is omitted for the sake of brevity. Hence, the optimization problem of seeking the optimal transmission polarization to minimize the MSE of estimating X t for vector systems can be formulated as

$$ {\mathcal{P}}_2\left\{\begin{array}{c}\underset{{\boldsymbol{D}}^{\boldsymbol{\prime}},{\left\{{\alpha}_i\right\}}_{i=1}^N,{\left\{{\beta}_i\right\}}_{i=1}^N}{min}\mathrm{T}\mathrm{r}\left({\boldsymbol{D}}^{\boldsymbol{\prime}}\right)\\ {}\mathrm{s}.\mathrm{t}.\kern0.5em {\boldsymbol{D}}^{\boldsymbol{\prime}}={\left({\boldsymbol{C}}_{\mathrm{t}}^{-1}+{\mathbf{T}}^{\mathrm{H}}{\left(\mathbf{T}{\boldsymbol{C}}_{\mathrm{c}}{\mathbf{T}}^{\mathrm{H}}+\frac{\sigma_{\nu}^2}{2}{\mathbf{I}}_{2N}\right)}^{-1}\mathbf{T}\right)}^{-1}\mathrm{with}\\ {}\kern1.5em \mathbf{T} = \left[\begin{array}{l}{\xi}_{\mathrm{h}}^{(1)}\kern0.75em {\xi}_v^{(1)}\kern1.25em 0\kern1.5em 0\\ {}\kern0.5em 0\kern1.25em 0\kern1.5em {\xi}_{\mathrm{h}}^{(1)}\kern0.75em {\xi}_v^{(1)}\\ {}\kern3.5em \vdots \\ {}{\xi}_{\mathrm{h}}^{(N)}\kern0.5em {\xi}_v^{(N)}\kern0.75em 0\kern1.75em 0\\ {}\ 0\kern1.5em 0\kern1.25em {\xi}_{\mathrm{h}}^{(N)}\kern0.5em {\xi}_v^{(N)}\end{array}\right]\kern0.5em \mathrm{with}\\ {}\kern1.75em \boldsymbol{\xi} (i)=\left[\begin{array}{l}{\xi}_{\mathrm{h}}^{(i)}\\ {}{\xi}_v^{(i)}\end{array}\right]=\left[\begin{array}{l} \cos {\alpha}_i \cos {\beta}_i+j \sin {\alpha}_i \sin {\beta}_i\\ {}- \sin {\alpha}_i \cos {\beta}_i+j \cos {\alpha}_i \sin {\beta}_i\end{array}\right]\kern0.5em \mathrm{with}\\ {}\kern1.75em {\alpha}_i\in \left[-\pi /2,\pi /2\right],\ {\beta}_i\in \left[-\pi /4,\pi /4\right],i=1,\dots, N.\end{array}\right. $$
(25)

Similar in solving \( {\mathcal{P}}_1 \), the computational complexity of solving \( {\mathcal{P}}_2 \) depends exponentially on the observation number N, as well as the complexity burden required at each observation, i.e., the maximum point search algorithm used for parameters {α, β}. Considering employing lattice search with l α and l β points in each dimension of the domain space {α, β}, the required complexity is O((l α l β )N).

5.2 Sequential estimation algorithm for vector systems

Because of the similar reason, namely directly solving \( {\mathcal{P}}_2 \) by searching for the optimal trigonometric parameters with latter search requires a high complexity burden, now in this subsection, we develop the relevant sequential optimization procedure for such vector system. Let \( {\widehat{\boldsymbol{X}}}_{\mathrm{t},n} \) denote the minimum MSE estimator based on the observations \( {\left[{y}_{\mathrm{h}}(1),{y}_{\mathrm{v}}(1),\dots, {y}_{\mathrm{h}}(n),{y}_{\mathrm{v}}(n)\right]}^{\mathrm{T}} \) and \( {\boldsymbol{D}}_n^{\boldsymbol{\prime}} \) be the corresponding covariance matrix of the estimator. Then, with the new sample y(n + 1), the estimator is updated as

$$ {\widehat{\boldsymbol{X}}}_{\mathrm{t},n+1}={\widehat{\boldsymbol{X}}}_{\mathrm{t},n}+{\boldsymbol{K}}_{n+1}\left(\boldsymbol{y}\left(n+1\right)-{\boldsymbol{t}}^{\mathrm{T}}\left(n+1\right){\widehat{\boldsymbol{X}}}_{\mathrm{t},n}\right), $$
(26)

where

$$ \boldsymbol{t}\left(n+1\right)={\left[\begin{array}{l}{\xi}_{\mathrm{h}}\left(n+1\right)\kern1em {\xi}_v\left(n+1\right)\kern1.25em 0\kern1.25em 0\\ {}\kern0.75em 0\kern1.25em 0\kern0.75em {\xi}_{\mathrm{h}}\left(n+1\right)\kern1em {\xi}_v\left(n+1\right)\end{array}\right]}^{\mathrm{T}}, $$
(27)

and

$$ \frac{{\boldsymbol{K}}_{n+1}={\boldsymbol{D}}_n^{\boldsymbol{\prime}}\boldsymbol{t}\left(n+1\right)}{{\left[{\mathbf{T}}_{n+1}{\boldsymbol{C}}_{\mathrm{c}}{\mathbf{T}}_{n+1}^{\mathrm{H}}+\frac{\sigma_{\nu}^2}{2}{\mathbf{I}}_{2\left(n+1\right)}\right]}_{2\left(n+1\right),2\left(n+1\right)}+{\boldsymbol{t}}^{\mathrm{H}}\left(n+1\right){\boldsymbol{D}}_n^{\boldsymbol{\prime}}} \boldsymbol{t}\left(n+1\right), $$
(28)

with T n + 1 = [t(1), …, t(n + 1)]T and [·]2(n + 1),2(n + 1) denoting the 2(n + 1)th diagonal element of matrix [·]. In the meantime, the covariance matrix is updated as

$$ {\boldsymbol{D}}_{n+1}^{\boldsymbol{\prime}}=\left({\mathbf{I}}_{2\left(n+1\right)}-{\boldsymbol{K}}_{n+1}{\boldsymbol{t}}^{\mathrm{T}}\left(n+1\right)\right){\boldsymbol{D}}_n^{\boldsymbol{\prime}}. $$
(29)

Again, as to \( {\boldsymbol{D}}_{n+1}^{\boldsymbol{\prime}} \), with n increasing, \( {\left[{\boldsymbol{D}}_{n+1}^{\boldsymbol{\prime}}\right]}_{ii} \) decreases and converges to a certain value. As a consequence, \( \mathrm{t}\mathrm{r}\left({\boldsymbol{D}}_{n+1}^{\boldsymbol{\prime}}\right) \) is also a monotonic decreasing sequence and converges to a certain value. Finally, the devised sequential optimization procedure is summarized in Algorithm 2.

It is worth noticing that the computational complexity of Algorithm 2 is linear to the observation number N, with requiring the lattice search to seek the optimal polarization in the domain space {α, β} for every observation. Thus, the overall complexity is O(l α l β N). Compared to handling \( {\mathcal{P}}_2 \) straightly with lattice search (the corresponding computational complexity is O((l α l β )N)), the computation burden is greatly reduced. The reason why the proposed algorithm is less expensive than solving \( {\mathcal{P}}_2 \) is similar as that in Algorithm 1, namely it is sequential and does not require optimization for all observations.

6 Numerical examples

In this section, the performance analysis of the proposed algorithms is presented. As to scalar measurement systems with the joint Tx/Rx polarization optimization, we provide numerical examples to demonstrate the effectiveness of the devised algorithms, in comparison with two other polarization approaches. Additionally, we also compare their performances with that of the vector measurement system with transmission polarization optimization and highlight the advantage of the latter design.

Throughout the simulations, we generate the target and clutter covariance matrices employing a similar way as in [3]. Though the solution in [3] is criticized incorrectly in Section 1, the approach there to generate target and clutter covariance matrices is fine. Precisely, the target covariance matrix is chosen as

$$ {\boldsymbol{C}}_{\mathrm{t}}=a{\boldsymbol{U}}_{\mathrm{t}}{\boldsymbol{\Lambda}}_{\mathrm{t}}{\boldsymbol{U}}_{\mathrm{t}}^{\mathrm{H}}, $$
(30)

where U t is an arbitrary unitary matrix constructed with the left singular vectors of a 4 × 4 matrix M with independent and identical distribution complex Gaussian entries, i.e., M = U t Λ M U r, Λ t = diag(rand(1, 4)) with rand(1, 4) denoting a random four-dimensional real vector, and diag(·) being the diagonalization of the vector (·). Note that we normalize rand(1, 4) to keep the target power to be constant. a is the factor that controls the signal-to-clutter-plus-noise ratio (SCNR) with the definition [5]:

$$ \text{SCNR}=\frac{\text{tr}(\boldsymbol{C}_{t})}{\text{tr}(\boldsymbol{C}_{c})+{\sigma}^{2}_{v}}, $$
(31)

where the clutter covariance matrix is chosen as

$$ {\boldsymbol{C}}_{\mathrm{c}}={\boldsymbol{U}}_{\mathrm{c}}{\boldsymbol{\Lambda}}_{\mathrm{c}}{\boldsymbol{U}}_{\mathrm{c}}^{\mathrm{H}}, $$
(32)

with Λ c = diag([0.25, 0.25, 0.25, 0.25]) and U c being chosen with the similar operation to generate U t. Throughout the simulations, we assume the noise power \( {\sigma}_{\nu}^2 \) = 0 dB.

Furthermore, by inspection of Eqs. (19) and (29), we can observe that D n + 1 and \( {\boldsymbol{D}}_{n+1}^{\boldsymbol{\prime}} \) do not depend on the observations \( {\left\{{y}_{i+1}\right\}}_{i=0}^n \) and \( {\left\{{\boldsymbol{y}}_{i+1}\right\}}_{i=0}^n \), respectively, even though in Algorithm 1 and Algorithm 2, both of them are the input parameters that update the estimation of target scattering vector \( {\widehat{\boldsymbol{X}}}_{\mathrm{t}} \). Since the MSE of \( {\widehat{\boldsymbol{X}}}_{\mathrm{t}} \) is the figure of merit for the optimization problem and Tr(D n + 1), \( \mathrm{T}\mathrm{r}\left({\boldsymbol{D}}_{n+1}^{\boldsymbol{\prime}}\right) \) are the objectives to be optimized for each algorithm, respectively, in this section, we will investigate the changes of Tr(D n + 1) and \( \mathrm{T}\mathrm{r}\left({\boldsymbol{D}}_{n+1}^{\boldsymbol{\prime}}\right) \) without considering updating of \( {\widehat{\boldsymbol{X}}}_{\mathrm{t}} \). As a consequence, we do not use \( {\left\{{y}_{i+1}\right\}}_{i=0}^n \) and \( {\left\{{\boldsymbol{y}}_{i+1}\right\}}_{i=0}^n \) in the simulation. Also notice that n ≥ 4 is required for both the proposed algorithms. So we randomly choose \( {\left\{{\alpha}_i\right\}}_{i=1}^4={\left\{{\beta}_i\right\}}_{i=1}^4={\left\{{\theta}_i\right\}}_{i=1}^4={\left\{{\vartheta}_i\right\}}_{i=1}^4=\pi /8 \) (some other values within the definition ranges can be also used) as the initial trigonometric parameters to generate A 4 and T 4 for Algorithm 1 and Algorithm 2, respectively.

Finally, regarding the lattice search used to seek the optimal polarizations for Algorithm 1 and Algorithm 2, set l α  = l θ  = 103 and l β  = l ϑ  = 500. By using the Monte Carlo method, the MSEs are calculated by averaging 105 independent realizations of C t and C c, respectively.

6.1 Scalar measurement systems

We choose the conventional polarimetric radar and the polarimetric radar with adaptive transmission polarization as the counterparts to compare their performances with the polarimetric radar with joint Tx/Rx optimization. As to conventional polarimetric systems, the transmission side alternatively transmits horizontal and vertical polarizations, i.e., ξ = [1, 0]T in the current pulse and ξ = [0, 1]T in next pulse. In the reception side, the horizontal and vertical polarizations are simultaneously received, so the reception vector satisfies \( \boldsymbol{\eta} ={\left[\sqrt{2}/2,\sqrt{2}/2\right]}^{\mathrm{T}} \). Therefore, the relevant MSE can be calculated by substituting the waveform parameters {α = 0, β = 0, θ = ‐ π/4, ϑ = 0} and {α = − π/2, β = 0, θ = − π/4, ϑ = 0}, alternatively, into Algorithm 1 with the increase of observation samples and without optimal polarization search. Then, as to the scalar system with transmission polarization optimization, the transmission polarization \( {\left[{\xi}_{\mathrm{h}},{\xi}_{\mathrm{v}}\right]}^{\mathrm{T}} \) is allowed to be chosen freely while the reception polarization is fixed as \( {\left[\sqrt{2}/2,\sqrt{2}/2\right]}^{\mathrm{T}} \). As such, we can employ Algorithm 1 to optimally select transmission polarization by fixing θ = − π/4 and ϑ = 0.

By employing Algorithm 1, Fig. 2 plots the MSEs of the estimating target scattering matrix S t versus the number of observation samples for the aforementioned three scalar systems, with SCNR = 0 dB. As expected, the devised method monotonically reduces the MSE and converges to a certain value. On the other hand, the plots clearly show that the scalar system with joint Tx\Rx polarization optimization performs better than that with transmission polarization optimization as well as the scalar system with transmission polarization optimization showing a significant performance gain with respect to conventional polarization radars.

Fig. 2
figure 2

MSE performance comparison of estimating the target scattering matrix S t versus the number of observation samples, with SCNR = 0 dB

Additionally, Fig. 3 draws the MSE of the estimating target scattering matrix S t versus the SCNR, with a fixed 50 observation samples. It can be observed that, the scalar system with optimally designed Tx/Rx polarizations leads to a power gain of 4–6 dB with respect to that with only transmission polarization optimization, as well as 6–8 dB gain with respect to the conventional polarization radar.

Fig. 3
figure 3

MSE performance comparison of estimating the target scattering matrix S t versus the SCNR, with a fixed 50 observation samples

6.2 Scalar measurement systems versus vector measurement systems

In this subsection, we compare the performance of scalar measurement systems with that of vector systems. As in Fig. 4, the MSEs of the three aforementioned scalar systems and the vector system with transmission polarization optimization versus the number of observation samples with SCNR = 0 dB are illustrated. The results highlight that the vector measurement with transmission polarization optimization has almost the same estimation performance as the scalar measurement with joint Tx/Rx polarization optimization. This is because, in the former case, the receiver polarization optimization is implicitly performed. In other words, even though we perform joint optimization over both transmission and reception polarizations for the scalar system, we are still finding the best linear combination of the two received measurements at each receiver. However, combining them linearly need not be the overall optimal solution, even if we spend much more time on doing so. This can be avoided by retaining the vector measurements, thereby showing better performance as demonstrated in [6].

Fig. 4
figure 4

MSE performance comparison between scalar measurement systems and vector measurement ones versus the number of observation samples, with SCNR = 0 dB

Finally, Fig. 5 shows the performance of the four systems as a function of SCNR, with the fixed length of observation sample. Again, the results demonstrate that the vector measurement system has the same performance with the scalar measurement system with joint Tx/Rx optimization.

Fig. 5
figure 5

MSE performance comparison between scalar measurement systems and vector measurement ones versus the SCNR, with a fixed 50 observation samples

7 Conclusions

Adaptive polarization design of polarimetric radars, for the purpose of optimally estimating the target polarization scattering vector, is studied. Starting from the problem of designing the optimal polarizations for the scalar measurement system with joint Tx/Rx polarization optimization, under considering the minimum MSE of the estimator as the figure of merit, a sequential method of selecting the optimal Tx/Rx polarizations for such system is developed by owing to a suitable reformulation of the considered non-convex design problem. Furthermore, by employing a similar deviation procedure, a sequential method to select the optimal transmission polarization for the vector system with transmission polarization optimization is also devised. Since both the proposed algorithms make use of the sequential minimum MSE estimation, they monotonically decrease the MSE of the estimation and converge to a stationary point. The complexity of the proposed methods is linear with the number of outer iterations whereas at each iteration, it mainly requires the lattice search along four and two trigonometric parameters, respectively.

Several numerical examples have been provided to assess the effectiveness of the proposed methods. Precisely, the performance of the scalar measurement system with joint Tx\Rx polarization optimization, the one with transmission polarization optimization and the one with conventional polarimetric design, is compared. The system with joint Tx\Rx polarization optimization shows a significant performance improvement with respect to the one with transmission polarization optimization, as well as the conventional polarization approach. Moreover, the numerical results also show that the vector measurement system with transmission polarization optimization provides the comparative estimation performance with the scalar measurement system with joint Tx/Rx optimization. This is because, in the latter case, the receiver polarization optimization is implicitly performed. But the vector measurement system requires a lower computation burden, highlighting the advantage of such system design.

References

  1. D Giuli, Polarization diversity in radars. Proc. IEEE 74(2), 245–269 (1986)

    Article  MathSciNet  Google Scholar 

  2. J Wang, A Nehorai, Adaptive polarimetry design for a target in compound-Gaussian clutter. Signal Process. 89, 1061–1069 (2009)

    Article  MATH  Google Scholar 

  3. J-J Xiao, A Nehorai, Joint transmitter and receiver polarization optimization for scattering estimation in clutter. IEEE Trans. Signal Process. 57(10), 4142–4147 (2009)

    Article  MathSciNet  Google Scholar 

  4. DA Garren, AC Odom, SU Pillai, JR Guerci, Full-polarization matched-illumination for target detection and identification. IEEE Trans. Aerosp. Electr. Syst. 38(3), 824–837 (2002)

    Article  Google Scholar 

  5. M Hurtado, A Nehorai, Polarimetric detection of targets in heavy inhomogeneous clutter. IEEE Trans. Signal Process. 56(4), 1349–1361 (2008)

    Article  MathSciNet  Google Scholar 

  6. S Gogineni, A Nehorai, Polarimetric MIMO radar with distributed antennas for target detection. IEEE Trans. Signal Process. 58(3), 1689–1697 (2010)

    Article  MathSciNet  Google Scholar 

  7. J Liu, Z Zhang, Y Yang, Performance enhancement of subspace detection with a diversely polarized antenna. IEEE Signal Process. Lett. 19(1), 4–7 (2012)

    Article  Google Scholar 

  8. J Liu, Z Zhang, Y Yang, Optimal waveform design for generalized likelihood ratio and adaptive matched filter detectors using a diversely polarized antenna. Signal Process. 92(4), 1126–1131 (2012)

    Article  MathSciNet  Google Scholar 

  9. M Hurtado, T Zhao, A Nehorai, Adaptive polarized waveform design for target tracking based on sequential Bayesian inference. IEEE Trans. Signal Process. 56(3), 1120–1133 (2008)

    Article  MathSciNet  Google Scholar 

  10. A Nehorai, E Paldi, Vector-sensor array processing for electromagnetic source localization. IEEE Trans. Signal Process. 42(2), 376–398 (1994)

    Article  Google Scholar 

  11. S Boyd, L Vandenberghe, Semidefinite programming. SIAM Rev. 38, 49–95 (1996)

    Article  MATH  MathSciNet  Google Scholar 

  12. CVX Research, Inc. CVX: Matlab software for disciplined convex programming, version 2.0. http://cvxr.com/cvx, April 2011

  13. M A Richards, J A Scheer, W A Holm, Principles of Modern Radar: Basic principles (IET Digital Library, 2010)

  14. A Aubry, A De Maio, A Farina, M Wicks, Knowledge-aided (potentially cognitive) transmit signal and receive filter design in signal-dependent clutter. IEEE Trans. Aerosp. Electron. Syst. 49(1), 93–117 (2013)

    Article  Google Scholar 

  15. F Gini, A De Maio, L Patton, Waveform design and diversity for advanced radar systems (Institution of Engineering and Technology, London, 2012)

    Book  Google Scholar 

  16. A Aubry, A De Maio, M Piezzo, A Farina, M Wicks, Cognitive design of the receive filter and transmitted phase code in reverberating environment. IET Radar Sonar Navig. 6(9), 822–833 (2012)

    Article  Google Scholar 

  17. SM Kay, Fundamentals of statistical signal processing: estimation theory (Prentice-Hall, New Jersey, 1993)

    MATH  Google Scholar 

  18. A Nehorai, E Paldi, Acoustic vector-sensor array processing. IEEE Trans. Signal Process. 42(9), 2481–2491 (1994)

    Article  Google Scholar 

  19. GE Box, WG Hunter, JS Hunter, Statistics for experimenters (John Wiley & Sons, New York, 1978)

    MATH  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Mr. John Lucynski for providing some comments on how to correct the language. This work was also supported by the National Natural Science Foundation of China (Nos. 61401488, 61501475, and 61490692).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xu Cheng.

Additional information

Competing interests

The authors declare that they have no competing interests.

Appendices

Appendix 1: Proof of the failure of the proposed method in [3] to solve Tx/Rx polarizations

Before the proof, in such context, we declare that the parameter definitions and notations hereinafter have the same meaning as in [3]. In [3], the authors propose an adaptive waveform polarization method for the estimation of target scatter in the presence of clutter. The proposed algorithm, to determine the coefficients of the antenna transmission and reception polarizations, has a convex form and can be solved by employing SDP. However, the authors do not investigate whether the polarization vectors can be obtained after the convex problem is solved to obtain A (at least such processing is absent in the paper). In this appendix, we provide a proof that the transmission and reception polarization vectors cannot be effectively obtained through the solution of A. Essentially, the convex form is only a relaxation version of the initial problem. The proof is as follows.

Let us assume that A is already solved by employing the proposed convex optimization in [3]. A is an m × 4 complex matrix and based on (4) and (6) of [3], its ith row satisfies

$$ \begin{array}{ll}{\mathbf{a}}_i & i=\mathbf{a}\left(P(i);\boldsymbol{\xi} (i),\boldsymbol{\eta} (i)\right)\hfill \\ {}\hfill & =\sqrt{P(i)}\mathbf{p}\left(\boldsymbol{\xi} (i),\boldsymbol{\eta} (i)\right)\hfill \\ {}\hfill & \triangleq \sqrt{P(i)}{\left[{\left({\xi}_{\mathrm{h}}^{(i)}\right)}^{*}{\eta}_{\mathrm{h}}^{(i)}\kern0.75em {\left({\xi}_v^{(i)}\right)}^{*}{\eta}_v^{(i)}\kern0.75em {\left({\xi}_{\mathrm{h}}^{(i)}\right)}^{*}{\eta}_v^{(i)}\kern0.75em {\left({\xi}_v^{(i)}\right)}^{*}{\eta}_{\mathrm{h}}^{(i)}\right]}^{\mathrm{T}}.\hfill \end{array} $$
(33)

Notice that the transmission polarization vector ξ can be generally parameterized as follow:

$$ \boldsymbol{\xi} = {\left[{\xi}_{\mathrm{h}}\ {\xi}_{\mathrm{v}}\right]}^{\mathrm{H}}\triangleq \left[ \cos \phi {\mathrm{e}}^{j{\theta}_{\mathrm{h}}}\kern0.5em \sin \phi {\mathrm{e}}^{j{\theta}_{\mathrm{v}}}\right], $$
(34)

where \( \left\{\phi, {\theta}_{\mathrm{h}},{\theta}_{\mathrm{v}}\right\}\in \left(-\pi, \pi \right] \). In a similar way, the reception polarization vector η can be taken into the following form:

$$ \boldsymbol{\eta} ={\left[{\eta}_{\mathrm{h}}\ {\eta}_{\mathrm{v}}\right]}^{\mathrm{H}}\triangleq \left[ \cos \varphi {\mathrm{e}}^{j{\vartheta}_{\mathrm{h}}}\kern0.5em \sin \varphi {\mathrm{e}}^{j{\vartheta}_{\mathrm{v}}}\right], $$
(35)

where \( \left\{\varphi, {\vartheta}_{\mathrm{h}},{\vartheta}_{\mathrm{v}}\right\}\in \left(-\pi, \pi \right] \).Moreover, we also introduce parameterized representation of the 4 × 1 complex vector p(ξ(i), η(i)) as follow:

$$ \mathbf{p}\left(\boldsymbol{\xi} (i),\boldsymbol{\eta} (i)\right)\triangleq \left[{p}_{i1}{\mathrm{e}}^{j{\beta}_{i1}},{p}_{i2}{\mathrm{e}}^{j{\beta}_{i2}},{p}_{i3}{\mathrm{e}}^{j{\beta}_{i3}},{p}_{i4}{\mathrm{e}}^{j{\beta}_{i4}}\right]. $$
(36)

where \( {p}_{i1}^2+{p}_{i2}^2+{p}_{i3}^2+{p}_{i4}^2=1 \) and {β i,1, β i,2, β i,3, β i,4}  (−π, π].

Hence, by introducing (34), (35), and (36) into (33), we obtain the following equations:

$$ \left\{\begin{array}{l}{\left({\xi}_{\mathrm{h}}^{(i)}\right)}^{*}{\eta}_{\mathrm{h}}^{(i)}= \cos \phi \cos \varphi {\mathrm{e}}^{j\left({\vartheta}_{\mathrm{h}}-{\theta}_{\mathrm{h}}\right)}={p}_{i1}{\mathrm{e}}^{j{\beta}_{i1}}\\ {}{\left({\xi}_v^{(i)}\right)}^{*}{\eta}_v^{(i)}= \sin \phi \sin \varphi {\mathrm{e}}^{j\left({\vartheta}_v-{\theta}_v\right)}={p}_{i2}{\mathrm{e}}^{j{\beta}_{i2}}\\ {}{\left({\xi}_{\mathrm{h}}^{(i)}\right)}^{*}{\eta}_v^{(i)}= \cos \phi \sin \varphi {\mathrm{e}}^{j\left({\vartheta}_v-{\theta}_{\mathrm{h}}\right)}={p}_{i3}{\mathrm{e}}^{j{\beta}_{i3}}\\ {}{\left({\xi}_v^{(i)}\right)}^{*}{\eta}_{\mathrm{h}}^{(i)}= \sin \phi \cos \varphi {\mathrm{e}}^{j\left({\vartheta}_{\mathrm{h}}-{\theta}_v\right)}={p}_{i4}{\mathrm{e}}^{j{\beta}_{i4}}\end{array}\right.. $$
(37)

Furthermore, (37) can be recast into two equation systems, namely

$$ \left\{\begin{array}{l} \cos \phi \cos \varphi ={p}_{i1}\kern1.25em \\ {} \sin \phi \sin \varphi ={p}_{i2}\kern1em \\ {} \cos \phi \sin \varphi ={p}_{i3}\kern0.75em \\ {} \sin \phi \cos \varphi ={p}_{i4}\end{array}\right., $$
(38)

and

$$ \left\{\begin{array}{l}{\vartheta}_{\mathrm{h}}-{\theta}_{\mathrm{h}}={\beta}_{i1}\\ {}{\vartheta}_{\mathrm{v}}-{\theta}_{\mathrm{v}}={\beta}_{i2}\\ {}{\vartheta}_{\mathrm{v}}-{\theta}_{\mathrm{h}}={\beta}_{i3}\\ {}{\vartheta}_{\mathrm{h}}-{\theta}_{\mathrm{v}}={\beta}_{i4}\end{array}\right.. $$
(39)

As to (38), after some algebraic manipulations and inverse trigonometrical transform, we can transform (38) into the following equations:

$$ \left\{\begin{array}{l}\phi -\varphi = \arccos \left({p}_{i1}+{p}_{i2}\right)\\ {}\phi +\varphi = \arccos \left({p}_{i1}-{p}_{i2}\right)\\ {}\phi +\varphi = \arcsin \left({p}_{i3}+{p}_{i4}\right)\\ {}\phi -\varphi = \arcsin \left({p}_{i4}-{p}_{i3}\right)\end{array}\right.. $$
(40)

We can see that (40) is an over-determined equation system. By employing the least-square method, we can search for the “approximately” solution of ϕ and φ as

$$ \left\{\begin{array}{l}\phi =0.25\times \left( \arccos \left({p}_{i1}+{p}_{i2}\right)+ \arccos \left({p}_{i1}-{p}_{i2}\right)+\left. \arcsin \left({p}_{i4}-{p}_{i3}\right)+ \arcsin \left({p}_{i3}+{p}_{i4}\right)\right)\right.\\ {}\varphi =0.25\times \left( \arccos \left({p}_{i1}-{p}_{i2}\right)- \arccos \left({p}_{i1}+{p}_{i2}\right)+\left. \arcsin \left({p}_{i3}+{p}_{i4}\right)- \arcsin \left({p}_{i4}-{p}_{i3}\right)\right)\right.\end{array}\right.. $$
(41)

As to (39), after some algebraic manipulations, we obtain the following equation system:

$$ \left\{\begin{array}{l}{\theta}_{\mathrm{h}}-{\theta}_{\mathrm{v}}={\beta}_{i4}-{\beta}_{i1}\\ {}{\theta}_{\mathrm{h}}-{\theta}_{\mathrm{v}}={\beta}_{i2}-{\beta}_{i3}\\ {}{\vartheta}_{\mathrm{h}}-{\vartheta}_{\mathrm{v}}={\beta}_{i1}-{\beta}_{i3}\\ {}{\vartheta}_{\mathrm{h}}-{\vartheta}_{\mathrm{v}}={\beta}_{i4}-{\beta}_{i2}\end{array}\right.. $$
(42)

Since β i1, β i2, β i3, and β i4 can be, respectively, an arbitrary value within (−π, π], the first equation may conflict with the second equation when β i4 − β i1 ≠ β i2 − β i3and the third one may contradict the fourth one when β i1 − β i3 ≠ β i4 − β i2.

As a consequence, the transmission and reception polarization vectors cannot be always effectively solved based on A. The proof is concluded.

RemarksLemma 2.1 of [10] provides a three-parameter representation of the signal polarization, which has a different form from our so-calledtwo-parameter “general” formulation (34). In the appendix, we do not implement such three-parameter representation since the proof will become more cumbersome and prolix by employing the new one. But the study shows (not shown here) that the question of conflict arises again with the use of the three-parameter representation.

Appendix 2: Proof of Property 1

Let us first introduce the following new variables for (11)

$$ \left\{\begin{array}{l}\ \boldsymbol{B}\kern0.5em = {\sigma}_{\nu}^{-1}\mathbf{A}{\boldsymbol{C}}_{\mathrm{c}}^{1/2}\\ {}{\boldsymbol{C}}_{\mathrm{t},1}^{-1}={\boldsymbol{C}}_{\mathrm{c}}^{1/2}{\boldsymbol{C}}_{\mathrm{t}}^{-1} {\boldsymbol{C}}_{\mathrm{c}}^{1/2}\\ {}{\boldsymbol{D}}_1^{-1}={\boldsymbol{C}}_{\mathrm{c}}^{1/2}{\boldsymbol{D}}^{-1}{\boldsymbol{C}}_{\mathrm{c}}^{1/2} \end{array}\right.. $$
(43)

Then (11) can be written into

$$ {\boldsymbol{D}}_1^{-1}={\boldsymbol{C}}_{\mathrm{t},1}^{-1}+{\boldsymbol{B}}^{\mathrm{H}}{\left(\boldsymbol{B}{\boldsymbol{B}}^{\mathrm{H}}+{\mathbf{I}}_N\right)}^{-1}\boldsymbol{B}. $$
(44)

Using the matrix inversion lemma, we obtain

$$ {\left({\mathbf{I}}_4+{\boldsymbol{B}}^{\mathrm{H}}\boldsymbol{B}\right)}^{-1}={\mathbf{I}}_4-{\boldsymbol{B}}^{\mathrm{H}}{\left({\mathbf{I}}_N+\boldsymbol{B}{\boldsymbol{B}}^{\mathrm{H}}\right)}^{-1}\boldsymbol{B}. $$
(45)

Therefore, we have

$$ {\boldsymbol{D}}_1^{-1}={\boldsymbol{C}}_{\mathrm{t},1}^{-1}+{\mathbf{I}}_4-{\left({\mathbf{I}}_4+{\boldsymbol{B}}^{\mathrm{H}}\boldsymbol{B}\right)}^{-1}, $$
(46)

Meanwhile, substituting (14) and (15) into the second term of (8), the following equation is obtained

$$ \mathbf{A}=\mathbf{U}\boldsymbol{M}, $$
(47)

where

$$ \mathbf{U}=\left[\begin{array}{l}{\mathrm{e}}^{j\left({\varphi}_1-{\phi}_1\right)}\ 0\kern0.75em 0 \dots 0\\ {}0\kern0.75em {\mathrm{e}}^{j\left({\varphi}_2-{\phi}_2\right)}\ 0 \dots 0\\ {}\vdots \kern2.5em \ddots \kern2.25em \vdots \\ {}0\kern2.25em \cdots \kern0.75em {\mathrm{e}}^{j\left({\varphi}_m-{\phi}_m\right)}\end{array}\right], $$
(48)

and

$$ \boldsymbol{M}=\left[\begin{array}{l}{\zeta}_{\mathrm{h}}^{(1)}{\iota}_{\mathrm{h}}^{(1)}, {\zeta}_{\mathrm{h}}^{(1)}{\iota}_{\mathrm{v}}^{(1)}, {\zeta}_{\mathrm{v}}^{(1)}{\iota}_{\mathrm{h}}^{(1)}, {\zeta}_{\mathrm{v}}^{(1)}{\iota}_{\mathrm{v}}^{(1)}\\ {}\kern5em \vdots \\ {}{\zeta}_{\mathrm{h}}^{(N)}{\iota}_{\mathrm{h}}^{(N)}, {\zeta}_{\mathrm{h}}^{(N)}{\iota}_{\mathrm{v}}^{(N)}, {\zeta}_{\mathrm{v}}^{(N)}{\iota}_{\mathrm{h}}^{(N)}, {\zeta}_{\mathrm{v}}^{(N)}{\iota}_{\mathrm{v}}^{(N)}\end{array}\right]. $$
(49)

U is a unitary matrix, and U H U = UU H = I N . Thus, the following equation holds

$$ \begin{array}{l}{\boldsymbol{B}}^{\mathrm{H}}\boldsymbol{B}={\sigma}_{\nu}^{-1}{\left({\boldsymbol{C}}_{\mathrm{c}}^{1/2}\right)}^{\mathrm{H}}{\mathbf{A}}^{\mathrm{H}}\cdotp {\sigma}_{\nu}^{-1}\mathbf{A}{\boldsymbol{C}}_{\mathrm{c}}^{1/2}\\ {}={\sigma}_{\nu}^{-2}{\left({\boldsymbol{C}}_{\mathrm{c}}^{1/2}\right)}^{\mathrm{H}}{\boldsymbol{M}}^{\mathrm{H}}{\mathbf{U}}^{\mathrm{H}}\mathbf{U}\boldsymbol{M}{\boldsymbol{C}}_{\mathrm{c}}^{1/2}\\ {}={\sigma}_{\nu}^{-2}{\left({\boldsymbol{C}}_{\mathrm{c}}^{1/2}\right)}^{\mathrm{H}}{\boldsymbol{M}}^{\mathrm{H}}\boldsymbol{M}{\boldsymbol{C}}_{\mathrm{c}}^{1/2}\end{array} $$
(50)

Noticing that

$$ \mathrm{T}\mathrm{r}\left(\boldsymbol{D}\right)=\mathrm{T}\mathrm{r}\left({\boldsymbol{C}}_{\mathrm{c}}^{1/2}{\boldsymbol{D}}_1{\boldsymbol{C}}_{\mathrm{c}}^{1/2}\right)=\mathrm{T}\mathrm{r}\left({\boldsymbol{C}}_{\mathrm{c}}{\boldsymbol{D}}_1\right), $$
(51)

substituting (44) and (50) into (51), the trace of D can be written as

$$ \begin{array}{l}\mathrm{T}\mathrm{r}\left(\boldsymbol{D}\right)=\mathrm{T}\mathrm{r} \left({\boldsymbol{C}}_{\mathrm{c}}{\boldsymbol{D}}_1\right)\\ {}=\mathrm{T}\mathrm{r}\left({\boldsymbol{C}}_{\mathrm{c}}{\left({\boldsymbol{C}}_{\mathrm{t},1}^{-1}+ {\mathbf{I}}_4-{\left({\mathbf{I}}_4+{\sigma}_{\nu}^{-2}{\left({\boldsymbol{C}}_{\mathrm{c}}^{1/2}\right)}^{\mathrm{H}} {\boldsymbol{M}}^{\mathrm{H}}\boldsymbol{M}{\boldsymbol{C}}_{\mathrm{c}}^{1/2}\right)}^{-1}\right)}^{-1}\right).\end{array} $$
(52)

Therefore, the initial signal phases φ and ϕ do not affect the value of D. This concludes the proof.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cheng, X., Shi, L., Chang, Y. et al. Target scattering estimation in clutter with polarization optimization. EURASIP J. Adv. Signal Process. 2015, 105 (2015). https://doi.org/10.1186/s13634-015-0286-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13634-015-0286-y

Keywords