- Research
- Open access
- Published:
To the theory of adaptive signal processing in systems with centrally symmetric receive channels
EURASIP Journal on Advances in Signal Processing volume 2016, Article number: 33 (2016)
Abstract
This paper presents the analytical derivation of joint probability density functions (pdfs) of the maximum likelihood (ML) estimates of a real and complex persymmetric correlation matrices (PCM) of multivariate Gaussian processes. It is oriented at the modifications of the classical Wishart’s–Goodman’s pdfs adapted to the ML estimates of the data CMs in a wide class of signal processing (SP) problems in systems with centrally symmetric (CS) receive channels. The importance of the derived modified pdfs for such CS systems could be as great as that of the classical Wishart’s–Goodman’s pdfs for systems with arbitrary receive channels. Some properties of the new obtained joint pdfs are featured.
1 Introduction
The multivariate statistical analysis of random processes widely uses the Wishart distribution, which describes statistical properties of a maximum likelihood (ML) estimate of the real-valued positively definite correlation matrix (CM) of multivariate Gaussian processes/fields [1–5]. For the ML estimate of the complex-valued Hermitian positively definite general-type data CM, such distribution is derived by Goodman [6–10]. An importance of both these distributions is caused by the fact that the ML CM estimates are widely used in many signal processing contexts: for radar applications [9, 11–17], for “superresolving” direction of arrival (DOA) estimation [7, 12, 18, 19], for multichannel communication systems [10], feature enhanced radar imaging with synthetic aperture radar (SAR) sensors [20, 21], digital beamforming in adaptive array (AA) systems, and fractional SAR modalities [22–25].
The Wishart–Goodman distributions are true for a general type data CM (GCM) of Gaussian processes/fields. These distributions in their classical forms do not take into account CM structure practically possible specificity caused by the peculiarities of signal processing (SP) system. At the same time, prior knowledge about the CM structure specificity could considerably enhance the processing efficiency and analysis precision due to a considerable decrease in the dimensionality of the parameter vectors involved into the adaptation process [13, 26].
The well-known example of such specificity is a persymmetry, i.e., symmetry relative to the secondary diagonal, of symmetric (real-valued) and Hermitian (complex-valued) CMs. Persymmetric CM (PCM) coincides with a result of itself turn relative to the secondary diagonal and thus is completely determined by a set of parameters (CM elements) which quantity is approximately twice less than that for respective general-type CM.
In multichannel (in space or time) signal processing systems, the CM persymmetry could be caused, in particular, by central symmetry (CS) of pairwise-identical receive channels arrangement. Such CS is peculiar to numerous different-purpose space-time signal processing systems (see, for instance, references 1–7 in [27]).
The ML for Hermitian PCM was derived for the first time by Nitzberg in his paper [28]. From the moment of this paper publication, numerous works have been performed to investigate an efficiency of this estimate use in CS systems of space-time adaptive signal processing [29–38]. Extensive supplementary list of relevant references is given in [16, 27].
In spite of a diversity of all these works, one common feature is inherent to all of them. Each such work sets an objective to determine statistical characteristics of one or another of function of ML estimate of relevant persymmetric correlation matrix (being usually the Hermitian one) since this function serves as a criterion of respective SP system efficiency. Thus, in [38], such function is “the persymmetric multiband generalized ratio algorithm (PGLR)” and the probability of false alarm and the probability of detection are obtained for this function. Works [28–35] investigate the relative losses, introduced in [9], in signal-to-(interference + noise) ratio (SINR) at the output of CS adaptive detector. These losses are taken relatively to those at the output of optimal detector, and probability density functions (pdfs), mean values, and variances are derived for them. Other criteria and their statistical characteristics under CM persymmetry are considered in [37].
The objectives been set are usually attained by the methods, which have specific distinctions from those been used for an analysis of efficiency of adaptive processing based on ML estimates of GCM. These distinctions are caused by the fact that, under generally concerned conditions, the PCM’s initial ML estimate [28] as well as its variants [30, 36, 39] are the sum of two summands. Both these summands have the Wishart (or Wishart–Goodman) distribution with the same matrix of parameters which, however, are not independent [34]. This fact forbids to use directly the methodology, developed in [7, 9, 10], in order to find the statistical characteristics of functions of GCM ML estimates by using Wishart’s–Goodman’s probability density function (pdf).
For PCM, it is possible to proceed to this methodology after a number of preliminary mathematical transformations [35, 38], which overcome the abovementioned mutual dependence of summands in PCM ML estimate.
Having analyzed these transformations, the author noticed that they solve the problems formulated in [35, 38] as well as “suggest” a way to derive directly a pdf of РCM ML estimates. Such problem, which has not been set and solved in these works, seemed interesting from theoretical and practical considerations. This has stimulated such problem formulation and solution. It was expected that the importance of these distributions for CS systems with PCM of Gaussian inputs should be as great as that of the Wishart’s–Goodman’s distributions for systems with arbitrary characteristics of receive channels.
The goal of this paper is twofold: (i) to derive closed form analytical expressions for the pdfs of the ML estimates of persymmetric real and complex CMs of Gaussian processes/fields of various natures and (ii) to feature their usefulness in statistical data characterization and operational performance analysis in applications to SP systems that possess a space-time receive channel СS property.
The rest of the paper is organized as follows. Section 2 reviews the persymmetric CMs models. In Sections 3 and 4, we derive the distributions of ML estimates of real and complex persymmetric CMs, respectively, in a closed analytical form. Section 5 exemplifies the usage of the derived distributions in some characteristic applications related to multichannel adaptive SP problems. Conclusion in Section 6 resumes the study.
2 Overview of properties of persymmetric correlation matrices
А. The real M × M matrix \( \mathbf{R}={\left[{r}_{i\ell}\right]}_{i,\ell =1}^M \) is persymmetric if it coincides with the matrix obtained after rotation of R with respect to the secondary diagonal, i.e., when the property
for a real matrix (i.e., matrix composed of real-valued entries) holds.
If the correlation (symmetric) matrix plays a role of R, then the additional equations are true
Hereinafter, superscript “T” denotes a vector/matrix transposition;
is the v × v orthogonal symmetric permutation matrix with unit entries on its secondary diagonal, and e i represents the i − th (i ∈ 1, ν) column of the v × v unity matrix I v .
For even M (M = 2 ⋅ L), matrix (2) allows the following block representation
where R 11 and R 12 represent the corresponding L × L blocks in (4).
Let us introduce the 2 ⋅ L × 2 ⋅ L = M × M matrix
that possesses the following properties (easily verifiable via simple algebraic manipulations)
Using (6), matrix R (4) can be transformed into the following block-diagonal form
with the determinant
that coincides with the determinant of the initial matrix
due to orthogonality (6) of matrix S L defined in (5).
B. The complex М × М matrix \( \mathbf{C}={\left[{c}_{i\ell}\right]}_{i,\ell =1}^M \) = C′ + j ⋅ C″ is persymmetric if the following equalities
hold.
If matrix C is associated with a correlation (Hermitian) matrix, the following additional equalities are true
Here, superscripts (~) and (*) define complex conjugation and Hermitian conjugation (complex conjugation and transposition), respectively.
Let us introduce the unitary М × М matrix [31, 35, 38]
that obviously satisfy the properties
Using (13), matrix (11) can be next transformed into the real symmetric М × М matrix
with the determinant
which coincides with that of the initial matrix
3 pdf of ML estimate of real persymmetric CM
А. Let M − variate random real (i.e., composed of real-valued entries) Gaussian (normal) vectors \( {\mathbf{y}}_i={\left[{y}_{\ell}^{(i)}\right]}_{\ell =1}^M \) of the K − variate sample \( \mathbf{Y}={\left[{\mathbf{y}}_i\right]}_{i=1}^K \) be mutually independent and have zero means and identical non-negative definite М × М CMs R, i.e.,
where δ(x) is the Kroneker symbol, and overbar defines the statistical averaging operator.
The joint pdf p(Y) of elements of sample Y in this case is given by [3, 5, 12]
where tr (Φ) is the trace (sum of diagonal elements) of a matrix Φ, and
represents the М × М sample (random) CM.
Under conditions (17) and (18), matrix \( \widehat{\mathbf{R}}={K}^{-1}\cdot {\mathbf{A}}_r \) represents an ML estimate of the real-valued general form CM R [3, 11–13], and matrix A r has the Wishart distribution \( {W}_M^{\left(\mathbf{R}\right)}\left({\mathbf{A}}_r,K,\mathbf{R}\right) \) with K − M degrees of freedom and parameter matrix R [2–5, 40]. Then,
where
and Γ(x) is the gamma function, which for an integer x = n ≥ 1 is equal to (n − 1) !.
Here, the distribution of a random matrix is specified via a joint distribution of random elements that compose such a matrix [3, 9]. Thus, (20) presents an “economical” definition of the pdf p (A r ) = p ( a iℓ ), i ∈ 1, M, ℓ ∈ i, M, as a function of M ⋅ (M + 1)/2 scalar variables, which are completely specified by the real-valued diagonal and above-diagonal elements of the symmetric matrix A defined by (19).
If CM R is persymmetric, then under the condition (17), its ML estimate may be written asFootnote 1
The problem at hand is to derive the closed-form expression for the pdf of that matrix A rp .
B. The matrix defined by (21) is a sum of two symmetric matrices each being a result of a permutation of another one with respect to its secondary diagonal; thus, it is also symmetric and persymmetric at the same time that follows directly from definition (2). For even M = 2 ⋅ L (at this stage, we restrict ourselves by that assumption for the sake of simplicity), matrix (21) is defined by L ⋅ (L + 1) random parameters—its elements a iℓ , i ∈ 1, L; ℓ ∈ i, M + 1 − i.
A comparison of (21) with (19) reveals that the first term of matrix A rp has Wishart distribution (20) with the parameter matrix R/2, and the second term has the same Wishart distribution under the conditions (2). Vectors Π M ⋅ y i , i ∈ 1, K of the “inverted” sample Π M ⋅ Y in (21), possess the same properties (17) as the initial vectors y i . If these terms are mutually independent, then their sum has the Wishart distribution analogous to (20) with 2 ⋅ K − M degrees of freedom and the parameter matrix R/2 [3–5]. However, for the terms of matrix A rp defined by (21), this condition is not valid; therefore, its distribution should be different [34].
C. In order to find the desired pdf, we next partition the initial 2 ⋅ L × K matrix sample \( \mathbf{Y}={\left[{\mathbf{y}}_i\right]}_{i=1}^K, \) \( {\mathbf{y}}_i={\left[{y}_{\ell}^{(i)}\;\right]}_{\ell =1}^{2L} \) into the L × K “upper” Y U and “lower” Y L blocks, so that
Let us introduce a linear transform of (22) performed with matrix S M defined by (5), i.e.,
which allows rewrite (21), taking into account (6), as follows
It is easy to deduce that taking into account the properties (5) of matrix J M , the addends embraced in the above formula have identical L × L diagonal blocks and opposite in signs L × L off-diagonal blocks. Therefore, one can rewrite
where the L × L diagonal blocks B ∑ and B Δ are expressed as follows:
Taking into account the interrelations (24), the problem at hand is transformed now into the problem of derivation of the pdf of the auxiliary matrix B V (24).
D. First, note that due to orthogonality of matrix S M , the Jacobian of the transform, \( \mathbf{Y}={\mathbf{S}}_M^T\cdot \mathbf{V} \), is equal to unity; hence, the pdf p(V) of the transformed sample V (23) under conditions (18) becomes
Using the property of the matrix product trace, tr(A ⋅ B) = tr(B ⋅ A), and taking into account (7)–(9), the latter formula can be rewritten as follows:
Next, taking into account the explained above properties (7) and (23)–(25), we obtain
that yield
where
The properties (17) and (4) of the CM blocks admit the following representations
and using the definitions (23), (22), and (7), it is easy to deduce that matrices R ∑ and Π L ⋅ R Δ ⋅ Π L , which specify the entries in the corresponding expressions (27), can be expressed as follows,
In doing so, it is easy to observe that K − variate “summ”, V ∑ and “difference”, V Δ; samples (23) of the random L = M/2 − variate vectors v ∑i and v Δi (i ∈ 1, K) have normal distributions (27), and matrices B ∑ and B Δ formed via (25) have Wishart distributions with K − M degrees of freedom and the parameter matrices R ∑ and Π L ⋅ R Δ ⋅ Π L , respectively, i.e.,
On the other hand, due to the mutual independence of samples V ∑ and V Δ that follow from (26), matrices B ∑ and B Δ defined by (25) are also mutually independent, and their joint density is, therefore, p(B ∑, B Δ) = p(B ∑) ⋅ p(B Δ). Multiplying these densities (29a) and (29b) and taking into account (26), (24), and (8), we obtain the density p (B V ) of matrix B V defined by (24),
Each of two symmetric L × L matrices B ∑ and B Δ in the arguments of p(B ∑, B Δ) are defined by L ⋅ (L + 1)/2 parameters, so the number of such parameters in matrix B V (24) is equal to L ⋅ (L + 1) that exactly coincides with the number of parameters that determine matrix A rp (21). Therefore, to obtain the desired pdf p (A rp ) using (30), it is enough to define the Jacobian of the transform (24) that relates B V and A rp .
Using (5) and (24), it is easy to deduce that
Then, the Jacobian matrix of the transform (24) can be written as \( {\mathbf{I}}_{L\cdot \left(L+1\right)/2}\otimes \left[\begin{array}{cc}\hfill 1\hfill & \hfill -1\hfill \\ {}\hfill 1\hfill & \hfill \kern0.75em 1\hfill \end{array}\right] \) where ⊗ defines the Kroneker product, and hence, that Jacobian is equal to 2L ⋅ (L + 1)/2.
Replacing in (30) matrix B V by its representation (24a) and taking into account (24b), (9), and (8), we obtain
The latter formula describes the desired pdf of the real symmetric and persymmetric random matrix A rp of the ML estimate \( {\widehat{\mathbf{R}}}_p \) (21) of the real and also persymmetric CM R of an even order M = 2 ⋅ L defined above in (2) and (17). This formula has the same form as the Wishart distribution (20) of the matrix A r (19) from the ML estimate \( \widehat{\mathbf{R}}={K}^{-1}\cdot {\mathbf{A}}_r \) of GCM. However, for formula (31), the reduced number of parameters that determine PCM has resulted in increased on L = M/2 number of degrees of freedom. That is why this formula could be considered as modified Wishart distribution of the ML estimate \( {\widehat{\mathbf{R}}}_p \) (21) of the real PCM R of an even order M = 2 ⋅ L.
4 Distribution density of ML estimate of complex persymmetric CM
A. Let the random complex normal M − variate vectors \( {\mathbf{y}}_i={\left[\;{y}_{\ell}^{(i)}\right]}_{\ell =1}^{\;M}={\mathbf{y}}_i^{\mathbf{\hbox{'}}}+j\cdot {\mathbf{y}}_i^{\mathbf{\hbox{'}\hbox{'}}} \) of the K − variate sample \( \mathbf{Y}={\left[{\mathbf{y}}_i\right]}_{i=1}^K \) be mutually independent and have zero means and identical non-negative definite complex Hermitian М × М CMs, \( \mathbf{C}={\left[{c}_{i\ell}\;\right]}_{i,\ell =1}^M= \) C ' + j ⋅ C '', i.e.,
The latter means [8, 12] that real \( {\mathbf{y}}_i^{\mathbf{\prime}} \) and imaginary \( {\mathbf{y}}_i^{\mathbf{{\prime\prime}}} \) parts of vectors y i (i ∈ 1, K) are zero means jointly normal real-valued vectors. Then, the 2 ⋅ M − variate vectors
are also mutually independent with zero means and identical 2 ⋅ М × 2 ⋅ М CMs
The joint distribution of the sample Y in that case is given by [8, 12]
where
is the М × М sample complex CM. Under conditions (32), the matrix Ĉ = K − 1 ⋅ A c in (34) specifies an ML estimate of the general type complex CM С [6–12], whereas matrix A c is characterized by the complex Wishart distribution, \( {W}_M^{\left(\mathbf{C}\right)}\;\left({\mathbf{A}}_c,K,\mathbf{C}\right) \), with K − M + 1 degrees of freedom and the parameter matrix C [6, 8, 9], i.e.,
Here, the pdf of the complex matrix C is treated as the joint distribution of its random real and imaginary parts [6–10]. Thus, (35a) specifies a non-negative function of M 2 parameters
Such parameters are completely defied by the real diagonal elements a ii (i ∈ 1, M) of the random Hermitian complex matrix A c (34) and M ⋅ (M − 1) real \( \left({a}_{i\ell}^{\hbox{'}}\right) \) and imaginary \( \left({a}_{i\ell}^{\hbox{'}\hbox{'}}\right) \) parts of its above-diagonal elements \( {a}_{i\ell }={a}_{i\ell}^{\hbox{'}}+j\cdot {a}_{i\ell}^{\hbox{'}\hbox{'}},\kern0.61em \left(i\in 1,\kern0.36em M-1;\kern0.36em \ell \in i+1,\kern0.24em M\right) \).
If CM C is persymmetric, then under the conditions (32), its ML estimate admits the following representation [28–31, 27–37]:
Thus, the problem at hand now is to find the distribution density of the matrix A cp in (36).
B. By construction, this matrix A cp is Hermitian and persymmetric as a sum of two Hermitian matrices each being a result of permutation of another one with respect to the secondary diagonal. Therefore, such A cp is completely specified by M ⋅ (M + 1)/2 real-valued scalar parameters, among which there are
the matrix imaginary parts \( \left({a}_{i\ell}^{\hbox{'}\hbox{'}}\right) \), and the rest M ⋅ (M + 1)/2 − z real parts \( \left({a}_{i\ell}^{\hbox{'}}\right) \) of the elements a iℓ , i ∈ 1, L; ℓ ∈ i, M + 1 − i that explicitly specify the whole matrix A cp . In (37), ε[x] represents the integer part of the embraced variable x.
From a comparison of (36) with (34), it follows that the first addend in matrix A cp is characterized by the distribution, \( {W}_M^{\left(\mathbf{C}\right)}\ \left({\mathbf{A}}_c,K,\mathbf{C}/2\right) \). The second addend has the same distribution as well, since under the condition (11), vectors \( {\boldsymbol{\Pi}}_M\cdot {\mathbf{y}}_i^{\sim}\kern0.5em \left(i\in 1,K\right) \) of the “reverse” and complex conjugate sample Π M ⋅ Y ~ possess the same properties as the initial vectors, y i . The samples Y and Π M ⋅ Y ~ are mutually uncorrelated, i.e., \( \overline{\mathbf{Y}\cdot {\left({\boldsymbol{\Pi}}_M\cdot {\mathbf{Y}}^{\sim}\right)}^{*}}=0 \) [8, 12]; however, they are not jointly normal [34]. Absence of mutual correlation does not mean mutual independence that does not allow to represent the joint pdf p (Y, Π M ⋅ Y ~) via the product p (Y) ⋅ p (Π M ⋅ Y ~). In addition, the distribution \( {W}_M^{\left(\mathbf{C}\right)}\ \left({\mathbf{A}}_c,2K,\mathbf{C}/2\right) \) specifies the density of the sum in (36) only under the conditions of mutual independence of the addends.
C. Let us consider the transform of the sample matrix Y performed by the unitary matrix T defined in (12), i.e.,
Using (13) and (38), matrix A cp defined by (36) admits the following representation
Obviously, the addends in braces in this equality are complex conjugate, thus
where
The representation (39) yields the following equalities
that reduce the problem at hand to deriving the pdf of the real symmetric matrix B V .
D. Note that due to (32a) and (32c), CMs of the “summary” v ∑i and the “difference” v Δi (i ∈ 1, K) vectors in samples V ∑ and V Δ (38b) are identical and equal to
whereas (as a consequence of unitary T, and the properties specified by (9) and (33)) the density p(V) of the transformed sample V (38a) becomes
where matrix C r has been defined in (14).
Taking into account a symmetry of matrices C r and C ∑, and the expressions (40), it is easy to verify the following equalities
Those (43) can be also re-expressed as
where
From (18), (20), and (40), it follows now that the pdf(s) of the matrices B ∑ and B Δ in (40) can be expressed as
and because of (45), these matrices B ∑ and B Δ are mutually independent.
The pdf of the sum (40) is therefore given by
This formula has been already derived in somewhat different way in [35, 38]. In these works, it has been used in accordance with methodology [7, 9, 10] in order to attain the objectives been set, which, however, did not include a derivation of the pdf of the complex Hermitian persymmetric М × М matrix A cp (36). At the same time, it is quite simple to proceed to this pdf form the pdf (47b) by determination of a Jacobian of the transform (41) which connects the matrices B V and A cp .
The symmetric М × М matrix B V is defined by M ⋅ (M + 1)/2 parameters, whose number exactly coincides with the number of parameters that specify the Hermitian persymmetric matrix A cp . For elements \( {a}_{i\ell }={a}_{i\ell}^{\hbox{'}}+j\cdot {a}_{i\ell}^{"} \) of that matrix, the following equalities
hold.
Next, taking into account (12), we can express the elements of matrix B V (41) as follows:
that allow to compute the Jacobian of the transform (41)

Replacing in (47b) matrix B V by its representation (41) and taking into account (48), (42), (16), and (15) yields
The latter formula completely defines the desired pdf of the complex Hermitian persymmetric М × М matrix A cp present in the ML estimate Ĉ p (36) of the Hermitian PCM C specified in (11) and (32). This formula has the same form as the Wishart’s–Goodman’s pdf (35) of the matrix A c (34) from the ML estimate Ĉ = K − 1 ⋅ A c of GCM. However, for formula (49), the reduced number of parameters that determine PCM has resulted in increased on (M − 1)/2 number of degrees of freedom. That is why this formula could be considered as modified Wishart’s–Goodman’s distribution of the ML estimate Ĉ p (36) of complex PCM C of an order M.
Note that in a particular case of M = 1, when \( \begin{array}{cc}\hfill L=1, \kern0.3em z=0,\hfill & \hfill \left|\;\mathbf{C}\right|={c}_{11}=\overline{{\left|{\mathbf{y}}_1^{\left(\;i\;\right)}\;\right|}^2}={\sigma}^2\hfill \end{array} \) and \( {\mathbf{A}}_{cp}={a}_{11}={\displaystyle \sum_{i=1}^K\;{\left|\;{\mathbf{y}}_1^{(i)}\;\right|}^2={\mathbf{A}}_c} \), formula (49) is transformed into
i.e., the pdf (49) turns into the Erlang distribution (50) with the shape and scale parameters, K and σ 2, respectively. Such pdf (50) characterizes, in an explicit statistical sense, the pdf of a sum of K squared magnitudes of independent complex normal random variables with zero means and equal variances σ 2 [39].
5 Exemplifying practical usage of the derived distributions
Distributions (31) and (49) of a persymmetric estimation CM (21) and (36) resemble the pdf(s) (20) and (35) of a general type CM estimates (19) and (34) but with an increased number of the degrees of freedom. In connection with this, the well-known properties of real [1–5] and complex [6–10] Wishart distributions (with relevant modifications) are transferred into the derived here distributions. Here beneath, we feature an importance of those distributions referring to some characteristic SP examples.
A. A non-degenerate transformation
of a 2L × 2L real persymmetric matrix A rp (21) distributed via (31) with the non-random symmetric and persymmetric 2 ⋅ L × 2 ⋅ L matrix U = U T = Π M ⋅ U ⋅ Π M gives rise to the random symmetric and persymmetric matrix B rp (51) with the same distribution but the transformed parametric matrixFootnote 2
Indeed, under the conditions (21), such matrix B rp (51) can be expressed as
where \( \mathbf{V}=\mathbf{U}\cdot \mathbf{Y}={\left\{{\mathbf{v}}_i\right\}}_{\ell =1}^K \) is the K − variate sample composed of 2 ⋅ L − variate random vectors
The pdf of matrix B p (51) (for a fixed given K ≥ L) can therefore be expressed as
This formula permits ease computing of the Jacobian \( \left|\;\frac{\partial \left({\mathbf{B}}_{rp}\right)}{\partial \left({\mathbf{A}}_{rp}\right)}\right|=\frac{p\;\left({\mathbf{A}}_{rp}\right)}{p\;\left({\mathbf{B}}_{rp}\right)} \) of the transform (51). Using (31) and taking into account (52), we obtain \( \left|\;\frac{\partial \left({\mathbf{B}}_{rp}\right)}{\partial \left({\mathbf{A}}_{rp}\right)}\right|={\left|\mathbf{U}\;\right|}^{L+1} \).
Similarly, the use of the complex matrix (36) (characterized by distribution (49)) in the transform B cp = U ⋅ A cp ⋅ U (for a Hermitian and persymmetric U) yields the same distribution p (B cp ) (49) for B cp but with the properly transformed parametric matrix G c = U ⋅ C ⋅ U. The Jacobian of the transform is given by \( \left|\;\frac{\partial \left({\mathbf{B}}_{cp}\right)}{\partial \left({\mathbf{A}}_{cp}\right)}\right|={\left|\mathbf{U}\;\right|}^{M+1} \).
Note that the computations of the corresponding Jacobians become intractable without knowledge of the analytically closed expressions (31) and (49) for the corresponding pdf(s).
B. In the example, let us compare mean values of relative bias
between random (estimate) spectral function (SF)
of Capon method [7, 18, 19] and true value of this SF
The latter is inversely proportional to a quadratic form of complex steering vector x(α) = x′(α) + j ⋅ x″(α) with matrix \( {\mathbf{C}}_p^{-1} \) being inverse to Hermitian PCM C p .
As an estimate of this matrix, we will use in (57a) following matrices
with defining matrices A c (34) and A cp (36) which obey pdfs (35) and (49), respectively. In the latter case, we will also assume that the steering vector x(α) obeys a condition
which is completely natural for CS receive channels.
As is shown in [7, 10], in the case (a)
where d is a random variable with the independent on α Erlang’s pdf
with the shape parameter δ g and the scale parameter equal to unity [39]. Its mean is \( \overline{d}={\delta}_g \). Therefore, by virtue of (60), \( \overline{\widehat{\nu}\left(\alpha \right)}={K}^{-1}\cdot \overline{d}=1-\left(M-1\right)/K \), so that
In the case (b), let us use the representation (39) for the matrix A cp . Then, taking into account the properties (13) of the matrix T (12), we will obtain for the SF ŝ(α) (57a) the following
where B V is the matrix (40) with the pdf (47). Under conditions (59), the vector
and the latter SF is transformed to the formula
which has a quadratic form of real-valued vector x Δ(α) with real-valued symmetric matrix B V , with pdf (47), in denominator.
Next, using the methodology [7, 9, 10], it is possible to demonstrate that, under considered conditions, quantity \( \widehat{\nu}\left(\alpha \right) \) (56) obeys following equality similar to (60)
where d 1 is a random variable with the independent on α pdf
For odd M, this distribution become the Erlang’s one [39] with the shape parameter δ p and the scale parameter equal to unity. That is why mean value \( {\overline{d}}_1={\delta}_p \), and, by virtue of (60) and (56),
Thereby, under considered conditions, for the same training sample size K, the estimate (58b) reduces as great as twice the bias (56) of SF estimate (57) in contrast to the estimate (58a). This is equivalent to the statement that equal values of bias (56) are provided by the estimate (58b) at the twice smaller training sample size K.
C. Note, that, under conditions (59), the quadratic form in (57a) could be computed in more simple way by using the estimate (58b) instead of (58a). This is easy to follow by rewriting it as
Matrix \( {\mathbf{A}}_{cp}^{-1} \), being inverse to Hermitian persymmetric matrix, is also Hermitian persymmetric one, i.e., \( {\mathbf{A}}_{cp}^{-1}={\boldsymbol{\Pi}}_M\cdot {\left({\mathbf{A}}_{cp}^{\sim}\right)}^{-1}\cdot {\boldsymbol{\Pi}}_M \). That is why, under conditions (59), vector \( {\mathbf{w}}_p\left(\alpha \right)=c\cdot {\boldsymbol{\Pi}}_M\cdot {\mathbf{w}}_p^{\sim}\left(\alpha \right) \) and thus it is completely determined by the half (M/2) of its components which could be computed at the expense of ≈ M 2/2 multiplications. Approximately M/2 multiplications are enough in order to compute the scalar product q p (α). Thus, an amount of computations necessary to calculate the quadratic form in (57a) based on (58b) could be twice less than based on (58a).
D. Similar gains in the amount of computations and efficiency (including an efficiency in terms of criteria different from (56)) could be achieved in the systems with CS receive channels also by using other kinds of PCM ML estimates (considered, particularly, in [30–32, 35, 36, 37, 38–19]).
6 Conclusions
The main result of this work is the derivation of the pdfs (37) and (49) of random M × M real-valued and complex-valued ML estimates (21) and (36) of persymmetric CMs (2) and (11) of multivariate Gaussian processes and fields. Such CMs arise in numerous practical applications, particularly, in the tasks of space-time adaptive signal processing in systems with central symmetry of receive channels [20–32, 27–37, 38, 16, 17]. The derived pdfs have the same form as the classical Wishart’s pdf and Wishart’s–Goodman’s pdf (20) and (35) for GCM ML estimates, but with the number of degrees of freedom being increased approximately on M/2. This is due to the approximately half less number of determining parameters (elements) of the PCM. That is why, in the systems with the CS receive channels, the derived distributions appeal to such importance as that of the classical Wishart and Wishart’s–Goodman’s pdfs in systems with arbitrary receive channels.
Reduced dimensionality of parameters vector for PCM in CS systems increases an efficiency of adaptive signal processing based on ML estimates (21) and (36) as compared with ML estimates (19) and (34). The gain depends on the efficiency criterion and additional conditions which take into account the task specificity. Nevertheless, in the above-considered example as well as in numerous other practically important cases, the adaptive processing based on estimates (21) and (36) under central symmetry of space-time receive channels could provide for some conditions at the expense of K p training samples, the efficiency (in terms of different criteria) being close to that provided by the estimates (19) and (34) at the expense of K ≈ 2 ⋅ K p training samples [34, 35, 36, 37].
It is essential that, taking into account the structure specificity of utilized matrices and vectors, it is possible to make signal processing in CS systems based on ML estimates (21) and (36) as well as their variants [30–32, 36, 37] less computationally consuming.
General considerations allow to assume a possibility to obtain similar gain not only in case of Gaussian stochastic processes but also in case of other stochastic processes. The latter, in particular, could be subject to exponential distribution or Weibull one. However, a rigorous proof of this hypothesis is unavailable to the author, since for such non-Gaussian processes, there are currently unknown neither their correlation matrices’ ML estimates analogous to (19) and (34) nor these estimates’ pdfs analogous to Wishart’s (20) pdf and Wishart’s–Goodman’s (35) one.
Finally, note that statistical characteristics of different functions of ML estimates of Gaussian processes’ PCM could be obtained without explicit use of their pdfs (31) and (49) what was illustrated, in particular, in [35, 36, 37, 38, 16, 17]. Nevertheless, the pdfs (31) and (49) are also useful in such case because they allow to perform only one step of transformations similar to (9) and (39) and then to use the methodology [7, 9, 10] in order to find statistical characteristics of functions of GCM ML estimates. Such approach seems to be the easiest one and methodologically reasonable. Namely, this approach is used by the author for comparative analysis of a number of “superresolving” spectral estimation methods in systems with CS receive channels. The results of this analysis are planned to be discussed in a special paper.
In closing, the author would like to thank all five reviewers for the time and effort with this paper.
Abbreviations
- AA:
-
antenna array
- CM:
-
correlation matrix
- CS:
-
centrally symmetric
- GCM:
-
general type correlation matrix
- ML:
-
maximum likelihood
- PCM:
-
persymmetric correlation matrices
- pdf:
-
probability density function
- SAR:
-
synthetic aperture radar
- SF:
-
spectral function
- SINR:
-
signal-to-(interference + noise) ratio
- SP:
-
signal processing
References
J Wishart, The generalised product moment distribution in samples from a normal multivariate population. Biometrika 20A(1–2), 32–52 (1928). doi:10.1093/biomet/20A.1-2.32
M Hazewinkel (ed.), Encyclopaedia of Mathematics, Vol. 9 (Sto–Zyg) (Kluwer, Dordrecht, 1993)
TW Anderson, An introduction to multivariate statistical analysis, 3rd edn. (John Wiley & Sons, Hoboken NJ, 2003)
H Cramér, Mathematical methods of statistics (Princeton University Press, Princeton, N.J., 1946)
MH DeGroot, Optimal statistical decisions (McGraw-Hill, New York, 1969)
NR Goodman, Statistical analysis based on a certain multivariate complex Gaussian distribution (an introduction). The Annals of Mathematical Statistics 34(1), 152–177 (1963). doi:10.1214/aoms/1177704250
J Capon, NR Goodman, Probability distributions for estimators of the frequency-wavenumber spectrum. IEEE Proceedings 58(10), 1785–1786 (1970). doi:10.1109/PROC.1970.8014
DR Brillinger, Time series: data analysis and theory (Holden Day, Inc., San Francisco, 1981)
IS Reed, JD Mallett, LE Brennan, Rapid convergence rate in adaptive arrays. IEEE Trans. Aerosp. Electron. Syst. AES-10(6), 853–863 (1974). doi:10.1109/TAES.1974.307893
LE Brennan, IS Reed, An adaptive array signal processing algorithm for communications. IEEE Trans. Aerosp. Electron. Syst. AES-18(1), 124–130 (1982). doi:10.1109/TAES.1982.309212
YD Shirman, VN Manzhos, Theory and techniques of radar information processing in a background of interference (Radio i Sviaz, Moscow, 1981) (in Russian)
RA Monzingo, TW Miller, Introduction to adaptive arrays (John Wiley & Sons, New York, 1980)
VG Repin, GP Tartakovskiy, Statistical synthesis under prior uncertainty and adaptation of information systems (Soviet Radio, Moscow, 1977) (in Russian)
H Wang, L Cai, On adaptive multiband signal detection with the SMI algorithm. IEEE Trans. Aerosp. Electron. Syst. 26(5), 768–773 (1990). doi:10.1109/7.102712
H Wang, L Cai, On adaptive multiband signal detection with the GLR algorithm. IEEE Trans. Aerosp. Electron. Syst. 27(2), 225–233 (1991). doi:10.1109/7.78296
C Hao, S Gazor, G Foglia, B Liu, C Hou, Persymmetric adaptive detection and range estimation of a small target. IEEE Trans. Aerosp. Electron. Syst. 51(4), 2590–2604 (2015). doi:10.1109/TAES.2015.140517
J Liu, G Cui, H Li, B Himed, On the performance of a persymmetric adaptive matched filter. IEEE Trans. Aerosp. Electron. Syst. 51(4), 2605–2614 (2015). doi:10.1109/TAES.2015.140633
DI Lekhovytskiy, PM Flekser, Statistical analysis of quasi-harmonic spectral estimation resolution with Capon’s method, Proceedings of International Scientific and Technical Conference “Modern Radiolocation”, Kyiv, Ukraine, vol. 1, 1994, pp. 66–71. in Russian
DI Lekhovytskiy, YS Shifrin, Statistical analysis of “superresolving” methods for direction-of-arrival estimation of noise radiation sources under finite size of training sample. Signal Process. 93(12), 3382–3399 (2013). doi:10.1016/j.sigpro.2013.03.008
YV Shkvarko, Unifying regularization and Bayesian estimation methods for enhanced imaging with remotely sensed data—part I: theory. IEEE Trans. Geosci. Remote Sens. 42(5), 923–931 (2004). doi:10.1109/TGRS.2003.823281
YV Shkvarko, Unifying regularization and Bayesian estimation methods for enhanced imaging with remotely sensed data—part II: implementation and performance issues. IEEE Trans. Geosci. Remote Sens. 42(5), 932–940 (2004). doi:10.1109/TGRS.2003.823279
YV Shkvarko, Unifying experiment design and convex regularization techniques for enhanced imaging with uncertain remote sensing data—part I: theory. IEEE Trans. Geosci. Remote Sens. 48(1), 82–95 (2010). doi:10.1109/TGRS.2009.2027695
YV Shkvarko, Unifying experiment design and convex regularization techniques for enhanced imaging with uncertain remote sensing data—part II: adaptive implementation and performance issues. IEEE Trans. Geosci. Remote Sens. 48(1), 96–111 (2010). doi:10.1109/TGRS.2009.2027696
YV Shkvarko, J Tuxpan, SR Santos, ℓ2–ℓ1 structured descriptive experiment design regularization based enhancement of fractional SAR imagery. Signal Process. 93(12), 3553–3566 (2013). doi:10.1016/j.sigpro.2013.03.024
YV Shkvarko, J Tuxpan, SR Santos, Dynamic experiment design regularization approach to adaptive imaging with array radar/SAR systems. Sensors 11(5), 4483–4511 (2011). doi:10.3390/s110504483
RL Stratonovich, Principles of adaptive reception (Soviet Radio, Moscow, 1973) (in Russian)
A De Maio, D Orlando, An invariant approach to adaptive radar detection under covariance persymmetry. IEEE Trans. Signal Process. 63(5), 1297–1309 (2015). doi:10.1109/TSP.2014.2388441
R Nitzberg, Application of maximum likelihood estimation of persymmetric covariance matrices to adaptive processing. IEEE Trans. Aerosp. Electron. Syst. AES-16(1), 124–127 (1980). doi:10.1109/TAES.1980.308887
EI Lifanov, VA Likharev, Estimate of the covariation matrix of stationary interference. Telecommunications and Radio Engineering 37–38(5), 103–105 (1983)
VI Zarytskyi, VN Kokin, DI Lekhovytskiy, VV Salamatin, Recurrent adaptive processing algorithms under condition of central symmetry of space-time reception channels. Radiophysics and Quantum Electronics 28(7), 592–598 (1985). doi:10.1007/BF01034102
VA Averochkin, PE Baranov, VS Tokolov, Effectiveness of adaptive filters with real weight coefficients. Radioelectron. Commun. Syst. 30(4), 80–82 (1987)
VM Koshevyy, VV Radionov, Evaluation of limiting rate of convergence of algorithms for signal adaptive processing for a class of structures of noise correlation matrices. Telecommunications and Radio Engineering 45(6), 36–41 (1991)
MB Sverdlik, VE Shpatakovskiy, Estimation of the potential convergence rate of the adaptive signal processing in equispace receive channels. Radiotekhnika i Elektronika 34(4), 760–766 (1989) (in Russian)
YI Abramovich, AY Gorokhov, On an estimation of the convergence rate of adaptive interference compensation filters with a persymmetrical correlation matrix. J. Commun. Technol. Electron. 38(7), 9–21 (1993)
VM Koshevyy, VV Radionov, Effectiveness of adaptive filters with central symmetry of receiving channels. Radiotekhnika i Elektronika 39(11), 1779–1788 (1994) (in Russian)
SA Vorobyov, Reduced complexity blind unitary prewhitening with application to blind source separation. Paper presented at the 2005 1st IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing, Puerto Vallarta, Mexico, 13–15 December, 2005. doi:10.1109/CAMAP.2005.1574214
DI Lekhovytskiy, DV Atamanskiy, IG Kirillov, VI Zarytskyi, Comparison of adaptive processing efficiencies for arbitrary and centrosymmetrical arrays. Antennas 44(1), 99–103 (2000) (in Russian)
L Cai, H Wang, A persymmetric multiband GLR algorithm. IEEE Trans. Aerosp. Electron. Syst. 28(3), 806–816 (1992). doi:10.1109/7.256301
NAJ Hastings, JB Peacock, Statistical distributions: a handbook for students and practitioners (Butterworths, London, 1975)
VL Girko, Multivariate statistical analysis (Vyshcha Shkola, Kyiv, 1988) (in Russian)
Acknowledgements
The author is very grateful to Yu. I. Abramovich and V. M. Koshevyy for the helpful discussions and insightful comments that helped to improve the paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The author declares that he has no competing interests.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Lekhovytskiy, D.I. To the theory of adaptive signal processing in systems with centrally symmetric receive channels. EURASIP J. Adv. Signal Process. 2016, 33 (2016). https://doi.org/10.1186/s13634-016-0329-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13634-016-0329-z