Limiting spectral distribution of the sample covariance matrix of the windowed array data

Yazdian, Ehsan; Gazor, Saeed; Bastani, Mohammad Hasan

doi:10.1186/1687-6180-2013-42

Research
Open access
Published: 06 March 2013

Limiting spectral distribution of the sample covariance matrix of the windowed array data

Ehsan Yazdian¹,
Saeed Gazor² &
Mohammad Hasan Bastani³

EURASIP Journal on Advances in Signal Processing volume 2013, Article number: 42 (2013) Cite this article

3815 Accesses
2 Citations
Metrics details

Abstract

In this article, we investigate the limiting spectral distribution of the sample covariance matrix (SCM) of weighted/windowed complex data. We use recent advances in random matrix theory and describe the distribution of eigenvalues of the doubly correlated Wishart matrices. We obtain an approximation for the spectral distribution of the SCM obtained from windowed data. We also determine a condition on the coefficients of the window, under which the fragmentation of the support of noise eigenvalues can be avoided, in the noise-only data case. For the commonly used exponential window, we derive an explicit expression for the l.s.d of the noise-only data. In addition, we present a method to identify the support of eigenvalues in the general case of signal-plus-noise. Simulations are performed to support our theoretical claims. The results of this article can be directly employed in many applications working with windowed array data such as source enumeration and subspace tracking algorithms.

Introduction

The distribution of the eigenvalues of the sample covariance matrix (SCM) of data has important impact on the performance of signal processing algorithms. Over the last decade, the properties of complex Wishart matrices are used in the analysis and design of many signal processing algorithms such as in array processing. Our knowledge about the distribution of eigenvalues, eigenvectors and determinants of complex Wishart matrices and their limiting behavior is emerging as a key tool in a number of applications, e.g., in data compression and analysis of wireless MIMO channels [1, 2], array processing, source enumeration and identification [3–5], adaptive algorithms [6, 7]. The densities of the singular values of random matrices and their asymptotic behavior (as the matrix size tends to infinity) has been employed in some applications [8–10]. The eigenvalues of the SCM are often used to describe many signal processing problems. For example in [8], they are used as sufficient statistics for array source enumeration.

Let X ₁, …, X _N be N independent zero mean Gaussian random vectors with covariance matrix of A, i.e., $N_{M} (0, A)$ , where A is a nonnegative M × M Hermitian matrix. The SCM R _N is defined as $R_{N} = \frac{1}{N} \sum_{i = 1}^{N} X_{i} X_{i}^{H} = \frac{1}{N} {XX}^{H}$ , where X = [X ₁,…,X _N] contains N snapshots of the received data. In this article, we refer to this SCM as the SCM with rectangular window (SCM-R) as all data samples have equal weights, i.e., a rectangular window is used. In this case R _N has a Wishart distribution [11] and for more than four decades, it has been known that the joint probability density function (PDF) of its eigenvalues, can be expressed in terms of hyper-geometric functions [12]. More recently, a simpler form of this joint PDF was derived in terms of the product of two determinants [13]. However, this form is applicable if the array is small and the eigenvalues of the covariance matrix of the observed data are distinct. Several articles have investigated the behavior of the eigenvalues of R _N when M, N → ∞ assuming $\frac{M}{N} \to c > 0$ [14, 15]. This is a more realistic assumption than assuming M is finite and N is infinite, because in most practical applications the covariance matrix A slowly varies, hence, the effective window length could not be arbitrary long. For instance, the eigenvalue estimators that are consistent in this asymptotic regime are more robust to finite sample size than other estimators which are only guaranteed to converge for fixed M and N→∞[9]. There are many works on the distribution of eigenvalues in this asymptotic regime, such as information-plus-noise [16] and spiked models where all eigenvalues are equal excluding a small number of fixed eigenvalues (spikes) [17]. Specifically, the distribution of the largest noise eigenvalue is widely studied [18, 19].

Some signal processing algorithms process a batch of data together and deal with the SCM-R. In addition, the existing results in literature about the behavior of the eigenvalues mainly consider the rectangular window. However in a number of practical signal processing algorithms, the SCM is estimated by applying a window as follows

R_{N} = \frac{1}{N} \sum_{i = 1}^{N} w_{i} X_{i} X_{i}^{H},

(1)

where {w _i ≥ 0,i = 1,…,N} is a non-negative sequence. Hereafter, we refer to R _N as the SCM. The SCM-R is obtained using a rectangular window, i.e., where w _i is non-zero and constant for i = 1,…,N. These weights allow to flexibly emphasize or deemphasize some of the observations. For example smaller weights for old data samples allows to improve the agility of the algorithms. For instance in cognitive radio, it is important to detect the activities of users and the idle channels as fast as possible, thereby reducing the detection time and improving the agility of the system [20, 21]. Among all windows, the exponential window, w _i = w ₀ p ⁱ, is commonly used. Two reasons for this popularity are (1) this window allows to develop fast recursive algorithms which are considerably less expensive in terms of computational complexity, thereby facilitate the real-time implementation of these algorithms (e.g. see [22, 23]) and (2) allows to forget the old data, thereby improving the tracking ability in non-stationary environments. For instance exponentially windowed data is used in most of the existing subspace tracking algorithms[24, 25]. That is because only a rank-one update is required for each new data vector to update the underlying SCM, which leads to simple low cost subspace tracking algorithms.

In this article, we study the effects of windowing on the distribution of the eigenvalues of the SCM. In this case, the SCM in (1) has a doubly correlated Wishart distribution [26–30]. We must note that, there are numerous research results for the case of Wishart matrices, however, the spectral properties in the doubly correlated case has not been sufficiently studied.

Manipulating the joint PDF of the eigenvalues which is a very complex function is not practical, particularly for large matrices. An alternative approach used in the literature, is to employ the following empirical spectral distribution (e.s.d.) of a square matrix $A \in C^{M \times M}$

F^{A} (x) \overset{Δ}{=} \frac{1}{M} # {λ_{i} \leq x | i = 1, \dots, M},

(2)

where λ ₁, λ ₂, … ,λ _M are eigenvalues of A and #{.} denotes the cardinality of a set. Note that, in this definition all eigenvalues of A are assumed to be real. Although this formulation is less explicit than the joint PDF of eigenvalues, it describes the statistical behavior of the eigenvalues. In many practical cases A is a random matrix and the e.s.d. F ^A(x) is a random function which converges almost surly to a deterministic cumulative distribution function as the dimension of the system grows. In such cases, lim M → ∞ F ^A(x) is referred to as the limiting spectral distribution (l.s.d.) of A.

In recent years, some results have been obtained on the limiting behavior of the e.s.d. of correlated Wishart matrices. In this article, for the white noise case, we study the behavior of eigenvalues of the SCM. In particular for the exponential window, we extend the results previously demonstrated in [31] and give more details along with the proofs of the required theorems. We then consider the case of signal plus noise and present a method to determine the support of eigenvalues. The main contributions in this article are

A method is proposed to approximate the spectral distribution of the SCM using arbitrary windows with that of an equivalent Wishart Distribution. For the especial case of white noise (noise only), this approximation is the Marchenko–Pastur (M–P) distribution, which is the known distribution for the case of a rectangular window.
In Theorem 2, we derive an accurate and explicit equation for the l.s.d. of the SCM of noise-only data for the exponential window. Many simulations are performed to show the accuracy of this l.s.d.
In Theorem 3 we present a systematic method to compute the support of eigenvalues in the signal plus noise data case using an exponentially weighted window. In addition to the results, we follow up a different and novel approach in proving this theorem compared with the existing proof for the rectangular window case where the Stieltjes transform m(z) has the explicit inverse [15]. This approach can be easily utilized for other window types where the Stieltjes transform is expressed explicitly or implicitly as a function of z.

The demonstrated results provide a key step toward characterization of the distribution of eigenvalues in the general Covariance matrix of windowed data. The results of this work are useful in the design and implementation of robust algorithms using windowed snapshots. Our derivations in Theorems 2 and 3 can be directly used to design unbiased eigenvalue and eigenvector estimators. These estimators are important especially because the exponential window is used in numerous applications. They can be used as a basis to improve the performance and accuracy of many existing algorithms which are based on exponentially windowed data, in many fields such as subspace tracking, DOA estimation and source enumeration.

The remainder of this article is organized as follows: Section 2 introduces the system model and some important mathematical tools. We derive an approximation for the Stieltjes transform of l.s.d. of eigenvalues of weighted windowed array data in Section 3. Asymptotic spectrum of the eigenvalues in noise-only data case is analyzed in Section 4. The signal plus noise case is studied in Section 5. Section 6 provides simulation results. Finally, we conclude this work and suggest future works in Section 7.

2 System model for windowed SCM

We assume that $X_{i} \in C^{M}$ in (1) is a circularly symmetrical independent Gaussian random vector process with zero mean and covariance matrix of $A \in C^{M \times M}$ , i.e., $X_{i} \sim N_{M} (0, A)$ . In this case, we can rewrite (1), supposing that SCM is estimated using a window of size N with positive coefficients w ₁,…,w _N,

R_{N} = \frac{1}{N} \sum_{i = 1}^{N} w_{i} A^{\frac{1}{2}} U_{i} U_{i}^{H} A^{\frac{1}{2}} = \frac{1}{N} A^{\frac{1}{2}} U W_{N} U^{H} A^{\frac{1}{2}},

(3)

where U = [U ₁,…,U _N] is an M × N matrix contains i.i.d. zero-mean unit-variance complex Gaussian entries and $W_{N} \overset{Δ}{=} diag (w_{1}, \dots, w_{N})$ . The matrix R _N has a doubly correlated Wishart distribution. In practice, it is very complex to directly characterize the e.s.d. of R _N thus, we use the Stieltjes transform of this distribution and indirectly characterize the behavior of the eigenvalues. Then, in the asymptotic regime as M,N→∞ given $\frac{M}{N} \to c > 0$ , the inverse transform of the limit gives the l.s.d of SCM.

Definition 1

[15]Stieltjes transform m(z), $z \in C^{+} \equiv {z \in C : Im (z) > 0}$ of a distribution function F ^R(x) is defined as

m (z) = \int \frac{1}{λ - z} d F^{R} (λ) .

(4)

The inverse Stieltjes transform formula is as follows:

F^{R} (x) = \frac{1}{Π} \lim_{y \to 0^{+}} \int_{- \infty}^{x} Im {m (t + iy)} dt, \forall x \in R .

(5)

Hence, in order to characterize the asymptotic distribution of the sample eigenvalues, we alternatively characterize the asymptotic behavior of the corresponding Stieltjes transform, and then use the Stieltjes inversion formula in (5) to obtain l.s.d. of SCM f ^R(x). We use the following theorem which gives the Stieltjes transform of the correlated Wishart matrix [29] and is the basis for derivations in this article.

Theorem 1

For a finite length window with length of N , consider the matrix defined by $R_{N} = \frac{1}{N} {A_{N}}^{\frac{1}{2}} U W_{N} U^{H} {A_{N}}^{\frac{1}{2}}$ . Assume that all elements of $U \in C^{M \times M}$ are i.i.d. random variables with zero-mean, unit variance and finite E {|U _{i
j}|⁴}. In addition, suppose that $A_{N} \in C^{M \times M}$ is a Hermitian nonnegative definite matrix, W _N = diag( w ₁, …, w _N), $F^{A_{N}} \overset{D}{\to} F^{A}$ , $F^{W_{N}} \overset{D}{\to} F^{W}$ when M,N → ∞ with $\frac{M}{N} \to c > 0$ . In this case, the empirical distribution $F^{R_{N}}$ , with probability 1, converges weakly to a probability distribution function F ^R whose Stieltjes transform m(z), for $z \in C^{+}$ , is given by

m (z) = \int \frac{1}{a (\int \frac{w}{1 + cwe (z)} d F^{W} (w)) - z} d F^{A} (a),

(6)

where e(z) is the unique solution of the following equation in $C^{+}$

e (z) = \int \frac{a}{a (\int \frac{w}{1 + cwe (z)} d F^{W} (w)) - z} d F^{A} (a)

(7)

Proof 1

See[29]for proof. Similar results are also demonstrated in[26], [28]with some differences in the assumptions on correlation matrices. □

We emphasize that (6) and (7) give the exact distribution in the asymptotic regime as M,N → ∞ with $\frac{M}{N} \to c > 0$ . Since in practice, the array dimension and/or sample size are usually finite numbers, this method gives a deterministic approximation for the actual sample eigenvalue distribution.

To show how this method works, we now consider the simplest case (where the distribution is well known) using a rectangular window and white Gaussian noise, i.e., W = I _N × N and A = σ ² I _M × M. In this case, we have d F ^W(w) = δ(w − 1)d w and d F ^A(x) = δ(x − σ ²)d x, where δ(x) is the Dirac delta function. Thus with straightforward manipulations of (6) and (7), the Stieltjes transform is found to be the solution of

z = z (m) = \frac{σ^{2}}{1 + c σ^{2} m} - \frac{1}{m} .

(8)

In this case, as expected the e.s.d. of the SCM-R, $F^{R_{N}} (x)$ , converges to the M–P distribution [14] as follows,

\begin{array}{l} f^{MP} (x) = & \frac{d}{dx} F^{MP} (x), = \frac{max (c, 1) - 1}{c} δ (x) \\ + \frac{\sqrt{(x - a_{-}) (a_{+} - x)}}{2 Π σ^{2} xc} π_{a_{-}, a_{+}} (x), \end{array}

(9)

where $a_{\pm} = σ^{2} {(1 \pm \sqrt{c})}^{2}$ and $π_{a, b} (x) = \{\begin{matrix} 1 & a \leq x \leq b \\ 0 & otherwise \end{matrix}$ .

Now, let us consider an arbitrary window and white noise A = σ ² I _M×M. In this case from (6), (7) and d F ^A(x) = δ(x − σ ²)d x, we obtain $m (z) = \frac{1}{σ^{2}} e (z)$ . Thus, from (6), we obtain

m (z) = \frac{1}{σ^{2} \int \frac{w}{1 + cw σ^{2} m} d F^{W} (w) - z} .

(10)

3 Effective length of a window

In this section, we define the effective length of a window which allows to approximate the distribution of the eigenvalues of windowed SCM with that of a rectangular window with an equivalent length, assuming that the covariance matrix of data A satisfies the assumptions of Theorem 1. In several existing articles some intuitive equivalent length are defined simply to extend the previously existing results for the rectangular case in order to analyze the behavior of the eigenvalues in the weighted window cases [22], [23].

Consider a window w _i>0 of length N and denote $W_{N} = diag (w_{1}, \dots, w_{N}) \in R^{N \times N}$ with a converging distribution, i.e., $\lim_{N \to \infty} F^{W_{N}} = F^{W}$ . We assume that the sample size N is much larger than the array dimension M, i.e., c is small. It is known that m(z) is bounded for z ∈ C ⁺[15]. Thus for 0 < c ≪ 1 we have 0 < c σ ²|m| sup |w| < β < 1 where β is some constant number. It is easy to show that^a, we have $|\frac{σ^{2} w}{1 + cw σ^{2} m} - \frac{1}{cm} \sum_{i = 1}^{I} {(- cw σ^{2} m)}^{i}| \leq \frac{| cm |^{I} | w σ^{2} |^{I + 1}}{1 - β}$ . This yields

\int \frac{σ^{2} wd F^{W} (w)}{1 + cw σ^{2} m} = c^{I} O + \frac{1}{cm} \sum_{i = 1}^{I} {(- c σ^{2} m)}^{i} E {w^{i}},

(11)

where $E {.} = \int (.) d F^{W} (w)$ and $| O | \leq \frac{| m |^{I} | σ^{2} |^{I + 1}}{1 - β} E {w^{I + 1}}$ .

Since 0 < c ≪ 1, for I = 2 and defining $c_{e} = c \frac{E {w^{2}}}{E^{2} {w}}$ and w _e=E{w} as the effective parameters, we can rewrite (11) as

z \approx \frac{E {w} σ^{2}}{1 + c \frac{E {w^{2}}}{E {w}} σ^{2} m} - \frac{1}{m} = \frac{w_{e} σ^{2}}{1 + c_{e} w_{e} σ^{2} m} - \frac{1}{m},

(12)

where using E{w ³} < sup{w ²}E{w} and E{w ²} < sup{w}E{w} it is easy to show that the approximation error is bounded by $σ^{2} E {w} \frac{2 β^{2}}{1 - β}$ .

Definition 2

The expression (12) represents the M–P distribution as in (8) for a rectangular window of length

N_{e} = \frac{M}{c_{e}} = N \frac{E^{2} {w}}{E {w^{2}}},

(13)

with all coefficients equal to w _e. The average weight w _e is a scale parameter for the eigenvalues of covariance matrix of the received data. Although we have derived the effective length for the noise only data, our results reveal that this effective window length gives accurate results for the signal plus noise case.

For the white noise data, the l.s.d. of SCM can be approximated by the M–P distribution defined in (9) by substituting c and σ ², with c _e and w _e σ ², respectively. Note that the effective window length is always smaller than the number of samples N. This approximation can be intuitively interpreted as a Wishart approximation where the effect of “windowing” is approximated with a rectangular window with an effective number of samples of N _e and the covariance matrix of the received data is scaled to A _e=w _e A”.

Now, we compute the effective length of the triangular and the exponential windows. A triangular window is defined by $w_{i} = 2 (1 - \frac{i - 1}{N - 1})$ for i = 1,…,N ≫ 1 and has the average weight of $\frac{1}{N} \sum_{i = 1}^{N} w_{i} = 1$ . Using (13), the effective length of the triangular window is

N_{e} = \frac{3}{2} \frac{N - 1}{2 N - 1} N \approx \frac{3}{4} N.

(14)

The exponential window is very popular in signal processing applications due to its simple implementation and is defined by w _i=w ₀ p ⁱ for i=1,2,…, where p∈(0,1) and w ₀ is a normalization constant. We note that the exponential window is inherently an infinite length window. Interestingly, in Theorem 1 the window length and the array dimension jointly tend to infinity where $\lim_{M, N \to \infty} \frac{M}{N} = c > 0$ . Here for a finite array dimension M, we first approximate the exponential window with N coefficients, which is only accurate if N is large enough such that the omitted coefficients are negligible. Asymptotically as M,N jointly tend to ∞, the results from this truncated window become accurate for describing the underlying distributions using the exponential window. In this case, with some calculations we obtain $N_{e} = \frac{1 - p^{2}}{{(1 - p)}^{2}} \frac{{(1 - p^{N - 1})}^{2}}{1 - p^{2 N - 2}}$ . Thus the effective length of the exponential window (for N≫1) becomes

N_{e} \approx \frac{1 + p}{1 - p},

(15)

which is not a function of N. As expected the effective length of the window increases as the forgetting factor p approaches one.

4 Spectral analysis of noise-only data

In this section, for the windowed data case, the l.s.d. of the SCM is characterized more accurately. In practice, the array dimension and the effective window length are both finite. However, we are interested in the impact of the weights of the window f ^W(w), on the limiting distribution of the eigenvalues as M,N→∞ employing Theorem 1. We use two approaches to model f ^W(w), Discrete and Continuous. The former considers f ^W(w) as a finite sum of discrete masses at the coefficients of the window. The discontinuous distribution function modeling is useful to analyze the support of eigenvalues and its connectivity. The latter approach, approximates f ^W(w) as a continuous function allowing to derive some explicit equations for the Stieltjes transform.

Let S _F denote the support of the function F ^R(x) and $S_{F}^{c}$ shows its complement. From (5), we see that S _F consists of points on the real axis where the imaginary part of m(z) is positive, i.e., the support is the union of some subintervals. Thus to find the support of the distribution of eigenvalues, we must determine such intervals. In [15], it is shown that limy→0⁺ m _F(x+i y) exists for all x≠0, and therefore we can define

m_{F} (x) = \lim_{y \to 0^{+}} m_{F} (x + iy), x \in R ∖ {0} .

(16)

The following lemma is the key to determine these intervals on real axis [15].

Lemma 1 ([32], Lemma 6.1)

For any c.d.f. F, let S _F denote its support and $S_{F}^{c}$ be the complement of S _F. For $x \in S_{F}^{c}$ , m=m _F(x) is the only real solution of x = z (m) which satisfies

\frac{dz (m)}{dm} > 0,

(17)

where z(m) is the inverse function of m(z). Also conversely, for any real m in the domain of z(m) if $\frac{dz (m)}{dm} > 0$ then x = z (m) is outside the support of F.

This simply means that the support S _F, is the union of intervals on the vertical axis where z(m) is increasing for real values of m. According to (10), for noise only data z(m) can be written as follows

z (m) = \int \frac{σ^{2} w}{1 + cw σ^{2} m} d F^{W} (w) - \frac{1}{m} .

(18)

4.1 Discrete distribution function approach

Suppose the window consists of N _d distinct weights w _i,i=1,…,N _d, each with multiplicity $n = \frac{N}{N_{d}}$ . Therefore as (M,N→∞ and $\frac{M}{N} \to c > 0$ ), we can evaluate (18) in terms of the weights

z (m) = \frac{1}{N_{d}} \sum_{i = 1}^{N_{d}} \frac{σ^{2} w_{i}}{1 + c w_{i} σ^{2} m} - \frac{1}{m} .

(19)

Figure 1, represent a typical case of the function on the right-hand side of (19). Lemma 1 states that the support of the distribution of eigenvalues is the complement of the set of all values x ∈ R ⁺ for which x = z(m) is increasing for real values of m, i.e., $(\frac{dz (m)}{dm} > 0)$ . The function z(m) has poles at m=0, $- \frac{1}{c w_{1} σ^{2}}, \dots, - \frac{1}{c w_{N_{d}} σ^{2}}$ . In addition, z(m) is an analytic function and we have

\lim_{m \to 0^{\pm}} z (m) = \mp \infty, \lim_{m \to {(- \frac{1}{c w_{i} σ^{2}})}^{\pm}} z (m) = \pm ∞.

(20)

For c < 1, i.e., where the length of the window is more than the array dimension, $\lim_{m \to \pm \infty} z (m) = 0^{\pm}$ , thus as Figure 1 shows, $\frac{dz (m)}{dm} = 0$ has at least two solutions which we denote them $m_{u} \in (\frac{- 1}{c σ^{2} max (w_{i})}, 0)$ and m _l∈(0,∞). For c > 1, as Figure 2 shows typically, from lim_m→±∞ z(m) = 0^∓ we conclude that m _l must be in $(- \infty, \frac{- 1}{c σ^{2} min (w_{i})})$ . We must note that, for c > 1, the SCM has M − N zero eigenvalues expressed with a probability mass of $(1 - \frac{1}{c})$ in the l.s.d. of SCM, that is not counted as a cluster in these derivations, i.e. for c >, the PDF of the distribution includes a term of $(1 - \frac{1}{c}) δ (x)$ . If the weights are widely separated, the support of eigenvalues may become fragmented into union of a number of disjoint intervals.

In many signal processing applications the white noise subspace is separated from the signal subspace based on the eigenvalues of the SCM. Such a fragmentation of the support of noise eigenvalues misleads the subspace based algorithms and leads to noise eigenvalues to be mistaken as signal ones.

In fact, it is desirable that the support of eigenvalues be as compact as possible. To avoid such an undesirable fragmentation, the equation $\frac{dz (m)}{dm} = 0$ should not have real solution for $m \in (\frac{- 1}{c σ^{2} min (w_{i})}, \frac{- 1}{c σ^{2} max (w_{i})})$ , i.e.,

\begin{align} \frac{1}{N_{d}} \sum_{i = 1}^{N_{d}} {(1 - \frac{1}{1 + c w_{i} σ^{2} m})}^{2} > c, \\ \forall & m \in (\frac{- 1}{c σ^{2} min (w_{i})}, \frac{- 1}{c σ^{2} max (w_{i})}) . \end{align}

(21)

Under this connectivity condition, the support of eigenvalues is the interval [x _l = z(m _l),x _u=z(m _u)], which can be calculated, numerically. Our simulations show that this condition is satisfied for popular window types especially for N _d ≫ 1 used in practice. Figure 2 shows a typical case for c > 1 where $\frac{dz (m)}{dm} = 0$ has an even number of real-valued solutions (counting multiplicities) which we denote them by $m_{1}^{-} \leq m_{1}^{+} < \dots < m_{q}^{-} \leq m_{q}^{+}$ (in addition to m _l,m _u). Each pair of these solutions determines a sub-interval for the support of eigenvalues, i.e., we have $S_{F} = [x_{l}, x_{u}] - {[x_{1}^{-}, x_{1}^{+}] \cup \dots \cup [x_{q}^{-}, x_{q}^{+}]}$ , where $x_{i}^{-} = z (m_{i}^{-}), x_{i}^{+} = z (m_{i}^{+})$ . Reducing c or reducing the gap between weight values {w _i} makes the support more compact at the expense of using more temporal samples.

4.2 Continuous function approach

The goal of this approach is to find closed form expressions of Stieltjes integrals of the l.s.d. This approach could be used for any window shapes. However, we start with the triangular window and then consider the exponential window which are more popular. Here, we model the function f ^W(w) with a continuous distribution and evaluate (18) to found the Stieltjes transform.

For a triangular window $w_{i} = 2 (1 - \frac{i - 1}{N - 1}), i = 1, \dots, N$ , we have $F^{W_{N}} (w) = \frac{1}{N} \sum_{i = 1}^{N} U (w - w_{i})$ where U(w) is the unit step function. In this case it is easy to show that $F^{W_{N}} (w)$ converges to a uniform distribution as N increases, i.e.,

\lim_{N \to \infty} F^{W_{N}} (w) = F^{W} (w) = \{\begin{matrix} \frac{1}{2} w, 0 < w < 2, \\ 0, otherwise. \end{matrix}

(22)

Substituting f ^W(w) in (18), we get

z (m) = \frac{1}{cm} (1 - \frac{1}{2 c σ^{2} m} ln (1 + 2 c σ^{2} m)) - \frac{1}{m},

(23)

for $m \in (- \frac{1}{2 c σ^{2}}, \infty)$ and m≠0.

Again, we first use Lemma 1 and determine the support of eigenvalues (by plotting z(m) for real m and finding the intervals on the vertical axis where z(m) is not increasing). Figure 3 plots the lower and upper boundaries of support of eigenvalues for a triangular window for different values of c. It can be seen that the discrete distribution $F^{W_{N}} (w)$ (assuming N _d=50) and the continuous approach result in almost the same boundaries. We also observe that these boundaries are close to those obtained by the Wishart approximation assuming the effective window length in (14). In this figure using the rectangular window with same length as the triangular window, the distribution is referred to as the M–P distribution. Also we observe that the eigenvalues tend to more concentrate around their real value σ ²=1 as the window length increases. In addition from this figure, we conclude that the support using the triangular window is looser than than that of the rectangular window for a given value of c, because the effective length of the triangular window is less than that of a rectangular window.

For the exponential window, first we introduce the new parameter γ as the ratio of smallest to largest weights of the truncated exponential window. The coefficients of the window can be redefined as as a function of γ as $w_{i} = w_{0} γ^{\frac{i}{N}}, i = 1, \dots, N$ . Therefore from $f^{W_{N}} (w) = \sum_{i = 1}^{N} \frac{1}{N} δ (w - w_{i})$ , from $i = \frac{N}{ln γ} ln (\frac{w_{i}}{w_{0}})$ , it is easy to show that

\begin{array}{l} F^{W_{N}} (w) = \{\begin{array}{l} 0 w < γ w_{0}, \\ 1 - \frac{1}{N} ⌊\frac{N}{ln γ} ln (\frac{w}{w_{0}})⌋ γ w_{0} \leq w \leq w_{0} γ^{\frac{1}{N}}, \\ 1 w_{0} γ^{\frac{1}{N}} \leq w, \end{array} \end{array}

(24)

where ⌊.⌋ is the floor function. This increasing staircase function takes values on $\{0, \frac{1}{N}, \frac{2}{N}, \dots, 1\}$ . To satisfy the constraints of Theorem 1 for the exponential window, we assume that the ratio of smallest to largest weights of the window, γ=p ^N > 0, is an arbitrary small real constant. In other words, the forgetting factor of the window $p = γ^{\frac{1}{N}} \in (0, 1)$ approaches to 1, as M,N→∞. The smaller γ, the better this truncated exponential model fits the exponential window with the forgetting factor p. From, $\lim_{N \to \infty} w_{0} = \frac{ln γ}{γ - 1}$ , we conclude that $\lim_{N \to \infty} F^{W_{N}} = F^{W} (w)$ where

\begin{matrix} F^{W} (w) = \{\begin{array}{l} 0 w < \frac{γ ln γ}{γ - 1}, \\ 1 - \frac{1}{ln γ} ln (\frac{w (γ - 1)}{ln γ}) \frac{γ ln γ}{γ - 1} < w < \frac{ln γ}{γ - 1}, \\ 1 w > \frac{ln γ}{γ - 1} . \end{array} \end{matrix}

(25)

is a continuous function, independent of window size N and satisfies the assumptions of Theorem 1. Thus, this theorem is applicable to the exponential window truncated at some large integer N.

Substituting f ^W(w) in (18), in the asymptotic regime of Theorem 1 as γ→0, such that $\frac{M}{n_{0}} \to c_{0}$ , z(m) satisfies

z (m) = \frac{1}{c_{0} m} ln (1 + c_{0} σ^{2} m) - \frac{1}{m},

(26)

for all $m \in (- \frac{1}{c_{0} σ^{2}}, \infty) ∖ {0}$ where $n_{0} = - \frac{1}{ln (p)}$ .

One can use the same method as in the discrete distribution function approach and identify the support of the distribution S _F. However, the function z(m) in (26) is simple and the following theorem gives the explicit distribution.

Theorem 2

For the exponentially weighted window, the l.s.d. of SCM, f ^R(x), is given by

f^{R} (x) = \frac{e^{c_{0} - \frac{x}{σ^{2}}}}{Π c_{0} σ^{2}} Im (e^{- ω_{- 1} (- \frac{x}{σ^{2}} exp \{c_{0} - \frac{x}{σ^{2}}\})}) π_{x_{-}, x_{+}} (x),

(27)

and upper and lower boundaries of the support are

x_{-} = σ^{2} \frac{ω_{0} (- e^{- c_{0} - 1}) + 1}{exp {ω_{0} (- exp (- c_{0} - 1)) + c_{0} + 1} - 1},

(28a)

x_{+} = σ^{2} \frac{ω_{- 1} (- e^{- c_{0} - 1}) + 1}{exp {ω_{- 1} (- exp (- c_{0} - 1)) + c_{0} + 1} - 1}

(28b)

respectively, where ω _k(x) is the branch of Lambert W function ^b[33]with k=−1 and k=0.

Proof 2

According to the Lemma 1, boundaries of the support of eigenvalues are the real solutions of z ^′(m) = 0, i.e., with some simple calculations, are the solutions of

ln (1 + c_{0} σ^{2} m) = c_{0} + 1 - \frac{1}{1 + c_{0} σ^{2} m} .

(29)

Denoting y= ln(1+c ₀ σ ² m)−c ₀−1, we obtain

y e^{y} = - e^{- c_{0} - 1} \in [- e^{- 1}, 0), \forall c_{0} > 0 .

(30)

This equation has two real solutions m ₋ and m ₊ expressed using Lambert W function as

\begin{align} m_{-} = \frac{1}{c_{0} σ^{2}} (exp {ω_{0} (- e^{- c_{0} - 1}) + c_{0} + 1} - 1), \end{align}

(31)

\begin{align} m_{+} = \frac{1}{c_{0} σ^{2}} (exp {ω_{- 1} (- e^{- c_{0} - 1}) + c_{0} + 1} - 1) . \end{align}

(32)

Using (26), the boundaries z(m ₋) and z(m ₊) are obtained as in (28a) and (28b) which determine the support of eigenvalues as the interval $[z (m_{-}), z (m_{+})] \subset R$ .

To obtain the l.s.d. of SCM, we should find m(z) with positive imaginary part for all z∈[z(m ₋),z(m ₊)]. In (26), we denote

v = - ln (1 + c_{0} σ^{2} m) - \frac{z}{σ^{2}} + c_{0},

(33)

and obtain

v e^{v} = (- \frac{z}{σ^{2}} e^{c_{0} - \frac{z}{σ^{2}}}) .

(34)

Therefore, the solutions are

v_{k} = ω_{k} (- \frac{z}{σ^{2}} e^{c_{0} - \frac{z}{σ^{2}}}), \forall k \in Z .

(35)

According to (16), (33), for the values of z in the interval of the obtained support on the real axis, due to the properties of the complex logarithm function, the imaginary part of v is in [−Π,Π], thus only the branches with k = 0 and k = − 1 are acceptable solutions. It is easy to see that for z ∈ [z(m ₋),z(m ₊)], the expression on the right-hand side of (34) belongs to $[- e^{c_{0} - 1}, - e^{- 1}]$ . From (33) and properties of Lambert W function, we also deduce that Im{m} and sin(−Im{v}) have the same signs, and for $x \in [- e^{c_{0} - 1}, - e^{- 1}]$ the function sin(−Im{ω _k(x)}) is positive for k = − 1 and is negative for k = 0. Therefore, the Stieltjes transform of the l.s.d. of SCM is obtained from (35) and (33) as

m = \frac{1}{c_{0} σ^{2}} (e^{c_{0} - \frac{z}{σ^{2}} - ω_{- 1} (- \frac{z}{σ^{2}} e^{c_{0} - \frac{z}{σ^{2}}})} - 1) .

(36)

Using the inverse formula in (18), the l.s.d of SCM is

\begin{array}{l} f^{R} (x) = & \frac{1}{Π} Im [\frac{1}{c_{0} σ^{2}} (e_{0}^{c} - \frac{x}{σ^{2}} - ω_{- 1} \\ (- \frac{x}{σ^{2}} exp \{c_{0} - \frac{x}{σ^{2}}\}) - 1)] . \end{array}

(37)

Dropping the real terms inside the brackets and applying some simplifications, we obtain (27). □

We can define a second effective window length by employing and comparing the boundaries of the support of eigenvalues in (28a) and in (9) for a rectangular window which is only in terms of σ ² and c. Equating the length of the supports in (9) (28a), i.e., a ₊−a ₋ = x ₊−x ₋, we can find a rectangular window to match the support as same as that of the exponential window and define the length of this rectangular window as another effective length for the exponential window. In some array signal processing applications, the effective length of the exponential window has been considered to be $N_{e} = \frac{1}{1 - p}$ [22], [23]. Figure 4 compares these effective lengthes in terms of the forgetting factor p and reveals that the effective window length defined in (13) gives an accurate approximation for the exponential window. We also see a large gap between the traditional approximation for the effective length in [22] and what is obtained in this article using random matrix theory.

Remark 1

In the economic literature, other methods have been proposed to approximate the spectral density function of exponentially weighted financial covariance matrices for Portfolio Optimization[34], [35]. These methods that are used in other articles (e.g., in[36], [37]) are based on numerical calculations rather than developing some closed form expressions. Pafka et al.[34]supposed that the density of the eigenvalues is aproximated by $ρ (u) = \frac{Qν}{Π}$ where $Q = \frac{1}{M (1 - p)}$ and ν is the root of

\frac{u}{σ^{2}} - \frac{uν}{tan (uν)} + ln (ν σ^{2}) - ln (sin (uν)) - \frac{1}{Q}

(38)

In contrast to these methods for the exponential window, we derive an accurate explicit closed form expression which can be easily employed in many applications such as in signal processing and economy.

5 Spectral analysis of signal plus noise data

In this section, we consider the case of white noise plus some signal sources, i.e., where the eigenvalues of A are not equal. In the general case, let λ _q > ⋯ > λ ₁ >0 denote the set of q distinct eigenvalues of the covariance matrix and the multiplicity of λ _ℓ is denoted by k _ℓ (we must have $M = \sum_{ℓ = 1}^{q} k_{ℓ}$ ). For example suppose a real phased array communication system with q − 1 independent signals impinging on it simultaneously on the same frequency band from different directions where q < M. The smallest eigenvalue λ ₁ can be interpreted as the noise eigenvalue and other q − 1 larger eigenvalues are referred to as signal eigenvalues. In the asymptotic regime, when N,M are growing large, we assume that $\frac{k_{ℓ}}{M} \to α_{ℓ} > 0$ , where α _ℓ,ℓ= 1, …,q are multiplicity ratios of eigenvalues. In this case the spectral distribution of the matrix A in Theorem 1 can be expressed as sum of Dirac delta functions, i.e. $d F^{A} (a) = \sum_{i = 1}^{q} α_{i} δ (a - λ_{i}) da$ .

In what follows, we present an approach to determine the support of eigenvalues and also the l.s.d. of exponentially weighted SCM of signal plus noise data in the asymptotic regime. The first in determining the distribution of the eigenvalues is to determine its support on the real positive axis.

The definition of the Stieltjes transform in (4) implies that for any distribution F and real x outside the support of F, m(x) is well defined and its derivative, $m^{'} (x) = \int \frac{dF (y)}{{(y - x)}^{2}}$ , is obviously real and positive. Thus, m(z) is increasing on intervals on real line outside the support of its distribution function F[15]. Therefore, the inverse function theorem proves that its inverse exists on these intervals and shall also be increasing. For the one sided correlated Wishart matrices, where the inverse of m(z) has an explicit expression, Lemma 1 shows that the converse of the above statements are also true [15], i.e. for any real m in the domain of z(m), if $\frac{dz (m)}{dm} > 0$ then x = z (m) is outside the support of the distribution. Therefore, the support of eigenvalues is a Borel subset of $R^{+}$ for which z(m) is increasing which can be determined by simply plotting the inverse function z(m) for real m. Paul and Silverstein ([29], page 2) suggested the same method for doubly correlated Wishart matrices if there exists an explicit inverse z = z(m) for the limiting Stieltjes transform m(z). Unfortunately, for non-rectangular windows, the inverse of m(z) in general has no explicit expression [29]. Fortunately, by introducing two auxiliary variables u and h in what follows for the exponential window, we found z(h) which implicitly expresses z as a function of m. Then, we prove that the same method can be extended for the exponential window case, while the main difference here is that we are able to use the implicit expressions to determine this Borel set. Although the exponential window case is studied in this article, the same approach may be used for some other window types, to determine the support of eigenvalues.

From (6) and (7) we obtain

1 + zm = \int \frac{we}{1 + cwe} d F^{W} (w) .

(39)

Substituting (25) in (39), we get

1 + zm = \frac{1}{c_{0}} ln (\frac{1 - γ + c_{0} e}{1 - γ + γ c_{0} e}) .

(40)

Substituting d F ^A(a) in (7) changes the integral to a summation and we obtain e(z) as

e (z) = \sum_{i = 1}^{q} \frac{α_{i} λ_{i}}{\frac{λ_{i}}{c_{0} e (z)} ln (\frac{1 - γ + c_{0} e (z)}{1 - γ + γ c_{0} e (z)}) - z} .

(41)

According to Theorem 1, for any $z \in C^{+}$ , there is a unique solution e=e(z) for (41) in $C^{+}$ . In this case the Stieltjes transform m(z) is calculated from (40) as

m (z) = \frac{1}{z} (\frac{1}{c_{0}} ln (\frac{1 - γ + c_{0} e (z)}{1 - γ + γ c_{0} e (z)}) - 1)

(42)

This expression gives the implicit relation between m and z, which cannot be sorted to express m as an explicit function of z or conversely, z as a function of m. Defining the auxiliary variable/function u = c ₀ (1+z m(z)) which provides a bijective relation between e and m for all z≠0, we have

u = ln (\frac{1 - γ + c_{0} e (z)}{1 - γ + γ c_{0} e (z)}) .

(43)

This equation reveals that the imaginary parts of u and e have the same signs. In addition since γ <1, c ₀ is real and using the properties of the complex logarithm function in (43), we deduce that u always lies in a strip of the positive complex plane where its imaginary part is less than Π, i.e., the domain of u is defined as D _u = {u|0 < Im{u}<Π}. Equation (43) also provides a bijective relation between e and u, therefore according to Theorem 1 for any $z \in C^{+}$ , there is a unique u∈D _u, satisfying

u = c_{0} \sum_{i = 1}^{q} \frac{α_{i}}{\frac{λ_{i}}{z} \frac{u}{e^{u} - 1} \frac{1}{1 - γ} (1 - γ e^{u}) - 1} + c_{0} .

(44)

Defining the second auxiliary variable/function as

h = \frac{u}{(1 - e^{u}) z} \frac{1 - γ e^{u}}{1 - γ}, \forall z \neq 0 .

(45)

and define D _h as its range for all $z \in C^{+}$ . Resorting (44), we have

u = - c_{0} \sum_{i = 1}^{q} \frac{α_{i}}{λ_{i} h + 1} + c_{0} .

(46)

Proposition 1

The auxiliary variable h, as a function of u and z, has some interesting properties as:

(1) h always lies in the subset $D_{h} \subset C^{+}$ for all $z \in C^{+}$ .

(2) for h∈D _h, z can be explicitly expressed as a function of h

z (h) = \frac{c_{0}}{h} \frac{\sum_{i = 1}^{q} \frac{α_{i}}{λ_{i} h + 1} - 1}{e^{c_{0} \sum_{i = 1}^{q} \frac{α_{i} λ_{i} h}{λ_{i} h + 1}} - 1} \frac{1 - γ e^{c_{0} \sum_{i = 1}^{q} \frac{α_{i} λ_{i} h}{λ_{i} h + 1}}}{1 - γ} .

(47)

(1) For any $z \in C^{+}$ , a unique h satisfying (47) exists in

\{h \in C | Im \{h\} \sum_{i = 1}^{q} \frac{c_{0} α_{i} λ_{i}}{| λ_{i} h + 1 |^{2}} \in (0, Π)\} .

(48)

Proof 3

The first property can is simply implied from (46) as the imaginary part of h and u have the same sign. Using (45) and (46), we can easily find (47). The third property is proved as follows. The constraint in (48) is obtained from Im{u} ∈ (0,Π) and (46). According to Theorem 1, for any $z \in C^{+}$ , there is a unique u∈D _u, satisfying (44). The unique pair (z,u) gives an h in $C^{+}$ according to (45). In order to prove the uniqueness of h, suppose that h ₁ and h ₂ in $C^{+}$ satisfy (47) and (48). Thus, (46) yields u ₁,u ₂∈D _u satisfying (44). In addition, we must have u ₁ = u ₂ since for any $z \in C^{+}$ , there exists a unique u ₁∈D _u. Thus for z and u ₁ = u ₂, (45) yields that h ₁ = h ₂. □

Although z(h) in (47) is defined only for h∈D _h, it is an analytic function for all $h \in C ∖ \{0, \frac{- 1}{λ_{1}}, \dots, \frac{- 1}{λ_{q}}\}$ . In addition note that $z (h) = \frac{- 1}{h}$ at the roots of $\sum_{i = 1}^{q} \frac{α_{i}}{λ_{i} h + 1} - 1 = 0$ .

Also using (46) and (47) we express m as a function of h as follows

\begin{matrix} m_{h} (h) = \frac{1 - γ}{c_{0}} \frac{\sum_{i = 1}^{q} \frac{α_{i}}{λ_{i} h + 1}}{\sum_{i = 1}^{q} \frac{α_{i}}{λ_{i} h + 1} - 1} \frac{(1 - e^{- c_{0} \sum_{i = 1}^{q} \frac{α_{i}}{λ_{i} h + 1} + c_{0}}) h}{1 - γ e^{- c_{0} \sum_{i = 1}^{q} \frac{α_{i}}{λ_{i} h + 1} + c_{0}}}, \end{matrix}

(49)

for h∈D _h. Similar to z(h), the complex function m _h(h) is an analytic function for all $h \in C$ except at the set of real values $\{0, \frac{- 1}{λ_{1}}, \dots, \frac{- 1}{λ_{q}}\}$ and the points where z(h) = 0.

The inverse Stieltjes transform in (5) reveals that the l.s.d. depends on the behavior of m(z) in the vicinity of the real axis, i.e. for $z \to x_{0} \in R$ . Proposition 1 shows that the z(h) in (47) is injective over $h \in D_{h} \subset C^{+}$ and allows us to treat h(z) as its inverse for $z \in C^{+}$ . To determine the range of h(z) denoted by D _h we can evaluate z(h) for all h in (48) and take only those values of h for which z is in $C^{+}$ . As an example Figure 5 shows this region for c ₀ = 0.2 and a covariance matrix with two distinct eigenvalues 2 and 1 with the multiplicity ratios $α_{1} = \frac{1}{3}$ and $α_{2} = \frac{2}{3}$ . The white regions in Figure 5 shows the values of h for which z(h) has negative imaginary part, and the blue parts are the values where Im {z(h)} > 0. We observe that some parts of the positive complex plane are not in the domain of z(h) as we restrict the range to $z (h) \in C^{+}$ .

5.1 Support of eigenvalues

Theorem 3

For the exponentially weighted window defined in Theorem 2, under the assumptions of Theorem 1, the complement of support of eigenvalues, is the set of values of x = z(h) on the vertical axis where $\frac{dz (h)}{dh} > 0$ for some $h \in R$ , where z(h) is defined in (47).

Proof 4

Let S _F denotes the support of the function F ^R(x) and $S_{F}^{c}$ shows its complement. To prove Theorem 3, first we show that for any $x \in S_{F}^{c}$ , there exist a $h \in R$ where $\frac{dz (h)}{dh} > 0$ . Then, we prove the converse, i.e. if $\frac{dz (h)}{dh} > 0$ for some $h \in R$ , then x=z(h) is a real number outside the support of eigenvalues.

From (5), we see that S _F consists of points on the real axis where Im {m(x+i y)} tends to a positive number when y → 0⁺. Thus to find S _F, we must determine such subintervals on the real axis, or equivalently we can determine $S_{F}^{c}$ by finding the intervals on the real axis where lim y → 0⁺{m(x+i y)} is real. Consider any x ₁ and x ₂ such that $(x_{1}, x_{2}) \subset S_{F}^{c} \subset R^{+}$ . According to the definition of Stieltjes transform in (4), m(z) and $u (z) = c_{0} (1 + zm (z)) = c_{0} \int \frac{λdF (λ)}{λ - z}$ are both real and well defined for any z ∈ (x ₁, x ₂). In addition $\frac{du}{dz} = c_{0} \int \frac{λdF (λ)}{{(λ - z)}^{2}}$ is nonnegative on this interval. Thus u(z) is a real invertible function on (x ₁,x ₂), and its inverse z(u) is also real and increasing on the interval $(u (x_{1}), u (x_{2})) \in R$ , i.e. $\frac{dz}{du} > 0$ .

Lemma 2

For any given $z \in R^{+}$ , the function h(u,z) in (45) is monotonically increasing versus $u \in R$ .

Proof

Defining $h (0, z) = \frac{γ - 1}{z} = \lim_{u \to 0} h (u, z)$ , the function h(u,z) is continuous for all $u \in R$ and all $z \in R^{+}$ , and for all $z \in R^{+}$ we have □

\frac{∂h}{∂u} = \frac{(e^{u} (u - 1) + 1) + γ e^{u} (e^{u} - u - 1)}{{(e^{u} - 1)}^{2} z (1 - γ)} > 0 .

(50)

Since $\frac{dz}{dh} = \frac{dz}{du} \frac{du}{dh}$ and $\frac{du}{dh} = \frac{c_{0}}{M} \sum_{i = 1}^{q} \frac{k_{i} λ_{i}}{{(λ_{i} h + 1)}^{2}}$ are positive for all $h \in R$ , Lemma 2 implies that the signs of $\frac{dz}{du}$ and $\frac{dz}{dh}$ are identical. Thus if z is an increasing function of $u \in R$ , it is also an increasing function of $h \in R$ as well, and vice versa, i.e., the intervals for which z is increasing versus u is equal to the intervals for which z is increasing versus h. This proves the direct part of the theorem.

To prove the converse part, consider that Theorem 3 implies that $\frac{dz (h_{0})}{d h_{0}}$ is real and non-negative for some $h_{0} \in R$ . Since z(h) and m _h(h) are both real at point h = h ₀, it is sufficient to show that the point h ₀ belongs to the boundary of D _h. In this case, as the function m(h) is continuous in the complex plane (excluding few points as stated after (49)), we conclude that lim_y → 0 ⁺ Im {m(h ₀+i y)} = Im {m(h ₀)} = 0. To show that h ₀ is on the boundary of D _h, we prove that the points in the vicinity of h ₀ in the positive complex plane, belong to D _h. Let {h _n} be any complex sequence with positive imaginary part converging to h ₀ as N → ∞. Since z(h) is continuous, the sequence {z _n} = {z(h _n)} exists and converges to z(h ₀).

Lemma 3

Let z(h) be an analytic function of h over an open set G, and h(t) ∈ G be a differentiable curve at t. Then if $\frac{dz (h)}{dh}$ is a positive real number, we have $arg \{\frac{d}{dt} z (h)\} = arg {h^{'} (t)}$ .

Proof 5

This lemma is obtained from the Chain rule; since z(h(t)) is differentiable at t and $\frac{d}{dt} z (h (t)) = h^{'} (t) z^{'} (h)$ . Thus for positive real $\frac{dz (h)}{dh}$ , the argument of $\frac{d}{dt} z (h)$ and h ^′(t) are the same. □

We use Lemma 3 which implies that if $\frac{dz (h)}{dh}$ is positive and real at the point h = h ₀ then $arg \{\frac{d}{dt} z (h)\} = arg {h^{'} (t)}$ for any differentiable curve h(t) at t = t ₀ where h(t ₀) = h ₀, i.e. the slope of the curve h(t) in the complex plane is the same as the slope of z(h(t)) at h(t) = h ₀. Now for sufficiently large n, consider the line L _n = h(t) = (1 − t)h ₀ + t h _n, 0 ≤ t ≤ 1 in $C^{+}$ which originates from h ₀ and ends at h _n. The transformation of L _n, z(L _n), is also a line in the positive imaginary part of complex plane with the same slope as L _n, as we have supposed that $\frac{dz (h)}{dh} \approx z^{'} (h_{0})$ for the points on L _n. Thus the point z _n also lies in the positive complex plane. In the other words, for sufficiently large n, the sequence {z _n} lies in $C^{+}$ ; hence the sequence {h _n} is in D _h. Finally, we conclude that the Stieltjes transform is defined on any such sequences and the sequence of Stieltjes transform {m(z _n)} = {m _h(h _n)} is also in $C^{+}$ for those values of n and converges to m _h(h ₀) which is a real number. Thus z(h ₀) is outside the support of eigenvalues. □

Remark 2

We must note that we use a different approach in proving Theorem 3 comparing with proof exists for the rectangular window case where the Stieltjes transform m(z) has the explicit inverse[15]. This approach is very simple and can be used in other cases where the Stieltjes transform is expressed explicitly or implicitly as a function of z.

Theorem 3 states that in order to find the support of eigenvalues, we could first find the intervals on the real line where z(h) is increasing. In a sufficiently small vicinity of these intervals on the positive imaginary part of the complex plane, it is discussed in the proof that the imaginary part of z(h) is also positive for all h in this vicinity, therefore this vicinity lies in D _h. Having a closer look at Figure 5, we find that $D_{h} \subset C^{+}$ approaches real axis only for some values of h which can be easily studied that these are the intervals for which z ^′(h)>0. Thus according to this theorem the support of eigenvalues consists of three disjoint intervals for the setting of Figure 5.

Employing Theorem 3 and plotting z(h) for h < 0 one can determine the support of eigenvalues of the SCM in the asymptotic regime. The function z(h) has asymptotes at $- \frac{1}{λ_{1}}, \dots, - \frac{1}{λ_{q}}$ with the following one-sided limits

\lim_{h↓ - \frac{1}{λ_{i}}} z (h) = + \infty, \lim_{h↑ - \frac{1}{λ_{i}}} z (h) = - \infty, \forall i = 1, \dots, q.

(51)

Figure 6 shows a typical representation of the support of eigenvalues in the signal plus noise case when c ₀ = 0.1 and the covariance matrix has four distinct eigenvalues 5, 3, 2, 1 with multiplicities α ₁ = α ₂ = α ₃ = 0.1 and α ₄ = 0.7. It can be studied that in general, z(h)→ + ∞ as h→0⁻ and z(h)→0⁺ as h→−∞ and also analogous with the rectangular window case [38] the number of extrema of z(h) (counting the multiplicities) is even and are the solutions of $\frac{dz}{dh} = 0$ . Generally, in order to determine the support of eigenvalues, we identify all intervals on the vertical axis where z(h) is increasing and in general case denote them by $S_{F, b}^{c}, b \in {1, \dots, s}$ . Removing these intervals from $R$ , what is left is S _F and according to the proof of Theorem 3. these intervals will not overlap each other. To see this, we note for each $x \in S_{F}^{c}$ , there is a unique h∈D _h, such that x=z(h). Assume that $I_{H, b}^{c}$ , b ∈ {1,…,s} are the subintervals in the h domain where z(h) is increasing. Therefore, $I_{H, b}^{c}$ uniquely determines $S_{F, b}^{c}$ , which is an interval in $S_{F}^{c}$ . The complement of these intervals are the points determine the support of eigenvalues. It can be seen in Figure 6 that the support of the distribution is the union of four clusters where each of them represents the support of the distribution of only one of the eigenvalues. This is analogous with the results proven in the literature for rectangular windows [38], i.e., in this case all eigenvalues are separable on the vertical axis.

Figure 7 illustrates the same curves for c ₀ = 0.4, i.e., the forgetting factor p is reduced compared with Figure 6. We observe that the smaller the forgetting factor of the exponential window the larger the width of the subintervals associated to distinct eigenvalues. In some cases, some of adjacent subintervals may overlap, e.g. in Figure 7, the support associated to λ ₄ = 5 and λ ₃ = 3 have overlap whereas the two smaller ones are separable. Figure 8 shows $D_{h} \subset C^{+}$ , domain of h in the complex plane, using the same setting as in Figure 7. It has been shown that D _h approaches real axis only for the values of h for which z ^′(h)>0 in Figure 7 which identifies the regions on the real axis outside of the support of eigenvalues. We observe that D _h has no intersection with the real axis between $h = - 1, \frac{- 1}{2}, \frac{- 1}{3}$ which reveals that the subintervals of support associated with three smallest eigenvalues λ=1,2,3, are not disjoint.

Figure 9, demonstrates the support of l.s.d. of SCM identified using Theorem 3 for c ₀ ∈ {0.1,0.3} and λ ₂ ∈ [1,4] with multiplicity of α ₂ = 0.1 and λ ₁ = 1 with multiplicity of α ₁ = 0.9. We observe that for large values of λ ₂, the support associated with two eigenvalues are disjoint intervals. However, these two disjoint intervals become connected as the distance between λ ₂ and λ ₁ reduces. In practice, the value of c ₀ determines the window shape and has an important impact on the width of these intervals and on the location of the breakpoint. The location of breakpoint determines the capability of the window to identify two distinct eigenvalues. Figure 9 illustrates that the larger the value of c ₀, the smaller the breakpoint of the support, i.e., by increasing p, we may be able to separate closer eigenvalues.

5.2 Limiting spectral distribution

In the noise only case, we find an explicit equation for the l.s.d. of the exponentially weighted SCM employing Lambert W function. However in the signal plus noise case, the l.s.d. can not be obtained explicitly and should be calculated numerically using (5) and (47). It is the same as the rectangular window case where the l.s.d of noise only data has M–P distribution, however there is no explicit equation for the signal plus noise case.

To find the imaginary part of the Stieltjes transform, one could alternatively find the complex roots with positive imaginary part of the inverse function z(m) for all z in the support of the eigenvalues, i.e., z ∈ S _F. Since the imaginary parts of m(z) and h(z) have the same sign and there is no explicit expression for z(m), we find the complex roots of z(h) using (47) and (48) for any real x _h = z(h)∈S _F, where Re {h}∈(h _b−,h _b+), b∈{1,…,s}. This can be done by finding ν = Im{h} for which Im {z(h)} = 0. By inserting the calculated h in (49), we obtain the Stieltjes transform for x _h∈S _F. Finally F ^R(x) is obtained using (5). According to Proposition 1, for any $z \in C^{+}$ there exists a unique h satisfying (47) and (48), thus the above procedure results in the desired value of h and m.

6 Simulation results

In Figure 10, we plot the density functions and a histogram to show the accuracy of the derived l.s.d.’s in this article for an array with a finite dimension M = 20 and an exponential window with p = 0.975. In this case we have c ₀ = −M ln(p) = 0.5. In addition, in all our simulations, we used γ = 10⁻⁸; thus according to the definition of γ in the truncated exponential window, we have $N = \frac{ln (γ)}{ln (p)} = - ln (γ) \frac{M}{c_{0}} = 737$ and the truncated exponential window accurately describes the exponential window. In this case, the histogram of the eigenvalues is generated by 2,000 samples of SCMs, each computed from 2,000 independent data sets, where each data set consists of N independent random vectors of length M. Using the forgetting factor of the exponential window, p, the SCM is generated using $\frac{1}{N} \sum_{i = 1}^{N} p^{i} X_{i} X_{i}^{H}$ . Then using the eigenvalues of all of these SCMs the histogram of the eigenvalues of SCMs is generated. It can be observed that the histogram of the eigenvalues accurately fits the derived l.s.d. of the exponentially weighted windowed data in (27). This figure also shows results of the method in [34]. We observe that these results approximately fits the simulated data. As mentioned before, this method uses numerical calculations rather than a closed form expression. The Wishart approximation (for the effective length of window (15)) is also plotted in this figure which has a similar shape with small deviation from the histogram. As mentioned before, in some array signal processing applications, the effective length of the exponential window has been considered to be $N_{e} = \frac{1}{1 - p}$ [22, 23]. To evaluate the accuracy of this approximation, the M–P density function using this effective length is also plotted which shows a larger deviation from the simulated data.

In Figure 11, the l.s.d of exponentially windowed data is plotted for different values of p ∈ {0.95,0.97,0.98,0.99,0.995} and M = 20. We observe that as p tends to one, the eigenvalues become more concentrated around their true values. This is because the effective length of the window increases as p approaches 1.

Figure 12 shows the spectral distribution for an exponentially windowed SCM in a case where the eigenvalues are 12, 7, 3,1 with the same multiplicity ratios $α_{1} = α_{2} = α_{3} = α_{4} = \frac{1}{4}$ for two values of c ₀ = 0.1 and c ₀ = 0.4. It can be seen that as c ₀ decreases (i.e., as the forgetting factor p increases for a fixed value of array dimension M) the spectral distribution tends to concentrate around the true eigenvalues. Figure 12 shows that the supports corresponding to eigenvalues λ ₄ = 12, λ ₃ = 7 are not disjoint for c ₀ = 0.4 where as they are separate for c ₀ = 0.1. In this figure, the empirical distributions are obtained using simulation data with M = 20, $N = - ln (γ) \frac{M}{c_{0}} \in {920, 3684}$ and $p = e^{\frac{- c_{0}}{M}} \in {0.98, 0.995}$ and the l.s.d. are numerically calculated as introduced in the previous section. In this case the multiplicities of all of the eigenvalues of the covariance matrix is 5. We see that the l.s.d. fit the empirical results even for moderate and small array dimensions.

7 Conclusion

In this article the l.s.d. of SCM in the case of weighted windowed data has been studied. Defining the effective length of a window, we have approximated the distribution of the eigenvalues in the weighted window case with that of a Wishart matrix, when the number of samples are much more than array dimension. Also the connectivity condition for coefficients of the window has been developed to avoid fragmentation of the support of eigenvalues in the noise only data. For the exponential window, we have derived an exact expression for the l.s.d. of SCM which has excellent agreement with the simulation results. We have also introduced a way to analyze the support and distribution of eigenvalues in the signal plus noise data cases. The results of this work could be used in design and improvement of detectors and estimators based on weighted windowed data especially when an exponential window is employed.

Endnotes

^aFrom $\frac{w σ^{2}}{1 + cwm σ^{2}} - \frac{1}{cm} \sum_{i = 1}^{I} {(- cwm σ^{2})}^{i} = \frac{1}{cm} \frac{{(- cwm σ^{2})}^{I + 1}}{1 + cwm σ^{2}}$ , we get $| \frac{w σ^{2}}{1 + cwm σ^{2}} - \frac{1}{cm} \sum_{i = 1}^{I} {(- cwm σ^{2})}^{i} | = \frac{1}{| cm |} \frac{| cwm σ^{2} |^{I + 1}}{| 1 + cwm σ^{2} |} \leq \frac{| cm |^{I} | w σ^{2} |^{I + 1}}{1 - β}$ .

^bThe Lambert W function [33], ω (x) is also called the Omega function and is the solution of ω e ^ω = z for any complex number z. This equation is not injective, thus the function ω(z) is multivalued and has a set of different branches named ω _k(z) for any integer k. For real values of z, there exist two real valued branches of Lambert W function ω ₀ (z) and ω ₋₁ (z) which take on real values for $z \in [- \frac{1}{e}, \infty) \cup [- \frac{1}{e}, 0)$ and complex values, otherwise. The function ω ₀(z) is referred to as the principal branch of the Lambert W function and shown by ω(z) for simplicity.

References

Silverstein J, Tulino A: Theory of large dimensional random matrices for engineers. In IEEE Ninth International Symposium on Spread Spectrum Techniques and Applications. Manaus-Amazon, Brazil; 2006:458-464.
Google Scholar
Yazdian E, Gazor S, Bastani M: Source enumeration in large arrays using moments of eigenvalues and relatively few samples. IET Signal Process 2012, 6(7):689-696. 10.1049/iet-spr.2011.0260
Article MathSciNet Google Scholar
Tadaion A, Derakhtian M, Gazor S, Aref M: A fast multiple-source detection and localization array signal processing algorithm using the spatial filtering and ML approach. IEEE Trans. Signal Process 2007, 55(5):1815-1827.
Article MathSciNet Google Scholar
Affes S, Gazor S, Grenier Y: Robust adaptive beamforming via LMS-like target tracking. In IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP-94. Adelaide, Australia; 1994:269-269.
Google Scholar
Gazor S, Affes S, Grenier Y: Robust adaptive beamforming via target tracking. IEEE Trans. Signal Process 1996, 44(6):1589-1593. 10.1109/78.506628
Article Google Scholar
Gazor S, Far R: Adaptive maximum windowed likelihood multicomponent AM-FM signal decomposition. IEEE Trans. Audio Speech Lang. Process 2006, 14(2):479-491.
Article Google Scholar
Gazor S, Shahtalebi K: A new NLMS algorithm for slow noise magnitude variation. IEEE Signal Process. Lett 2002, 9(11):348-351.
Article Google Scholar
Kritchman S, Nadler B: Non-parametric detection of the number of signals: hypothesis testing and random matrix theory. IEEE Trans. Signal Process 2009, 57(10):3930-3941.
Article MathSciNet Google Scholar
Mestre X: On the asymptotic behavior of the sample estimates of eigenvalues and eigenvectors of covariance matrices. IEEE Trans. Signal Process 2008, 56(11):5353-5368.
Article MathSciNet Google Scholar
Couillet R, Silverstein J, Bai Z, Debbah M: Eigen-inference for energy estimation of multiple sources. IEEE Trans. Inf. Theory 2011, 57(4):2420-2439.
Article MathSciNet Google Scholar
Wishart J: The generalised product moment distribution in samples from a normal multivariate population. Biometrika 1928, 20(1/2):32-52.
Article Google Scholar
James A: Distributions of matrix variates and latent roots derived from normal samples. Ann. Math. Stat 1964, 35(2):475-501. 10.1214/aoms/1177703550
Article Google Scholar
Chiani M, Win M, Zanella A: On the capacity of spatially correlated MIMO Rayleigh-fading channels. IEEE Trans. Inf. Theory 2003, 49(10):2363-2371. 10.1109/TIT.2003.817437
Article MathSciNet Google Scholar
Marčenko V, Pastur L: Distribution of eigenvalues for some sets of random matrices. Math. USSR-Sbornik 1967, 1: 457. 10.1070/SM1967v001n04ABEH001994
Article Google Scholar
Silverstein J, Choi S: Analysis of the limiting spectral distribution of large dimensional random matrices. J. Multivar. Anal 1995, 54(2):295-309. 10.1006/jmva.1995.1058
Article MathSciNet Google Scholar
Dozier R, Silverstein J: Analysis of the limiting spectral distribution of large dimensional information-plus-noise type matrices. J. Multivar. Anal 2007, 98(6):1099-1122. 10.1016/j.jmva.2006.12.005
Article MathSciNet Google Scholar
Baik J, Silverstein J: Eigenvalues of large sample covariance matrices of spiked population models. J. Multivar. Anal 2006, 97(6):1382-1408. 10.1016/j.jmva.2005.08.003
Article MathSciNet Google Scholar
El Karoui N: Tracy–Widom limit for the largest eigenvalue of a large class of complex sample covariance matrices. Ann. Prob 2007, 35(2):663-714. 10.1214/009117906000000917
Article MathSciNet Google Scholar
Baik J, Ben Arous G, Péché S: Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices. Ann. Prob 2005, 33(5):1643-1697. 10.1214/009117905000000233
Article Google Scholar
Ganesan G, Li Y: Cooperative spectrum sensing in cognitive radio, part I: two user networks. IEEE Trans. Wirel. Commun 2007, 6(6):2204-2213.
Article Google Scholar
Ganesan G, Li Y: Agility improvement through cooperative diversity in cognitive radio. In Global Telecommunications Conference, GLOBECOM’05. IEEE. St. Louis, MO; 2005:5-5.
Google Scholar
Champagne B: Adaptive eigendecomposition of data covariance matrices based on first-order perturbations. IEEE Trans. Signal Process 1994, 42(10):2758-2770. 10.1109/78.324741
Article Google Scholar
Valaee S, Kabal P: An information theoretic approach to source enumeration in array signal processing. IEEE Trans. Signal Process 2004, 52(5):1171-1178. 10.1109/TSP.2004.826168
Article MathSciNet Google Scholar
Ouyang S, Hua Y: Bi-iterative least-square method for subspace tracking. IEEE Trans. Signal Process 2005, 53(8):2984-2996.
Article MathSciNet Google Scholar
Doukopoulos X, Moustakides G: Fast and stable subspace tracking. IEEE Trans. Signal Process 2008, 56(4):1452-1465.
Article MathSciNet Google Scholar
Burda Z, Jurkiewicz J, Wacław B: Spectral moments of correlated Wishart matrices. Phys. Rev. E Stat. Nonlinear Soft Mat. Phys 2005, 71(2 Pt 2):026111.
Article Google Scholar
Forrester P: Eigenvalue distributions for some correlated complex sample covariance matrices. J. Phys. A Math. Theor 2007., 40:
Google Scholar
Zhang L: Spectral analysis of large dimensional random matrices, Ph.D. Thesis. 2006.
Google Scholar
Paul D, Silverstein J: No eigenvalues outside the support of the limiting empirical spectral distribution of a separable covariance matrix. J. Multivar. Anal 2009, 100: 37-57. 10.1016/j.jmva.2008.03.010
Article MathSciNet Google Scholar
Zhang H, Jin S, Zhang X, Yang D: On marginal distributions of the ordered eigenvalues of certain random matrices. EURASIP J. Adv. Signal Process 2010, 2010: 67.
Google Scholar
Yazdian E, Bastani MH, Gazor S: Spectral distribution of the exponentially windowed sample covariance matrix. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Kyoto, Japan; 2012:3529-3532.
Google Scholar
Bai Z, Silverstein J: Spectral Analysis of Large Dimensional Random Matrices. : Springer; 2010.
Book Google Scholar
Corless R, Gonnet G, Hare D, Jeffrey D, Knuth D: On the Lambert W function. Adv. Comput. Math 1996, 5: 329-359. 10.1007/BF02124750
Article MathSciNet Google Scholar
Pafka S, Potters M, Kondor I: Exponential weighting and random-matrix-theory-based filtering of financial covariance matrices for portfolio optimization. 2004.http://arxiv.org/abs/cond-mat/0402573 (available at )
Google Scholar
DE LACHAPELLE D: Modern portfolio theory revisited: from real traders to new methods. 2012.
Google Scholar
Daly J, Crane M, Ruskin H: Random matrix theory filters in portfolio optimisation: a stability and risk assessment. Phys. A Stat. Mech. Appl 2008, 387(16):4248-4260. 10.1016/j.physa.2008.02.045
Article Google Scholar
Potters M, Bouchaud J, Laloux L: Financial applications of random matrix theory: old laces and new pieces. Acta Physica Polonica B 2005, 36: 2767.
MathSciNet Google Scholar
Bai Z, Silverstein J: Exact separation of eigenvalues of large dimensional sample covariance matrices. Ann. Prob 1999, 27(3):1536-1555. 10.1214/aop/1022677458
Article MathSciNet Google Scholar

Download references

Acknowledgments

This work is supported in part by Iran Telecommunication Research Center (ITRC).

Author information

Authors and Affiliations

Department of Electrical and Computer Engineering, Isfahan University of Technology, Isfahan, Iran
Ehsan Yazdian
Department of Electrical and Computer Engineering, Queens University, Kingston, ON, Canada
Saeed Gazor
Electrical Engineering Department, Sharif University of Technology, Tehran, Iran
Mohammad Hasan Bastani

Authors

Ehsan Yazdian
View author publications
You can also search for this author in PubMed Google Scholar
Saeed Gazor
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Hasan Bastani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ehsan Yazdian.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Authors’ original file for figure 12

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Yazdian, E., Gazor, S. & Bastani, M.H. Limiting spectral distribution of the sample covariance matrix of the windowed array data. EURASIP J. Adv. Signal Process. 2013, 42 (2013). https://doi.org/10.1186/1687-6180-2013-42

Download citation

Received: 13 August 2012
Accepted: 05 February 2013
Published: 06 March 2013
DOI: https://doi.org/10.1186/1687-6180-2013-42

Limiting spectral distribution of the sample covariance matrix of the windowed array data

Abstract

Introduction

2 System model for windowed SCM

Definition 1

Theorem 1

Proof 1

3 Effective length of a window

Definition 2

4 Spectral analysis of noise-only data

Lemma 1 ([32], Lemma 6.1)

4.1 Discrete distribution function approach

4.2 Continuous function approach

Theorem 2

Proof 2

Remark 1

5 Spectral analysis of signal plus noise data

Proposition 1

Proof 3

5.1 Support of eigenvalues

Theorem 3

Proof 4

Lemma 2

Proof

Lemma 3

Proof 5

Remark 2

5.2 Limiting spectral distribution

6 Simulation results

7 Conclusion

Endnotes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords