Prediction on widely factorizable signals

Fernández-Alcalá, Rosa M.; Navarro-Moreno, Jesús; Ruiz-Molina, Juan C.

doi:10.1186/1687-6180-2012-96

Research
Open access
Published: 01 May 2012

Prediction on widely factorizable signals

Rosa M. Fernández-Alcalá¹,
Jesús Navarro-Moreno¹ &
Juan C. Ruiz-Molina¹

EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 96 (2012) Cite this article

2018 Accesses
1 Citations
Metrics details

Abstract

This article extends the solutions to the prediction problem for factorizable real random signals to the class of improper complex-valued random signals. For that, the concept of widely factorizable signals is introduced and several real examples of signals having a widely factorizable correlation function are presented. A widely linear processing is considered in the design of both linear and nonlinear prediction algorithms which are computationally feasible from the practical standpoint. These algorithms are valid for stationary as well as nonstationary signals and they can be applied from only the knowledge of the second-order properties of the augmented vectors involved, it not being necessary to know if the signal satisfies a state-space model.

1 Introduction

In recent research, estimation theory becomes very relevant within the complex random signals field. In fact, although traditionally the treatment of this type of problem has consisted in mere extensions of the vectorial real-valued estimation algorithms to the complex plane, a complex formalism has a special value in the description of some physical systems in such diverse fields as communications, oceanography, meteorology and optics among others (see, for example, [1] and the references therein).

The classic processing with such signals, called strictly linear (SL), assumes that the signals involved are proper.^a However, this assertion is not always justified. The improper nature of some signals requires that the pseudocorrelation function must be taken into account in describing and characterizing their second order properties completely [2–4].

With this motivation, [2] introduces a new methodology called widely linear (WL) whose more notable characteristic is that it utilizes not only the observed signal but also its conjugate to obtain estimators with better behavior than the conventional ones in the sense of reducing the mean square estimation error. Moreover, this kind of processing has become very usual in the last decade for designing linear and nonlinear estimation algorithms from a discrete-time [1, 5–9] as well as a continuous-time perspective of the problem [10]. Specifically, focussing our attention on the discrete case, the recent books of Mandic and Goh [1] and Adali and Haykin [11] about WL adaptive systems can be considered as two reference texts in this area which provide a unified treatment of linear and nonlinear complex-valued adaptive filters.

On the other hand, knowledge of second-order statistics is a key assumption in solving estimation problems (see, for example, [12]). In practice this knowledge is available because second-order statistics of the problem have been measured experimentally or they are understandable enough from the physical mechanism [13]. In this framework, WL estimation algorithms for computing all types (filtering, prediction, and smoothing) of estimates have been devised in [8] for second-order stationary (SOS) signals, i.e., those signals with constant mean function and both correlation and pseudocorrelation functions only dependent on the difference of time instants. Although the WL estimation problem considered is very general, its applicability is limited to SOS signals.

In the real field, an alternative estimation methodology based on correlation information has been developed to solve very general linear and nonlinear estimation problems for the class of factorizable signals (see, for example, [14, 15]). This type of signal is characterized by having a factorizable kernel which is a very general condition valid for stationary and nonstationary signals. This formulation does not require postulating a dynamical model for the signal and thus, it is useful whenever the physical mechanism generating the signal of interest is not known or if it impossible to determine (for instance, if it does not satisfy a state-space model).

In this article, our objective is to extend this last methodology to the complex plane. For that, we introduce a new class of signals, called widely factorizable, whose correlation function of the augmented vector formed by the signal and its conjugate is a factorizable kernel. Then, a WL processing is employed in the development of both linear and nonlinear prediction algorithms for widely factorizable signals observed in the presence of noise. With this aim, in Section 2 widely factorizable signals are defined and also the basic notation and concepts about complex-valued signals are summarized. Next, the linear augmented complex prediction problem is addressed in Section 3 where a recursive algorithm is provided for computing the optimal WL prediction estimate as well as its associated error. This section also includes three numerical examples which show the enhancement of the proposed WL predictor in relation to the SL solution. Finally, by following a similar reasoning to the extended Kalman filter (EKF), the nonlinear augmented complex prediction problem is solved in Section 4 where a numerical example is also developed in order to compare the proposed WL algorithm with two conventional nonlinear augmented complex techniques, the WL EKF and the WL unscented Kalman filter (UKF) suggested in [1]. Furthermore, these solutions are also compared with the SL EKF and SL UKF.

2 Preliminaries

This section covers some of the more basic notions associated with the complex-valued random signals field and introduces a new class of signals, called widely factorizable. Moreover, the notation and hypotheses held throughout the article are also established in this section.

First of all, note that all vectors will be denoted by bold small letters and bold capital letters will be used for matrices. Also, row k of any matrix A(·) will be denoted by a_[k](·). Furthermore, 0_pand 0_{p × q}represent the p-vector and the p × q-matrix, respectively, whose elements are all zeros.

Unless indicated to the contrary, throughout this article we consider an improper complex-valued signal {s(t_i ), t_i ∈ T}, T = {t₁, t₂, ...}, with zero-mean and correlation function r_s (t_i, t_j ) = E[s(t_i )s* (t_j )], where the superscript '*' represents the complex conjugate.

The signal {s(t_i ), t_i ∈ T} is said to be factorizable if there exist two l-vectors α(t_i ) and β(t_i ) such that its correlation function r_s (t_i, t_j ) can be expressed in the form

r_{s} (t_{i}, t_{j}) = \{\begin{gathered} α^{T} (t_{i}) β^{*} (t_{j}), t_{i} \geq t_{j} \\ β^{T} (t_{i}) α^{*} (t_{j}), t_{i} \leq t_{j} \end{gathered}

(1)

where the superscript 'T' denotes the transpose. Note that this type of signal is very general and includes both stationary and nonstationary signals. However, a new class of signals is possible by imposing the condition of factorizable kernel on the correlation function R_s(t_i, t_j ) = E[s(t_i )s^H(t_j )], with the superscript 'H' denoting the conjugate transpose, of the augmented vector s(t_i ) = [s(t_i ), s* (t_i )] ^T. This type of signal, called widely factorizable, is introduced next.

Definition 2.1 A signal {s(t_i ), t_i ∈ T} is said to be widely factorizable if and only if there exist two 2 × m-matrices A(t_i ) and B(t_i ) such that the correlation function R_s(t_i, t_j ) of the augmented vector s(t_i ) can be expressed in the form

R_{s} (t_{i}, t_{j}) = \{\begin{gathered} A (t_{i}) B^{H} (t_{j}), t_{i} \geq t_{j} \\ B (t_{i}) A^{H} (t_{j}), t_{i} \leq t_{j} \end{gathered}

(2)

where the superscript 'H' denotes the conjugate transpose.

Note that condition (2) implies (1), however condition (1) does not assure that the correlation function of the augmented vector satisfies (2). As a consequence, we have that all widely factorizable signals are also factorizable but the converse does not hold.

Some illustrative examples of widely factorizable signals are the following:

i) The rotation of a real factorizable zero-mean signal x(t_i ) by an independent random phase θ, s(t_i ) = e^{θ j}x(t_i ), with $j = \sqrt{- 1}$ . Assume

r_{x} (t_{i}, t_{j}) = \{\begin{gathered} a^{T} (t_{i}) b (t_{j}), t_{i} \geq t_{j} \\ b^{T} (t_{i}) a (t_{j}), t_{i} \leq t_{j} \end{gathered}

(3)

with a(t_i ) and b(t_i ) two real l-vectors. Thus,

A (t_{i}) = (\begin{matrix} 0_{l}^{T} & a^{T} (t_{i}) \\ a^{T} (t_{i}) & 0_{l}^{T} \end{matrix}), B^{H} (t_{i}) = (\begin{matrix} φ_{θ} (- 2) b (t_{i}) & b (t_{i}) \\ b (t_{i}) & φ_{θ} (2) b (t_{i}) \end{matrix})

(4)

where φ_θ (·) is the characteristic function of θ.

ii) The seismic ground acceleration can be represented by a uniformly modulated nonstationary process given by s(t_i ) = d(t_i )x(t_i ), where x(t_i ) is a stationary process with zero mean and known second-order statistics, and d(t_i ) is the time modulating function. Both x(t_i ) and d(t_i ) can be either real or complex-valued [16]. Thus, if d(t_i ) is complex and x(t_i ) is real and factorizable with correlation function given by (3) then, s(t_i ) is widely factorizable being

A (t_{i}) = (\begin{gathered} d (t_{i}) a^{T} (t_{i}) \\ d^{*} (t_{i}) a^{T} (t_{i}) \end{gathered}), B^{H} (t_{i}) = (b (t_{i}) d^{*} (t_{i}) b (t_{i}) d (t_{i}))

Common choices are $d (t_{i}) = e^{j 2 π ω_{0} t_{i}}$ with ω₀ a constant frequency and x(t_i ) the Ornstein-Uhlenbeck process.

On the other hand, if both d(t_i ) and x(t_i ) are complex and x(t_i ) is also widely factorizable with

R_{x} (t_{i}, t_{j}) \{\begin{gathered} E (t_{i}) F^{H} (t_{j}), t_{j} \geq t_{j} \\ F (t_{i}) E^{H} (t_{j}), t_{i} \leq t_{j} \end{gathered}

then, s(t_i ) is widely factorizable with

A (t_{i}) = (\begin{matrix} d (t_{i}) & 0 \\ 0 & d^{*} (t_{i}) \end{matrix}) E (t_{i}), B^{H} (t_{i}) = F^{H} (t_{i}) (\begin{matrix} d^{*} (t_{i}) & 0 \\ 0 & d (t_{i}) \end{matrix})

iii) In electromagnetic theory, the time-varying position of the electric field vector can be represented as the following signal $s (t_{i}) = A e^{j w_{0} t_{i}} + B e^{- j w_{0} t_{i}}$ where the expressions for the random coefficients A and B can be found in [[17], p. 7]. Hence, s(t_i ) is widely factorizable with

A (t_{i}) = {(\begin{gathered} r_{A} e (t_{i}) + r_{A B}^{*} e^{*} (t_{i}) γ_{A}^{*} e^{*} + γ_{A B}^{*} e (t_{i}) \\ r_{B} e^{*} (t_{i}) + r_{A B} e (t_{i}) γ_{B}^{*} e (t_{i}) + γ_{A B}^{*} e^{*} (t_{i}) \\ γ_{B} e^{*} (t_{i}) + γ_{A B} e (t_{i}) r_{B} e (t_{i}) + r_{A B}^{*} e^{*} (t_{i}) \\ γ_{A B} e^{*} (t_{i}) + γ_{A} e (t_{i}) r_{A} e^{*} (t_{i}) + r_{A B} e (t_{i}) \end{gathered})}^{T}

(5)

B^{H} (t_{i}) = {(\begin{matrix} e (t_{i}) & e^{*} (t_{i}) \\ 0 & 0 \end{matrix} \begin{matrix} 0 & 0 \\ e (t_{i}) & e^{*} (t_{i}) \end{matrix})}^{H}

where $e (t_{i}) = e^{j ω_{0} t_{i}}, r_{A} = E [A A^{*}], r_{A B} = E [A B^{*}], γ_{A} = E [A A], γ_{B} = E [B B]$ , and γ_AB= E[AB].

iv) A signal widely used in many areas of signal processing is a linear frequency modulation or chirp with random phase. The chirp process can be expressed as $s (t_{i}) = e^{j π (2 α t_{i} + β t_{i}^{2} + 2 θ)}$ where θ is the random phase, α determines the starting instantaneous frequency of the chirp and β is the chirp rate. Then, the chirp process is widely factorizable with

A (t_{i}) = (\begin{matrix} c (t_{i}) & 0 \\ 0 & c^{*} (t_{i}) \end{matrix})

B^{H} (t_{i}) = (\begin{matrix} (1 - {|φ_{θ} (2 π)|}^{2}) c^{*} (t_{i}) & (φ_{θ} (4 π) - φ_{θ}^{2} (2 π)) c (t_{i}) \\ {(φ_{θ} (4 π) - φ_{θ}^{2} (2 π))}^{*} c^{*} (t_{i}) & (1 - {|φ_{θ} (2 π)|}^{2} c (t_{i})) \end{matrix})

with $c (t_{i}) = e^{j π (2 α t_{i} + β t_{i}^{2})}$ and φ_θ (·) the characteristic function of θ.

v) An application of the complex Ornstein-Uhlenbeck process is the description of the motion of the instantaneous axis of the Earth's rotation [18]. This motion has an 1 year period and if it is removed, there remains the so-called Chandler Wobble, which has a period of about 435 days (14 months). Kolmogorov proposed the complex stochastic process $s (t_{i}) = x (t_{i}) + j y (t_{i}) = σ e^{j 2 π t_{i}} + ξ (t_{i})$ to describe the Chandler Wobble, i.e., the motion of the pole, where x(t_i ) and y(t_i ) are the coordinates of the deviation of the instantaneous pole from the North Pole. In that model the first term is a periodical component, and the second term ξ(t_i ) is a complex Ornstein-Uhlenbeck process. It is not difficult to check that the signal s(t_i ) is proper and then,

A (t_{i}) = (\begin{matrix} \frac{1}{λ} e^{(ω j{λ) t_{i}} & 0 \\ 0 & \frac{1}{λ} e^{- (ω j - λ) t_{i}} \end{matrix}), B^{H} (t_{i}) = (\begin{matrix} e^{(- ω j + λ) t_{i}} & 0 \\ 0 & e^{(ω j + λ) t_{i}} \end{matrix})

where λ > 0 is the drift parameter and ω ∈ ℝ is the period.

On the other hand, we provide a simple example of a factorizable signal which is not widely factorizable: a zero-mean complex-valued signal {s(t_i ), t_i = i/100, i = 1, ..., 100} with $r_{s} (t_{i}, t_{j}) = 2 e^{- |t_{i} - t_{j}|}$ and $ρ_{s} (t_{i}, t_{j}) = j 0.5 e^{t_{i} t_{j}} .$

In the following, we also assume that the signal of interest s(t_i ) is widely factorizable in the sense of Definition 2.1.

Finally, R_sy(t_i, t_j ) = E[s(t_i )y^H(t_j )] denotes the cross-correlation function between any two augmented signals s(t_i ) and y(t_i ), and r_{s
y}(t_i, t_j ) = E[s(t_i )y^H(t_j )] represents the cross-correlation function between s(t_i ) and the augmented vector y(t_i ).

3 Linear augmented complex prediction

Assume that the signal s(t_i ) established in Section 2 is observed through the following linear equation:

y (t_{i}) = g (t_{i}) s (t_{i}) + v (t_{i}), t_{1} \leq t_{i} \leq t_{n}

(6)

where g(t_i ) is a deterministic complex-valued function and v(t_i ) is a doubly white noise^b correlated with s(t_i ). Moreover, the cross-correlation function and the augmented signal s(t_i ) and the augmented noise v(t_i ) is of the form

R_{s v} (t_{i}, t_{j}) = \{\begin{gathered} C (t_{i}) D^{H} (t_{j}), t_{i} \geq t_{j} \\ E (t_{i}) F^{H} (t_{j}), t_{i} \leq t_{j} \end{gathered}

(7)

where C(t_i ), D(t_i ), E(t_i ), and F(t_i ) are matrices of dimensions 2 × l, 2 × l, 2 × l^', and 2 × l^', respectively.

We consider the problem of obtaining the optimal (in the sense of minimizing the WL mean square error) estimator^c of the signal s(t_k ) as a function of the information given by the observations {y(t₁), ..., y(t_n ), y*(t₁), ..., y*(t_n )}, for t_k ≥ t_n . It is known that such an estimator can be expressed as a linear function of the set of augmented observations {y(t₁), ..., y(t_n )} as follows [2]

ŝ (t_{k} | t_{n}) = \sum_{j = 1}^{n} h^{T} (t_{k}, t_{j}, t_{n}) y (t_{j}), t_{k} \geq t_{n}

(8)

where the 2D vector h(t_k, t_j, t_n ), called the impulse response function, satisfies the equation

r_{s y} (t_{k}, t_{j}) = \sum_{i = 1}^{n} h^{T} (t_{k}, t_{i}, t_{n}) R (t_{i}, t_{j}) + h^{T} (t_{k}, t_{j}, t_{n}) \sum, t_{1} \leq t_{j} \leq t_{n}, t_{k} \geq t_{n}

(9)

where $R (t_{i}, t_{j}) = G (t_{i}) R_{s} (t_{i}, t_{j}) G^{H} (t_{j}) + G (t_{i}) R_{s v} (t_{i}, t_{j}) + R_{s v}^{H} (t_{j}, t_{i}) G^{H} (t_{j})$ , with G(t_i ) denoting the 2 × 2-diagonal matrix G(t_i ) = diag(g(t_i ), g*(t_i )), and E[v(t_i )v^H(t_i )] = Σ.

From (2) and (7), it is easy to check that r_{s
y}(t_k, t_j ) and R(t_i, t_j ) can be written as follows:

r_{s y} (t_{k}, t_{j}) = \{\begin{gathered} ψ_{[1]} (t_{k}) Γ^{H} (t_{j}), t_{k} \geq t_{j} \\ π_{[1]} (t_{k}) Φ^{H} (t_{j}), t_{k} \leq t_{j} \end{gathered}

(10)

R (t_{i}, t_{j}) = \{\begin{gathered} Φ (t_{i}) Γ^{H} (t_{j}), t_{i} \geq t_{j} \\ Γ (t_{i}) Φ^{H} (t_{j}), t_{i} \leq t_{j} \end{gathered}

(11)

where ψ_[1](t_k ) is the first row of the 2 × q-matrix $Ψ (t_{i}) = [A (t_{i}), C (t_{i}), 0_{2 \times l^{'}}]$ , π_[1](t_k ) is the first row of the 2×q-matrix Π(t_i ) = [B(t_i ), 0_2×l, E(t_i )], Φ(t_i ) = [G(t_i )A(t_i ), G(t_i ) C(t_i ), F(t_i )], and Γ(t_i ) = [G(t_i ) B(t_i ), D(t_i ), G(t_i ) E(t_i )] are also matrices of dimensions 2 × q, with q = m + l + l^' .

Although the problem is completely determined from the computation of the impulse response function by solving Equation (9), our aim here is to provide a recursive algorithm for its computation. Next, the recursive formulas for computing the estimator (8) and its associated error p(t_k |t_n ) = E[|s(t_k ) - ŝ(t_k |t_n )|²] are devised.

Theorem 3.1 The optimal WL estimate ŝ(t_k |t_n ) defined in (8) can be recursively computed as follows:

ŝ (t_{k} | t_{n}) = ψ_{[1]} (t_{k}) ϵ (t_{n}), t_{k} \geq t_{n}

(12)

where the q-vector ϵ (t_n ) is recursively computed from the expression

\begin{gathered} ϵ (t_{n}) = ϵ (t_{n - 1}) + J (t_{n}, t_{n}) [y (t_{n}) - Φ (t_{n}) ϵ (t_{n - 1})] \\ ϵ (t_{0}) = 0_{q} \end{gathered}

with the q × 2-matrix J(t_n, t_n ) given by the equation

J (t_{n}, t_{n}) = [Γ^{H} (t_{n}) - Q (t_{n - 1}) Φ^{H} (t_{n})] Ω^{- 1} (t_{n})

(14)

with the 2 × 2-matrix Ω(t_n ) = Σ + [Γ(t_n ) - Φ(t_n ) Q (t_n-1)] Φ ^H (t_n ) and the q × q-matrix Q(t_n ) satisfying the recursive equation

\begin{gathered} Q (t_{n}) = Q (t_{n - 1}) + J (t_{n}, t_{n}) [Γ (t_{n}) - Φ (t_{n}) Q (t_{n - 1})] \\ Q (t_{0}) = 0_{q \times q} \end{gathered}

(15)

Moreover, the associated error is given by the expression

p (t_{k} | t_{n}) = r_{s} (t_{k}, t_{k}) - ψ_{[1]} (t_{k}) Q (t_{n}) ψ_{[1]}^{H} (t_{k}), t_{k} \geq t_{n}

(16)

Proof. From (10) and (11), Equation (9) can be rewritten as

h^{T} (t_{k}, t_{j}, t_{n}) \sum = ψ_{[1]} (t_{k}) Γ^{H} (t_{j}) - \sum_{i = 1}^{n} h^{T} (t_{k}, t_{i}, t_{n}) R (t_{i}, t_{j})

Then, if we introduce a function J(t_j, t_n ) satisfying the equation

J (t_{j}, t_{n}) \sum = Γ^{H} (t_{j}) - \sum_{i = 1}^{n} J (t_{i}, t_{n}) R (t_{i}, t_{j})

(17)

we obtain that

h^{T} (t_{k}, t_{j}, t_{n}) = ψ_{[1]} (t_{k}) J (t_{j}, t_{n})

(18)

and then, substituting (18) in (8), and defining the function

\begin{gathered} ϵ (t_{n}) = \sum_{i = 1}^{n} J (t_{i}, t_{n}) y (t_{i}) \\ ϵ (t_{0}) = 0_{q} \end{gathered}

(19)

the Equation (12) for the optimal estimator is devised.

Now, subtracting the Equation (17) for t_n and t_{n- 1}and taking (11) into account, we can write

[J (t_{j}, t_{n}) - J (t_{j}, t_{n - 1})] \sum = - J (t_{n}, t_{n}) Φ (t_{n}) Γ^{H} (t_{j}) - \sum_{i = 1}^{n - 1} [J (t_{i}, t_{n}) - J (t_{i}, t_{n - 1})] R (t_{i}, t_{j})

Thus, from (17), we have the relation

J (t_{j}, t_{n}) - J (t_{j}, t_{n - 1}) = - J (t_{n}, t_{n}) Φ (t_{n}) J (t_{j}, t_{n - 1})

(20)

As a consequence, subtracting the Equation (19) for t_n and t_{n- 1}and using (20) in the resulting equation, the recursive expression (13) is obtained.

Next, we proceed to derive expression (14) for J(t_n, t_n ). By taking t_j = t_n in (17) and using (11), we have

J (t_{n}, t_{n}) \sum = Γ^{H} (t_{n}) - \sum_{i = 1}^{n} J (t_{i}, t_{n}) Γ (t_{i}) Φ^{H} (t_{n}) = [Γ^{H} (t_{n}) - Q (t_{n}) Φ^{H} (t_{n})]

(21)

where we have introduced the q × q-matrix

\begin{gathered} Q (t_{n}) = \sum_{i = 1}^{n} J (t_{i}, t_{n}) Γ (t_{i}) \\ Q (t_{0}) = 0_{q \times q} \end{gathered}

(22)

Moreover, if we subtract Q(t_{n- 1}) from Q(t_n ), and use (20) and (22) in the resulting expression, the recursive Equation (15) for Q(t_n ) is derived. Finally, using (15) in (21), it is easy to check that J(t_n, t_n ) satisfies the expression (14).

Finally, in order to derive expression (16) for the error p(t_k|t_n ) associated with the above estimate, we remark that, from the orthogonal projection lemma, this function can be expressed as

p (t_{k} | t_{n}) = r_{s} (t_{k}, t_{k}) - E [ŝ (t_{k} | t_{n}) ŝ^{*} (t_{k} | t_{n})]

Then, substituting (12) in the above equation and using (17) and (19), we check that

p (t_{k} | t_{n}) = r_{s} (t_{k}, t_{k}) - ψ_{[1]} (t_{k}) E [ϵ (t_{n}) ϵ^{H} (t_{n})] ψ_{[1]}^{H} (t_{k}) = r_{s} (t_{k}, t_{k}) - ψ_{[1]} (t_{k}) \sum_{i = 1}^{n} J (t_{i}, t_{n}) Γ (t_{i}) ψ_{[1]}^{H} (t_{k})

As a consequence, from (22), (16) is obtained.

Remark 1 When {s(t_i ), t_i ∈ T} is a factorizable real-valued signal with correlation function of the form (1), and the observations of the signal verify the complex-valued linear Equation (6) with

r_{s v} (t_{i}, t_{j}) = \{\begin{gathered} c^{T} (t_{i}) D^{H} (t_{j}), t_{i} \geq t_{j} \\ e^{T} (t_{i}) F^{H} (t_{j}), t_{i} \leq t_{j} \end{gathered}

where c(t_i ) and e(t_i ) are vectors of dimensions l and l', respectively, and D(t_i ) and F(t_i ) are matrices of respective dimensions 2 ×l and 2 ×l^', we obtain that Algorithm 3.1 holds, replacing the involved 2 × q-matrices Ψ(t_i ), Π(t_i ) by the vectors $ψ^{T} (t_{i}) = [α^{T} (t_{i}), c^{T} (t_{i}), 0_{l^{'}}^{T}]$ and $π^{T} (t_{i}) = [β^{T} (t_{i}), 0_{l}^{T}, e^{T} (t_{i})]$ , and taking the matrices Φ(t_i ) = [g(t_i ) α ^T (t_i ), g(t_i )c ^T (t_i ), F(t_i )] and Γ(t_i ) = [g(t_i )β^T(t_i ), D(t_i ), g(t_i )e ^T (t_i )], with g(t_i ) = [g(t_i ), g*(t_i )]^T .

Remark 2 The efficiency of Algorithm 3.1 is closely related to the dimensions l, l^', and m of the matrices involved in the factorizations (2) and (7). Indeed, the computational complexity of this algorithm is of order q, with q = m + l + l^', and thus, it involves a further complication in implementation and an increased computational burden as q grows. Since the factorization of the covariance is not unique then, the key question is in choosing the one which minimizes the dimension q. There exist simple cases, as illustrated previously, where the factorization is easily obtained. Nevertheless, in those more complex cases where this factorization is not trivial, one can use several methods available in the literature to get a factorization with minimum dimension (see, e.g., [19]).

3.1 Numerical examples

The advantages of the proposed Algorithm with respect to the SL solution are illustrated here through three numerical examples. The first one involves real correlation matrices and analyzes the effectiveness of the WL processing with respect to the SL one in terms of the impropriety degree of the observations. In the second example, complex correlation matrices are considered and the resulting WL and SL estimation errors are graphically compared. Finally, the third example shows a real application to seismic signal processing.

3.1.1 Example

Let {x(t_i ), t₁≤ t_i ≤ t₁₀₀}, with t_i = i/ 100, i = 1, ..., 100, be an Ornstein-Uhlenbeck process with correlation function

r_{x} (t_{i}, t_{j}) = exp (- | t_{i} - t_{j} |), t_{1} \leq t_{i}, t_{j} \leq t_{100}

which is transmitted over a channel that rotates it by a standard normal phase θ and adds a doubly white Gaussian noise v(t_i ) correlated with the signal with

R_{x v} (t_{i}, t_{j}) = (\begin{matrix} \frac{1}{25} t_{i}^{2} t_{j} & \frac{1}{25} t_{i}^{2} t_{j} \\ \frac{1}{25} t_{i}^{2} t_{j} & \frac{1}{25} t_{i}^{2} t_{j} \end{matrix})

Thus, the signal of interest is s(t_i ) = e^{θ j}x(t_i ) and the observations y(t_i ) are of the form (6) with g(t_i ) = 1. Moreover, we assume that θ is independent of x(t_i ) and v(t_i ).

Note that the correlation matrix of the augmented signal s(t_i ) can be expressed in the form (2), where A and B are as in (4) with l = 1, $a (t_{i}) = e^{- t_{i}}$ , $b (t_{i}) = e^{t_{i}}$ , and φ_θ (-2) = φ_θ (2) = e^-2. Moreover, the cross-correlation matrix between s(t_i ) and v(t_i ) is of the form (7) with D(t_i ) = F(t_i ) = [t_i /25, t_i /25]^T and $C (t_{i}) = E (t_{i}) = {[e^{- 1 / 2} t_{i}^{2}, e^{- 1 / 2} t_{i}^{2}]}^{T}$ .

In this example, we consider the problem of computing the fixed-lead predictor ŝ(t_k+10|t_k ). As a measure for comparing the performance of the WL and SL fixed-lead predictors we use the one defined in [6], the mean square of the difference between both errors, for

\sum = (\begin{matrix} 2 τ \\ τ 2 \end{matrix})

with τ varying within the interval [1, 2):

\frac{1}{100} \sum_{k = 1}^{100} {({\tilde{p}}_{τ} (t_{k + 10} | t_{k}) - p_{τ} (t_{k + 10} | t_{k}))}^{2}

(24)

with p_τ (t_k+10|t_k ) and ${\tilde{p}}_{τ} (t_{k + 10} | t_{k})$ denoting the WL and SL fixed-lead prediction errors, respectively, for every value τ. The results obtained are displayed in Figure 1 which shows that, not only the WL fixed-lead predictor presents a better behavior than the SL fixed-lead predictor but also the difference between both errors (in the mean square sense) increases with τ , and hence, the WL technique becomes more effective.

3.1.2 Example

Let {s(t_i ), t₁≤ t_i ≤ t₁₀₀}, with t_i = i/100, i = 1, ..., 100, be a signal of the form

s (t_{i}) = A e^{j t_{i}} + B e^{- j t_{i}}

where A and B are complex random variables. In this example, the signal is assumed to be widely stationary, that is E[AA] = E[BB] = E[AB*] = 0 (see [[17], p. 24]), and also we consider that E[AA*] = E[BB*] = 1 and E[AB] = -0.8. Thus, by using (5), the correlation matrix of the signal s(t_i ) can be expressed in the form (2). Moreover, we consider that the observations y(t_i ) are of the form (6) with g(t_i ) = 1 and where the noise ν(t_i ) is uncorrelated with the signal and its augmented variance matrix is

\sum = (\begin{gathered} 2 1 \\ 1 2 \end{gathered})

On the basis of the set of observations {y(t₁), y(t₂), ..., y(t₁₀₀)}, we consider the problem of computing the fixed-lead predictor ŝ(t_k+10|t_k ). Then, Algorithm 3.1 is used to obtain the WL fixed-lead prediction error p_τ (t_k+10|t_k ) which is compared with the SL fixed-lead prediction error ${\bar{p}}_{τ} (t_{k + 10} | t_{k})$ in Figure 2. As could be expected, this figure shows that the WL fixed-lead predictor presents a better behavior than the SL fixed-lead predictor. Finally, Figure 3 depicts the performance measure given by (24) with τ ∈ [1, 2) and Σ as in (23). Again, the improved precision attained with the WL fixed-lead predictor with respect to the SL fixed-lead predictor is observed as τ increases.

3.1.3 Example

As indicated in Section 2, uniformly modulated nonstationary processes are often used to model seismic records, especially acceleration records. The modulated nonstationary process is given by s(t_i ) = d(t_i )x(t_i ), where x(t_i ) is a stationary process with zero mean and known second-order statistics, and d(t_i ) is the time modulating function. A stochastic earthquake model commonly used for x(t_i ) is the Kanai-Tajimi process (see, e.g., [20]). It is well-known that the Kanai-Tajimi earthquake model is covariance equivalent with the subset of the ARMA(2,1) model corresponding to a unit value of the spring-dashpot input ratio [21]. For firm ground conditions, at moderate epicentral distance, Kanai and Tajimi have suggested specific values for the parameters in the equation of motion in continuous time whose corresponding discrete ARMA(2,1) model is [21]

x (t_{i}) - 1.604 x (t_{i - 1}) + 0.686 x (t_{i - 2}) = e (t_{i}) - 0.767 e (t_{i - 1})

with $σ_{e}^{2} = 39.08$ and Δt = t_i -t_{i- 1}= 0.02 s. This ARMA model has a factorizable correlation function given by (3). Moreover, we have chosen the modulating function $d (t_{i}) = e^{j t_{i}}$ and the augmented error covariance matrix of the form

\sum = (\begin{matrix} 1 τ \\ τ 1 \end{matrix})

with τ ∈ [0, 1). Here we have studied the filtering problem. Figure 4 illustrates the enhancement of the WL filter in relation to the SL one by using the measure

\frac{1}{100} \sum_{k = 1}^{100} {({\tilde{p}}_{τ} (t_{k} | t_{k}) - p_{τ} (t_{k} | t_{k}))}^{2}

Similar to the previous examples, we observe the superiority of the WL estimate as τ increases.

4 Nonlinear augmented complex prediction

Given the same conditions on the signal established in Section 2, we suppose that the ob-servation process can be given by a nonlinear relation of the form

y (t_{i}) = z (s (t_{i}), t_{i}) + ν (t_{i}), t_{1} \leq t_{i} \leq t_{n}

(25)

where z(·) is a complex-valued nonlinear function and the signal s(t_i ) is uncorrelated with the noise ν(t_i ). As in the linear case, the aim here is to estimate the signal s(t_k ) on the basis of the observation set {y(t₁), ..., y(t_n ), y*(t₁), ..., y*(t_n )}, with t_k ≥ t_n .

The solution to this problem is addressed by following a similar philosophy to the EKF [22]. Specifically, following the basic idea of the EKF the nonlinear function z(s(t_n ), t_n ) is linearized at each time instant by a first-order Taylor series expanded about the estimated signal ŝ(t_n |t_n-1)

z (s (t_{n}), t_{n}) \approx z (ŝ (t_{n} | t_{n - 1}), t_{n}) + \frac{\partial z (s, t_{n})}{\partial s} |_{s = ŝ (t_{n} | t_{n - 1})} (s (t_{n}) - ŝ (t_{n} | t_{n - 1}))

Consequently, we can proceed to approximate the nonlinear observation Equation (25) as shown by

ȳ (t_{n}) \approx g (t_{n}) s (t_{n}) + v (t_{n})

(26)

where

ȳ (t_{n}) = y (t_{n}) - z (ŝ (t_{n} | t_{n - 1}), t_{n}) + g (t_{n}) ŝ (t_{n} | t_{n - 1})

and

g (t_{n}) = {\frac{\partial z (s, t_{n})}{\partial s}|}_{s = ŝ (t_{n} | t_{n - 1})}

(27)

and thus, s(t_k ) can be estimated in terms of the set of observations ${ȳ (t_{1}), \dots, ȳ (t_{n}), ȳ^{*} (t_{1}), \dots, ȳ^{*} (t_{n})}$ from the relation (26). For that, the formulas given in Algorithm 3.1 for the WL predictor can be used. Note that in the case of the signal and observation noise being uncorrelated, Algorithm 3.1 holds with Ψ(t_i ) = A(t_i ), Π(t_i ) = B(t_i ), Φ(t_i ) = G(t_i )A(t_i ) and Γ(t_i ) = G(t_i )B(t_i ). Then, as in the EKF, in the resulting formulas we also use the linearized observation function (27) in place of the previous function g(t_n ) and the term G(t_n )A(t_n )ϵ(t_{n- 1}) is replaced by the vector

z (ŝ (t_{n} | {t_{n -}}_{1}), t_{n}) = {[z (ŝ (t_{n} | {t_{n -}}_{1}), t_{n}), z^{*} (ŝ (t_{n} | {t_{n -}}_{1}), t_{n})]}^{T}

Next the formulas of the proposed Algorithm are summarized.

Theorem 4.1 A nonlinear augmented complex predictor of the signal s(t_k ) based on the set of nonlinear observations {y(t₁), ..., y(t_n ), y* (t₁), ..., y*(t_n )} of the form (25) can be determined through the equation

ŝ (t_{k} | t_{n}) = a_{[1]} (t_{k}) ϵ (t_{n})

where the m-vector ϵ (t_n ) is recursively computed from the expression

\begin{gathered} ϵ (t_{n}) = ϵ (t_{n - 1}) + J (t_{n}, t_{n}) [y (t_{n}) - z (ŝ (t_{n} | t_{n - 1}), t_{n})] \\ ϵ (t_{0}) = 0_{m} \end{gathered}

with

J (t_{n}, t_{n}) = [B^{H} (t_{n}) - Q (t_{n - 1}) A^{H} (t_{n})] G^{H} (t_{n}) Ω^{- 1} (t_{n})

where Ω(t_n ) = Σ + G(t_n ) [B(t_n ) - A(t_n )Q(t_{n- 1})] A ^H (t_n )G ^H (t_n ), G(t_n ) = diag(g(t_n ), g*(t_n )), with $g (t_{n}) = \partial z (s, t_{n}) / \partial s |_{s = ŝ (t_{n} | t_{n - 1})},$ and Q(t_n ) satisfies the recursive equation

\begin{gathered} Q (t_{n}) = Q (t_{n - 1}) + J (t_{n}, t_{n}) G (t_{n}) [B (t_{n}) - A (t_{n}) Q (t_{n - 1})] \\ Q (t_{0}) = 0_{m \times m} \end{gathered}

Remark 3 Similarly to Remark 1, if the signal {s(t_i ), t_i ∈ T} is a real-valued signal with factorizable kernel of the form (1) which is observed through a complex-valued nonlinear equation of the form (25), Algorithm 4.1 holds with a_[1](t_i ) = α^T(t_i ), g(t_i ) = [g(t_i ), g*(t_i )] ^T and replacing the 2 × q-matrices A(t_i ) and B(t_i ) by the vectors α (t_i ) and β (t_i ), respectively.

4.1 Numerical example

In this Example, the estimation of a real signal on the basis of a set of complex-valued observations is considered. Specifically, let {s(t_i ), t₁≤ t_i ≤ t₁₀₀} be a real Wiener process with unit variance parameter and the observation equation

y (t_{i}) = e^{s (t_{i}) j} + v (t_{i}), t_{i} = i / 100, i = 1, \dots, 100

where v(t_i ) = e^{θ j}u(t_i ), with θ a standard normal phase and u(t_i ) a white Gaussian noise with unit spectral height and uncorrelated with the signal s(t_i ).

In this case, the correlation function of the signal s(t_i ) can be expressed in the form (1) with α(t_i ) = 1 and β(t_i ) = t_i .

With the aim of examining the good behavior of the WL solution proposed in Algorithm 4.1, the estimation error of the WL filter is compared with the errors associated with SL and WL conventional Algorithms. Specifically, the standard EKF and UKF and the WL EKF and WL UKF proposed in [1] have been implemented. For that, we use the fact that the signal s(t_i ) obeys the state equation

s (t_{i}) = s (t_{i - 1}) + w (t_{i}), t_{i} = i / 100, i = 1, \dots, 100

with w(t_i ) a centered Gaussian signal with variance parameter 10^-2.

On the other hand, for computing the estimation errors, Monte Carlo simulations have been performed. Figure 5 shows the results obtained with 5000 sample paths, confirming the better behavior of a WL processing in the nonlinear estimation problem. In fact, the dashed line represents the errors associated with the SL EKF and SL UKF (the differences between them are negligible) and the solid line depicts the errors associated with the WL UKF, WL EKF and the filter given in Algorithm 4.1 (again the differences between them are negligible). Obviously, the similar behavior shown here by the three WL filters has not to be repeated in other examples. As occurs in the standard estimation techniques, there is not a best nonlinear estimator either. In each application one has to pick the appropriate nonlinear estimation method. Really, in every particular case one has to choose the estimator which is found to best trade off various properties such as estimation accuracy, ease of implementation, numerical robustness, and computational burden [22]. Note that unlike UKF and EKF, Algorithm 4.1 does not require a state space model but only the knowledge of the second-order statistics of the processes involved.

Endnotes

^aA zero-mean complex-valued signal {s(t_i ), t_i ∈ T}, T = {t₁, t₂, ..., } is said to be proper if the pseudocorrelation function, ρ_s (t_i, t_j ) = E [s(t_i )s(t_j )], is null for all t_i, t_j ∈ T. Otherwise, it is called improper. ^bv(t_i ) is said to be a doubly white noise if E[v(t_i )v* (t_j )] = σ₁δ_ij and E[v(t_i )v(t_j )] = σ₂δ_ij , with |σ₂| < σ₁ and δ_ij stands for the Kronecker delta function [3]. ^cA simple application of the Hilbert space projection theorem shows that WL estimation outperforms SL estimation for general complex-valued signals. Specifically, denote $\bar{sp} {y (t_{1}), \dots, y (t_{n})}$ and $\bar{sp} {y (t_{1}), \dots, y (t_{n}), y^{*} (t_{1}), \dots, y^{*} (t_{n})}$ the closed spans of the following sets {y(t₁), ..., y(t_n )} and {y(t₁), ..., y(t_n ), y*(t₁), ..., y (t_n )}, respectively. Let $\tilde{s} (t_{k} | t_{n})$ and ŝ(t_k |t_n ) be the projections of s(t_k ), t_k ≥ t_n , onto the spaces $\bar{sp} {y (t_{1}), \dots, y (t_{n})}$ and $\bar{sp} {y (t_{1}), \dots, y (t_{n}), y^{*} (t_{1}), \dots, y^{*} (t_{n})}$ respectively. Thus, $\tilde{s} (t_{k} | t_{n})$ is the SL estimate of s(t_k ) and ŝ(t_k |t_n ) is its WL estimate. Since $\bar{sp} \{y (t_{1}), \dots, y (t_{n})\} \subseteq \bar{sp} \{y (t_{1}), \dots, y (t_{n}), y^{*} (t_{1}), \dots, y^{*} (t_{n})\}$ , then, by the projection theorem, it follows that the mean square error of ŝ(t_k |t_n ) is smaller than that of ŝ(t_k |t_n ).

Abbreviations

EKF:: extended Kalman filter
SL:: strictly linear
SOS:: second-order stationary
UKF:: un-scented Kalman filter
WL:: widely linear.

References

Mandic DP, Goh VSL: Complex Valued Nonlinear Adaptative Filters. Noncircularity, Widely Linear and Neural Models. Wiley, New York; 2009.
Chapter Google Scholar
Picinbono B, Chevalier P: Widely linear estimation with complex data. IEEE Trans Signal Process 1995, 43(8):2030-2033. 10.1109/78.403373
Article Google Scholar
Picinbono B, Bondon P: Second-order statistics of complex signals. IEEE Trans Signal Process 1997, 45(2):411-420. 10.1109/78.554305
Article Google Scholar
CheongTook C, Mandic DP: Augmented second-order statistics of quaternion random signals. Signal Process 2011, 91(2):214-224. 10.1016/j.sigpro.2010.06.024
Article Google Scholar
Goh VSL, Mandic DP: An augmented extended kalman filter algorithm for complex-valued recurrent neural networks. Neural Comput 2007, 19: 1039-1055. 10.1162/neco.2007.19.4.1039
Article Google Scholar
Navarro-Moreno J: ARMA prediction of widely linear systems by using the innovations algorithm. IEEE Trans Signal Process 2008, 56(7):3061-3068.
Article MathSciNet Google Scholar
Rubin-Delanchy P, Walden AT: Kinematics of complex-valued time series. IEEE Trans Signal Process 2008, 56(9):4189-4198.
Article MathSciNet Google Scholar
Navarro-Moreno J, Moreno-Kayser J, Fernández-Alcalá RM, Ruiz-Molina JC: Widely linear estimation algorithms for second-order stationary signals. IEEE Trans Signal Process 2009, 57(12):4930-4935.
Article MathSciNet Google Scholar
Xia Y, CheongTook C, Mandic DP: An augmented affine projection algorithm for the filtering of noncircular complex signals. Signal Process 2010, 90(6):1788-1799. 10.1016/j.sigpro.2009.11.026
Article Google Scholar
Martíınez-Rodríguez AM, Navarro-Moreno J, Fernández-Alcalá RM, Ruiz-Molina JC: A general solution to the continuous-time estimation problem under widely linear processing. EURASIP J Adv Signal Process (accepted, 2011)
Adali T, Haykin S: Adaptive Signal Processing: Next Generation Solutions. Wiley, New York; 2010.
Book Google Scholar
Cambanis S: A general approach to linear mean-square estimation problems. IEEE Trans Inf Theory 1973, IT-19: 110-114.
Article MathSciNet Google Scholar
Poor HV: An Introduction to Signal Detection and Estimation. 2nd edition. Springer-Verlag, New York; 1998.
Google Scholar
Sugisaka M: The design of on-line least-squares estimators given covariance specifications via an imbedding method. Appl Math Comput 1983, 13: 55-85. 10.1016/0096-3003(83)90030-9
Article MathSciNet Google Scholar
Fernández-Alcalá RM, Navarro-Moreno J, Ruiz-Molina JC: On the smoothing estimation problem for the intensity of a DSMPP. Methodol Comput Appl Probab 2010, 1-12. Online. Available: http://dx.doi.org/10.1007/s11009-010-9165-z
Google Scholar
Moustafa A, Takewaki I: Use of probabilistic and deterministic measures to identify unfavourable earthquake records. J Zhejiang Univ Sci A 2009, 10: 619-634. Online. Available: http://dx.doi.org/10.1631/jzus.A0930001
Article Google Scholar
Schreier PJ, Scharf LL: Statistical Signal Processing of Complex-Valued Data. Cambridge University Press, New York; 2010.
Book Google Scholar
Arató M, Baran S, Ispány M: Functionals of complex Ornstein-Uhlenbeck processes. Comput Math Appl 1999, 37: 1-13.
Google Scholar
Baggeroer A: State variable analysis procedures. In Detection, Estimation, and Modulation Theory. Wiley, New York; 1971:286-327.
Google Scholar
Wang S, Hong H: Quantiles of critical separation distance for nonstationary seismic excitations. Eng Struct 2006, 28: 985-991. 10.1016/j.engstruct.2005.11.003
Article Google Scholar
Conte JP, Pister KS, Mahin SA: Nonstationary ARMA modeling of seismic motions. Soil Dyn Earthquake Eng 1992, 11: 411-426. 10.1016/0267-7261(92)90005-X
Article Google Scholar
Haykin S: Kalman Filtering and Neural Networks. John Wiley & Sons, Inc., New York; 2001.
Book Google Scholar

Download references

Acknowledgements

This study was supported in part by Project MTM2007-66791 of the Plan Nacional de I+D+I, Ministerio de Educación y Ciencia, Spain, which was financed jointly by the FEDER.

Author information

Authors and Affiliations

Department of Statistics and Operations Research, University of Jaén, Campus Las Lagunillas, Jaén, 23071, Spain
Rosa M. Fernández-Alcalá, Jesús Navarro-Moreno & Juan C. Ruiz-Molina

Authors

Rosa M. Fernández-Alcalá
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Navarro-Moreno
View author publications
You can also search for this author in PubMed Google Scholar
Juan C. Ruiz-Molina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rosa M. Fernández-Alcalá.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Fernández-Alcalá, R.M., Navarro-Moreno, J. & Ruiz-Molina, J.C. Prediction on widely factorizable signals. EURASIP J. Adv. Signal Process. 2012, 96 (2012). https://doi.org/10.1186/1687-6180-2012-96

Download citation

Received: 29 June 2011
Accepted: 01 May 2012
Published: 01 May 2012
DOI: https://doi.org/10.1186/1687-6180-2012-96

Prediction on widely factorizable signals

Abstract

1 Introduction

2 Preliminaries

3 Linear augmented complex prediction

3.1 Numerical examples

3.1.1 Example

3.1.2 Example

3.1.3 Example

4 Nonlinear augmented complex prediction

4.1 Numerical example

Endnotes

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

About this article

Cite this article

Share this article

Keywords