Widely linear Markov signals

Espinosa-Pulido, Juan A; Navarro-Moreno, Jesús; Fernández-Alcalá, Rosa M; Ruiz-Molina, Juan C

doi:10.1186/1687-6180-2012-256

Research
Open access
Published: 18 December 2012

Widely linear Markov signals

Juan A Espinosa-Pulido¹,
Jesús Navarro-Moreno¹,
Rosa M Fernández-Alcalá¹ &
…
Juan C Ruiz-Molina¹

EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 256 (2012) Cite this article

2475 Accesses
3 Citations
Metrics details

Abstract

The insufficiency to guarantee the existence of a state-space representation of the classical wide-sense Markov condition for improper complex-valued signals is shown and a generalization is suggested. New characterizations for wide-sense Markov signals which are based either on second-order properties or on state-space representations are studied in a widely linear setting. Moreover, the correlation structure of such signals is revealed and interesting results on modeling in both the forwards and backwards time directions are proved. As an application we give some recursive estimation algorithms obtained from the Kalman filter. The performance of the proposed results is illustrated in a numerical example in the areas of estimation and simulation.

1 Introduction

Markov signals are characterized by the condition that future development of these signals depends only on current states and not their history up to that time. In general, Markov processes are easier to model and analyze, and they do include interesting applications. Among others, estimation and detection are areas of signal processing where this kind of process has provided efficient solutions (see, e.g., [1, 2]). Non-Markov processes in which the future state of a process depends on its whole history are generally harder to analyze mathematically [3]. In linear minimum-mean square error (MMSE) estimation theory, when the processes under consideration are not Gaussian, the classes of stochastic processes which are of practical importance are wide-sense Markov (WSM) processes. The concept of WSM signal is easier to check than the condition of (strictly) Markov since it involves only second-order characteristics [4]. In general, WSM processes (with the exception of Gaussian processes) are not Markov in the strict sense. The equivalence between the WSM condition and the state-space representation for the signal is really what makes WSM signals especially attractive in signal processing [1].

Widely linear (WL) processing is an emerging research area in the complex-valued signal analysis which gives significant performance gains with respect to strictly linear (SL) processing (excellent account of the topic and the literature can be found in [5, 6]). It has proved to be a more useful approach than SL processing since complex-valued random signals are in general improper (i.e., they are correlated with their complex conjugates). Thus, the improper nature of most signals forces us to consider the so-called augmented statistics to entirely describe their second-order properties. Using augmented statistics means incorporating in the analysis the information supplied by the complex conjugate of the signal and examining properties of both the correlation and complementary correlation functions. SL processing operates ignoring this last function. Some areas of signal processing in which the treatment of improper signals by using a WL processing has proved to be beneficial are estimation [5–11], detection [12], modeling [8], and simulation [13].

A general characteristic of the articles devoted to studying WSM complex-valued signals is that they use a SL processing approach (see e.g., [1, 14–16]). We will show by means of simple examples that the classical definition and the associated characterizations of WSM signals are incorrect for improper signals. The examples then motivate the extension of the concept of WSM signal to a WL setting and the study of new characterizations. Specifically, we introduce the concept of widely linear Markov (WLM) signals and we give different characterizations based either on second-order properties or on state-space representations from a WL processing point of view. The analysis is performed in both the forwards and backwards directions of time. We also provide a way to check the WLM condition, similar to the well-known triangular property, based on augmented statistics and determine the correlation structure of WLM signals. The modeling part is the focus of this article. In this sense, WL forwards and backwards Markovian representations are suggested, the interrelation between them is studied and the connection with the WL autoregressive representations defined in [8] is established. These Markovian representations also become a starting point for the application of different recursive estimation algorithms. Thus, the application of the Kalman filter on the forwards and backwards representations yields different WL prediction, filtering and smoothing algorithms. The point, which is illustrated in an example, is that besides the well-known performance gain of the WL approach we also get more realistic results in simulation and modeling.

The article is organized as follows. In Section 2, we present some background material on complex-valued Markov signals, illustrate the incapacity of the usual WSM condition in order to characterize the state-space representation for improper signals and suggest the concept of WLM signal. Some preliminary characterizations are also given. Section 3 studies the correlation structure of WLM signals. In Section 4, we discuss the modeling problem for WLM signals and analyze the stationary case. The estimation problem is treated in Section 5. We apply our results in the fields of signal simulation and estimation by considering a numerical example in Section 6. A Section of conclusions ends the article. To preserve continuity in our presentation, all proofs are deferred to an Appendix Appendix 1.

2 Preliminaries

In this section, we give the main definitions, notation and auxiliary results. We also present two examples which motivate the necessity of the new concept introduced.

Bold capital letters will be used to refer to matrices and bold lower-case letters will be used to refer to vectors. The row j of any matrix A(·) will be denoted by A_[j](·), the n-vector of zeros by 0_n and the n×m-matrix of zeros by 0_n×m. Furthermore, the superscripts ∗, T, and H represent the complex conjugate, transpose, and complex transpose, respectively.

Let ${x_{t}, t \in Z}$ be a zero-mean complex random signal with correlation function $r (t, s) = E [x_{t} x_{s}^{*}]$ and complementary correlation function c(t s)=E x_tx_s. Most of the results in this article are valid for nonstationary signals. However, for some of them the stationary condition is necessary. The signal x_tis said to be of second-order wide-sense stationary (SOS) if the functions r(t s) and c(t s) depend on t − s. A zero-mean stochastic process w_t is called a doubly white noise if $E [w_{t} w_{s}^{*}] = e_{1} δ (t - s)$ and E w_tw_s=e₂δ(t − s) with |e₂| ≤ e₁(see [8] for a complete study of their characteristics). The linear MMSE estimator of x_t based on the set of observations ${x_{t_{1}}, x_{t_{2}}, \dots, x_{t_{m}}}$ will be denoted by $\hat{x} (t | t_{1}, t_{2}, \dots, t_{m})$ and we will refer to it as the SL estimator.

The Markov condition on a signal ${x_{t}, t \in Z}$ establishes the following identity for the conditional probability:

P (x_{t} \leq x | x_{t_{1}}, x_{t_{2}}, \dots, x_{t_{m}}) = P (x_{t} \leq x | x_{t_{1}})

for all x and t > t₁ > ⋯ > t_m. Doob [4] introduced a weaker concept based on the SL estimator which has received great attention in the literature (e.g., [1, 14–16]). A signal x_t is called WSM if $\hat{x} (t | τ \leq s) = \hat{x} (t | s)$ for any s < t. Such signals have remarkable properties. For example, Beutler [14] showed that a signal x_t is WSM if, and only if, the function $\bar{k} (t, s) = r (t, s) r^{- 1} (s, s)$ has the triangular property, i.e.,

\bar{k} (t, s) = \bar{k} (t, τ) \bar{k} (τ, s), t \geq τ \geq s

(1)

Another characterization in terms of so-called Markovian state-space models can be found in [1]. They showed that a signal {x_tt ≥ 0} is WSM if, and only if, it has a state-space model of the form

x_{t + 1} = \bar{k} (t + 1, t) x_{t} + u_{t}

(2)

where u_t is a white noise uncorrelated with x₀. Doob’s definition was later generalized in [16] in the following sense: x_t is a WSM signal of order n ≥ 1 if $\hat{x} (t | τ \leq s) = \hat{x} (t | s, s - 1, \dots, s - n + 1)$ for any s < t. The authors also studied the second-order properties of such signals.

All these studies have a common characteristic: the information supplied by the complementary correlation function is ignored, i.e., the results are derived assuming implicitly that the signal is proper (c(t,s)=0). As noted above, nowadays, the research activity in the field of the complex-valued signal is more and more focused on the better performing and less familiar WL processing. In this setting the SL MMSE estimator is replaced by the WL MMSE estimator, denoted by ${\hat{x}}^{WL} (t | t_{1}, t_{2}, \dots, t_{m})$ , which uses the information of the augmented vector of observations ${[x_{t_{1}}, x_{t_{1}}^{*}, x_{t_{2}}, x_{t_{2}}^{*}, \dots, x_{t_{m}}, x_{t_{m}}^{*}]}^{T}$ . The immediate question that arises is whether the classical concept of WSM signals remains valid in the WL processing approach. The following two examples give us the answer.

Example 1. Consider a signal {x_t,t ≥ 0}with correlation function $r (t, s) = \frac{1}{2} (e^{3 | t - s |} + e^{| t - s |})$ and complementary correlation function $c (t, s) = \frac{1}{2} (e^{3 | t - s |} - e^{| t - s |})$ .It is easy to check that r(t,s) does not satisfy the triangular property (1) and then, the signal cannot be modeled by a representation of the form (2). However, as we will show later, it is possible to find a state-space representation for such a signal given by (26). Thus, the classical WSM condition is clearly insufficient in the improper case to find a state-space representation for the signal involved.

Example 2. Assume that {x_t,1 ≤ t ≤ 100}is a signal with correlation and complementary correlation functions given by r(t,s) = (t/100 + 1)^1/6(s/100)⁴ and c(t,s) = j(s/100)⁴, for s ≤ t,respectively, with $j = \sqrt{- 1}$ .Here, the triangular property (1) holds and then x_thas the representation

x_{t + 1} = {(\frac{t + 101}{t + 100})}^{1 / 6} x_{t} + u_{t}

(3)

with x_tuncorrelated with u_t.However, this model presents two important shortcomings in the WL processing framework: the noise u_tis correlated with $x_{t}^{*}$ and the information supplied by c(t,s)is ignored. Both problems can be avoided by considering a more competitive model for x_tobtained with the additional information of $x_{t}^{*}$ .In fact, we can write an alternative state-space representation for x_tgiven by (27). An exhaustive study about the superiority of (27) against (3) is presented in Section 6.

From these two simple examples we extract the following consequences: the classical definition of a WSM signal must be extended to deal with improper signals, this new concept must be characterized adequately to avoid the drawback shown in Example 1 and new results about modeling are necessary to exploit the information available in both x_t and $x_{t}^{*}$ thus attaining better models for the signal as illustrated in Example 2. Next, we introduce such a definition in a WL processing setting.

Definition 1. A complex-valued signal ${x_{t}, t \in Z}$ is said to be WLM of order n ≥ 1, briefly a WLM(n) signal, if the following condition holds

{\hat{x}}^{WL} (t | τ \leq s) = {\hat{x}}^{WL} (t | s, s - 1, \dots, s - n + 1)

for any s < t.

Notice that this concept extends both the classical notion of WSM introduced by Doob in [4] and the later generalization given in [16].

In the rest of the section, we provide different characterizations of WLM(n) signals. For that, we need to introduce some additional notation. Denote the augmented forwards vector of order n ≥ 1 of x_t as the 2n-vector

x_{t} = {[x_{t}, x_{t}^{*}, x_{t - 1}, x_{t - 1}^{*}, \dots, x_{t - n + 1}, x_{t - n + 1}^{*}]}^{T}

and its correlation function by $R (t, s) = E [x_{t} x_{s}^{H}]$ . From now on, we assume that det {R_t} ≠ 0 with R_t:=R(t,t). Moreover, we define the normalized correlation function as

K (t, s) = R (t, s) R_{s}^{- 1}

(4)

Similarly, we define the augmented backwards vector of order n ≥ 1 of x_t as the 2n-vector

x_{t}^{b} = {[x_{t + n - 1}, x_{t + n - 1}^{*}, x_{t + n - 2}, x_{t + n - 2}^{*}, \dots, x_{t}, x_{t}^{*}]}^{T}

The following results establish the relation between the signals x_t and their augmented forwards and backwards versions. We start first with the augmented forwards vector and we give a test similar to (1) for a signal being WLM(n).

Theorem 1. The following statements are equivalent:

1.
${x_{t}, t \in Z}$ is a WLM(n) signal.
2.
For s < t, the WL MMSE estimator of x _t on the basis of the set ${x_{τ}, x_{τ}^{*}, τ \leq s}$ is of the form
${\hat{x}}^{WL} (t | τ \leq s) = K (t, s) x_{s}$
(5)

3.
For t ≥ τ ≥ s,
$K (t, s) = K (t, τ) K (τ, s)$
(6)

Now, we suggest a characterization based on the augmented backwards vector. This result also shows the independence from the time direction of the Markov condition.

Theorem 2. The following statements are equivalent:

1.
${x_{t}, t \in Z}$ is a WLM(n) signal.
2.
${\hat{x}}^{WL} (t | τ \geq s) = {\hat{x}}^{WL} (t | s, s + 1, \dots, s + n - 1)$ for any s > t.
3.
For s > t, the WL MMSE estimator of $x_{t}^{b}$ on the basis of the set ${x_{τ}^{b}, x_{τ}^{b^{*}}, τ \geq s}$ is of the form
${\hat{x}}^{b^{WL}} (t | τ \geq s) = K (t + n - 1, s + n - 1) x_{s}^{b}$
(7)

3 Correlation structure of WLM(n) signals

In this section, the second-order properties of a WLM(n) signal ${x_{t}, t \in Z}$ are analyzed. Specifically, we study the structure of the matrices R(t,s), K(t,s), R_t, and K_t:=K(t + 1,t).

Proposition 1. 1. The following relations hold:

\begin{align} K_{[2 (j + i) - 1]} (t + j, t) = [\underset{2 i - 2}{\underset{⏟}{0, \dots, 0}}, 1, \underset{2 (n - i) + 1}{\underset{⏟}{0, \dots, 0}}], \\ j < n, i = 1, \dots, n - j \end{align}

(8)

\begin{align} K_{[2 (j + i)]} (t + j, t) = [\underset{2 i - 1}{\underset{⏟}{0, \dots, 0}}, 1, \underset{2 (n - i)}{\underset{⏟}{0, \dots, 0}}], \\ j < n, i = 1, \dots, n - j \end{align}

(9)

\begin{align} K_{[2 + i]} (t + j + 1, t) = K_{[i]} (t + j, t), \\ j \geq 0, i = 1, \dots, 2 n - 2 \end{align}

(10)

\begin{align} K_{[1]} (t + j + 1, t) = K_{[1]} (t + j + 1, t + j) K (t + j, t), \\ j \geq 0 \end{align}

(11)

\begin{align} K_{[2]} (t + j + 1, t) = K_{[2]} (t + j + 1, t + j) K (t + j, t), \\ j \geq 0 \end{align}

(12)

2. The matrix K _t is of the form

K_{t} = [\begin{matrix} k_{1, t} k_{2, t} k_{3, t} k_{4, t} \dots k_{2 n - 3, t} k_{2 n - 2, t} k_{2 n - 1, t} k_{2 n, t} \\ k_{2, t}^{*} k_{1, t}^{*} k_{4, t}^{*} k_{3, t}^{*} \dots k_{2 n - 2, t}^{*} k_{2 n - 3, t}^{*} k_{2 n, t}^{*} k_{2 n - 1, t}^{*} \\ 1 0 0 0 \dots 0 0 0 0 \\ 0 1 0 0 \dots 0 0 0 0 \\ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮ \\ 0 0 0 0 \dots 1 0 0 0 \\ 0 0 1 0 \dots 0 0 0 0 \end{matrix}]

(13)

where k_i,t = k_i(t + 1,t) for i = 1,…,2n and k_i(t + 1,t) is defined in (28).

3. The matrices R(t,s) and K _t satisfy the recursive equation

R (t + 1, s) = K_{t} R (t, s), s \leq t

(14)

which has the solution

R (t, s) = K_{t - 1} \dots K_{s} R_{s}, s < t

(15)

Moreover,

R_{t + 1} = K_{t} R_{t} K_{t}^{H} + Q_{t}

where Q_t is a 2n×2n-matrix of the form

(16)

with

A_{t} = [\begin{matrix} a_{1, t} a_{2, t} \\ a_{2, t}^{*} a_{1, t} \end{matrix}]

where a_1,t are real positive numbers and A_t is nonnegative definite.

4 Modeling of WLM(n) signals

We aim to provide different ways of modeling for WLM(n) signals. The connection between stationary WLM(n) signals and the autoregressive representations defined in [8] is also established. First, we present a new characterization in which the equivalence between a WLM signal of order n and their forwards and backwards representations is given. Such representations show that a WLM(n) signal depends only on the n preceding or subsequent states and their conjugates.

Theorem 3. A signal {x_t,0 ≤ t ≤ m} is a WLM(n) if, and only if, it has the forwards and backwards representations

\begin{align} x_{t + 1} = k_{t}^{T} x_{t} + w_{t}, t \geq n - 1 \end{align}

(17)

\begin{align} x_{t} = {k^{b}}_{t + 1}^{T} x_{t + 1}^{b} + w_{t + 1}^{b}, t \leq m - n + 1 \end{align}

(18)

where k_t, $k_{t}^{b}$ are 2n-vectors, and w_t, $w_{t}^{b}$ are doubly white noises such that

\begin{align} E [w_{t} x_{n - 1}] = 0_{2 n}, t \geq n - 1 \\ E [w_{t}^{b} x_{m - n + 1}^{b}] = 0_{2 n}, t \leq m - n + 1 \end{align}

(19)

Now we state a parallel result to the classical one established for stationary WSM processes and autoregressive representations [16].

Corollary 1. If {x_t,0 ≤ t ≤ m} is a SOS WLM(n) signal, then x_t is the solution of the WL system defined in [8]

x_{t + 1} = \sum_{i = 0}^{n - 1} g_{1, i} x_{t - i} + \sum_{i = 0}^{n - 1} g_{2, i} x_{t - i}^{*} + w_{t}

(20)

where $g_{1, i}, g_{2, i} \in C$ , i = 1,…,n − 1, and w_t is a doubly white noise such that $E [w_{t} w_{t}^{*}] = a_{1}$ and E[w_tw_t] = a₂.

We summarize the previous results in the following steps which provides forwards and backwards models for a WLM(n) signal:

Step 1: Define the 2n-vector k_t such that $k_{t}^{T}$ coincides with the first row of the matrix
$K_{t} : = R (t + 1, t) R_{t}^{- 1}$
(21)

Similarly, we define the 2n-vector $k_{t + 1}^{b}$ such that ${k^{b}}_{t + 1}^{T}$ is equal to the 2n − 1 row of the matrix
$K_{t + 1}^{b} : = K (t + n - 1, t + n) = R (t + n - 1, t + n) R_{t + n}^{- 1}$
(22)
Step 2: Consider the matrices
$\begin{align} Q_{t} = R_{t + 1} - K_{t} R_{t} K_{t}^{H} \end{align}$
(23)

$\begin{align} Q_{t + 1}^{b} = R_{t + n - 1} - K_{t + 1}^{b} R_{t + n} {K^{b}}_{t + 1}^{H} \end{align}$
(24)
Step 3: The signal x_tcan be represented by the following forwards and backwards models:
$\begin{align} x_{t + 1} = k_{t}^{T} x_{t} + w_{t}, t \geq n - 1 \\ x_{t} = {k^{b}}_{t + 1}^{T} x_{t + 1}^{b} + w_{t + 1}^{b}, t \leq m - n + 1 \end{align}$

where w_t is a doubly white noise uncorrelated with x_{n − 1}for all t ≥ n − 1 and $w_{t}^{b}$ is a doubly white noise uncorrelated with x_{m − n + 1}for all t ≤ m − n + 1. Moreover, $E [w_{t} w_{t}^{*}]$ and E[w_tw_t] are the (1,1)-element and (1,2)-element of the matrix Q_t, respectively. Similarly, $E [w_{t}^{b} {w_{t}^{b}}^{*}]$ and $E [w_{t}^{b} w_{t}^{b}]$ are the (2n−1,2n−1)-element and (2n−1,2n)-element of the matrix $Q_{t}^{b}$ , respectively.

In certain situations we have a forwards model of the form (17) for the signal x_t. It would be interesting to be able to obtain a backwards model directly from the forwards model. Next, we show a useful way to get our objective.

Proposition 2. Given a forwards model of the form

\begin{align} x_{t + 1} = k_{t}^{T} x_{t} + w_{t}, n - 1 \leq t \leq m \end{align}

(25)

with w_ta doubly white noise uncorrelated with x_{n − 1}, then {x_t,0 ≤ t ≤ m} has the backwards representation

\begin{align} x_{t} = {k^{b}}_{t + 1}^{T} x_{t + 1}^{b} + w_{t + 1}^{b}, 0 \leq t \leq m - n + 1 \end{align}

where the 2n-vector $k_{t + 1}^{b}$ satisfies that ${k^{b}}_{t + 1}^{T}$ is equal to the 2n−1 row of the matrix $K_{t + 1}^{b} = R_{t + n - 1} K_{t + n - 1}^{H} R_{t + n}^{- 1}$ and $w_{t}^{b}$ is a doubly white noise with the properties given in Step 3 above.

Example 1 (continued) It is not difficult to check that x_tis a WLM(1) signal by using property (6). Hence, applying Steps 1–3 above, it has the state-space representation

x_{t + 1} = \frac{1}{2} (e^{3} + e) x_{t} + \frac{1}{2} (e^{3} - e) x_{t}^{*} + w_{t}

(26)

with w_t a doubly white noise uncorrelated with x₀and $x_{0}^{*}$ . Moreover, as x_t is also a SOS signal, this model is trivially its WL autoregressive representation.

Example 2 (continued) From Theorem 1 and Steps 1–3, it follows that x_tis a WLM(1) signal and has the state-space representation

\begin{array}{l} x_{t + 1} = \frac{1 0^{1 / 3} {(t + 101)}^{1 / 6} {(t + 100)}^{1 / 6} - 10}{1 0^{1 / 3} {(t + 100)}^{1 / 3} - 10} x_{t} \\ + j \frac{1 0^{2 / 3} (- {(t + 101)}^{1 / 6} + {(t + 100)}^{1 / 6})}{1 0^{1 / 3} {(t + 100)}^{1 / 3} - 10} x_{t}^{*} + w_{t} \end{array}

(27)

with w_t a doubly white noise uncorrelated with x₁and $x_{1}^{*}$ .

5 Estimation problem of WLM(n) signals

Once the modeling problem has been solved for WLM(n) signals, we address the MMSE estimation problem of such signals under a WL processing approach. The forwards and backwards representations given in Theorem 3 notably simplify the design of different recursive estimation algorithms. To this end, we use the Kalman recursions on the forwards representation to provide the solution for the prediction and filtering problems and on the backwards representation for the smoothing problem (see, e.g., [17, 18]).

Suppose that we observe a WLM(n) signal {x_t,0 ≤ t ≤ m} via the process

y_{t} = h_{t} x_{t} + v_{t}, 0 \leq t \leq m

with v_t a doubly white noise such that $E [v_{t} v_{t}^{*}] = n_{1, t}$ and E[v_tv_t]=n_2,t with n_1,t > |n_2,t|. Moreover, we assume that v_t is uncorrelated with x_sand $x_{s}^{*}$ for all t,s.

Consider the 2-vector $y_{t} = {[y_{t}, y_{t}^{*}]}^{T}$ , the 2×2n matrix

H_{t} = [\begin{matrix} h_{t} 0 0 \dots 0 \\ 0 h_{t}^{*} 0 \dots 0 \end{matrix}]

and the 2×2 matrix

N_{t} = [\begin{matrix} n_{1, t} n_{2, t} \\ n_{2, t}^{*} n_{1, t} \end{matrix}]

5.1 Prediction and filtering cases

Denote the WL filtered estimator of x_tby ${\hat{x}}_{t}^{WL}$ and the one-step-ahead predictor of x_{t + 1} by ${\hat{x}}_{t + 1 | t}^{WL}$ , both obtained on the basis of the information provided by the set ${y_{0}, y_{0}^{*}, \dots, y_{t}, y_{t}^{*}}$ , and consider their associated errors $p_{t} = E [| x_{t} - {\hat{x}}_{t}^{WL} |^{2}]$ and $p_{t + 1 | t} = E [| x_{t + 1} - {\hat{x}}_{t + 1 | t}^{WL} |^{2}]$ . Also denote the estimate of x_n−1 obtained from the information provided by ${[y_{n - 1}, y_{n - 1}^{*}, \dots, y_{0}, y_{0}^{*}]}^{T}$ by ${\hat{x}}_{n - 1}$ and its associated error by P_n−1. By combining the forwards representation (17) and the classical Kalman filter we present Algorithm 1 which provides these estimators in an efficient way.

Algorithm 1. WL filter and prediction Require: y_t, H_t, N_t, K_t, Q_t, g =[1,0,…,0]^T, ${\hat{x}}_{n - 1}$ , and P_n−1Ensure: ${\hat{x}}_{t + 1 | t}^{WL}$ , ${\hat{x}}_{t + 1}^{WL}$ , p_{t + 1|t}, and p_{t + 1}

1:
? for t = n -1do
2:
${\hat{x}}_{t + 1 | t} ? K_{t} {\hat{x}}_{t}$
3:
$P_{t + 1 | t} ? K_{t} P_{t} K_{t}^{H} + Q_{t}$
4:
$\begin{array}{l} F_{t + 1} ? \\ P_{t + 1 | t} H_{t + 1}^{H} {[H_{t + 1} P_{t + 1 | t} H_{t + 1}^{H} + N_{t + 1}]}^{- 1} \end{array}$
5:
${\hat{x}}_{t + 1} ? {\hat{x}}_{t + 1 | t} + F_{t + 1} [y_{t + 1} - H_{t + 1} {\hat{x}}_{t + 1 | t}]$
6:
P_{t + 1}?P_{t + 1|t}-F_{t + 1}H_{t + 1}P_{t + 1|t}
7:
${\hat{x}}_{t + 1 | t}^{WL} ? g^{T} {\hat{x}}_{t + 1 | t}$
8:
${\hat{x}}_{t + 1}^{WL} ? g^{T} {\hat{x}}_{t + 1}$
9:
p_{t + 1|t}?g^TP_{t + 1|t}g
10:
p_{t + 1}?g^TP_{t + 1}g
11:
? end for

5.2 Smoothing case

Next, we compute two WL smoothing estimators of x_tbased on future data. The first smoother is obtained from the set of observations ${y_{t}, y_{t}^{*}, y_{t + 1}, y_{t + 1}^{*}, \dots, y_{m}, y_{m}^{*}}$ and it will be denoted by ${\hat{x}}_{t}^{b^{WL}}$ . The second one is derived from the information supplied by the set ${y_{t + 1}, y_{t + 1}^{*}, \dots, y_{m}, y_{m}^{*}}$ and we will refer to it as ${\hat{x}}_{t | t + 1}^{b^{WL}}$ . The errors of both estimators are $p_{t}^{b} = E [| x_{t} - {\hat{x}}_{t}^{b^{WL}} |^{2}]$ and $p_{t | t + 1}^{b} = E [| x_{t} - {\hat{x}}_{t | t + 1}^{b^{WL}} |^{2}]$ , respectively. The initial condition ${\hat{x}}_{m - n + 1}^{b}$ is the estimate of $x_{m - n + 1}^{b}$ obtained from the 2n + 2-vector ${[y_{m - n + 1}, y_{m - n + 1}^{*}, \dots, y_{m}, y_{m}^{*}]}^{T}$ and $P_{m - n + 1}^{b}$ is its associated error. By applying the backwards Kalman recursions on the backwards model (18) we get Algorithm 2.

Algorithm 2. WL smoothing Require: y_t, H_t, N_t, $K_{t + 1}^{b}$ , $Q_{t + 1}^{b}$ , l =[0,…,0,1,0]^T, ${\hat{x}}_{m - n + 1}^{b}$ , and $P_{m - n + 1}^{b}$ Ensure: ${\hat{x}}_{t | t + 1}^{b^{WL}}$ , ${\hat{x}}_{t}^{b^{WL}}$ , $p_{t | t + 1}^{b}$ , and $p_{t}^{b}$

1:
? for t = m - n do
2:
${\hat{x}}_{t | t + 1}^{b} ? K_{t + 1}^{b} {\hat{x}}_{t + 1}^{b}$
3:
$P_{t | t + 1}^{b} ? K_{t + 1}^{b} P_{t + 1}^{b} {K^{b}}_{t + 1}^{H} + Q_{t + 1}^{b}$
4:
$F_{t}^{b} ? P_{t | t + 1}^{b} H_{t}^{H} {[H_{t} P_{t | t + 1}^{b} H_{t}^{H} + N_{t}]}^{- 1}$
5:
${\hat{x}}_{t}^{b} ? {\hat{x}}_{t | t + 1}^{b} + F_{t}^{b} [y_{t} - H_{t} {\hat{x}}_{t | t + 1}^{b}]$
6:
$P_{t}^{b} ? P_{t | t + 1}^{b} - F_{t}^{b} H_{t} P_{t | t + 1}^{b}$
7:
${\hat{x}}_{t | t + 1}^{b^{WL}} ? l^{T} {\hat{x}}_{t | t + 1}^{b}$
8:
${\hat{x}}_{t}^{b^{WL}} ? l^{T} {\hat{x}}_{t}^{b}$
9:
$p_{t | t + 1}^{b} ? l^{T} P_{t | t + 1}^{b} l$
10:
$p_{t}^{b} ? l^{T} P_{t}^{b} l$
11:
? end for

6 Numerical example

This section is devoted to showing the advantages of representation (27) (model 2) in relation to (3) (model 1) in two fields of signal processing: simulation and estimation. Firstly, we use such models to simulate trajectories of x_t defined in Example 2. Specifically, 50,000 trajectories of both models have been generated via Montecarlo simulation. To assess the performance of the simulations we compare the true correlation and complementary correlation functions with the simulated ones. Figure 1a,b depicts the true correlation and complementary correlation functions of x_t, Figure 1c,d the simulated simulated ones corresponding to model 1 and Figure 1e,f the simulated ones for model 2. We can see that the simulated trajectories of model 1 pick up adequately the behavior of the correlation function. However, these trajectories are unable to show the basic characteristics of the complementary correlation function. This shortcoming does not appear with model 2 whose simulated trajectories yield accurate representations of the second-order moments of x_t. For more detail, the 2D sections of the true complementary function and the simulated ones with models 1 and 2 for t=60 and t=90, respectively, are shown in Figure 2a,b.

Finally, we compare the SL smoother obtained with model 1 and the WL smoother derived in Algorithm 2 for model 2. For the particular case in which h_t=1 and n_1,t=1, Figure 3a compares the error $p_{t}^{b}$ obtained for n_2,t=0.25 (dotted line) and n_2,t=0.8 (solid line) with the counterpart SL error (dashed line). On the other hand, considering n_2t=n₂and denoting the errors of the improper and proper smoothers for every value of n₂ by $p_{t}^{b} (n_{2})$ and ${\bar{p}}_{t}^{b} (n_{2})$ , respectively, Figure 3b displays the mean of the difference between the SL and WL estimation errors, that is, $DE (n_{2}) = \frac{1}{100} \sum_{t = 1}^{100} ({\bar{p}}_{t}^{b} (n_{2}) - p_{t}^{b} (n_{2}))$ with n₂ varying within the interval [0,1). As expected, both figures show that WL estimation outperforms SL estimation, that is, they illustrate the better performance of the improper smoother in relation to the proper one. From Figure 3b, we also come to the conclusion that this gain in performance decreases as n₂reduces.

7 Conclusions

The limited utility of the classical WSM definition to characterize the existence of a state-space representation for improper random signals has been revealed. By means of two simple examples, we have shown that in some cases the triangular condition fails to hold for signals with a state-space representation or that there exist signals with autocorrelations satisfying the triangular property for which the associated state-space representations present drawbacks in relation to their WL counterparts. Thus, the definition of a WSM signal has been extended to deal with improper signals providing new characterizations for WLM signals based either on second-order properties or on state-space representations. Moreover, a way to check the WLM condition has been given and the correlation structure of WLM signals has been devised. Finally, WL forwards and backwards Markovian representations have been presented from which some applications are illustrated in the signal estimation and simulation fields.

Appendix 1

Proof of Theorem 1

To prove the implication 1)⇒2) observe that if x_t is a WLM(n) signal then for any s < t,

\begin{matrix} {\hat{x}}^{WL} (t | τ \leq s) = k_{1} (t, s) x_{s} + k_{2} (t, s) x_{s}^{*} + \dots \\ + k_{2 n - 1} (t, s) x_{s - n + 1} + k_{2 n} (t, s) x_{s - n + 1}^{*} \end{matrix}

(28)

which implies that ${\hat{x}}^{WL} (t | τ \leq s)$ is of the form (5) with K(t,s) defined in (4). Moreover, the rows are of the form

\begin{matrix} K_{[2 i - 1]} (t, s) = [k_{1} (t - i + 1, s), k_{2} (t - i + 1, s), \dots, k_{2 n - 1} \\ \times (t - i + 1, s), k_{2 n} (t - i + 1, s)] \\ K_{[2 i]} (t, s) = [k_{2}^{*} (t - i + 1, s), k_{1}^{*} (t - i + 1, s), \dots, k_{2 n}^{*} \\ \times (t - i + 1, s), k_{2 n - 1}^{*} (t - i + 1, s)] \end{matrix}

(29)

for i=1,…,n. The inverse implication, 2)⇒1), is checked similarly.

Finally, the proof of 2)⇔3) is similar to the one given in Theorem 1 of [16].

Proof of Theorem 2

The proof of 2)⇔3) is similar to that of Theorem 1 by taking into account that $E [x_{t}^{b} {x_{s}^{b}}^{H}] = R (t + n - 1, s + n - 1)$ . Now, we prove 1)⇔3). Following a similar reasoning to that used in the proof of Theorem 1 in [16], we have that (7) is equivalent to the condition

K (t, s) = K (t, τ) K (τ, s), t \leq τ \leq s

and thus,

\begin{align} K^{H} (s, t) = K^{H} (τ, t) K^{H} (s, τ) = {(K (s, τ) K (τ, t))}^{H}, \\ t \geq τ \geq s \end{align}

from which, applying Theorem 1, it follows that x_t is a WLM(n) signal. In a similar way the implication 1)⇒3) is proven.

Proof of Proposition 1

Taking into account that ${\hat{x}}^{WL} (t + j - i | τ \leq t) = x_{t + j - i}$ for j ≤ i ≤ n − 1 we obtain (8) and (9). Likewise, (13) follows from (29), (8), and (9).

Now, from (6) we get

K (t + j + 1, t) = K (t + j + 1, t + j) K (t + j, t), j \geq 0

and together with (13) we demonstrate (10), (11), and (12).

On the other hand, (14) and (15) can be proven following a similar reasoning to that of Theorem 2 in [16].

Finally, by using the Hilbert projection theorem and (5) we have

x_{t + 1} = K_{t} x_{t} + w_{t}

(30)

where $w_{t} = {[w_{t}, w_{t}^{*}, 0, \dots, 0]}^{T}$ is the innovations process which, by construction, is uncorrelated with x_s for t ≥ s. Thus,

\begin{array}{l} R_{t + 1} = E [x_{t + 1} x_{t + 1}^{H}] = E [(K_{t} x_{t} + w_{t}) {(K_{t} x_{t} + w_{t})}^{H}] \\ = K_{t} R_{t} K_{t}^{H} + Q_{t} \end{array}

with $E [w_{t} w_{t}^{H}] = Q_{t}$ given in (16).

Proof of Theorem 3

If x_t is a WLM(n) signal then, from (13) and (30), we have

\begin{array}{l} x_{t + 1} = k_{1, t} x_{t} + k_{2, t} x_{t}^{*} + \dots + k_{2 n - 1, t} x_{t - n + 1} \\ + k_{2 n, t} x_{t - n + 1}^{*} + w_{t} \end{array}

(31)

where w_t is the first component of w_t. Hence, denoting $k_{t} = K_{[1]}^{T} (t + 1, t) = {[k_{1, t}, \dots, k_{2 n, t}]}^{T}$ we obtain (17). On the other hand, from the Hilbert projection theorem and (7) we get

x_{t}^{b} = K (t + n - 1, t + n) x_{t + 1}^{b} + w_{t + 1}^{b}

(32)

where $w_{t}^{b} = {[0, \dots, 0, w_{t}^{b}, {w_{t}^{b}}^{*}]}^{T}$ is the backwards innovations process which, from construction, is uncorrelated with x_s for t ≤ s. Hence, $x_{t} = K_{[2 n - 1]} (t + n - 1, t + n) x_{t + 1}^{b} + w_{t + 1}^{b}$ with $w_{t + 1}^{b}$ the 2n − 1 component of $w_{t + 1}^{b}$ . Thus, denoting ${k^{b}}_{t + 1}^{T} = K_{[2 n - 1]} (t + n - 1, t + n)$ , (18) is obtained.

Conversely, suppose that x_t has the representation (17). Denote $H$ the closed span generated by the set ${x_{τ}, x_{τ}^{*}, τ \leq t}$ . By using Proposition 2.3.2 of [19], to prove that ${\hat{x}}^{WL} (t | τ \leq s) = {\hat{x}}^{WL} (t | s, s - 1, \dots, s - n + 1)$ for any s < t is equivalent to ${\hat{x}}^{WL} (t + 1 | τ \leq t) = {\hat{x}}^{WL} (t + 1 | t, t - 1, \dots, t - n + 1)$ for all t. Thus, projecting (17) onto $H$ and taking Proposition 2.3.2 of [19] into account we have

{\hat{x}}^{WL} (t + 1 | τ \leq t) = k_{t}^{T} x_{t} + ŵ^{WL} (t | τ \leq t)

where $ŵ^{WL} (t | τ \leq t)$ is the projection of w_t onto $H$ . The hypothesis (19) guarantees that w_t is uncorrelated with x_sand $x_{s}^{*}$ for t ≥ s. Hence, $ŵ^{WL} (t | τ \leq t) = 0$ and x_t is a WLM(n) signal.

The proof for the backwards representation (18) is similar.

Proof of Corollary 1

Since x_t is a SOS signal then the matrices R(t + h,t), h=1,2,…, are independent of t. Thus, from (4) we obtain k_i,t=k_ifor all i and t. Finally, taking (31) into account we have

x_{t + 1} = \sum_{i = 0}^{n - 1} k_{2 i + 1} x_{t - i} + \sum_{i = 0}^{n - 1} k_{2 i + 2} x_{t - i}^{*} + w_{t}

which gives (20) defining g_1,i=k_{2i + 1}and g_2,i=k_{2i + 2}.

Proof of Proposition 2

From (25) and Theorem 3 it follows that $x_{t}^{b}$ has the representation (32). Then by using (22) we obtain

\begin{align} K_{t + 1}^{b} = K (t + n - 1, t + n) = R (t + n - 1, t + n) R_{t + n}^{- 1} \\ = R^{H} (t + n, t + n - 1) R_{t + n}^{- 1} = R_{t + n - 1} K_{t + n - 1}^{H} R_{t + n}^{- 1} \end{align}

and thus the result follows.

Abbreviations

MMSE:: Minimum-mean square error
SL:: Strictly linear
WL:: Widely linear
WLM:: Widely linear Markov
WSM:: Wide-sense Markov.

References

Kailath T, Sayed AH, Hassibi B: Linear Estimation. Prentince Hall, New Jersey; 2000.
Google Scholar
Poor HV, Chang Ch: A reduced-complexity quadratic structure for the detection of stochastic signals. J. Acoust. Soc. Am 1985, 78(5):1652-1657. 10.1121/1.392803
Article Google Scholar
Jones PW, Smith P: Stochastic Processes. An Introduction. Chapman &amp Hall/CRC, Boca Raton; 2010.
Google Scholar
Doob JL: Stochastic Processes. John Willey, New York; 1953.
Google Scholar
Mandic DP, Goh VSL: Complex Valued Nonlinear Adaptative Filters. Noncircularity, Widely Linear and Neural Models. John Willey, New York; 2009.
Book Google Scholar
Adali T, Haykin S: Adaptive Signal Processing: Next Generation Solutions. Wiley-IEEE Press, New York; 2010.
Book Google Scholar
Navarro-Moreno J, Estudillo MD, Fernández-Alcalá RM, Ruiz-Molina JC: Estimation of improper complex-valued random signals in colored noise by using the hilbert space theory. IEEE Trans. Inf. Theory 2009, 55(6):2859-2867.
Article Google Scholar
Picinbono B, Bondon P: Second-order statistics of complex signals. IEEE Trans. Signal Process 1997, 45(2):411-420. 10.1109/78.554305
Article Google Scholar
Goh VSL, Mandic DP: An augmented extended kalman filter algorithm for complex-valued recurrent neural networks. Neural Comput 2007, 19: 1039-1055. 10.1162/neco.2007.19.4.1039
Article Google Scholar
Navarro-Moreno J: ARMA prediction of widely linear systems by using the innovations algorithm. IEEE Trans. Signal Process 2008, 56(7):3061-3068.
Article MathSciNet Google Scholar
Cheong Took C, Mandic DP: A quaternion widely linear adaptive filter. IEEE Trans. Signal Process 2010, 58(8):4427-4431.
Article MathSciNet Google Scholar
Buzzi S, Lops M, Sardellitti S: Widely linear reception strategies for layered space-time wireless communications. IEEE Trans. Signal Process 2006, 54(6):2252-2262.
Article Google Scholar
Rubin-Delanchy P, Walden AT: Simulation of improper complex-valued sequences. IEEE Trans. Signal Process 2007, 55(11):5517-5521.
Article MathSciNet Google Scholar
Beutler F: Multivariate wide-sense Markov processes and prediction theory. Ann. Math. Stat 1963, 34(2):424-438. 10.1214/aoms/1177704154
Article MathSciNet Google Scholar
Mandrekar V: On multivariate wide-sense Markov processes. Nagoya Math. J 1968, 3: 7-19.
MathSciNet Google Scholar
Kasprzyk A, Szczotka W: Covariance structure of wide-sense Markov processes of order k ≥ 1. Appl. Math 2006, 33(2):129-143.
MathSciNet Google Scholar
Dini DH, Mandic DP, Julier SJ: A widely linear complex unscented Kalman filter. IEEE Signal Process. Lett 2011, 18(11):623-626.
Article Google Scholar
Dini DH, Mandic DP: Class of widely linear complex Kalman filters. IEEE Trans. Neural Netw. Learn. Syst 2012, 23(5):775-786.
Article MathSciNet Google Scholar
Brockwell PJ, Davis RA: Time Series: Theory and Methods. Springer-Verlag, New York; 1991.
Book Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics and Operations Research, University of Jaén, Campus Las Lagunillas, 23071, Jaén, Spain
Juan A Espinosa-Pulido, Jesús Navarro-Moreno, Rosa M Fernández-Alcalá & Juan C Ruiz-Molina

Authors

Juan A Espinosa-Pulido
View author publications
You can also search for this author in PubMed Google Scholar
Jesús Navarro-Moreno
View author publications
You can also search for this author in PubMed Google Scholar
Rosa M Fernández-Alcalá
View author publications
You can also search for this author in PubMed Google Scholar
Juan C Ruiz-Molina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rosa M Fernández-Alcalá.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Espinosa-Pulido, J.A., Navarro-Moreno, J., Fernández-Alcalá, R.M. et al. Widely linear Markov signals. EURASIP J. Adv. Signal Process. 2012, 256 (2012). https://doi.org/10.1186/1687-6180-2012-256

Download citation

Received: 28 May 2012
Accepted: 29 October 2012
Published: 18 December 2012
DOI: https://doi.org/10.1186/1687-6180-2012-256

Widely linear Markov signals

Abstract

1 Introduction

2 Preliminaries

3 Correlation structure of WLM(n) signals

4 Modeling of WLM(n) signals

5 Estimation problem of WLM(n) signals

5.1 Prediction and filtering cases

5.2 Smoothing case

6 Numerical example

7 Conclusions

Appendix 1

Proof of Theorem 1

Proof of Theorem 2

Proof of Proposition 1

Proof of Theorem 3

Proof of Corollary 1

Proof of Proposition 2

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords