PARALIND-based identifiability results for parameter estimation via uniform linear array

Liu, Xu; Jiang, Ting; Yang, Longxiang; Zhu, Hongbo

doi:10.1186/1687-6180-2012-154

Research
Open access
Published: 20 July 2012

PARALIND-based identifiability results for parameter estimation via uniform linear array

Xu Liu¹,
Ting Jiang²,
Longxiang Yang¹ &
…
Hongbo Zhu¹

EURASIP Journal on Advances in Signal Processing volume 2012, Article number: 154 (2012) Cite this article

2379 Accesses
6 Citations
Metrics details

Abstract

This article applies PARAllel profiles with LINear Dependencies (PARALIND) model to analyze identifiability of parameter estimation in the presence of incoherent multipath via uniform linear array (ULA). New identifiability results are derived based on the uniqueness property of PARALIND model and structure property of ULA. With the strong properties of trilinear model, the proposed identifiability conditions for propagation parameter identification are superior to early studies. We give a new tradeoff between the number of receiving antennae and sampling diversity to ensure parameter identification. Furthermore, a new lower bound of the number of receiving antennae for identifiability is derived. It also shows that the identifiability results is not only determined by traditional factors, such as the number of receiving antennae, oversampling factor or the total number of transmitting paths, but also related to the structure of multipath of sources.

Introduction

Deterministic parameter estimation is one major problem in multi-sensor array system to effectively locate and track various types of signals to minimize interference and maximize intended signal reception, capitalizing various structure property of source signals or (and) received signals [1–4]. The identifiability issue of parameter determination signifies the existence of a unique desired solution under ideal operating conditions and lays the foundation of the capability of estimation techniques. Identifiability results are usually related to the analysis method of the data model and a given algorithm. ESPRIT algorithm, which takes advantage of the rotational invariance property of the uniform linear array (ULA), can be valid to estimate direction of arrivals (DOAs) uniquely only if the number of calibrated receiving antennae is more than the number of sources and all single path signals follow distinct direction to the receiving end [5]. Fonvard/backward spatial smoothing techniques pointed out that 3 K /2 sensor elements should be enough to identify K DOAs of coherent signals [6, 7]. In multipath scenario, wireless channel is characterized not only by its DOA but also time delay of the different propagation paths. Van der Veen proposed a joint angle and delay estimation algorithm based on the smoothing method and joint diagonalization technique. A lower bound of number of receiving antennae and oversampling diversity for parameters identification has been presented for the given algorithm [1, 8, 9]. Recently, Sidiropoulos and Liu [10] linked trilinear decomposition to array signal processing and guaranteed several improved identifiability results of parameter estimation based on PARAFAC analysis, which introduces a new perspective to parameters estimation.

Trilinear data analysis models, such as Tucker3, PARAFAC and PARAllel profiles with LINear Dependencies (PARALIND), were applied into signal processing area in recent years [11–17]. PARALIND model is a kind of trilinear model that was first proposed by Bro et al. [18–20]. It can further be viewed as a new family of PARAFAC models and was developed to extend its usage to problems with linearly dependent factors. Then De Lathauwer and A.L.F. de Almeida introduced the ’Block term decomposition’ and ’Constrained block PARAFAC’, respectively, which have similar formulations but natural extensions to PARALIND [21, 22]. This article links PARALIND analysis to model identifiability of parameter estimation via ULA. Received signals of the ULA, transmitted through incoherent multipath rays of sources with distinct angles and delays, are constructed into PARALIND model. New identifiability results are presented based on the uniqueness issue of PARALIND. The main contributions of this article are listed in the following:

(i)
A new ‘space-time’ tradeoff between the number of receiving antennae and sampling diversity for parameter identification is derived based on the strong uniqueness properties of trilinear model.
(ii)
We give a new lower bound of the number of receiving antennae to identify parameters in multipath propagation scenario, which is more superior to early studies.
(iii)
Our work shows that the identifiability results for parameters identification are not only determined by some traditional factors, such as the number of receiving antennae, sampling diversity or the number of paths, but also related to the structure of multipath of sources, which was not considered in previous work.

The rest of this article is organized as follows: Section “Data model” lays the data model of array signals in multipath propagation channel. Section “Uniqueness of paralind” gives the basic uniqueness property of PARALIND model. Section “Paralind-based identifiability results for parameter estimation” proposes the main results of parameters identifiability. Some lemmas and theorems will be guaranteed and analyzed. In the last section, we draw the conclusion.

Some notations will be used in this article. diag([ a , b , $\dots])$ denotes the diagonal matrix with scalar entries a,b,…while $blockdiag ([A, B, \dots])$ denotes the block diagonal matrix with matrix entries A, B, …. (·)^T and (·)^‡ stand for transpose and pseudo-inverse, respectively; vec(·) stacks the columns of its matrix argument in a vector; unvec(·) is the inverse operation of vec(·), $unvec (c, I, J) = [c (1 : J), c (J + 1 : 2 J), \dots, c ((I - 1) J : IJ)]$ . ⊗ is Kronecker product; ⊙ denotes the Khatri-Rao product, which is a column-wise Kronecker product. Define $A = [a_{1}, \dots, a_{R}] \in ℂ^{I \times R}$ , $B = [b_{1}, \dots, b_{R}] \in ℂ^{J \times R}$ , The Khatri-Rao product of A and B is:

\begin{align} A ⊙ B = a_{1} \otimes b_{1}, \dots, a_{R} \otimes b_{R} \end{align}

Data model

Figure 1 gives a schematic communication scenario with multipath channel. F sources are transmitting to an array with K antennae through multipath scattering propagation channel. g ( t ) is the impulse response which collects all temporal aspects, such as pulse shaping, transmitting filter and receiving filter. Signals of f th user followr_fdistinct paths on its way from source to receiver, referred as multipath rays with distinct DOA, transmitting delay and attenuation. The j th path of source f is parameterized by a triple (θ_{f , j},β_{f , j},τ_{f , j}), whereθ_{f , j}: DOAβ_{f , j}: complex path attenuationτ_{f , j}: transmitting delay

Assume that the ULA is used in the receiving end and the distance d between adjacent elements is equal to (or less than) half of the wavelength of signals. Define r to be the total number of paths of all sources, as $r = \sum_{f = 1}^{F} r_{f}$ . Let us conveniently index the multiple rays of sources from 1 to r , starting with all rays associated with the first source and then rays associated with the second source, and so on. Index r parameter triples as $\{(θ_{1}, β_{1}, τ_{1}), \dots, (θ_{r_{1}}, β_{r_{1}}, τ_{r_{1}}), \dots, (θ_{r}, β_{r}, τ_{r})\}$ . The array manifold matrixA_θ, time manifold matrixG_τ and path attenuation matrix Γ are defined as:

\begin{array}{l} A_{θ} = [a (θ_{1}), \dots, a (θ_{r_{1}}), \dots, a (θ_{r})] \in ℂ^{K \times r} \\ G_{τ} = [g (τ_{1}), \dots, g (τ_{r_{1}}), \dots, g (τ_{r})] \in ℂ^{P \times r} \\ Γ = diag (β_{1}, \dots β_{r_{1}}, \dots, β_{r}) \in ℂ^{r \times r} \end{array}

(1)

where

\begin{array}{l} a (θ) = {[1, e^{j \frac{2 Πd}{λ} sin (θ)}, \dots, e^{j \frac{2 Π (K - 1) d}{λ} sin (θ)}]}^{T} \\ g (τ) = {[g (\frac{1}{P} - τ), g (\frac{2}{P} - τ), \dots, g (1 - \frac{1}{P} - τ)]}^{T} \end{array}

(2)

Received signals can be formulated as follow [1]:

X = (G_{r} ⊙ A_{θ}) Γ {(SJ)}^{T}

(3)

where

X = [\begin{array}{l} x (0) & \cdot\cdot\cdot & x (N - 1) \\ ⋮ & ⋮ & ⋮ \\ x (\frac{P - 1}{P}) & \cdot\cdot\cdot & x (N - 1 + \frac{P - 1}{P}) \end{array}]

(4)

is a KP × N space-time data matrix collecting samples during N symbol periods with oversampling factor P in the receiving end. x is a K ×1 array receiving signal. S is a data matrix of size N × F , collecting N symbols of all users. J is a selection matrix that joins multipath associated with a given source.

J = [\begin{array}{l} 1_{r_{1}}^{T} & 0, \dots, & 0 \\ \cdot\cdot\cdot\pm \\ 0 & \dots, & 1_{r_{F}}^{T} \end{array}]

(5)

where1_m denotes an m ×1 vector with elements 1. Equation (3) is a classical parameterized data model named “incoherent multipath with small delay spread” [1, 10]. The propagation parameters,θ_iandτ_i, $i = 1, \dots, r$ , are involved in array manifold matrixA_θ and time manifold matrixG_τ. The multipath structure is indicated by J.

The time delay τ is usually difficult to estimate from g ( t − τ ) directly. An alternative approach is to map τ into phase shift ϕ in the frequency domain by discrete Fourier transform (DFT) method [8]. Assume that g ( t ) is band limited and the sample rate is at or above the Nyquist rate. Take P points DFT of each antenna output over single symbol period. Then the following model is obtained [1]:

\bar{X} = (F_{ϕ} ⊙ A_{θ}) Γ {(SJ)}^{T}

(6)

where

F_{ϕ} = [\begin{array}{l} 1 & \dots & 1 \\ ϕ_{1} & \dots & ϕ_{r} \\ ⋮ & ⋮ & ⋮ \\ ϕ_{1}^{P - 1} & \dots & ϕ_{r}^{P - 1} \end{array}], ϕ_{i} = e^{- j 2 Π τ_{i} / P}

(7)

The advantage of (7) versus (3) is that, by using DFT method, delays are transformed into certain phase progressions andG_τ is converted into a Vandermonde matrixF_ϕ, which can provide facility for parameters estimation. Although DFT method may cause some extra error during parameter estimation, van der Veen et al. [8] has informed that this kind of error is very small comparing to the estimation errors that will occurred in the presence of noise.

According to [18], Equation (6) can be viewed as one slice formulation of PARALIND model. The link to PARALIND implies that generic PARALIND model fitting algorithms are directly applicable to deterministic parameter estimation [20]. However, the identifiability of the model pertains to the capability of recovering parameters in the absence of noise. The main work of this article is to investigate new identifiability results for parameter estimation in PARALIND decomposition perspective. Some novel results, such as the tradeoff between the number of receiving antennae and sampling diversity and the lower bound of receiving antennae for parameter identification, are also derived. Firstly, we give the basic uniqueness of PARALIND model.

Uniqueness of PARALIND

The uniqueness of the PARALIND model lays the foundation of its applications. Because of the linear dependence of the loading factors, PARALIND model does not follow directly from the uniqueness property of PARAFAC, but only has partial uniqueness (or essential uniqueness, defined in [23]), which depends on the specifics of the imposed dependency structure along with the adequacy of the factor variation information provided by a given set of data [19]. The uniqueness property of PARALIND was first proposed in [18] and improved by Stegeman and de Almeida [24]. De Lathauwer [23] has given an essential uniqueness theorem more quantitatively. Two new concepts are needed in this theorem.

Definition 1

( k -rank) [25]: Consider a matrix B of size I × J . If every l columns of B are linearly independent, but this does not hold for every l + 1 columns, then the k-rank of B is l , denoted ask_B= l .

Definition 2

( k’ -rank) [23]: Assume a partitioned matrix $A = [A_{1}, \dots, A_{M}]$ . The k’ -rank of A, denoted as ${rank}_{k^{″}} (A)$ or $k_{A}^{″}$ , is the maximal number r such that any set of r sub-matrices of A yields a set of linearly independent columns.

Theorem 1

[23]: Rewrite one slice matrix of PARALIND model

X = (A ⊙ B) {(CH)}^{T}

where $A \in ℂ^{I \times r}, B \in ℂ^{J \times r}, C \in ℂ^{K \times F}$ . H is dependence matrix

\begin{align} H = [\begin{array}{l} 1, \dots, 1, 0, \dots, 0, \dots, 0, \dots, 0 \\ 0, \dots, 0, 1, \dots, 1, \dots, 0, \dots, 0 \\ \cdot\cdot\cdot \\ \underset{r_{1}}{\underset{ï¸¸}{0, \dots, 0}}, \underset{r_{2}}{\underset{ï¸¸}{0, \dots, 0}}, \dots, \underset{r_{F}}{\underset{ï¸¸}{1, \dots, 1}} \end{array}] = [H_{1}, H_{2}, \dots, H_{F}] \end{align}

(8)

where $H_{f} \in ℂ^{F \times r_{f}}, f = 1, ., F$ are sub-matrices of H and $r = \sum_{f = 1}^{F} r_{f}$ . A B are partitioned as: $A = [A_{1}, \dots, A_{F}]$ , $B = [B_{1}, \dots, B_{F}]$ with the sub-matrices $A_{f} \in ℂ^{I \times r_{f}}, B_{f} \in ℂ^{J \times r_{f}}, f = 1, \dots, F$ , compatible with the block structure of H. Suppose that the condition:

k_{A}^{″} + k_{B}^{″} + k_{C} \geq 2 F + 2

(9)

holds and we have an alternative decomposition of X, represented by $(\hat{A}, \hat{B}, \hat{C})$ with $k_{\hat{A}}^{″}$ and $k_{\hat{B}}^{″}$ maximal under the given dimensionality constraints. Then there holds $\hat{A} = A π_{a} Δ_{a}, \hat{B} = B π_{b} Δ_{b}$ , whereπ_a,π_b are block permutation matrices andΔ_a,Δ_bare square nonsingular block-diagonal matrices, compatible with the block structure of A and B.

Theorem 1 presents the uniqueness properties of A and B. The uniqueness of matrix C is also studied in [18, 21]. Bro et al. [18] gives a demonstration of the uniqueness property of C, provided that A, B and C are full rank. Furthermore, de Lathauwer [21] gives the identifiability result of C more quantitatively in terms of high-order block tensor decomposition.

Consider Theorem 1 in sub-matrix formulation. Partition $\hat{A}$ and $\hat{B}$ to be compatible with the block structure of A and B, as: $\hat{A} = [{\hat{A}}_{1}, \dots, {\hat{A}}_{F}]$ , $\hat{B} = [{\hat{B}}_{1}, \dots, {\hat{B}}_{F}]$ . According to Theorem 1, it directly follows:

\begin{matrix} [{\hat{A}}_{1}, \dots, {\hat{A}}_{F}] & = [A_{1}, \dots, A_{F}] π_{a} Δ_{a} \\ = [A_{1}, \dots, A_{F}] blockdiag (U_{1}, \dots, U_{F}) \end{matrix}

(10)

\begin{matrix} [{\hat{B}}_{1}, \dots, {\hat{B}}_{F}] & = [B_{1}, \dots, B_{F}] π_{b} Δ_{b} \\ = [B_{1}, \dots, B_{F}] blockdiag (V_{1}, \dots, V_{F}) \end{matrix}

where $U_{f} \in ℂ^{r_{f} \times r_{f}}, V_{f} \in ℂ^{r_{f} \times r_{f}}, f = 1, \dots, F$ are 2 F nonsingular square matrices. It follows

\begin{array}{l} {\hat{A}}_{m} = A_{m} U_{m} \\ {\hat{B}}_{m} = B_{m} V_{m}, m = 1, \dots F \end{array}

(11)

Equation (11) gives another representation for uniqueness property of PARALIND model. It shows that the column space ofA_mandB_mare unique. However, it also implies when condition (9) is satisfied, mode matrices A and B are suffered from rotation ambiguity, characterized byU_fandV_f. Bro et al. [20] has pointed out that PARALIND model can also give uniqueness results if some of its mode matrices have theoretically motivated structural constraints. Due to the structure property of multi-sensor array, we give the PARALIND-based identifiability results for parameter estimation in the next section.

PARALIND-based identifiability results for parameter estimation

As we mentioned, data model (6) can be linked to PARALIND analysis. Array manifoldA_θ, time manifoldF_ϕ, data matrix S and selection matrix J play the roles of A, B, C and H in Theorem 1, respectively. Since the attenuation matrix Γ only leads to column scaling ofA_θandF_ϕ, which will not affect the identifiability results. Therefore, we simplify Γ to be an identity matrix during the following discussion. Then the data model is simplified as:

\bar{X} = (F_{ϕ} ⊙ A_{θ}) {(SJ)}^{T}

(12)

With the structure ofA_θ andF_ϕ, if these two matrices are uniquely determined, parameters θ and τ are determined naturally. According to Theorem 1, k -rank and k ’-rank play important roles in the uniqueness issue of PARALIND. Firstly, we present two lemmas to determine the k -rank and k ’-rank of a Vandermonde matrix.

Lemma 1

( k -rank of Vandermonde matrix) [26]

Consider an I × r Vandermonde matrix A with distinct nonzero generators $α_{1}, α_{2}, \dots, α_{r} \in ℂ$

A = [\begin{array}{l} 1, & \dots, & 1 \\ α_{1}, & \dots, & α_{r} \\ ⋮ \\ α_{1}^{I - 1}, & \dots, & α_{r}^{I - 1} \end{array}]

(13)

A is not only full rank but also full k-rank, and $k_{A} = r_{A} = min (I, r)$ .

Lemma 2

( k ’-rank of Vandermonde matrix)

Consider the Vandermonde matrix A in (13). Partition A to F sub-matrices, as $A = [A_{1}, A_{2}, \dots, A_{F}]$ , whereA_f is of size I ×r_f and $r = r_{1} + r_{2} + \cdot\cdot\cdot + r_{F}$ . Resort $r_{1}, \dots, r_{F}$ in descent order and assume that $r_{1} > r_{2} > \cdot\cdot\cdot > r_{F}$ . The k ’-rank of A can be determined as:

k_{A}^{″} = K, where \sum_{f = 1}^{K} r_{f} \leq I < \sum_{f = 1}^{K + 1} r_{f}

(14)

Note that the k ’-rank of A is determined not only by its dimension but also the partition structure. Ifr₁=r₂=r_F=1,k_A=k_A^″.

Proof

See Appendix. □

PARALIND-based identifiability result

According to (6), bothA_θandF_ϕ are Vandermonde matrices. Capitalizing on the property of PARALIND model and Vandermonde structure, we have the following theorem.

Theorem 2

:

Consider data model (12)

\bar{X} = (F_{ϕ} ⊙ A_{θ}) {(SJ)}^{T}

PartitionF_ϕandA_θ to F sub-matrices compatible with the structure of J, as: $F_{ϕ} = [F_{ϕ}^{1}, \dots, F_{ϕ}^{F}]$ and $A_{θ} = [A_{θ}^{1}, \dots, A_{θ}^{F}]$ , where $F_{ϕ}^{f}$ is P ×r_f and $A_{θ}^{f}$ is K ×r_f. Suppose that the condition

k_{A_{θ}}^{″} + k_{F_{ϕ}}^{″} + k_{S} \geq 2 F + 2

(15)

holds. ThenF_ϕ andA_θ can be uniquely determined from $\bar{X}$ . The related parameters, DOA θ and delay spread τ , are identifiable.

Proof

See Appendix. □

Although condition (15) in Theorem 2 and condition (9) in Theorem 1 are identical, the identifiability results of these two theorems are different. Theorem 1 shows thatA_θ andF_ϕonly have “column-space” uniqueness due to the rotation ambiguity in their sub-matrices when condition (9) is satisfied. However, Theorem 2 indicates that Vandermonde matricesA_θ andF_ϕ can be uniquely determined from $\bar{X}$ (no rotation ambiguity) under condition (15). According to the structure of array-manifold matrixA_θ and time-manifoldF_ϕ, the elements of the first row in these two matrices are equal to 1. The scaling ambiguous of the estimated matrices can be removed by normalizing the elements ofA_θandF_ϕwith respect to elements of the first row during parameter estimation.

Remark 1

: As a special case of (15), we assume that data matrix S is full k -rank, ask_S= F . It is achievable when the receiving antennae collect enough symbols for parameter estimation. Then condition (15) becomes $k_{A_{θ}}^{″} + k_{F_{ϕ}}^{″} \geq F + 2$ . According to the definition of k ’-rank, the maximum k ’-rank ofA_θ orF_ϕ is F . Then it requires that $min (k_{A_{θ}}^{″}, k_{F_{ϕ}}^{″}) \geq 2$ . This lower bound is similar to the identifiability requirement of PARAFAC model, which uses k -rank instead of k ’-rank (see Ref. [11]). Furthermore, according to Lemma 2, the k ’-rank ofA_θ andF_ϕ can be represented by $r_{1}, \dots, r_{F}$ . Then the minimum value of P and K can be determined as:

min (P, K) \geq \sum_{i = 1}^{2} r_{i}

(16)

where $r_{1}, \dots, r_{F}$ are descent sorted. Condition (16) shows an interesting result that, to ensure identifiability of parameters estimation, the minimum number of receiving antennae K and oversampling factor P is related to not only the number of sources, but also the number of paths of “some” sources.

Remark 2

Condition (15) shows that the identifiability of data model is determined by $k_{A_{θ}}^{″}$ and $k_{F_{ϕ}}^{″}$ . Lemma 2 also indicates that the k ’-rank ofA_θandF_ϕare related to the structure of their sub-matrices, which is compatible with the multipath structure of sources. Therefore, the identifiability result based on PARALIND analysis are not only determined by traditional factors, such as P , K and r , but also related to the structure of multipath, denoted as J. The following example can show this phenomenon:

Assume that the number of receiving antennae K =4 and the oversampling factor P =6. Six paths from three sources are arriving at the receiving end. Consider the following two cases:

(1)
The number of paths of each source is: r ₁=2, r ₂=2, r ₃=2. In this case, $k_{A_{θ}}^{″} = 2$ and $k_{F_{ϕ}}^{″} = 3$ . It has
$k_{A_{θ}}^{″} + k_{F_{ϕ}}^{″} = 5 \geq F + 2 = 5$

According Theorem 2, parameter identification is achievable.
(2)
The number of paths of each source is: r ₁=3, r ₂=2, r ₃=1. In this case, $k_{A_{θ}}^{″} = 1$ and $k_{F_{ϕ}}^{″} = 3$ . It has
$k_{A_{θ}}^{″} + k_{F_{ϕ}}^{″} = 4 < F + 2 = 5$

Theorem 2 is violated.

It shows that, although the receiving antennae number, oversampling factor and the multipath remain the same, the identifiability results may be different due to the multipath structure. Furthermore, [20] has shown that the dependence matrix J can be uniquely obtained in the PARALIND model by trilinear decomposition method. Therefore, the receive array can get the information of multipath directly from the data model.

PARALIND-based identifiability result with smoothing technique

The identifiability result of (15) can be alleviated by introducing spatial and temporal smoothing techniques from taking advantage of Vandermonde structure in array manifold matrixA_θ and time manifold matrixF_ϕ. Take ’temporal smoothing’ for example. Rewrite (15)

\bar{X} = (F_{ϕ} ⊙ A_{θ}) {(SJ)}^{T}

Construct L matrices of size MK × N

\begin{array}{l} {\bar{X}}^{(l)} = \bar{X} ((l - 1) K + 1 : (M + l - 1) K, :) \\ = [\begin{array}{l} \bar{X} ((l - 1) K + 1 : lK, :) \\ \bar{X} (lK + 1 : (l + 1) K, :) \\ \bar{X} ((l + 1) K + 1 : (l + 2) K, :) \\ ⋮ \\ \bar{X} ((l + M - 2) K + 1 : (l + M - 1) K, :) \end{array}] \\ = [\begin{array}{l} F_{ϕ} (l, :) ⊙ A_{θ} \\ F_{ϕ} (l + 1, :) ⊙ A_{θ} \\ F_{ϕ} (l + 2, :) ⊙ A_{θ} \\ ⋮ \\ F_{ϕ} (l + M - 1, :) ⊙ A_{θ} \end{array}] {(SJ)}^{T} \\ = (F_{ϕ} (l : l + M - 1, :) ⊙ A_{θ}) {(SJ)}^{T}, l = 1, \dots, L \end{array}

(17)

where A( a : b ,:) stands for rows a to b (inclusive) of A. L is defined as smoothing factor and M = P − L + 1. Due to the Vandermonde structure, it holds that

\begin{array}{l} F_{ϕ} (l : l + M - 1, :) = [\begin{array}{l} ϕ_{1}^{l - 1} & \dots & ϕ_{r}^{l - 1} \\ ⋮ & ⋮ & ⋮ \\ ϕ_{1}^{l + M - 2} & \dots & ϕ_{r}^{l + M - 2} \end{array}] \\ = [\begin{array}{l} 1 & \dots & 1 \\ ϕ_{1} & \dots & ϕ_{r} \\ ⋮ & ⋮ & ⋮ \\ ϕ_{1}^{M - 1} & \dots & ϕ_{r}^{M - 1} \end{array}] diag ([ϕ_{1}^{l - 1}, ϕ_{2}^{l - 1}, \dots, ϕ_{r}^{l - 1}]) \\ = F_{ϕ}^{M} diag (ϕ^{l - 1}) \end{array}

(18)

whereϕ l⁻¹denotes $[ϕ_{1}^{l - 1}, ϕ_{2}^{l - 1}, \dots, ϕ_{r}^{l - 1}]$ and $F_{ϕ}^{M} = F_{ϕ} (1 : M, :)$ , Substitute (18) into (17)

\begin{matrix} {\bar{X}}^{(l)} & = ((F_{ϕ}^{M} diag (ϕ^{l - 1})) ⊙ A_{θ}) {(SJ)}^{T} \\ = ((F_{ϕ}^{M} ⊙ A_{θ}) diag (ϕ^{l - 1})) {(SJ)}^{T} \end{matrix}

(19)

Lay out L matrices ${\bar{X}}^{(l)}, l = 1, \dots, L$ vertically and construct a new matrix $\tilde{X}$ of size LMK × N

\begin{align} \tilde{X} & = [\begin{array}{l} {\bar{X}}^{(1)} \\ ⋮ \\ {\bar{X}}^{(L)} \end{array}] = [\begin{array}{l} ((F_{ϕ}^{M} ⊙ A_{θ}) diag (ϕ^{0})) \\ ⋮ \\ ((F_{ϕ}^{M} ⊙ A_{θ}) diag (ϕ^{L - 1})) \end{array}] {(SJ)}^{T} \\ = (F_{ϕ}^{L} ⊙ (F_{ϕ}^{M} ⊙ A_{θ})) {(SJ)}^{T} \end{align}

(20)

where $F_{ϕ}^{L} = F_{ϕ} (1 : L, :)$ . Define $A_{θ}^{M} = F_{ϕ}^{M} ⊙ A_{θ}$ . It follows that

\tilde{X} = (F_{ϕ}^{L} ⊙ A_{θ}^{M}) {(SJ)}^{T}

(21)

The smoothing data $\tilde{X}$ can also be modeled as PARALIND. The main difference between (21) and (15) is that model matricesF_ϕ andA_θ in (15) is replaced by $F_{ϕ}^{L}$ and $A_{θ}^{M}$ . Parameters can also be determined if $F_{ϕ}^{L}$ and $A_{θ}^{M}$ are uniquely decomposed from $\tilde{X}$ . Before discussing the identifiability result of this smoothed model, we need the following lemma:

Lemma 3

( k ’-rank of Khatri-Rao product of Vandermonde matrix)

Consider two Vandermonde matrices $A \in ℂ^{I \times r}$ and $B \in ℂ^{J \times r}$ with distinct nonzero generators. A and B are partitioned as $A = [A_{1}, A_{2}, \dots, A_{F}]$ and $B = [B_{1}, B_{2}, \dots, B_{F}]$ , whereA_f is of size I ×r_f andB_f is of size J ×r_f, respectively, and $r = r_{1} + r_{2} + \cdot\cdot\cdot + r_{F}$ . Resort $r_{1}, \dots, r_{F}$ in descent order, as $r_{1} > r_{2} > \cdot\cdot\cdot > r_{F}$ . If

I + J \geq \sum_{f = 1}^{K} r_{f} + 1

(22)

then $k_{A ⊙ B}^{″} \geq K$ .

Proof

See Appendix □

Theorem 3

: Consider the smoothed data model (21)

\tilde{X} = (F_{ϕ}^{L} ⊙ A_{θ}^{M}) {(SJ)}^{T}

where $A_{θ}^{M} = F_{ϕ}^{M} ⊙ A_{θ}$ . $F_{ϕ}^{L}$ and $F_{ϕ}^{M}$ , presented in (31) and (28), are of size $L \times \sum_{f = 1}^{F} r_{f}$ and $M \times \sum_{f = 1}^{F} r_{f}$ , respectively. Assume that L is selected as $L = \sum_{f = 1}^{R} r_{f}, R \in [2, F]$ and S is full k -rank. Suppose that the conditions

\{\begin{array}{l} P + K \geq \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i}, R \in [2, F] \\ min (K, P) \geq 2 \end{array}

(23)

hold. Then $F_{ϕ}^{L}$ , $F_{ϕ}^{M}$ andA_θ can be uniquely determined from $\tilde{X}$ . Parameters are identifiable.

Proof

See Appendix □

Remark 3

:: Theorem 3 is guaranteed by smoothing matrixF_ϕbased on its Vandermonde structure. Note that array manifoldA_θ is also a Vandermonde matrix. Duality simplifies to symmetry, similar formulation can be obtained by smoothingA_θ, known as ‘spatial smoothing’:

\hat{X} = (A_{θ}^{L} ⊙ F_{ϕ}^{M}) {(SJ)}^{T}

(24)

where $F_{ϕ}^{M} = F_{ϕ} ⊙ A_{θ}^{M}$ , $A_{θ}^{M} = A_{θ} (1 : M, :)$ , and $A_{θ}^{L} = A_{θ} (1 : L, :)$ . Note that data model (24) has the same formulation as (21). It implies that the identifiability results of Theorem 3 is also available whenA_θ is smoothed instead, while R is the k ’-rank of $A_{θ}^{L}$ .

Remark 4

: Condition (23) gives a new tradeoff between the number of sensors K and oversampling factor P , referred as “space-time” tradeoff, to achieve parameter identifiability. As a special case of (23), two antennae are sufficient for r path when the oversampling factor P is more than $\sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i} - 2$ . In Theorem 3, The lower bound of choice of receiving antennae K and sampling diversity P is much superior to that in Theorem 2, discussed in Remark 1. It implies that smoothing technique can further improve the identifiability results of data model. It also shows that the system is capable of supporting many more paths than sensors, provided enough sampling diversity. As the complete symmetry in the roles of P and K , limited samples are also available for r path when enough antennae are used in the receiving end.

Remark 5

: Rewrite condition (23)

P + K \geq \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i}, R \in [2, F]

(25)

Similar to Remark 2, the value of P plus K is also related to the structure of multipath, denoted as $r_{1}, \dots, r_{F}$ . Moreover, it is of interest that the lower bound of P plus K is varied along R , the k ’-rank of $F_{ϕ}^{L}$ (or $A_{θ}^{L}$ in (24)). Now, we prove that the minimum lower bound of (25) can be achieved in the condition of R =2 or R = F . Define a function of variable R : $f (R) = \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i}$ . Let Δf ( R )= f ( R )− f (2), R ∈(2, F ]. Then we wish to prove that

\{\begin{array}{l} Δf (R) \geq 0, R \in (2, F) \\ f (F) = f (2) \end{array}

It follows

\begin{array}{l} Δf (R) = \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i} - \sum_{i = 1}^{2} r_{i} - \sum_{i = 1}^{F} r_{i} \\ = (\sum_{i = 1}^{R} r_{i} - \sum_{i = 1}^{2} r_{i}) - (\sum_{i = 1}^{F} r_{i} - \sum_{i = 1}^{F + 2 - R} r_{i}) \\ = \sum_{i = 3}^{R} r_{i} - \sum_{i = F - R + 3}^{F} r_{i} \\ = \sum_{i = 1}^{R - 2} (r_{R - i + 1} - r_{F - i + 1}) \end{array}

Note thatr₁≥r₂≥,···,≥r_Fand R ≤ F . It is clear thatr_{R − i + 1}≥r_{F − i + 1}, Therefore, Δf ( R )≥0. Since $Δf (F) = \sum_{i = 3}^{F} r_{i} - \sum_{i = 3}^{F} r_{i} = 0$ , we have f ( F )= f (2). This result gives a relationship between the smoothing factor R and the minimum value of P plus K in parameter estimation when r path is considered. Since the cost of parameter estimation is usually related to the number of receiving antenna and the oversampling factor, the result also implies that the cost of parameter estimation can be decreased when the smoothing factor is properly selected.

Remark 6

:: If the multiplication SJ is considered as a whole matrix, the data model (12) can be modeled as PARAFAC. However, since matrix multiplication SJ has collinear columns due to the structure of J. According to the uniqueness property of PARAFAC [25], uniqueness of the given model cannot be guaranteed so that meaningful results of parameter identifiability may not be derived directly based on PARAFAC model. Sidiropoulos and Liu [10] utilizes smoothing technique to improve the k-rank of SJ and gives the identifiability results of (12) based on PARAFAC model. Here we will show that condition (23) is superior to that in [10]. Define matrix C=SJ. Sidiropoulos and Liu [10] presented that the model (12) is identifiable, provided that

K + P + k_{C} \geq 2 r + 2

(26)

Note that C has collinear columns. According to the definition of k -rank, the k -rank of C is equal to 1. Condition (26) becomes K + P ≥2 r + 1. Because $r = \sum_{i = 1}^{F} r_{i}$ . We have

2 r + 1 > \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i}

It can be concluded that condition (23) is more relaxed than condition (26). The following example can guarantee the above improvement. Assume that four users are in consideration. The multipath of users arer₁=5,r₂=3,r₃=2,r₄=1, respectively. Figure 2 depicts the minimum requirement of receiving antennae K in these two conditions along with oversampling factor P varied.

Conclusion

This article has discussed identifiability issue of deterministic parameters estimation via multi-sensor array based on trilinear decomposition theory. With the uniqueness property of PARALIND model, new identifiability results are guaranteed, which are more superior to early studies. According to the proposed identifiability conditions, a new “space-time” tradeoff between the number of receiving antennae and sampling diversity for parameters identification is presented, and it shows that even two receiving antennae are sufficient for identifying parameters of r path, provided sufficient sampling diversity available. Besides, we find that the identifiability conditions is not only determined by some traditional factors, such as the number of receiving antennae, oversampling factors or number of paths, but also related to the multipath structure of each source, which was not considered in previous work.

Appendix

Proof of Lemma 2

According to the definition of k ’-rank, if $k_{A}^{″} = K$ , it means that any K sub-matrices of A yield a set of linearly independent columns but it cannot support K + 1 sub-matrices. Let $\tilde{A}$ be an $I \times \sum_{i = 1}^{K} {\tilde{r}}_{i}$ matrix including any K sub-matrices of A, as $\tilde{A} = [\tilde{A_{1}}, \dots, \tilde{A_{K}}]$ , where $\tilde{A_{1}}, \dots, \tilde{A_{K}}$ are randomly selected from $A_{1}, \dots, A_{F}$ and $\tilde{A_{i}} \neq \tilde{A_{j}}, i, j \in [1, F], i \neq j$ . Note that $\tilde{A_{k}}$ is of size $I \times {\tilde{r}}_{k}$ with Vandermonde structure and in most cases we have $\tilde{A_{k}} \neq A_{k}$ and ${\tilde{r}}_{k} \neq r_{k}$ . With the assumption of $r_{1} > r_{2} > \cdot\cdot\cdot > r_{F}$ , it follows that $\sum_{i = 1}^{K} {\tilde{r}}_{i} \leq \sum_{i = 1}^{K} r_{i}$ . Since $I \geq \sum_{f = 1}^{K} r_{f}$ guarantees $I \geq \sum_{f = 1}^{K} {\tilde{r}}_{f}$ . According to Lemma 1, $\tilde{A}$ is full column rank. It implies that any K sub-matrices of A are guaranteed to yield a set of linearly independent columns so that $k_{A}^{″} \geq K$ . On the other hand, define $\hat{A}$ as $\hat{A} = [A_{1}, \dots, A_{K}, A_{K + 1}]$ . $\hat{A}$ is a $I \times \sum_{i = 1}^{K + 1} r_{i}$ Vandermonde matrix with K + 1 sub-matrices of A and, $k_{\hat{A}} = min (I, \sum_{i = 1}^{K + 1} r_{i}) = I$ . It means that we can find K + 1 sub-matrices of A which yields a set of dependent columns, so that $k_{A}^{″} < K + 1$ . Therefore, $k_{A}^{″} = K$ . The proof is complete. __

Proof of Theorem 2

Before proving Theorem 2, we need the following Lemma:

Lemma 4

[27] Consider a matrix decomposition X= A B^T, where $A \in ℂ^{I \times F}$ is a Vandermonde matrix with distinct nonzero generator and $B \in ℂ^{J \times F}$ is a ‘tall’ or ‘square’ matrix with full column rank. Suppose that the condition $I \geq F + 1$ holds and then A and B can be uniquely decomposed from X under permutation and scaling ambiguous. It means that any other alternative decomposition of X, denoted as $X = \bar{A} \bar{B^{T}}$ in which $\bar{A} \in ℂ^{I \times F}$ has Vandermonde strucure and $\bar{B} \in ℂ^{J \times F}$ is full column rank, is related to A and B via $\bar{A} = A π_{A} Δ_{A}, \bar{B} = B π_{B} Δ_{B}$ , whereπ_Aπ_Bare permutation matrices andΔ_AΔ_Bare diagonal scaling matrices with nonzero elements.

According to Theorem 1, when condition (15) is followed, we have ${\hat{F}}_{ϕ}^{f} = F_{ϕ}^{f} U_{f}, {\hat{A}}_{θ}^{f} = A_{θ}^{f} V_{f}, f = 1, \dots F$ , where $U_{1}, V_{1}, \dots, U_{F}, V_{F}$ are 2 F nonsingular square matrices. Note that any subset of columns of a Vandermonde matrix forms a Vandermonde matrix. Therefore, $F_{ϕ}^{f}, A_{θ}^{f}, f = 1, \dots, F$ are all with Vandermonde structure. Lemma 4 provides that $F_{ϕ}^{1}, A_{θ}^{1}, \dots, F_{ϕ}^{F}, A_{θ}^{F}$ can be uniquely determined from ${\hat{F}}_{ϕ}^{1}, {\hat{A}}_{θ}^{1}, \dots, {\hat{F}}_{ϕ}^{F}, {\hat{A}}_{θ}^{F}$ only if the following conditions are satisfied:

\{\begin{array}{l} P \geq r_{f} + 1 \\ K \geq r_{f} + 1 \end{array} f = 1, \dots, F

(27)

Assume that $r_{1} > r_{2} > \cdot\cdot\cdot > r_{F}$ . Then conditions (27) becomes

\{\begin{array}{l} P \geq r_{1} + 1 \\ K \geq r_{1} + 1 \end{array}

(28)

Remark 1 derives that the minimum of P and K should be larger thanr₁ +r₂. Sincer₂ is no less than 1(multiple source assumption), it means that $min (P, K) \geq r_{1} + 1$ so that (15) is a sufficient condition for (28). Therefore, $F_{ϕ}^{f}, A_{θ}^{f}$ can be uniquely determined from ${\hat{F}}_{ϕ}^{f}, {\hat{A}}_{θ}^{f}$ when condition (15) is satisfied. ThenF_ϕ,A_θ can be uniquely obtained from $F_{ϕ}^{f}, A_{θ}^{f}$ . The proof is complete. $∎$

Proof of Lemma 3

We need the following Lemma:

Lemma 5

(full rank of Khatri-Rao product) [28]

Consider $A ⊙ B : = [a_{1} \otimes b_{1}, \dots, a_{F} \otimes b_{F}]$ , where A is of size I × F , B is of size J × F and $a_{f}, b_{f}, f = 1, \dots, F$ are columns of A, B . Ifr_A +k_B≥ F + 1 orr_B +k_A≥ F + 1 holds, then A⊙B is full column rank, asr_A⊙B= F .

Assume A, B are Vandermonde matrices. According to Lemma 1, $r_{A} = k_{A} = min (I, F), r_{B} = k_{B} = min (J, F)$ . As a special case of Lemma 5, the full rank condition of A⊙B with Vandermonde assumption is

min (I, F) + min (J, F) \geq F + 1

(29)

Randomly select K sub-matrices $\tilde{C_{1}}, \dots, \tilde{C_{K}}$ from C and construct a new matrix

\begin{align} \tilde{C} & = [{\tilde{C}}_{1}, \dots, {\tilde{C}}_{K}] = [{\tilde{A}}_{1} ⊙ {\tilde{B}}_{1}, \dots, {\tilde{A}}_{K} ⊙ {\tilde{B}}_{K}] \\ = [{\tilde{A}}_{1}, \dots, {\tilde{A}}_{K}] ⊙ [{\tilde{B}}_{1}, \dots, {\tilde{B}}_{K}] = \tilde{A} ⊙ \tilde{B} \end{align}

where $\tilde{A_{f}}$ is $I \times {\tilde{r}}_{f}$ , $\tilde{B_{f}}$ is $J \times {\tilde{r}}_{f}$ , $\tilde{A}$ is $I \times \sum_{f = 1}^{K} {\tilde{r}}_{f}$ and $\tilde{B}$ is $J \times \sum_{f = 1}^{K} {\tilde{r}}_{f}$ . Note that here $\tilde{A_{f}} \in \{A_{1}, \dots, A_{F}\}, {\tilde{B}}_{f} \in \{B_{1}, \dots, B_{F}\}$ , and ${\tilde{A}}_{i} \neq {\tilde{A}}_{j}, {\tilde{B}}_{i} \neq {\tilde{B}}_{j}, i \neq j$ . Similar to the proof procedure of Lemma 2, we only need to show that $\tilde{C}$ is full column rank under condition (22). It is equivalent to prove:

min (I, \sum_{f = 1}^{K} {\tilde{r}}_{f}) + min (J, \sum_{f = 1}^{K} {\tilde{r}}_{f}) \geq \sum_{f = 1}^{K} {\tilde{r}}_{f} + 1

In the light of (30), consider the following cases

(1)
$I \geq \sum_{f = 1}^{K} {\tilde{r}}_{f}, J \geq \sum_{f = 1}^{K} {\tilde{r}}_{f}$ . Then
$\begin{align} min (I, \sum_{f = 1}^{K} {\tilde{r}}_{f}) & + min (J, \sum_{f = 1}^{K} {\tilde{r}}_{f}) \\ = \sum_{f = 1}^{K} {\tilde{r}}_{f} + \sum_{f = 1}^{K} {\tilde{r}}_{f} \geq \sum_{f = 1}^{K} {\tilde{r}}_{f} + 1 \end{align}$

Condition (30) is satisfied.
(2)
$I < \sum_{f = 1}^{K} {\tilde{r}}_{f}, J \geq \sum_{f = 1}^{K} {\tilde{r}}_{f}$ or $I \geq \sum_{f = 1}^{K} {\tilde{r}}_{f}, J <$ $\sum_{f = 1}^{K} {\tilde{r}}_{f}$ . Then
$\begin{align} min (I, \sum_{f = 1}^{K} {\tilde{r}}_{f}) & + min (J, \sum_{f = 1}^{K} {\tilde{r}}_{f}) \\ = min (I, J) + \sum_{f = 1}^{K} {\tilde{r}}_{f} \geq \sum_{f = 1}^{K} {\tilde{r}}_{f} + 1 \end{align}$

Condition (30) is satisfied.
(3)
$I < \sum_{f = 1}^{K} {\tilde{r}}_{f}, J < \sum_{f = 1}^{K} {\tilde{r}}_{f}$ . Then depending on condition (22),
$min (I, \sum_{f = 1}^{K} {\tilde{r}}_{f}) + min (J, \sum_{f = 1}^{K} {\tilde{r}}_{f}) = I + J \geq \sum_{f = 1}^{K} r_{f} + 1$

Sincer₁≥r₂≥,··· ,≥r_F, and ${{\tilde{r}}_{1}, \dots, {\tilde{r}}_{K}} \subset {r_{1}, \dots, r_{F}}$ , it holds that $\sum_{f = 1}^{K} r_{f} + 1 \geq \sum_{f = 1}^{K} {\tilde{r}}_{f} + 1$ . Condition (30) is satisfied.

Therefore, $\tilde{C}$ is full column rank, so that $k_{C}^{″} \geq K$ . The proof is complete. $∎$

Proof of Theorem 3

Note that P = L + M −1 and $L = \sum_{i = 1}^{R} r_{i} \leq P$ . According to Lemma 2, the k’-rank of $F_{ϕ}^{L}$ is R . Condition (23) becomes

\begin{array}{l} P + K \geq \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i} \\ \Rightarrow L + M - 1 + K \geq \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i} \\ \Rightarrow \sum_{i = 1}^{R} r_{i} + M - 1 + K \geq \sum_{i = 1}^{R} r_{i} + \sum_{i = 1}^{F + 2 - R} r_{i} \\ \Rightarrow K + M \geq \sum_{i = 1}^{F + 2 - R} r_{i} + 1 \end{array}

(31)

According to Lemma 3, condition (31) provides that the k’-rank of $A_{θ}^{M}$ is larger than F + 2− R . Then we have

\begin{array}{l} k_{F_{ϕ}^{L}}^{″} = R \\ k_{A_{θ}^{M}}^{″} \geq F + 2 - R \end{array}\} \Rightarrow k_{F_{ϕ}^{L}}^{″} + k_{A_{θ}^{M}}^{″} \geq F + 2

(32)

With the assumption ofk_S= F and Theorem 1, condition (32) shows that the partial uniqueness of model (21) is achieved. Then we have

{\hat{F}}_{ϕ, f}^{L} = F_{ϕ, f}^{L} V_{f}^{T}

(33)

{\hat{A}}_{θ, f}^{M} = A_{θ, f}^{M} U_{f}^{T} = (F_{ϕ, f}^{M} ⊙ A_{θ, f}) U_{f}^{T}

(34)

whereV_f,U_fare nonsingular square matrices of sizer_f×r_f. Because $L = \sum_{i = 1}^{R} r_{i} \geq r_{f} + 1$ . According to Lemma 4, $F_{ϕ, f}^{L}$ can be uniquely determined from ${\hat{F}}_{ϕ, f}^{L}$ . Then we wish to prove that $F_{ϕ, f}^{M}$ andA_{θ , f}can be uniquely determined from ${\hat{A}}_{θ, f}^{M}$ . Consider the following cases with different value ofr_f

(1)
r _f=1If r _f=1, $F_{ϕ, f}^{M}$ and A _{θ , f} degenerate to vectors $F_{ϕ, f}^{M}$ , a _{θ , f}, and U _f degenerates to a scalar u _f. Equation (34) becomes
${\hat{a}}_{θ, f}^{M} = u_{f} (f_{ϕ, f}^{M} ⊙ a_{θ, f}) = u_{f} (f_{ϕ, f}^{M} \otimes a_{θ, f})$

where ${\hat{a}}_{θ, f}^{M} \in ℂ^{MK \times 1}$ , $f_{ϕ, f}^{M} \in ℂ^{M \times 1}$ and $a_{θ, f} \in ℂ^{K \times 1}$ . Becauseu_f only leads to column scaling of $F_{ϕ, f}^{M}$ anda_{θ , f}, which will not affect the identifiability result. We simplifyu_fto be 1. Rearrange ${\hat{A}}_{θ, f}^{M}$ to be a M × K matrix Ω
$\begin{align} Ω & = unvec ({\hat{a}}_{θ, f}^{M}, M, K) = unvec (f_{ϕ, f}^{M} \otimes a_{θ, f}, M, K) \\ = a_{θ, f} {(f_{ϕ, f}^{M})}^{T} \end{align}$

Then $F_{ϕ, f}^{M}$ anda_{θ , f} can be easily determined from Ω by using singular value decomposition method (SVD) up to scaling ambiguity.
(2)
r _f≥2It is of interest that Equation (34) is a standard slice matrix formulation of PARAFAC model when r _f≥2, of which three mode matrices are $F_{ϕ, f}^{M}$ , A _{θ , f} and U _f. Recall that the uniqueness condition of PARAFAC model is [11, 25, 29–31]
$k_{F_{ϕ, f}^{M}} + k_{A_{θ, f}} + k_{U_{f}} \geq 2 r_{f} + 2$
(35)

U_f is ar_f×r_f square nonsingular matrix with full k-rank, ask_U=r_f. $F_{ϕ, f}^{M} \in ℂ^{M \times r_{f}}$ and $A_{θ, f} \in ℂ^{K \times r_{f}}$ are Vandermonde matrices, and their k-ranks can be determined as $k_{F_{ϕ, f}^{M}} = min (M, r_{f})$ , $k_{A_{ϕ, f}} = min (K, r_{f})$ . Then condition (35) becomes:
$min (M, r_{f}) + min (K, r_{f}) \geq r_{f} + 2$
(36)

We now prove that condition (31) is sufficient to (36). Four cases need to be discussed:
1. (2.1)
  M ≥r_f, K ≥r_f, then $min (M, r_{f}) + min (K, r_{f}) = r_{f} + r_{f} \geq r_{f} + 2$ . Condition (36) is satisfied.
2. (2.2)
  M <r_f, K ≥r_f,then $min (M, r_{f}) + min (K, r_{f}) = M + r_{f}$ . Condition (36) is satisfied when M ≥2. However, if M =1, L = P , the structure of $F_{ϕ, f}^{M}$ shows that model (21) degenerate to $\tilde{X} = (F_{ϕ} ⊙ A_{θ}) {(SJ)}^{T}$ . According to (31), $K \geq \sum_{i = 1}^{F + 2 - R} r_{i}$ . The k’-rank ofA_θ is larger than F + 2− R . With the assumption of $L = \sum_{i = 1}^{R} r_{i}$ , the k’-rank ofF_ϕis R . It holds
  $k_{F_{ϕ}}^{″} + k_{A_{θ}}^{″} + k_{S} \geq R + F - R + 2 + F = 2 F + 2$
  (37)
  
  Theorem 2 shows thatA_θandF_ϕ can be uniquely determined from $\tilde{X}$ under condition (37).
3. (2.3)
  M ≥r_f, K <r_f, then $min (M, r_{f}) + min (K, r_{f}) = r_{f} + K$ . Since $min (K, P) \geq 2$ . $min (M, r_{f}) + min (K, r_{f})$ is larger thanr_f + 2. Condition (36) is satisfied.
4. (2.4)
  M <r_f, K <r_f, then $min (M, r_{f}) + min (K, r_{f}) = M + K \geq \sum_{i = 1}^{F + 2 - R} r_{i} + 1$ . Because 2≤ R ≤ F , $\sum_{i = 1}^{F + 2 - R} r_{i} + 1 \geq r_{1} + r_{2} + 1 \geq r_{f} + 2$ . Condition (36) is satisfied.

Therefore, $F_{ϕ, f}^{L}$ , $F_{ϕ, f}^{M}$ andA_{θ , f} can be uniquely determined from ${\hat{F}}_{ϕ, f}^{L}$ and ${\hat{A}}_{θ, f}^{M}$ under condition (23). This completes the proof. $∎$

References

van der Veen AJ: Algebraic methods for deterministic blind beamforming. Proc. IEEE 1998, 86: 1987-2008. 10.1109/5.720249
Article Google Scholar
Krim H, Viberg M: Two decades of array signal processing research, the parametric approach. IEEE Signal Process. Mag 1996, 13: 67-94. 10.1109/79.526899
Article Google Scholar
Liu ZT, He J, Liu Z: Computationally efficient DOA and polarization estimation of coherent sources with linear electromagnetic vector-sensor array. EURASIP J. Adv. Signal Process 2010. doi:10.1155/2011/490289
Google Scholar
Sohl A, Klein A: Semiblind channel estimation for IFDMA in case of channels with large delay spreads. Eurasip J. Adv. Signal Process 2010. doi:10.1155/2011/857859
Google Scholar
Roy R, Kailath T: ESPRIT-Estimation of signal parameters via rotational invariance techniques. IEEE Trans. Acoust. Speech Signal Process 1989, 37: 984-995. 10.1109/29.32276
Article Google Scholar
Pillai S, Kwon B: Forward–backward spatial smoothing techniques for the coherent signal identification. IEEE Trans. Acoust. Speech Signal Process 1989, 37: 8-15. 10.1109/29.17496
Article Google Scholar
Linebarger DA, DeGroat RD, Dowling EM: Efficient direction-finding methods employing forword/backward averaging. IEEE Trans. Signal Process 1994, 42(8):2136-2145. 10.1109/78.301848
Article Google Scholar
van der Veen AJ, van der Veen MC, Paulraj A: Joint angle and delay estimation using shift-invariance techniques. IEEE Trans. Signal Process 1998, 46(2):405-418. 10.1109/78.655425
Article Google Scholar
van der veen MC, van der Veen AJ: Estimation of multipath parameters in wireless communications. IEEE Trans. Signal Process 1998, 46(3):682-690. 10.1109/78.661335
Article Google Scholar
Sidiropoulos ND, Liu X: Identifiability results for blind beamforming in incoherent multipath with small delay spread. IEEE Trans. Signal Process 2001, 49(1):228-238. 10.1109/78.890366
Article Google Scholar
Sidiropoulos ND, Giannakis GB, Bro R: Blind PARAFAC receivers for DS-CDMA systems. IEEE Trans. Signal Process 2000, 48(3):810-823. 10.1109/78.824675
Article Google Scholar
de Almeida ALF, Favier G, Mota JCM: Constrained tucker-3 model for blind beamforming. Signal Process 2009, 89: 1240-1244. 10.1016/j.sigpro.2008.11.016
Article Google Scholar
Sidiropoulos ND, Dimie GZ: Blind multiuser detection in W-CDMA systems with large delay spread. IEEE Signal Process. Lett 2001, 8(3):87-89.
Article Google Scholar
Zhang XF, Xu DZ: Blind PARAFAC signal detection for polarization sensitive array. Eurasip J. Adv. Signal Process 2007. doi:10.1155/2007/12025
Google Scholar
Liu X, Xu ZZ: A PARALIND-based blind multiuser detection algorithm in MIMO-CDMA system. J. Syst. Eng. Electron. (China) 2011, 33(2):404-410.
Article Google Scholar
Liang JL, Yang SY, Zhang JY: 4D near-field source localization using cumulant. Eurasip J. Adv. Signal Process 2007. doi:10.1155/2007/17820
Google Scholar
Carvalho LC, Roemer F, Haardt M: Multi-dimensional model order selection. Eurasip J. Adv. Signal Process 2011. doi:10.1186/1687-6180-2011-26
Google Scholar
Bro R, Harshman RA, Sidiropoulos ND, Lundy ME: Modeling multi-way data with linearly dependent loadings, KVL Technical Report. (2005).
Google Scholar
Bahram M, Bro R: A novel strategy for solving matrix effect in three-way data using parallel profiles with linear dependencies. Anal. Chim. Acta 2007, 584: 397-402. 10.1016/j.aca.2006.11.070
Article Google Scholar
Bro R, Harshman RA, Sidiropoulos ND, Lundy ME: Modeling multi-way data with linearly dependent loadings. J. Chemometr 2009, 23(7-8):324-340. 10.1002/cem.1206
Article Google Scholar
de Lathauwer L: Decomposition of a higher-order tensor in block terms-part I: lemmas for partitioned matrices. SIAM J. Matrix Anal. Appl 2008, 30(3):1022-1032. 10.1137/060661685
Article MathSciNet Google Scholar
de Almeida ALF, Favier G, Mota JCM: Constrained tensor modeling approach to blind multiple-antenna CDMA schemes. IEEE Trans. Signal Process 2008, 56(6):2417-2428.
Article MathSciNet Google Scholar
de Lathauwer L: Decomposition of a higher-order tensor in block terms-part II: definitions and uniqueness. SIAM J. Matrix Anal. Appl 2008, 30(3):1033-1066. 10.1137/070690729
Article MathSciNet Google Scholar
Stegeman A, de Almeida ALF: Uniqueness conditions for constrained three-way factor decompositions with linearly dependent loadings. SIAM J. Matrix Anal. Appl 2009, 31: 1469-1490.
Article MathSciNet Google Scholar
Kruskal JB: Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics. Linear Algeb. Appl 1977, 18: 95-138. 10.1016/0024-3795(77)90069-6
Article MathSciNet Google Scholar
Sidiropoulos ND, Giannakis GB, Bro R: Parallel factor analysis in sensor array processing. IEEE Trans. Signal Process 2000, 48(8):2377-2388. 10.1109/78.852018
Article Google Scholar
Liu X, Xu ZZ, Lei L: Identification of array signal parameters based on matrix decomposition. J. Appl. Sci. (China) 2010, 28(1):49-55.
MathSciNet Google Scholar
Guo X, Brie D, Zhu S, Liao X: A CANDECOMP/PARAFAC perspective on uniqueness of DOA estimation using a vector sensor array. IEEE Trans. Signal Process 2011, 59: 3475-3481.
Article MathSciNet Google Scholar
De Lathauwer L: A link between the canonical decomposition in multilinear algebra and simultaneous matrix diagonalization. SIAM J. Matrix Anal. Appl 2006, 28: 642-666. 10.1137/040608830
Article MathSciNet Google Scholar
Stegeman A: On uniqueness conditions for Candecomp/Parafac and Indscal with full column rank in one mode. Linear Algeb. Appl 2009, 431: 211-227. 10.1016/j.laa.2009.02.025
Article MathSciNet Google Scholar
Jiang T, Sidiropoulos ND: Kruskal’s permutation lemma and the identification of Candecomp/Parafac and bilinear models with constant modulus constraints. IEEE Trans. Signal Process 2004, 52: 2625-2636. 10.1109/TSP.2004.832022
Article MathSciNet Google Scholar

Download references

Acknowledgements

This work is supported by the China NSF Grants (61101104, 61071090,61100195), the National Science and Technology Major Project (2011ZX03005-004-03), the Jiangsu ”973” Project (BK2011027), A Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions-Information and Communication Engineering; Nanjing University of Posts & Telecommunications Project (NY211010). The authors wish to thank the anonymous reviewers for their valuable suggestions on improving this article.

Author information

Authors and Affiliations

Jiangsu Key Laboratory of Wireless Communication, College of Telecommunications and Information Engineering, Nanjing University of Posts & Telecommunications, Nanjing, China
Xu Liu, Longxiang Yang & Hongbo Zhu
Nanjing Panda Technology CO. LTD, Nanjing, China
Ting Jiang

Authors

Xu Liu
View author publications
You can also search for this author in PubMed Google Scholar
Ting Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Longxiang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Hongbo Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xu Liu.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Liu, X., Jiang, T., Yang, L. et al. PARALIND-based identifiability results for parameter estimation via uniform linear array. EURASIP J. Adv. Signal Process. 2012, 154 (2012). https://doi.org/10.1186/1687-6180-2012-154

Download citation

Received: 31 January 2012
Accepted: 21 June 2012
Published: 20 July 2012
DOI: https://doi.org/10.1186/1687-6180-2012-154

PARALIND-based identifiability results for parameter estimation via uniform linear array

Abstract

Introduction

Data model

Uniqueness of PARALIND

Definition 1

Definition 2

Theorem 1

PARALIND-based identifiability results for parameter estimation

Lemma 1

Lemma 2

Proof

PARALIND-based identifiability result

Theorem 2

Proof

Remark 1

Remark 2

PARALIND-based identifiability result with smoothing technique

Lemma 3

Proof

Theorem 3

Proof

Remark 3

Remark 4

Remark 5

Remark 6

Conclusion

Appendix

Proof of Lemma 2

Proof of Theorem 2

Lemma 4

Proof of Lemma 3

Lemma 5

Proof of Theorem 3

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords