Canonical polyadic decomposition of third-order semi-nonnegative semi-symmetric tensors using LU and QR matrix factorizations

Wang, Lu; Albera, Laurent; Kachenoura, Amar; Shu, Huazhong; Senhadji, Lotfi

doi:10.1186/1687-6180-2014-150

Research
Open access
Published: 08 October 2014

Canonical polyadic decomposition of third-order semi-nonnegative semi-symmetric tensors using LU and QR matrix factorizations

Lu Wang^1,2,4,
Laurent Albera^1,2,4,5,
Amar Kachenoura^1,2,4,
Huazhong Shu^3,4 &
…
Lotfi Senhadji^1,2,4

EURASIP Journal on Advances in Signal Processing volume 2014, Article number: 150 (2014) Cite this article

1815 Accesses
2 Citations
Metrics details

Abstract

Semi-symmetric three-way arrays are essential tools in blind source separation (BSS) particularly in independent component analysis (ICA). These arrays can be built by resorting to higher order statistics of the data. The canonical polyadic (CP) decomposition of such semi-symmetric three-way arrays allows us to identify the so-called mixing matrix, which contains the information about the intensities of some latent source signals present in the observation channels. In addition, in many applications, such as the magnetic resonance spectroscopy (MRS), the columns of the mixing matrix are viewed as relative concentrations of the spectra of the chemical components. Therefore, the two loading matrices of the three-way array, which are equal to the mixing matrix, are nonnegative. Most existing CP algorithms handle the symmetry and the nonnegativity separately. Up to now, very few of them consider both the semi-nonnegativity and the semi-symmetry structure of the three-way array. Nevertheless, like all the methods based on line search, trust region strategies, and alternating optimization, they appear to be dependent on initialization, requiring in practice a multi-initialization procedure. In order to overcome this drawback, we propose two new methods, called ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ , to solve the problem of CP decomposition of semi-nonnegative semi-symmetric three-way arrays. Firstly, we rewrite the constrained optimization problem as an unconstrained one. In fact, the nonnegativity constraint of the two symmetric modes is ensured by means of a square change of variable. Secondly, a Jacobi-like optimization procedure is adopted because of its good convergence property. More precisely, the two new methods use LU and QR matrix factorizations, respectively, which consist in formulating high-dimensional optimization problems into several sequential polynomial and rational subproblems. By using both LU and QR matrix factorizations, we aim at studying the influence of the used matrix factorization. Numerical experiments on simulated arrays emphasize the advantages of the proposed methods especially the one based on LU factorization, in the presence of high-variance model error and of degeneracies such as bottlenecks. A BSS application on MRS data confirms the validity and improvement of the proposed methods.

Introduction

Higher order (HO) arrays, commonly called tensors, play an important role in numerous applications, such as chemometrics [1], telecommunications [2], and biomedical signal processing [3]. They can be seen as HO extensions of vectors (one-way arrays) and matrices (two-way arrays). In many practical situations, the available data measurements cannot be arranged into a tensor form directly, that is to say, the observation diversity is insufficient either in time or frequency. However, if the latent data satisfies the statistical independence assumption, which is reasonable in many applications, meaningful HO arrays can be built by resorting to HO statistics (HOS) of the data [4]. In this instance, the HO arrays are partially symmetric or Hermitian due to the special algebraic structure of the basic HOS, such as moments and cumulants. In independent component analysis (ICA), the latent physical phenomena which are assumed to be statistically independent can be revealed by decomposing the HO array into factors. There exists several ways to decompose a given HO array, such as the Tucker model [5, 6]. Among the existing reliable HO array decomposition models, the canonical polyadic (CP) decomposition model has attracted much attention. Indeed, its uniqueness can be ensured under the sufficient conditions established by Kruskal [7]. In addition, unlike the HO singular value decomposition (HOSVD) [6], the CP model does not impose any orthogonality constraint on its factors.

Theoretically, a polyadic decomposition exactly fits an array by a sum of rank-one terms [8]. A CP decomposition is defined as a polyadic decomposition with a minimal number of rank-one terms which are needed to exactly fit a given HO array. Currently, the CP decomposition is gaining importance in several applications, for example, in exploratory data analysis [9], sensor array processing [10], telecommunications [11, 12], ICA [13], and in multiple-input multiple-output radar systems [14]. A multitude of methods were developed to compute the CP decomposition. They include the iterative alternating least squares (ALS) procedure [15], which gains popularity due to its simplicity of implementation and low numerical complexity. Uschmajew proved the local convergence property of ALS under some conditions [16]. However, this convergence can be slow. Therefore, an enhanced line search (ELS) procedure was proposed by Rajih et al. [17] to cope with the slow convergence problem of ALS. Other approaches were also proposed, such as the conjugate gradient algorithm [18] and joint eigenvalue decomposition-based algorithms [19, 20], to cite a few. Some HO arrays enjoy certain properties, such as i) symmetry and ii) nonnegativity, which cannot be simply handled by the aforementioned general CP decomposition methods. Therefore, special CP models become more and more important.

The first special form of the CP model for three-way arrays that are symmetric in two modes brings the concept of individual differences in scaling (INDSCAL) analysis [21]. On one hand, INDSCAL analysis has been studied as a way of multiple factor analysis [22] with applications to chemometrics, psychology, and marketing research. On the other hand, in the domain of signal processing, and more particularly in blind source separation (BSS), the INDSCAL analysis is widely known as the joint diagonalization of a set of matrices by congruence (JDC). During the past two decades, many successful JDC methods have been proposed, such as Yeredor’s alternating columns and diagonal center (ACDC) algorithm [23], the joint approximate diagonalization (JAD) algorithm proposed by Cardoso and Souloumiac [24], the fast Frobenius diagonalization (FFDIAG) algorithm proposed by Ziehe et al. [25], Afsari’s LUJ1D algorithm [26], and many others [27–33]. A recent survey of JDC can be found in [34]. The second special form of CP model is defined when all the factors in the CP decomposition are constrained to be nonnegative, commonly known as nonnegative tensor factorization (NTF). NTF can be regarded as the extension of nonnegative matrix factorization (NMF) [35] to higher orders. In many applications, the physical properties are inherently nonnegative, such as chemistry [1] and fluorescence spectroscopy [36, 37]. In those applications, the results are only meaningful if the nonnegativity constraint is satisfied. Various methods for computing NTF and also NMF can be found in [38, 39].

So far, the CP model with both the symmetry and nonnegativity constraints has not received much attention. Coloigner et al. proposed a family of algorithms based on line search and trust region strategies [40]. Wang et al. developed an alternating minimization scheme [41]. Those methods appear to depend on initialization, and therefore in practice require a multi-initialization procedure, leading to an increase of numerical complexity. In this paper, we propose to fit the CP model of a three-way array by imposing both the semi-nonnegativity and the semi-symmetry constraints. More precisely, we impose a nonnegativity constraint on the two symmetric modes of the INDSCAL model, which leads to the semi-nonnegative INDSCAL model or equivalently the CP decomposition of semi-nonnegative semi-symmetric three-way arrays. Such a model is often encountered in ICA problems where a nonnegative mixing matrix is frequently considered. For example, in magnetic resonance spectroscopy (MRS), the columns of the mixing matrix represent the positive concentrations of the source metabolites. Then, the three-way array built by stacking the matrix slices of a cumulant array is both nonnegative and symmetric in two modes. In such a case, the semi-nonnegative INDSCAL problem is equivalent to the JDC problem subject to a nonnegativity constraint on the joint transformation matrix. We propose two new algorithms to solve the semi-nonnegative INDSCAL problem, called ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ . Firstly, we rewrite the constrained optimization problem as an unconstrained one. Actually, the nonnegativity constraint is ensured by means of a square change of variable. Secondly, we propose two Jacobi-like approaches using LU and QR matrix factorizations, respectively, which consist in formulating high-dimensional optimization problems into several sequential polynomial and rational subproblems. By using both LU and QR matrix factorizations, we aim at studying the influence of the used matrix factorization. Numerical experiments highlight the advantages of the proposed methods especially ${JD}_{LU}^{+}$ , in the case of dealing with high-variance model error and with degeneracies such as bottlenecks. A BSS application on MRS signals confirms the validity and improvement of the proposed methods. A part of this work has been recently presented at the 8th IEEE Sensor Array and Multichannel Signal Processing Workshop [42].

The rest of the paper is organized as follows. After the presentation of some notations, the ‘Multilinear algebra prerequisites and problem statement’ section introduces some basic definitions of the multilinear algebra then gives the semi-nonnegative INDSCAL problem formulation. In the ‘Methods’ section, we describe the proposed algorithms in detail and also provide an analysis of the numerical complexities. The ‘Simulation results’ section shows the computer simulation results. Finally, we conclude the paper.

Multilinear algebra prerequisites and problem statement

Notations

The following notations are used throughout this paper. and denote the set of real-valued (N₁ × N₂ × ⋯ × N_i) arrays and the set of nonnegative real-valued (N₁ × N₂ × ⋯ × N_i) arrays, respectively. Vectors, matrices, and HO arrays are denoted by bold lowercase letters (a, b, ⋯), bold uppercase letters (A, B, ⋯) and bold calligraphic letters (, , ⋯), respectively. The (i,j)-th entry of a matrix A is symbolized by A_i,j. Sometimes, the MATLAB^®; column/row notation is adopted to indicate submatrices of a given matrix or subarrays of a HO array. Also, a_i denotes the i-th column vector of matrix A. ⊡ denotes the Hadamard product (element-wise product), and $A^{⊡ 2} = A ⊡ A$ . ⊙ denotes the Khatri-Rao product. A^♯ denotes the pseudo inverse of A. The superscripts ^-1, ^T, and ^-T stand for the inverse, the transpose, and the inverse after transpose operators, respectively. The (N × N) identity matrix is denoted by I_N. 0_N stands for N-dimensional vectors of zeros. |a| denotes the absolute value of a. ∥A∥_F and $det (A)$ stand for the Frobenius norm and determinant of matrix A, respectively. diag(A) returns a matrix comprising only the diagonal elements of A. Diag(b) is the diagonal matrix whose diagonal elements are given by the vector b. off(A) vanishes the diagonal components of the input matrix A. vec(A) reshapes a matrix A into a column vector by stacking its columns vertically.

Definitions and problem formulation

Now we introduce some basic definitions in multilinear algebra which are necessary for the problem formulation.

Definition 1.

The outer product $C = u^{(1)} \circ u^{(2)} \circ u^{(3)}$ of three vectors is a three-way array of whose elements are defined by $C_{i_{1}, i_{2}, i_{3}} = u_{i_{1}}^{(1)} u_{i_{2}}^{(2)} u_{i_{3}}^{(3)}$ .

Definition 2.

Each three-way array expressed as the outer product of three vectors is a rank-1 three-way array.

More generally, the rank of a three-way array is defined as follows:

Definition 3.

The rank of an array , denoted by $rk (C)$ , is the minimal number of rank-1 arrays belonging to that yield in a linear combination.

Despite the similarity between the definition of the tensor rank and its matrix counterpart, the rank of a three-way array may exceed its dimensions [4].

Definition 4.

A three-way array slice is a two-dimensional section (fragment) of a three-way array, obtained by fixing one of the three indices [38].

For example, the k-th frontal slice of a three-way array can be denoted by $C_{:, :, k}$ using MATLAB notation, and sometimes it is also denoted by C^(k).

The low-rank INDSCAL model of a three-way array is defined as follows:

Definition 5.

For a given P, corresponding to the number of rank-1 terms, the INDSCAL model of a three-way array can be expressed as:

C = \sum_{p = 1}^{P} a_{p} \circ a_{p} \circ d_{p} + V

(1)

where the three-way array represents the model residual.

The notation $C = [[A, A, D]] + V$ refers to the INDSCAL decomposition (1) of with the associated loading matrices and $D = [d_{1}, \dots, d_{P}] \in R^{K \times P}$ . If and only if the residual is a null tensor, we have an exact INDSCAL decomposition.

An exact INDSCAL decomposition is considered to be essentially unique when it is only subject to scale and permutation indeterminacies. It means that an INDSCAL decomposition is insensitive to a scaling of the three vectors a_p, a_p, and d_p provided that the product of the three scale numbers is equal to 1, and an arbitrary permutation of the rank-1 terms. A necessary and sufficient uniqueness condition for the INDSCAL model was established by Afsari [43].

The INDSCAL model can also be described by using the frontal slices of :

\forall k \in {1, 2, \dots, K}, C^{(k)} = C_{:, :, k} = A D^{(k)} A^{T} + V^{(k)}

(2)

where D^(k) is a diagonal matrix whose diagonal contains the elements of the k-th row of D, and $V^{(k)} = V_{:, :, k}$ .

In this paper, we propose to fit the INDSCAL model of three-way arrays while imposing nonnegativity constraints on both equal loading matrices A. It will be referred to as the semi-nonnegative INDSCAL model, as follows:

Problem 1.

Given and an integer P, find a semi-nonnegative INDSCAL model of $C = [[A, A, D]]$ , subject to the (N × P) matrix A having nonnegative components.

The semi-nonnegative INDSCAL problem is equivalent to the JDC problem subject to the nonnegativity constraint on the joint transformation matrix. In this paper, we mainly focus on the case of square nonnegative joint transformation matrix, for which N = P. The case of N > P will be discussed briefly in the next section. Therefore, the problem that we tackle in this paper is defined as follows:

Problem 2.

Given a three-way array with K symmetric frontal slices , find a (N × N) joint transformation matrix A and K diagonal matrices D ^(k) of dimension (N × N) such that:

\forall k \in {1, 2, \dots, K}, C^{(k)} = A D^{(k)} A^{T} + V^{(k)}

(3)

by minimizing the residual term V^(k) in a least-squares sense, subject to A having nonnegative components.

JDC cost functions

If the residual array is a realization of a Gaussian random array, it is logical to fit the INDSCAL model by the following direct least square (DLS) criterion [23, 44]:

J_{DLS} (A, D) = \sum_{k = 1}^{K} {∥C^{(k)} - A D^{(k)} A^{T}∥}_{F}^{2}

(4)

and to minimize (4) with respect to A and D. Note that, in the field of ICA, only the loading matrix A is of interest since it corresponds to the mixing matrix of several latent source signals. The minimization of (4) with respect to D, when A is fixed, was given by Yeredor in [23]:

D^{(k)} = Diag \{{[(A^{T} A) ⊡ (A^{T} A)]}^{- 1} {(A ⊙ A)}^{T} vec (C^{(k)})\}

(5)

When A is orthogonal, we can replace D^(k) by $Diag \{{(A ⊙ A)}^{T} vec (C^{(k)})\}$ in (4). Then, the extra parameter D can be eliminated and the minimization of (4) is equivalent to minimizing the following indirect least square (IDLS) criterion [45, 46]:

J_{IDLS-O} (A) = \sum_{k = 1}^{K} {∥off (A^{T} C^{(k)} A)∥}_{F}^{2}

(6)

In some cases such as in ICA, the orthogonality assumption of A can be satisfied by using a spatial whitening procedure [47]. However, it is known that the whitening procedure may introduce additional errors. Therefore, many algorithms propose to relax the orthogonality constraint by introducing the following cost function [25, 31]:

J_{IDLS} (A) = \sum_{k = 1}^{K} {∥off (A^{- 1} C^{(k)} A^{- T})∥}_{F}^{2}

(7)

Frequently, the minimization of criterion (7) is performed on a matrix $Z \overset{def}{=} A^{- 1}$ instead of A for simplicity, and Z is called the joint diagonalizer. To use this criterion, the matrix A (or Z) should be properly constrained in order to avoid the trivial zero solution and/or degenerate solutions [34].

Besides the criterions (4) and (7), Afsari [26] presented a new cost function, which is invariant to column scaling of A. Pham proposed an information theoretic criterion [48], which requires each matrix C^(k) to be positive definite. Tichavský and Yeredor gave a special weighted least square criterion [49].

Methods

Problem reformulation

Existing semi-nonnegative INDSCAL algorithms are based on the minimization of the cost function (4) [40, 41]. They are able to achieve a better estimation of A than ACDC when the data satisfies the semi-nonnegative INDSCAL model at the cost of a higher computational complexity. We propose to use criterion (7) based on elementary factorizations of A due to the fast convergence property of this kind of procedures. Generally, it is quite difficult to impose the nonnegativity constraint on A while computing its inverse A^-1 by minimizing (7). Let us consider the structure of $C = [[A, A, D]]$ with the following assumptions:

① is nonsingular;

② does not contain zero entries.

Then, each frontal slice of is nonsingular and its inverse can be expressed as follows:

{(C^{(k)})}^{- 1} = A^{- T} {(D^{(k)})}^{- 1} A^{- 1}

(8)

We use C^(k,-1) to denote ${(C^{(k)})}^{- 1}$ for simplicity. Eq. 8 shows that C^(k,-1) also preserves the jointly diagonalizable structure. Furthermore, instead of A^-1, A serves as the joint diagonalizer. Then, A can be estimated by minimizing the following modified criterion based on (7):

J (A) = \sum_{k = 1}^{K} {∥off (A^{T} C^{(k, - 1)} A)∥}_{F}^{2}

(9)

By such a manipulation, most algorithms based on criterion (7) can now estimate A directly. However, none of them can guarantee the nonnegativity of A. In order to impose the nonnegativity constraint on A, we resort to use a square change of variable which was introduced by Chu et al. [50] for NMF, next adopted by Royer et al. for NTF [37] and by Coloigner et al. for semi-nonnegative INDSCAL [40]:

A = B ⊡ B = B^{⊡ 2}

(10)

where . Then, problem 2 can be reformulated as follows:

Problem 3.

Given , find the square nonnegative loading matrix $A = B^{⊡ 2}$ such that B minimizes the following cost function:

J (B) = \sum_{k = 1}^{K} {∥off ({(B^{⊡ 2})}^{T} C^{(k, - 1)} B^{⊡ 2})∥}_{F}^{2}

(11)

LU and QR parameterizations of B

In order to minimize (11), one may consider a gradient-like approach. However, the performance of this kind of method is sensitive to the initial guess and to the search step size. In addition, the calculation of gradient of (11) with respect to B is computationally expensive due to the existence of the Hadamard product. Other algorithms, using Jacobi-like procedures [25, 26, 31], parameterize A as a product of several special elementary matrices and estimate each elementary matrix successively. We propose to follow such a minimization scheme.

Now let us recall the following definitions and lemmas:

Definition 6.

A unit upper (or lower) triangular matrix is an upper (or lower, respectively) triangular matrix whose main diagonal elements are equal to 1.

Definition 7.

An elementary upper (or lower) triangular matrix with parameters {i,j,u_i,j} and i<j is a unit upper (or lower, respectively) triangular matrix whose non-diagonal elements are zeros except the (i,j)-th entry, which is equal to u_i,j.

U^(i,j)(u_i,j) with 1 ≤ i < j ≤ N denotes an elementary upper triangular matrix:

U^{(i, j)} (u_{i, j}) = (\begin{matrix} I_{i - 1} & ⋮ & 0 & ⋮ & 0 \\ \dots & 1 & \dots & u_{i, j} & \dots \\ 0 & ⋮ & I_{j - i - 1} & ⋮ & 0 \\ \dots & 0 & \dots & 1 & \dots \\ 0 & ⋮ & 0 & ⋮ & I_{N - j} \end{matrix})

(12)

Similarly, L^(i,j)(ℓ_i,j) with 1 ≤ j < i ≤ N corresponds to an elementary lower triangular matrix.

Definition 8.

A Givens rotation matrix with parameters {i,j,θ_i,j} and i < j is equal to an identity matrix except for the (i,i)-th, (j,j)-th, (i,j)-th, and (j,i)-th entries, which are equal to $cos (θ_{i, j})$ , $cos (θ_{i, j})$ , $- sin (θ_{i, j})$ , and $sin (θ_{i, j})$ , respectively.

Q^(i,j)(θ_i,j) with 1 ≤ i < j ≤ N indicates the corresponding Givens rotation matrix:

Q^{(i, j)} (θ_{i, j}) = (\begin{matrix} I_{i - 1} & ⋮ & 0 & ⋮ & 0 \\ \dots & cos (θ_{i, j}) & \dots & - sin (θ_{i, j}) & \dots \\ 0 & ⋮ & I_{j - i - 1} & ⋮ & 0 \\ \dots & sin (θ_{i, j}) & \dots & cos (θ_{i, j}) & \dots \\ 0 & ⋮ & 0 & ⋮ & I_{N - j} \end{matrix})

(13)

Lemma 1.

Any (N × N) unit lower triangular matrix L whose (i,j)-th component is ℓ_i,j(i > j) can be factorized as the following product of N (N-1)/2 elementary lower triangular matrices [51, Chapter 3]:

L = \prod_{j \in J_{1}} \prod_{i \in I_{1} (j)} L^{(i, j)} (ℓ_{i, j})

(14)

where the two sets of indices $J_{1}$ and $I_{1} (j)$ are defined by $J_{1} = {1, 2, \dots, N}$ and $I_{1} (j) = {j + 1, j + 2, \dots, N}$ for the sake of convenience. Similarly, any (N × N) unit upper triangular matrix U whose (i,j)-th component is equal to u _i,j (i < j) can be factorized as a product of elementary upper triangular matrices as follows:

U = \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} U^{(i, j)} (u_{i, j})

(15)

where $I_{2}$ and $J_{2} (i)$ are two sets of indices, defined by $I_{2} = {N - 1, N - 2, \dots, 1}$ and $J_{2} (i) = {N, N - 1, \dots, i + 1}$ .

Lemma 2.

Any (N × N) orthonormal matrix Q can be factorized as the following product of N (N-1)/2 Givens rotation matrices [52, Chapter 14]:

Q = \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} Q^{(i, j)} (θ_{i, j})

(16)

where $I_{2}$ and $J_{2} (i)$ are defined in Lemma 1.

For any nonsingular matrix , the LU matrix factorization decomposes it as B= L U Λ Π, where is a unit lower triangular matrix, is a unit upper triangular matrix, is a diagonal matrix, and is a permutation matrix. B also admits the QR matrix factorization as B= Q R Λ, where is an orthonormal matrix, is a unit upper triangular matrix, and is a diagonal matrix. Due to the indeterminacies of the JDC problem, the global minimum of (11), say B, can be expressed as B= L U and B= Q R without loss of generality. Moreover, by incorporating Lemma 1 and Lemma 2, we obtain the two following elementary factorizations of B:

B = \prod_{j \in J_{1}} \prod_{i \in I_{1} (j)} L^{(i, j)} (ℓ_{i, j}) \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} U^{(i, j)} (u_{i, j})

(17)

B = \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} Q^{(i, j)} (θ_{i, j}) \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} U^{(i, j)} (u_{i, j})

(18)

As a consequence, the minimization of (11) with respect to B is converted to the estimate of N (N-1) parameters: ℓ_i,j and u_i,j for the LU decomposition (17), or θ_i,j and u_i,j for the QR decomposition (18). Instead of simultaneously computing the N(N-1) parameters, we propose two Jacobi-like procedures which perform N(N-1) sequential optimizations. This yields two new algorithms: i) the first algorithm based on (17), named ${JD}_{LU}^{+}$ , estimates each ℓ_i,j and u_i,j successively, and ii) the second one based on (18), called ${JD}_{QR}^{+}$ , estimates each θ_i,j and u_i,j sequentially.

Now, the difficulty is how to estimate four kinds of parameter, namely L^(i,j)(ℓ_i,j) and U^(i,j)(u_i,j) for ${JD}_{LU}^{+}$ , and Q^(i,j)(θ_i,j) and U^(i,j)(u_i,j) for ${JD}_{QR}^{+}$ . Two points should be noted here: i) L^(i,j)(ℓ_i,j) and U^(i,j)(u_i,j) belong to the same category of matrices; therefore, they can be estimated by the same algorithmic procedure just with an emphasis on the relation between the i and j indices (i < j for U^(i,j)(u_i,j) and j < i for L^(i,j)(ℓ_i,j)); ii) for both ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms, the procedure of estimating U^(i,j)(u_i,j) is identical. Consequently, the principal problem is reduced to estimating two kinds of parameters, namely U^(i,j)(u_i,j) and Q^(i,j)(θ_i,j).

Minimization with respect to the elementary upper triangular matrix U^(i,j)(u_i,j)

In this section, we minimize (11) with respect to U^(i,j)(u_i,j) with 1 ≤ i < j ≤ N. Let $\tilde{A}$ and $\tilde{B}$ denote the current estimate of A and B before estimating the parameter u_i,j, respectively. Let ${\tilde{A}}^{(new)}$ and ${\tilde{B}}^{(new)}$ stand for $\tilde{A}$ and $\tilde{B}$ updated by U^(i,j)(u_i,j), respectively. Furthermore, the update of $\tilde{B}$ is defined as follows:

{\tilde{B}}^{(new)} = \tilde{B} U^{(i, j)} (u_{i, j})

(19)

In order to compute the parameter u_i,j, a typical way is to minimize the criterion (11) with respect to u_i,j by replacing matrix $\tilde{B}$ by ${\tilde{B}}^{(new)}$ . For the sake of convenience, we denote J (u_i,j) instead of $J ({\tilde{B}}^{(new)})$ . Then, J(u_i,j) can be expressed as follows:

\begin{matrix} J (u_{i, j}) = \sum_{k = 1}^{K} {∥off \{{[{({\tilde{B}}^{(new)})}^{⊡ 2}]}^{T} C^{(k, - 1)} [{({\tilde{B}}^{(new)})}^{⊡ 2}]\}∥}_{F}^{2} \end{matrix}

(20)

The expression of the Hadamard square of the update ${\tilde{B}}^{(new)}$ is shown in the following proposition:

Proposition 1.

${\tilde{A}}^{(new)} = {({\tilde{B}}^{(new)})}^{⊡ 2} = {(\tilde{B} U^{(i, j)} (u_{i, j}))}^{⊡ 2}$

can be expressed as a function of u _i,j as follows:

\begin{matrix} {\tilde{A}}^{(new)} = {({\tilde{B}}^{(new)})}^{⊡ 2} = {\tilde{B}}^{⊡ 2} U^{(i, j)} (u_{i, j}^{2}) + 2 u_{i, j} ({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j}) e_{j}^{T} \end{matrix}

(21)

where ${\tilde{b}}_{i}$ and ${\tilde{b}}_{j}$ denote the i-th and j-th columns of $\tilde{B}$ , respectively, and e_j is the j-th column of the identity matrix I_N.

Inserting (21) into the cost function (20), we have:

\begin{array}{l} J (u_{i, j}) = & \sum_{k = 1}^{K} {∥off ({\tilde{C}}^{(k, new)})∥}_{F}^{2} = \sum_{k = 1}^{K} ∥off (\underset{①}{\underset{⏟}{U^{(i, j)} {(u_{i, j}^{2})}^{T} {\tilde{C}}^{(k)} U^{(i, j)} (u_{i, j}^{2})}} \\ + \underset{②}{\underset{⏟}{u_{i, j} U^{(i, j)} {(u_{i, j}^{2})}^{T} {\tilde{c}}^{(k, 1)} e_{j}^{T}}} + \underset{③}{\underset{⏟}{u_{i, j} e_{j} {\tilde{c}}^{(k, 2)} U^{(i, j)} (u_{i, j}^{2})}} + {\underset{④}{\underset{⏟}{u_{i, j}^{2} {\tilde{c}}^{(k, 3)} e_{j} e_{j}^{T}}})∥}_{F}^{2} \end{array}

(22)

where ${\tilde{C}}^{(k)} = {\tilde{A}}^{T} C^{(k, - 1)} \tilde{A}$ , ${\tilde{c}}^{(k, 1)} = 2 {\tilde{A}}^{T} C^{(k, - 1)} ({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})$ , ${\tilde{c}}^{(k, 2)} = 2 {({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})}^{T} \times C^{(k, - 1)} \tilde{A}$ , and ${\tilde{c}}^{(k, 3)} = 4 {({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})}^{T} C^{(k, - 1)} ({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})$ are a (N × N) constant matrix, a (N × 1) constant column vector, a (1 × N) constant row vector, and a constant scalar, respectively. The term ① in (22) transforms the j-th column and the j-th row of ${\tilde{C}}^{(k)}$ . The term ② in (22) is a zero matrix except its j-th column containing non-zero elements, while the term ③ contains non-zero entries only on its j-th row. The term ④ is a zero matrix except its (j,j)-th component being non-zero. In addition, ${\tilde{C}}^{(k, new)} = ① + ② + ③ + ④$ is a (N × N) symmetric matrix. Hence, (22) shows that only the j-th column and j-th row of ${\tilde{C}}^{(k, new)}$ involve the parameter u_i,j, while the other elements remain constant. Therefore, the minimization of the cost function (20) is equivalent to minimizing the sum of the squares of the j-th columns of ${\tilde{C}}^{(k, new)}$ except their (j,j)-th elements with k ∈ {1,⋯,K}. The required elements of ${\tilde{C}}^{(k, new)}$ can be expressed by the following proposition.

Proposition 2.

The elements of the j-th column except the (j,j)-th entry of ${\tilde{C}}^{(k, new)}$ is a second-degree polynomial function in u _i,j as follows, for every value n different of j:

{\tilde{C}}_{n, j}^{(k, new)} = {\tilde{C}}_{n, i}^{(k)} u_{i, j}^{2} + {\tilde{c}}_{n}^{(k, 1)} u_{i, j} + {\tilde{C}}_{n, j}^{(k)}

(23)

where ${\tilde{C}}_{n, i}^{(k)}$ and ${\tilde{C}}_{n, j}^{(k)}$ are the (n,i)-th and (n,j)-th components of matrix ${\tilde{C}}^{(k)}$ , respectively, and ${\tilde{c}}_{n}^{(k, 1)}$ is the n-th element of vector ${\tilde{c}}^{(k, 1)}$ .

The proof of this proposition is straightforward. Indeed, we can show that the elements of the j-th column except the (j,j)-th entry of the term ① in (22) can be expressed by ${\tilde{C}}_{n, i}^{(k)} u_{i, j}^{2} + {\tilde{C}}_{n, j}^{(k)}$ with 1 ≤ n ≤ N and n ≠ j, and those elements of the term ② in (22) are equal to ${\tilde{c}}_{n}^{(k, 1)} u_{i, j}$ with 1 ≤ n ≤ N and n ≠ j. The sum of these elements directly leads to (23). The terms ③ and ④ do not need to be considered, since they do not affect the off-diagonal elements in the j-th column. Proposition 2 shows that the minimization of the cost function (20) can be expressed in the following compact matrix form:

J (u_{i, j}) = \sum_{k = 1}^{K} {∥E^{(k)} u_{i, j}∥}_{F}^{2} = u_{i, j}^{T} Q_{E} u_{i, j}

(24)

where $Q_{E} = \sum_{k = 1}^{K} {(E^{(k)})}^{T} E^{(k)}$ is a (3 × 3) symmetric coefficient matrix. E^(k) is a ((N - 1) × 3) matrix defined as follows: the first column contains the i-th column of ${\tilde{C}}^{(k)}$ without the j-th element, the second column contains vector ${\tilde{c}}^{(k, 1)}$ without the j-th entry, and the third column contains the j-th column of ${\tilde{C}}^{(k)}$ without the j-th component. $u_{i, j} = {[u_{i, j}^{2}, u_{i, j}, 1]}^{T}$ is a three-dimensional parameter vector.

Equation (24) shows that J(u_i,j) is a fourth-degree polynomial function. The global minimum u_i,j can be obtained by computing the roots of its derivative and selecting the one yielding the smallest value of (24). Once the optimal u_i,j is computed, ${\tilde{B}}^{(new)}$ is updated by (19) and the joint diagonalizer ${\tilde{A}}^{(new)}$ is updated by computing ${({\tilde{B}}^{(new)})}^{⊡ 2}$ . Then, the same procedure is repeated to compute the next u_i,j with another (i,j) index.

The minimization of (11) with respect to the elementary lower triangular matrix L^(i,j)(ℓ_i,j) with 1 ≤ j < i ≤ N can be computed in the same way. Proposition 2 is also valid for the parameter ℓ_i,j when 1 ≤ j < i ≤ N. The detailed derivation is omitted here. The processing of all the N (N - 1) parameters u_i,j and ℓ_i,j is called a LU sweep. In addition, for estimating L^(i,j)(ℓ_i,j), the (i,j) index obeys the following order:

\begin{matrix} (2, 1), (3, 1), \dots, & (N, 1), (3, 2), (4, 2), \dots, (N, 2), \dots, \\ (N - 1, N - 2), (N, N - 2), (N, N - 1) \end{matrix}

(25)

Regarding U^(i,j)(u_i,j), the (i,j) index varies according to the following sequence:

\begin{array}{l} (N & - 1, N), (N - 2, N), (N - 2, N - 1), \dots, \\ (2, N), (2, N - 1), \dots, (2, 3), (1, N), (1, N - 1), \dots, (1, 2) \end{array}

(26)

The proposed ${JD}_{LU}^{+}$ algorithm is comprised of several LU sweeps.

Minimization with respect to the Givens rotation matrix Q^(i,j)(θ_i,j)

Now we minimize (11) with respect to Q^(i,j) (θ_i,j) with 1 ≤ i < j ≤ N. By abuse of notation, in this section, we continue to use $\tilde{A}$ and $\tilde{B}$ to denote the current estimate of A and B, respectively, before estimating the parameter θ_i,j. Also, let ${\tilde{A}}^{(new)}$ and ${\tilde{B}}^{(new)}$ stand for $\tilde{A}$ and $\tilde{B}$ updated by Q^(i,j)(θ_i,j), respectively. The update of $\tilde{B}$ is defined as follows:

{\tilde{B}}^{(new)} = \tilde{B} Q^{(i, j)} (θ_{i, j})

(27)

Similarly, for computing the parameter θ_i,j, we can minimize the criterion (11) with respect to θ_i,j by replacing matrix $\tilde{B}$ by ${\tilde{B}}^{(new)}$ . We denote J(θ_i,j) instead of $J ({\tilde{B}}^{(new)})$ for convenience purpose. Then, J(θ_i,j) can be expressed as follows:

\begin{matrix} J (θ_{i, j}) = \sum_{k = 1}^{K} {∥off \{{[{({\tilde{B}}^{(new)})}^{⊡ 2}]}^{T} C^{(k, - 1)} [{({\tilde{B}}^{(new)})}^{⊡ 2}]\}∥}_{F}^{2} \end{matrix}

(28)

The Hadamard square of the update ${\tilde{B}}^{(new)}$ now can be rewritten as shown in the following proposition.

Proposition 3.

${\tilde{A}}^{(new)} = {({\tilde{B}}^{(new)})}^{⊡ 2} = {(\tilde{B} Q^{(i, j)} (θ_{i, j}))}^{⊡ 2}$

can be written as a function of θ _i,j as follows:

\begin{array}{l} {\tilde{A}}^{(new)} = & {({\tilde{B}}^{(new)})}^{⊡ 2} = {\tilde{B}}^{⊡ 2} {(Q^{(i, j)} (θ_{i, j}))}^{⊡ 2} \\ + sin (2 θ_{i, j}) ({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j}) (e_{i}^{T} - e_{j}^{T}) \end{array}

(29)

where ${\tilde{b}}_{i}$ and ${\tilde{b}}_{j}$ denote the i-th and j-th columns of $\tilde{B}$ , respectively, and e_iand e_jare the i-th and j-th columns of the identity matrix I_N, respectively.

Inserting (29) into the cost function (28), we obtain:

\begin{array}{l} J (θ_{i, j}) & = \sum_{k = 1}^{K} {∥off ({\tilde{C}}^{(k, new)})∥}_{F}^{2} = \sum_{k = 1}^{K} ∥off (\underset{①}{\underset{⏟}{{[{(Q^{(i, j)} (θ_{i, j}))}^{⊡ 2}]}^{T} {\tilde{C}}^{(k)} {(Q^{(i, j)} (θ_{i, j}))}^{⊡ 2}}} \\ + \underset{②}{\underset{⏟}{sin (2 θ_{i, j}) {[{(Q^{(i, j)} (θ_{i, j}))}^{⊡ 2}]}^{T} {\tilde{c}}^{(k, 1)} (e_{i}^{T} - e_{j}^{T})}} \\ + \underset{③}{\underset{⏟}{sin (2 θ_{i, j}) (e_{i} - e_{j}) {\tilde{c}}^{(k, 2)} {(Q^{(i, j)} (θ_{i, j}))}^{⊡ 2}}} + {\underset{④}{\underset{⏟}{\overset{2}{sin} (2 θ_{i, j}) {\tilde{c}}^{(k, 3)} (e_{i} - e_{j}) (e_{i}^{T} - e_{j}^{T})}})∥}_{F}^{2} \end{array}

(30)

where ${\tilde{C}}^{(k)} = {\tilde{A}}^{T} C^{(k, - 1)} \tilde{A}$ , ${\tilde{c}}^{(k, 1)} = {\tilde{A}}^{T} C^{(k, - 1)} ({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})$ , ${\tilde{c}}^{(k, 2)} = {({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})}^{T} C^{(k, - 1)} \tilde{A}$ , and ${\tilde{c}}^{(k, 3)} = {({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})}^{T} C^{(k, - 1)} ({\tilde{b}}_{i} ⊡ {\tilde{b}}_{j})$ are a (N × N) constant matrix, a (N × 1) constant column vector, a (1 × N) constant row vector, and a constant scalar, respectively. The term ① in (30) transforms the i-th and j-th columns and the i-th and j-th rows of ${\tilde{C}}^{(k)}$ . The term ② in (30) is a zero matrix except its i-th and j-th columns containing non-zero elements, while the term ③ contains non-zero entries only on its i-th and j-th rows. The term ④ is a zero matrix except its (i,i)-th, (j,j)-th, (i,j)-th, and (j,i)-th components being non-zero. ${\tilde{C}}^{(k, new)} = ① + ② + ③ + ④$ is a (N × N) symmetric matrix. Hence, (30) shows that only the i-th and j-th columns and the i-th and j-th rows of ${\tilde{C}}^{(k, new)}$ involve the parameter θ_i,j, while the other components remain constant. It is noteworthy that the (i,j)-th and (j,i)-th components are twice affected by the transformation. Considering the symmetry of ${\tilde{C}}^{(k, new)}$ , we propose to minimize the sum of the squares of the (i,j)-th entries of the K matrices ${\tilde{C}}^{(k, new)}$ , instead of minimizing all the off-diagonal entries. Although minimizing this quantity is not equivalent to minimizing the global cost function (28), such a simplified minimization scheme is commonly adopted in many algorithms, such as [20, 31]. We denote this local minimization by $\tilde{J} (θ_{i, j})$ . The (i,j)-th component of ${\tilde{C}}^{(k, new)}$ is expressed in the following proposition.

Proposition 4.

The (i,j)-th entry of ${\tilde{C}}^{(k, new)}$ can be expressed as a function of θ _i,j as follows:

\begin{array}{l} {\tilde{C}}_{i, j}^{(k, new)} & = - \overset{2}{sin} (2 θ_{i, j}) {\tilde{c}}^{(k, 3)} \\ + \overset{2}{sin} (θ_{i, j}) ({\tilde{C}}_{i, i}^{(k)} \overset{2}{cos} (θ_{i, j}) + {\tilde{C}}_{j, i}^{(k)} \overset{2}{sin} (θ_{i, j})) \\ + \overset{2}{cos} (θ_{i, j}) ({\tilde{C}}_{i, j}^{(k)} \overset{2}{cos} (θ_{i, j}) + {\tilde{C}}_{j, j}^{(k)} \overset{2}{sin} (θ_{i, j})) \\ + sin (2 θ_{i, j}) ({\tilde{c}}_{i}^{(k, 1)} \overset{2}{cos} (θ_{i, j}) + {\tilde{c}}_{j}^{(k, 1)} \overset{2}{sin} (θ_{i, j})) \\ - sin (2 θ_{i, j}) ({\tilde{c}}_{j}^{(k, 2)} \overset{2}{cos} (θ_{i, j}) + {\tilde{c}}_{i}^{(k, 2)} \overset{2}{sin} (θ_{i, j})) \end{array}

(31)

where ${\tilde{C}}_{i, i}^{(k)}$ , ${\tilde{C}}_{j, j}^{(k)}$ , ${\tilde{C}}_{i, j}^{(k)}$ , and ${\tilde{C}}_{j, i}^{(k)}$ are the (i,i)-th, (j,j)-th, (i,j)-th, and (j,i)-th components of matrix ${\tilde{C}}^{(k)}$ , respectively. ${\tilde{c}}_{i}^{(k, q)}$ and ${\tilde{c}}_{j}^{(k, q)}$ are the i-th and j-th elements of vector ${\tilde{c}}^{(k, q)}$ with q ∈ {1,2}, respectively.

It is straightforward to show that the (i,j)-th entry of the term ① in (30) can be expressed by $\overset{2}{sin} (θ_{i, j}) \overset{2}{cos} (θ_{i, j}) ({\tilde{C}}_{i, i}^{(k)} + {\tilde{C}}_{j, j}^{(k)}) + \overset{4}{sin} (θ_{i, j}) C_{j, i}^{(k)} + \overset{4}{cos} (θ_{i, j}) {\tilde{C}}_{i, j}^{(k)}$ , the (i,j)-th element of the term ② is $sin (2 θ_{i, j}) (\overset{2}{cos} (θ_{i, j}) {\tilde{c}}_{i}^{(k, 1)} + \overset{2}{sin} (θ_{i, j}) {\tilde{c}}_{j}^{(k, 1)})$ , the (i,j)-th component of the term ③ is equal to $- sin (2 θ_{i, j}) (\overset{2}{sin} (θ_{i, j}) {\tilde{c}}_{i}^{(k, 2)} + \overset{2}{cos} (θ_{i, j}) {\tilde{c}}_{j}^{(k, 2)})$ , and that of the term ④ is $- \overset{2}{sin} (2 θ_{i, j}) {\tilde{c}}^{(k, 3)}$ . Then, Proposition 4 can be proved.

In order to simplify the notation of (31), we resort to the Weierstrass change of variable: $t_{i, j} = tan (θ_{i, j})$ . Then, we obtain:

\begin{array}{l} sin (2 θ_{i, j}) & = \frac{2 t_{i, j}}{1 + t_{i, j}^{2}}, cos (2 θ_{i, j}) = \frac{1 - t_{i, j}^{2}}{1 + t_{i, j}^{2}}, \overset{2}{sin} (θ_{i, j}) \\ = \frac{t_{i, j}^{2}}{1 + t_{i, j}^{2}}, \overset{2}{cos} (θ_{i, j}) = \frac{1}{1 + t_{i, j}^{2}} \end{array}

(32)

By substituting (32) into (31), we obtain an alternative expression of the (i,j)-th entry of ${\tilde{C}}^{(k, new)}$ which is described in the following proposition. Then, the minimization of $\tilde{J} (θ_{i, j})$ transforms to $\tilde{J} (t_{i, j})$ .

Proposition 5.

The (i,j)-th entry of ${\tilde{C}}^{(k, new)}$ can be expressed by a rational function of t _i,j as follows:

{\tilde{C}}_{i, j}^{(k, new)} = \frac{f_{4}^{(k)} t_{i, j}^{4} + f_{3}^{(k)} t_{i, j}^{3} + f_{2}^{(k)} t_{i, j}^{2} + f_{1}^{(k)} t_{i, j} + f_{0}^{(k)}}{{(1 + t_{i, j}^{2})}^{2}}

(33)

where $f_{4}^{(k)} = {\tilde{C}}_{j, i}^{(k)}$ , $f_{3}^{(k)} = - 2 {\tilde{c}}_{i}^{(k, 1)}$ , $f_{2}^{(k)} = {\tilde{C}}_{i, i}^{(k)} + {\tilde{C}}_{j, j}^{(k)} + 2 {\tilde{c}}_{j}^{(k, 2)} - 4 {\tilde{c}}^{(k, 3)}$ , $f_{1}^{(k)} = 2 {\tilde{c}}_{i}^{(k, 2)} - {\tilde{c}}_{j}^{(k, 1)}$ , and $f_{0}^{(k)} = {\tilde{C}}_{j, j}^{(k)}$ .

Eq. 33 easily shows that the sum of the squares of the (i,j)-th entries of the K matrices ${\tilde{C}}^{(k, new)}$ , is a rational function in t_i,j, namely $\tilde{J} (t_{i, j})$ , where the degrees of the numerator and the denominator are 8 and 8, respectively. $\tilde{J} (t_{i, j})$ can be expressed in the following compact matrix form:

\tilde{J} (t_{i, j}) = \sum_{k = 1}^{K} {∥{(f^{(k)})}^{T} τ_{i, j}∥}_{F}^{2} = τ_{i, j}^{T} Q_{F} τ_{i, j}

(34)

where $Q_{F} = \sum_{k = 1}^{K} f^{(k)} {(f^{(k)})}^{T}$ is a (5 × 5) symmetric coefficient matrix, $f^{(k)} = {[f_{4}^{(k)}, f_{3}^{(k)}, f_{2}^{(k)}, f_{1}^{(k)}, f_{0}^{(k)}]}^{T}$ is a five-dimensional vector, and τ_i,j is a five-dimensional parameter vector defined as follows:

τ_{i, j} = \frac{1}{{(1 + t_{i, j}^{2})}^{2}} {[t_{i, j}^{4}, t_{i, j}^{3}, t_{i, j}^{2}, t_{i, j}, 1]}^{T}

(35)

The global minimum t_i,j can be obtained by computing the roots of its derivative and selecting the one yielding the smallest value of $\tilde{J} (t_{i, j})$ . Once t_i,j is obtained, θ_i,j can be computed from the inverse tangent function $θ_{i, j} = arctan (t_{i, j})$ . It is noteworthy that the found θ_i,j cannot guarantee to decrease the actual cost function (28). If θ_i,j leads to an increase of (28), we reset θ_i,j= 0. Otherwise, ${\tilde{B}}^{(new)}$ is updated as described in (27) and the joint diagonalizer ${\tilde{A}}^{(new)}$ is updated by computing ${({\tilde{B}}^{(new)})}^{⊡ 2}$ . The same procedure will be repeated to compute θ_i,j with the next (i,j) index. The order of the (i,j) indices is defined in Eq. 26. The processing of all the N(N - 1)/2 parameters θ_i,j and also the other N(N - 1)/2 parameters u_i,j is called a QR sweep. Several QR sweeps yield the proposed ${JD}_{QR}^{+}$ algorithm.

Both of the ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms can be stopped when the value of cost function (11) or its relative change between two successive sweeps fall below a fixed small positive threshold. Such a stopping criterion is guaranteed to be met since the cost function is non-increasing in each Jacobi-like sweep.

Practical issues

In practice, we observe that if each frontal slice of the three-way array is almost exactly jointly diagonalizable due to a high signal-to-noise ratio (SNR), the classical non-constrained JDC methods can also give a nonnegative A with high probability. In this situation, the explicit nonnegativity constraint could be unnecessary and could increase the computational burden. Therefore, we propose to relax the nonnegativity constraint by directly decomposing A into elementary LU and QR forms, respectively, instead of using the decompositions of B as follows:

A = \prod_{j \in J_{1}} \prod_{i \in I_{1} (j)} L^{(i, j)} (ℓ_{i, j}) \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} U^{(i, j)} (u_{i, j})

(36)

A = \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} Q^{(i, j)} (θ_{i, j}) \prod_{i \in I_{2}} \prod_{j \in J_{2} (i)} U^{(i, j)} (u_{i, j})

(37)

where the index sets $I_{1} (j)$ , $J_{1}$ , $I_{2}$ , and $J_{2} (i)$ are defined in Lemma 1. By inserting (36) and (37) into the cost function (9), the ways of estimating the two sets of parameters {ℓ_i,j,u_i,j} and {θ_i,j,u_i,j} are identical to those of Afsari’s LUJ1D and QRJ1D methods [26], respectively. Therefore, in practice, in order to give an automatically SNR-adaptive method, for ${JD}_{LU}^{+}$ , in each Jacobi-like iteration, we suggest to compute u_i,j by LUJ1D first. If all the elements in the j-th column of $\tilde{A} U^{(i, j)} (u_{i, j})$ have the same sign ε, the update ${\tilde{A}}^{(new)} = ε \tilde{A} U^{(i, j)} (u_{i, j})$ is adopted. Otherwise, u_i,j is computed by minimizing (20) and ${\tilde{A}}^{(new)}$ is updated by computing (21). Each ℓ_i,j is computed similarly. Furthermore, the proposed ${JD}_{QR}^{+}$ and QRJ1D are combined in the same manner.

Afsari reported in [26] that if the rows of matrices ${\tilde{C}}^{(k)}$ (k ∈ {1,⋯,K}) are not balanced in their norms, the computation of the parameter could be inaccurate. In order to cope with this effect, we apply Afsari’s row balancing scheme every few sweeps. Such a scheme updates each ${\tilde{C}}^{(k, new))}$ by ${\tilde{C}}^{(k, new)} = Λ {\tilde{C}}^{(k)} Λ$ and ${\tilde{A}}^{(new)}$ by ${\tilde{A}}^{(new)} = \tilde{A} Λ$ using a diagonal matrix , whose diagonal elements are defined as follows:

Λ_{n, n} = \frac{1}{\sqrt{\sum_{k = 1}^{K} {∥{\tilde{C}}_{n, :}^{(k)}∥}^{2}}}, n \in {1, 2, \dots, N}

(38)

where ${\tilde{C}}_{n, :}^{(k)}$ denotes the n-th row of ${\tilde{C}}^{(k)}$ .

In ICA, when a non-square matrix with N>P is encountered, the invertibility assumption of the frontal slices C^(k) does not hold. In this situation, we can compress A by means of a nonnegative matrix such that the resulting matrix $\bar{A} = W_{+} A$ is a nonnegative square matrix. Then, the ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms can be used to compute the compressed loading matrix $\bar{A}$ . W₊ can be computed by using the nonnegative compression algorithm (NN-COMP) that we proposed in [53]. More precisely, given a realization of an observation vector, we obtain the square root of the covariance matrix, denoted by . The classical prewhitening matrix is computed by where ♯ denotes the pseudo inverse operator [47]. Then, the NN-COMP algorithm computes a linear transformation matrix such that W₊ = Ψ W has nonnegative components. Once $\bar{A}$ is estimated, the original matrix A is obtained as follows:

A = W^{♯} Ψ^{- 1} \bar{A} = Υ Ψ^{- 1} \bar{A}

(39)

It should be noted that generally A does not need to be computed in such an ICA problem, since the sources can be estimated directly by means of $\bar{A}$ .

Numerical complexity

The numerical complexities of ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ are analyzed in terms of the number of floating point operations (flops). A flop is defined as a multiplication followed by an addition. In practice, only the number of multiplications, required to identify the loading matrix from a three-way array , is considered, which does not affect the order of magnitude of the numerical complexity.

For both algorithms, the inverses C^(k,-1) (k ∈ {1,⋯,K}) of the frontal slices of cost N³K flops, the initialization of ${\tilde{C}}_{ini}^{(k)} = {\tilde{A}}_{ini}^{T} C^{(k, - 1)} {\tilde{A}}_{ini}$ requires 2N³K flops, and at each sweep, the calculation of parameters u_i,j needs N(N-1)(5N²+12N-8)K/2 flops. In addition, in the case of the ${JD}_{LU}^{+}$ algorithm, the calculation cost of ${\tilde{A}}^{(new)}$ , ${\tilde{B}}^{(new)}$ , and ${\tilde{C}}^{(k, new)}$ , with k ∈ {1,⋯,K}, is N (N-1)(4N + (4 N + 1) K) flops, and the numerical complexity of computing the parameters ℓ_i,j is equal to that of u_i,j. Regarding the ${JD}_{QR}^{+}$ algorithm, for each sweep, the complexity of calculating the parameters θ_i,j is equal to N(N-1)(5N²+ 3 N + 29)K/2 flops, and the estimation of ${\tilde{A}}^{(new)}$ , ${\tilde{B}}^{(new)}$ , and ${\tilde{C}}^{(k, new)}$ , with k ∈ {1,⋯,K}, costs N (N - 1)(5 N + (12 N + 20) K/2) flops. In practice, the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ techniques are combined with LUJ1D and QRJ1D [26], respectively, leading to the magnitude of global numerical complexities of ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ being between $O (N^{3} K)$ and $O (N^{4} K)$ . A recent nonnegative JDC method called ${ACDC}_{LU}^{+}$ [41] is also based on a square change of variable and LU matrix factorization. It minimizes the cost function (4) with respect to A and D alternately, leading to a higher numerical complexity. By means of the reformulation of the cost function, the proposed methods avoid the estimation of D, therefore achieving a lower complexity compared to ${ACDC}_{LU}^{+}$ . The explicit expressions of the overall complexity of ${JD}_{LU}^{+}$ , ${JD}_{QR}^{+}$ , and ${ACDC}_{LU}^{+}$ [41], as well as those of four classical JDC algorithms, namely ACDC [23], FFDIAG [25], LUJ1D [26], and QRJ1D [26], are listed in Table 1. One can notice that numerical complexities of the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ methods are at most one order of magnitude higher than those of the four JDC algorithms and still lower than that of ${ACDC}_{LU}^{+}$ . Moreover, ${JD}_{LU}^{+}$ is less computationally expensive than ${JD}_{QR}^{+}$ .

Table 1 Numerical complexities of seven JDC algorithms in terms of flops

Full size table

Simulation results

This section is twofold. In the first part, the performance of the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms is evaluated with simulated semi-nonnegative semi-symmetric three-way arrays . Several experiments are designed to study the convergence property, the influence of SNR, the impact of the third dimension K of , the effect of the coherence of the loading matrix D, and the influence of the condition number of the diagonal matrices D^(k). We also evaluate the proposed methods for estimating a non-square matrix A. The proposed algorithms are compared with four classical nonorthogonal JDC methods, namely ACDC [23], FFDIAG [25], LUJ1D [26], QRJ1D [26], and the nonnegative JDC method ${ACDC}_{LU}^{+}$ [41]. In the second part, the source separation ability of the proposed algorithms is studied through a BSS application. In this context, the ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ are used to jointly diagonalize several matrix slices of the fourth-order cumulant array [40] of the observations and compared with several classical ICA [47, 54, 55] and NMF [56] methods.

Simulated semi-nonnegative INDSCAL model

The synthetic semi-nonnegative semi-symmetric three-way array of rank N is generated randomly according to the semi-nonnegative INDSCAL model (3). When used without further specification, all the algorithms are manipulated under the following conditions:

i)
Model generation: The loading matrix is randomly drawn from a uniform distribution on the interval [ 0,1]. The loading matrix is drawn from a Gaussian distribution with a mean of 1 and a deviation of 0.5. The pure array is perturbed by a residual INDSCAL noise array . The loading matrices of are drawn from a zero-mean unit-variance Gaussian distribution. The resulting noisy three-way array can be written as follows:
$C_{N} = \frac{C}{∥ C ∥_{F}} + σ_{N} \frac{V}{∥ V ∥_{F}}$
(40)

where σ_N is a scalar controlling the noise level. Then, the SNR is defined by $SNR = - 20 \underset{10}{log} (σ_{N})$ .
ii)
Initialization: In each Monte Carlo trial, all the algorithms are initialized by a same random matrix whose components obey the uniform distribution over [ 0,1].
iii)
Afsari’s row balancing scheme: The LUJ1D, QRJ1D, ${JD}_{LU}^{+}$ , and ${JD}_{QR}^{+}$ algorithms perform the row balancing scheme once per run of five sweeps.
iv)
Stopping criterion: All the algorithms stop either when the relative error of the corresponding criterion between two successive sweeps is less than 10^-5 or when the number of sweeps exceeds 200. A sweep of ACDC includes a full AC phase and a DCphase.
v)
Performance measurement: The performance is measured by means of the error between the true loading matrix A and the estimate $\tilde{A}$ , the numerical complexity, and the CPU time. We define the following scale-invariant and permutation-invariant distance [40]:
$α (A, \tilde{A}) = \frac{1}{N} \sum_{n = 1}^{N} min_{(n, n^{'}) \in I_{n}^{2}} d (a_{n}, {\tilde{a}}_{n^{'}})$
(41)

where a_n and ${\tilde{a}}_{n^{'}}$ are the n-th column of A and the n^′-th column of $\tilde{A}$ , respectively. $I_{n}^{2}$ is defined recursively by $I_{1}^{2} = {1, \dots, N} \times {1, \dots, N}$ , and $I_{n + 1}^{2} = I_{n}^{2} - J_{n}^{2}$ where $J_{n}^{2} = {argmin}_{(n, n^{'}) \in I_{n}^{2}} d (a_{n}, {\tilde{a}}_{n^{'}})$ . In addition, $d (a_{n}, {\tilde{a}}_{n^{'}})$ is defined as the pseudo-distance between two vectors [13]:
$d (a_{n}, {\tilde{a}}_{n^{'}}) = 1 - \frac{{∥a_{n}^{T} {\tilde{a}}_{n^{'}}∥}^{2}}{∥ a_{n} ∥^{2} ∥ {\tilde{a}}_{n^{'}} ∥^{2}}$
(42)

The criterion (41) is an upper bound of the optimal permutation-invariant criterion. It avoids the burdensome computation of all the permutations. A small value of (41) means a good performance in the sense that $\tilde{A}$ is close to A.
vi)
Test environment: The simulations are carried out in Matlab v7.14 on Mac OS X and run on Intel Quad-Core CPU 2.8 GHz with 32 GB memory. Moreover, we repeat all the experiments with 500 Monte Carlo trials.

Convergence

In this experiment, the convergences of the ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms are compared to those of ACDC, FFDIAG, LUJ1D, QRJ1D, and ${ACDC}_{LU}^{+}$ . The dimensions of the three-way array are set to N = 5 and K = 15. The performance is assessed under three SNR conditions: SNR = - 5, 10, and 25 dB, respectively. Figure 1 shows the convergence curves measured in terms of the cost function as a function of sweeps. It shows that FFDIAG, LUJ1D, and QRJ1D exhibit fast convergence behavior. They converge in less than 20 sweeps. ${ACDC}_{LU}^{+}$ decreases the cost function (4) quasi-linearly. ACDC and ${ACDC}_{LU}^{+}$ do not converge in a maximum of 200 sweeps. The proposed ${JD}_{LU}^{+}$ algorithm converges in about 100 sweeps when SNR =25 dB and SNR =10 dB, and it converges in about 40 sweeps when SNR =-5 dB. Regarding ${JD}_{QR}^{+}$ , it reduces the cost function (11) to the values relatively higher than those achieved by ${JD}_{LU}^{+}$ and converges in about 50 sweeps whatever the SNR is. It seems that FFDIAG, LUJ1D, and QRJ1D achieve the fastest convergence rate. It should be noted that while an algorithm may converge to a point in which the value of the cost function is close to zero, such a point could be a local minimum far from the desired matrix A as shown in Figure 2. The top picture in Figure 2 shows the convergence curves measured in terms of the estimating error $α (A, \tilde{A})$ as a function of sweeps when SNR=25 dB. It shows that the solutions of FFDIAG, LUJ1D, and QRJ1D are still far from optimum. ACDC and ${ACDC}_{LU}^{+}$ give better estimations of A than the previous three methods. The best results are achieved by the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ methods. The middle picture in Figure 2 displays the convergence curves when SNR=10 dB. It can be observed that ACDC converges to a local minimum which is not the global one and that the performance of the proposed methods is still better than that of the five other algorithms. For a low SNR =-5 dB, as shown in the bottom picture in Figure 2, both the methods based on alternating optimization, namely ACDC and ${ACDC}_{LU}^{+}$ , converge to local minima which are less desirable. The proposed algorithms are always able to converge to better results than the classical methods. The average numerical complexities and CPU time of all the algorithms over Monte Carlo trials are shown in Table 2. It is observed that FFDIAG, LUJ1D, and QRJ1D require a small amount of calculations, whereas ${ACDC}_{LU}^{+}$ requires a large amount of calculations. The proposed ${JD}_{LU}^{+}$ just costs a bit more flops and CPU time than ACDC, but it is still much more efficient. Concerning the ${JD}_{QR}^{+}$ algorithm, it is more costly than ${JD}_{LU}^{+}$ , with a comparable performance. We can then conclude that ${JD}_{LU}^{+}$ offers the best performance/complexity compromise in these experiments.

Table 2 Average numerical complexities (in flops) and computation time (in seconds) of the convergence experiment

Full size table

Effect of SNR

In this section, we study the behaviors of the seven algorithms as a function of SNR. The dimensions of the three-way array $C_{N}$ are set to N = 5 and K = 15. We repeat the experiments with SNR ranging from -30 to 50 dB with a step of 2 dB. The top picture in Figure 3 depicts the average curves of $α (A, \tilde{A})$ of the seven algorithms as a function of SNR. The obtained results show that the performance of all the methods increases as SNR grows. For the unconstrained methods, generally, ACDC performs better than FFDIAG, LUJ1D, and QRJ1D. The nonnegativity constraint obviously helps ${ACDC}_{LU}^{+}$ , ${JD}_{LU}^{+}$ , and ${JD}_{QR}^{+}$ to improve the results for lower SNR values. The performance of ACDC and ${ACDC}_{LU}^{+}$ remains stable for higher SNR values due to the small number of available sweeps and the lack of good initializations. Generally, the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms outperform the others when SNR is between -20 and 30 dB and perform similar to FFDIAG, LUJ1D, and QRJ1D when SNR is above 45 dB. The average numerical complexity and CPU time at each SNR level of all the methods in this experiment are shown in the bottom of Figure 3. It shows that the proposed methods achieve better estimations of A and cost less flops and CPU time than ${ACDC}_{LU}^{+}$ . The ${JD}_{LU}^{+}$ gives the best performance/complexity trade-off for all the considered SNR values.

Effect of dimension K

In ICA, the third dimension K of the three-way array corresponds to the number of covariance matrices at different lags, or the number of matrix slices derived from a cumulant array. In this section, we study the influence of K on the performance of the seven algorithms. The first and second dimensions of $C_{N}$ are set to N = 5. The SNR value is fixed to 10 dB. We repeat the experiment with K ranging from 3 to 55. The top picture in Figure 4 shows the average curves of $α (A, \tilde{A})$ of all the algorithms as a function of K. For the five existing methods, ACDC, ${ACDC}_{LU}^{+}$ , FFDIAG, LUJ1D, and QRJ1D, their performance is quite stable with respect to K. The performance of the proposed methods progresses as K increases and then practically stabilizes for high values of K. It indicates that after some point (e.g., K ≥ 20), the additional information brought by an increase of K does not further improve the results. The proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms maintain competitive advantages through all the K values. The two images in the bottom of Figure 4 present the average numerical complexity and CPU time of all the algorithms in this experiment, respectively. It shows that the numerical complexity of ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ is between that of ACDC and ${ACDC}_{LU}^{+}$ . The ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ methods seem to be the most effective algorithms compared to the other methods.

Effect of coherence of D

In this experiment, the effect of the coherence of the third loading matrix D of the three-way array $C = [[A, A, D]]$ is evaluated. Let d_n and d_m denote the n-th and m-th columns of D, respectively. The angle ψ_n,m between d_n and d_m can be derived by using the following Euclidean dot product formula $d_{n}^{T} d_{m} = ∥ d_{n} ∥ ∥ d_{m} ∥ cos (ψ_{n, m})$ . Then, the coherence ρ of D is defined as the maximum absolute cosine of angle ψ_n,m between the columns of D as follows:

ρ = max_{\binom{n, m}{n \neq m}} | cos (ψ_{n, m}) | with cos (ψ_{n, m}) = \frac{d_{n}^{T} d_{m}}{∥ d_{n} ∥ ∥ d_{m} ∥}

(43)

The quantity ρ is also known as the modulus of uniqueness of JDC [43]. By its definition (43), ρ falls in the range of [ 0,1]. The JDC problem is considered to be ill-conditioned when ρ is close to 1. Such an ill-conditioned problem can be met in ICA when A has nearly collinear column vectors. For example, in order to perform ICA, provided that all the sources are non-Gaussian, which is often the case in practice, we can build a three-way array by stacking the matrix slices of the fourth-order cumulant array of the observation data. Then, the loading matrix D can be expressed as follows:

D = (A ⊙ A) C_{4, {s}}

(44)

where $C_{4, {s}} = diag [C_{1, 1, 1, 1, {s}}, \dots, C_{N, N, N, N, {s}}]$ is a (N × N) diagonal matrix with $C_{n, n, n, n, {s}}$ being the fourth-order cumulant of the n-th source, n ∈ {1,⋯,N}, and where ⊙ denotes the Khatri-Rao product. It can be observed that the coherence of the columns of A will influence the coherence of the matrix D. In the following test, the dimensions of the three-way array $C_{N}$ are set to N = 5 and K = 15. The SNR value is fixed to 10 dB. In order to control ρ, firstly, we randomly generate an orthogonal matrix so that ρ = 0 by orthogonalizing a (15×5) random matrix. Secondly, we rotate its five columns such that all the internal angles between any columns are equal to a predefined value ψ. Therefore, ρ is only controlled by the angle ψ and equals to $| cos (ψ) |$ . We repeat the experiment with the angle ψ ranging from 0 to π/2 with a step of π/60. A small ψ value means a large ρ value. The top picture in Figure 5 displays the average curves of $α (A, \tilde{A})$ of all the algorithms as a function of ψ. It shows that the nonnegativity constrained methods ${ACDC}_{LU}^{+}$ , ${JD}_{LU}^{+}$ , and ${JD}_{QR}^{+}$ , outperform the unconstrained ones ACDC, FFDIAG, LUJ1D, and QRJ1D. The proposed algorithms are more efficient, particularly when the coherence level is high. The average numerical complexity and CPU time displayed in the bottom of Figure 5 indicate that the ${JD}_{LU}^{+}$ algorithm provides the best performance/complexity compromise, while the ${JD}_{QR}^{+}$ algorithm is also competitive with regard to ${ACDC}_{LU}^{+}$ .

Effect of condition number of D ^(k)

When the JDC problem is considered, a diagonal matrix D^(k) could contain some diagonal elements which, despite being non-zero, are many orders of magnitude lower than some other elements, leading to an ill-conditioned matrix C^(k). For the proposed methods, the inverse of such a matrix C^(k) would contain numerical errors. In this experiment, we study the performance of the seven algorithms as a function of the condition number of one of the diagonal matrices D^(k). The dimensions of the three-way array $C_{N}$ are set to N = 5 and K = 15. The SNR value is set to 10 dB. We vary the condition number of the first diagonal matrix D⁽¹⁾ from 1 to 1,000 by fixing the ratio of its largest diagonal element to its smallest diagonal element. The top picture in Figure 6 displays the average curves of the estimating error $α (A, \tilde{A})$ of the seven algorithms as a function of the condition number of D⁽¹⁾. The results reveal that a highly ill-conditioned diagonal matrix D⁽¹⁾ has a clear negative effect on the estimation accuracy of all the algorithms. The nonnegativity constrained methods ${ACDC}_{LU}^{+}$ , ${JD}_{LU}^{+}$ , and ${JD}_{QR}^{+}$ outperform the classical algorithms ACDC, FFDIAG, LUJ1D, and QRJ1D whatever the condition number is. The proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms maintain advantages when the condition number is less than 100. Regarding the cases of larger condition numbers, ${ACDC}_{LU}^{+}$ is more superior since it does not need to invert the highly ill-conditioned matrix. It is worthy pointing out that in practice, we can choose these sufficiently well-conditioned matrices C^(k) for the proposed methods, whose condition numbers are below a predefined threshold. In addition, a weighted cost function for which weights would depend on the condition number of each matrix can be considered. On the other hand, the performance of the classical methods can also be improved by choosing a particular subset of available matrices [57] and by properly weighting the cost functions [49]. In order to give a fair comparison, all the algorithms operate on the same set of matrices in all the experiments of this paper. In addition, the average numerical complexity and CPU time at each condition number of all the methods in this experiment are shown in the bottom of Figure 6. It shows that the proposed methods give the best performance/complexity trade-off compared to ${ACDC}_{LU}^{+}$ whatever the condition number is.

Test with a non-square matrix A

As described in the section of practical issues, when a non-square matrix with N>P is met in ICA, we propose to compress it by a nonnegative compression matrix [53], such that the resulting matrix $\bar{A} = W_{+} A$ is a (P × P) nonnegative square matrix. Then, the proposed methods can be applied to estimate $\bar{A}$ . Similar to the classical prewhitening, the nonnegative compression step could introduce numerical errors. In this experiment, we compare our methods to ACDC and ${ACDC}_{LU}^{+}$ through a simulated ICA model. The latter algorithms can directly estimate a non-square matrix A from the fourth-order cumulant matrix slices. The ICA model is established as follows:

x [f] = A s [f] + ν [f]

(45)

where $x [f] = {[x_{1} [f], \dots, x_{N} [f]]}^{T}$ is the (N × 1) observation vector, $s [f] = {[s_{1} [f], s_{2} [f], s_{3} [f]]}^{T}$ is the (3 × 1) zero-mean unit-variance source vector whose elements are independently drawn from a uniform distribution over $[- \sqrt{3}, \sqrt{3}]$ , $ν = {[ν_{1} [f], \dots, ν_{N} [f]]}^{T}$ is the (N × 1) zero-mean unit-variance Gaussian noise vector, and A is the (N × 3) nonnegative mixing matrix whose components are independently drawn from a uniform distribution over [ 0,1]. In this context, the SNR is defined by:

SNR = 20 \underset{10}{log} (∥ {A s [f]} ∥_{F} / ∥ {ν [f]} ∥_{F})

(46)

For the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms, the given realization of {x[f] } is compressed by means of a matrix computed using the method proposed in [53], leading to a three-dimensional vector ${\bar{x} [f]}$ . We compute the fourth-order cumulant array of ${\bar{x} [f]}$ and choose the first three matrix slices in order to build a three-way array. Hence, ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ decompose a (3×3×3) array. Once the compressed mixing matrix $\bar{A}$ is estimated, the original mixing matrix is obtained by Eq. 39. Regarding ACDC and ${ACDC}_{LU}^{+}$ , the fourth-order cumulant array of {x[f] } is directly computed without compression. We apply ACDC and ${ACDC}_{LU}^{+}$ on two three-way arrays with different third dimensions. The first array of dimension (N × N × 3) is built by choosing the first three matrix slices from the fourth-order cumulant array, while the second array of dimension (N × N× N) is built using the first N matrix slices. We study the impact of the number of observations N on the performance of the JDC algorithms, by varying N from 4 to 24. The SNR value is fixed to 5 dB. The number of samples used to estimate the cumulants is set to 10³. Figure 7 shows the average curves of the estimating error $α (A, \tilde{A})$ of all the algorithms as a function of N. As it can be seen, when N ≤ 15, the larger the value of N, the more accurate estimation of A is achieved. When N > 15, the further increase of N does not bring significant improvement in terms of the estimation accuracy. ACDC and ${ACDC}_{LU}^{+}$ give better results when the array with a larger third dimension is considered. Their results on (N × N × N) arrays outperform the proposed methods when N = 4. ${ACDC}_{LU}^{+}$ also gives the best estimation on (N × N × N) arrays with N = 5. It suggests that the numerical errors introduced by the compression step limit the performance of the proposed methods when only a small number of observation is available. Such a negative effect can be partially compensated by using a large number of observations, since the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ methods maintain the highest performance in terms of estimation accuracy when N≥6. The performance ACDC and ${ACDC}_{LU}^{+}$ can be further improved by using a (N × N ×N²) array, which contains all the N² fourth-order cumulant matrix slices. However, it leads to a higher numerical complexity especially for a large value of N. Regarding the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ methods, their performance can also be improved by using all the nine matrix slices of the fourth-order cumulant array of the compressed observation vector. Nevertheless, the experimental result has already shown that by using only a small number of matrix slices, ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ can maintain lower numerical complexities than ACDC and ${ACDC}_{LU}^{+}$ , while achieving better estimation results, when a large value of N is considered. Therefore, despite the negative influence of the nonnegative compression, the proposed methods still offer a good performance/complexity compromise to estimate a non-square matrix A.

BSS application on MRS data

In this section, we aim to illustrate the potential capability of the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms for solving a real-life BSS problem by an application carried on simulated MRS data.

MRS is a powerful non-invasive analytical technique for analyzing the chemical content of MR-visible nuclei and therefore enjoys particular advantages for assessing metabolism. The chemical property of each nucleus determines the frequency at which it appears in the MR spectrum, giving rise to peaks corresponding to specific metabolites [58]. Therefore, the MRS observation spectra can be modeled as the mixture of the spectrum of each constitutional source metabolite. More specifically, it can be written as the noisy linear instantaneous mixing model described in Eq. 45, where is the MRS observation vector, is the source vector representing the statistically quasi-independent source metabolites, is the instrumental noise vector, and is the nonnegative mixing matrix containing the positive concentrations of the source metabolites. SNR is defined as in Eq. 46. In this experiment, two simulated MRS source metabolites {s₁[f] } and {s₂[f] }, namely the Choline (Cho) and Myo-inositol (Ins) (see Figure 8b), are generated by Lorentzian and Gaussian functions [59]. Each of the sources contains 10³ samples. The observation vector x[f] is generated according to (45). The components of the (N×2) mixing matrix A are randomly drawn from a uniform distribution. The additive noise ν[f] is modeled as a zero-mean unit-variance Gaussian vector. The ICA methods based on the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms, namely ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA, consist of four steps: i) compressing {x[f] } by means of a matrix [53], ii) estimating the fourth-order cumulant array of the compressed observations and stacking all the cumulant matrix slices in a three-way array, iii) decomposing the resulting three-way array by means of ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ , respectively, and iv) reconstructing the sources. The ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA are compared to four state-of-the-art BSS algorithms, namely two efficient ICA methods CoM ₂[54] and SOBI [47], the nonnegative ICA (NNICA) method with a line search along the geodesic [55], and the NMF method [56] based on alternating nonnegativity least squares. The performance is assessed by means of the error $α ({s [f]}^{T}, {\tilde{s} [f]}^{T})$ between the true source s[f] and its estimate $\tilde{s} [f]$ , the numerical complexity, and the CPU time. To find out the detailed analysis of the numerical complexity of the classical ICA algorithms, the reader can refer to the book chapter [60]. Figure 8 shows an example of the separation results of all the methods with N = 32 observations and a SNR of 10 dB. Regarding CoM ₂, SOBI, NNICA, and NMF, there are some obvious disturbances presented in the estimated metabolites. As far as ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA are concerned, the estimated source metabolites are quasi-perfect. Furthermore, the comprehensive performance of all the methods will be studied by the following experiments with 200 independent Monte Carlo trials.

In the first experiment, the effect of the number of observations N is evaluated. The SNR is fixed to 10 dB. The six methods are compared with N ranging from 4 to 116 with a step of 4. The average curves of error $α ({s [f]}^{T}, {\tilde{s} [f]}^{T})$ as a function of N are shown in the left image of Figure 9. It can be seen that the estimating errors of all the methods improve as N increases. It suggests that in noisy BSS contexts, using more sensors often yields better results. The proposed ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA methods maintain the competitive advantages. The average curves of the numerical complexities of this experiment are shown in the bottom left picture of Figure 9. We can notice that the numerical complexities of all the methods increase with N. The complexities of ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA seem identical in the logarithmic scaled plot, which is because theoretically their complexities are mainly dominated by the computation of the nonnegative compression step and of the cumulants. Indeed, ${JD}_{LU}^{+}$ -ICA is more computationally efficient than ${JD}_{QR}^{+}$ -ICA in the step of CP decomposition of the cumulant array. This can be verified by the average CPU time of those methods, shown in the bottom right image of Figure 9. We can observe that ${JD}_{LU}^{+}$ -ICA is slower than CoM ₂, but it is faster than NNICA, SOBI, and NMF.

In the second experiment, we study the influence of SNR on the performance of the six methods. The number of observations N is set to 32. SNR is varied from 0 to 50 dB with a step of 2 dB. The average curves of the estimating error $α ({s [f]}^{T}, {\tilde{s} [f]}^{T})$ as well as those of the numerical complexities and CPU time as a function of SNR of all the six methods are shown in Figure 10. The proposed ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA methods provide the best estimation results with moderate computational complexities and CPU time. Generally speaking, the ${JD}_{LU}^{+}$ -ICA algorithm offers the best performance/complexity trade-off in this BSS experimental context.

Conclusions

We have proposed two methods, called ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ , in order to achieve the CP decomposition of semi-nonnegative semi-symmetric three-way arrays. The nonnegativity constraint is imposed on the two symmetric modes of three-way arrays by means of a square change of variable, giving rise to an unconstrained joint diagonalization by congruence problem. Therefore, the nonnegative loading matrix can be estimated by computing the joint diagonalizer. We consider the elementary LU and QR parameterizations of the Hadamard square root of the nonnegative joint diagonalizer, leading to two Jacobilike optimization procedures. In each Jacobi-like iteration, the optimization is formulated into a minimization of a polynomial or rational function with respect to only one parameter. In addition, the numerical complexity for each algorithm has been analyzed.

The performance of the proposed ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ algorithms is evaluated with simulated semi-nonnegative semi-symmetric three-way arrays. Four classical nonorthogonal JDC methods without nonnegativity constraints including ACDC [23], FFDIAG [25], LUJ1D [26], and QRJ1D [26] and one nonnegative JDC method ${ACDC}_{LU}^{+}$ [41] are tested as reference methods. The performance is assessed in terms of the matrix estimation accuracy, the numerical complexity, and the CPU time. The convergence property, the influence of SNR, the impact of dimension, the effect of coherence, and the influence of condition number are extensively studied by Monte Carlo experiments. The obtained results show that the proposed algorithms offer better estimation accuracy by means of exploiting the nonnegativity a priori. The ${JD}_{LU}^{+}$ algorithm provides the best performance/complexity compromise.

The proposed algorithms are suitable tools for solving the ICA problems where a nonnegative mixing matrix is considered, such as in MRS. In this case, the three-way array built by stacking the matrix slices of a HO cumulant array fulfills the semi-nonnegative semi-symmetric structure. We proposed two ICA methods, namely ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA, based on CP decomposition of the fourth-order cumulant array using ${JD}_{LU}^{+}$ and ${JD}_{QR}^{+}$ , respectively. The source separation ability of the proposed algorithms is verified through a BSS application carried out on simulated MRS data. The ${JD}_{LU}^{+}$ -ICA and ${JD}_{QR}^{+}$ -ICA are compared to one NMF method [56], one nonnegative ICA method [55], and two classical ICA methods, namely CoM ₂[54] and SOBI [47]. The performance is comprehensively studied as a function of the number of observations and of SNR. The experimental results demonstrate the improvement of the proposed methods in terms of the source estimation accuracy and also show that exploiting the two a priori of the data, namely the nonnegativity of the mixing matrix and the statistical independence of the sources, allows us to achieve better estimation results. The ${JD}_{LU}^{+}$ -ICA algorithm provides the best performance/complexity trade-off.

References

Smilde A, Bro R, Geladi P: Multi-way Analysis: Applications in the Chemical Sciences. Wiley, West Sussex; 2004.
Book Google Scholar
de Almeida ALF, Favier G, Ximenes LR: Space-time-frequency (STF) MIMO communication systems with blind receiver based on a generalized PARATUCK2 model. IEEE Trans. Signal Process 2013, 61(8):1895-1909.
Article MathSciNet Google Scholar
De Vos M, Vergult A, De Lathauwer L, De Clercq W, Van Huffel S, Dupont P, Palmini A, Van Paesschen W: Canonical decomposition of ictal scalp EEG reliably detects the seizure onset zone. Neuroimage 2007, 37(3):844-854. 10.1016/j.neuroimage.2007.04.041
Article Google Scholar
Comon P, Luciani X, de Almeida ALF: Tensor decompositions, alternating least squares and other tales. J. Chemometr 2009, 23: 393-405. 10.1002/cem.1236
Article Google Scholar
Tucker LR: Some mathematical notes on three-mode factor analysis. Psychometrika 1966, 31(3):279-311. 10.1007/BF02289464
Article MathSciNet Google Scholar
De Lathauwer L, De Moor B, Vandewalle J: A multilinear singular value decomposition. SIAM J. Matrix Anal. Appl 2000, 21(4):1253-1278. 10.1137/S0895479896305696
Article MathSciNet MATH Google Scholar
Kruskal JB: Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics. Lin. Algebra Appl 1977, 18(2):98-138.
Article MathSciNet MATH Google Scholar
Hitchcock FL: The expression of a tensor or a polyadic as a sum of products. J. Math. Phys 1927, 6(1):164-189.
Article MathSciNet MATH Google Scholar
Kroonenberg PM: Applied Multiway Data Analysis. Wiley, Hoboken; 2008.
Book MATH Google Scholar
Sidiropoulos ND, Bro R, Giannakis GB: Parallel factor analysis in sensor array processing. IEEE Trans. Signal Process 2000, 48(8):2377-2388. 10.1109/78.852018
Article Google Scholar
de Almeida ALF, Favier G, Motab JCM: PARAFAC-based unified tensor modeling for wireless communication systems with application to blind multiuser equalization. Signal Process 2007, 87(2):337-351. 10.1016/j.sigpro.2005.12.014
Article MATH Google Scholar
de Almeida ALF, Favier G: Double Khatri-Rao space-time-frequency coding using semi-blind PARAFAC based receiver. IEEE Signal Process. Lett 2013, 20(5):471-474.
Article Google Scholar
Albera L, Ferréol A, Comon P, Chevalier P: Blind identification of overcomplete mixtures of sources (BIOME). Lin. Algebra Appl 2004, 391: 3-30.
Article MATH MathSciNet Google Scholar
Röemer F, Haardt M: Tensor-based channel estimation and iterative refinements for two-way relaying with multiple antennas and spatial reuse. IEEE Trans. Signal Process 2010, 58(11):5720-5735.
Article MathSciNet Google Scholar
Harshman RA, Lundy ME: PARAFAC: parallel factor analysis. Comput. Stat. Data Anal 1994, 18(1):39-72. 10.1016/0167-9473(94)90132-5
Article MATH Google Scholar
Uschmajew A: Local convergence of the alternating least squares algorithm for canonical tensor approximation. SIAM. J. Matrix Anal. Appl 2012, 33(2):639-652. 10.1137/110843587
Article MathSciNet MATH Google Scholar
Rajih M, Comon P, Harshman RA: Enhanced line search: a novel method to accelerate PARAFAC. SIAM J. Matrix Anal. Appl 2008, 30(3):1128-1147. 10.1137/06065577
Article MathSciNet MATH Google Scholar
Acar E, Dunlavy DM, Kolda TG: A scalable optimization approach for fitting canonical tensor decompositions. J. Chemometr 2011, 25(2):67-86. 10.1002/cem.1335
Article Google Scholar
Röemer F, Haardt M: A semi-algebraic framework for approximate CP decompositions via simultaneous matrix diagonalizations (SECSI). Signal Process 2013, 93(9):2722-2738. 10.1016/j.sigpro.2013.02.016
Article Google Scholar
Luciani X, Albera L: Canonical polyadic decomposition based on joint eigenvalue decomposition. Chemometr. Intell. Lab 2014, 132: 152-167.
Article Google Scholar
Carroll JD, Chang J-J: Analysis of individual differences in multidimensional scaling via an n-way generalization of Eckart-Young decomposition. Psychometrika 1970, 35(3):283-319. 10.1007/BF02310791
Article MATH Google Scholar
Husson F, Pagés J: INDSCAL model: geometrical interpretation and methodology. Comput. Stat. Data Anal 2006, 50(2):358-378. 10.1016/j.csda.2004.08.005
Article MATH MathSciNet Google Scholar
Yeredor A: Non-orthogonal joint diagonalization in the least-squares sense with application in blind source separation. IEEE Trans. Signal Process 2002, 50(7):1545-1553. 10.1109/TSP.2002.1011195
Article MathSciNet Google Scholar
Cardoso JF, Souloumiac A: Jacobi angles for simultaneous diagonalization. SIAM J. Matrix Anal. Appl 1996, 17: 161-164. 10.1137/S0895479893259546
Article MathSciNet MATH Google Scholar
Ziehe A, Laskov P, Nolte G, Muller K-R: A fast algorithm for joint diagonalization with non-orthogonal transformations and its application to blind source separation. J. Mach. Learn. Res 2004, 5: 777-800.
MathSciNet MATH Google Scholar
Afsari B: Simple LU and QR based non-orthogonal matrix joint diagonalization. In ICA 2006, Springer LNCS 3889. Charleston, SC, USA; 5–8 March 2006.
Google Scholar
Van der Veen AJ: Joint diagonalization via subspace fitting techniques. In Proc. ICASSP ‘01. Salt Lake, City, UT; 7–11 May 2001:2773-2776.
Google Scholar
Yeredor A: On using exact joint diagonalization for noniterative approximate joint diagonalization. IEEE Signal Process. Lett 2005, 12(9):645-648.
Article Google Scholar
Vollgraf R, Obermayer K: Quadratic optimization for simultaneous matrix diagonalization. IEEE Trans. Signal Process 2006, 54(9):3270-3278.
Article Google Scholar
Li XL, Zhang XD: Nonorthogonal joint diagonalization free of degenerate solution. IEEE Trans. Signal Process 2007, 55(5):1803-1814.
Article MathSciNet Google Scholar
Souloumiac A: Nonorthogonal joint diagonalization by combining Givens and hyperbolic rotations. IEEE Trans. Signal Process 2009, 57(6):2222-2231.
Article MathSciNet Google Scholar
Xu XF, Feng DZ, Zheng WX: A fast algorithm for nonunitary joint diagonalization and its application to blind source separation. IEEE Trans. Signal Process 2011, 59(7):3457-3463.
Article MathSciNet Google Scholar
Chabriel G, Barrère J: A direct algorithm for nonorthogonal approximate joint diagonalization. IEEE Trans. Signal Process 2012, 60(1):39-47.
Article MathSciNet Google Scholar
Chabriel G, Kleinsteuber M, Moreau E, Shen H, Tichavský P, Yeredor A: Joint matrices decompositions and blind source separation: A survey of methods, identification, and applications. IEEE Signal Process. Mag 2014, 31(3):34-43.
Article Google Scholar
Lee DD, Seung HS: Learning the parts of objects by non-negative matrix factorization. Nature 1999, 401(6755):788-791. 10.1038/44565
Article Google Scholar
Zhang Q, Wang H, Plemmons RJ, Pauca VP: Tensor methods for hyperspectral data analysis: a space object material identification study. J. Opt. Soc. Am. A. Opt. Image Sci. Vis 2008, 25(12):3001-3012. 10.1364/JOSAA.25.003001
Article Google Scholar
Royer J-P, Thirion-Moreau N, Comon P: Computing the polyadic decomposition of nonnegative third order tensors. Signal Process 2011, 91(9):2159-2171. 10.1016/j.sigpro.2011.03.006
Article MATH Google Scholar
Cichocki A, Zdunek R, Phan AH, Amari S: Nonnegative Matrix and Tensor Factorizations: Applications to Exploratory Multi-way Data Analysis and Blind Source Separation. Wiley, West Sussex; 2009.
Book Google Scholar
Zhou GX, Cichocki A, Zhao Q, Xie SL: Nonnegative matrix and tensor factorizations : an algorithmic perspective. IEEE Signal Process. Mag 2014, 31(3):54-65.
Article Google Scholar
Coloigner J, Karfoul A, Albera L, Comon P: Line search and trust region strategies for canonical decomposition of semi-nonnegative semi-symmetric 3rd order tensors. Lin. Algebra Appl 2014, 450(1):334-374.
Article MathSciNet MATH Google Scholar
Wang L, Albera L, Kachenoura A, Shu HZ, Senhadji L: Nonnegative joint diagonalization by congruence based on LU matrix factorization. IEEE Signal Process. Lett 2013, 20(8):807-810.
Article Google Scholar
Wang L, Albera L, Kachenoura A, Shu HZ, Senhadji L: CP decomposition of semi-nonnegative semi-symmetric tensors based on QR matrix factorization. In SAM’14, Proceedings of the Eighth IEEE Sensor Array and Multichannel Signal Processing Workshop. A Coruna, Spain; 22–25 June 2014:449-452.
Google Scholar
Afsari B: Sensitivity analysis for the problem of matrix joint diagonalization. SIAM J. Matrix Anal. Appl 2008, 30(3):1148-1171. 10.1137/060655997
Article MathSciNet MATH Google Scholar
Wax M, Sheinvald J: A least-squares approach to joint diagonalization. IEEE Signal Process. Lett 1997, 4(2):52-53.
Article Google Scholar
Dégerine S, Kane E: A comparative study of approximate joint diagonalization algorithms for blind source separation in presence of additive noise. IEEE Trans. Signal Process 2007, 55(6):3022-3031.
Article MathSciNet Google Scholar
Fadaili EM, Thirion-Moreau N, Moreau E: Nonorthogonal joint diagonalization/zero diagonalization for source separation based on time-frequency distributions. IEEE Trans. Signal Process 2007, 55(5):1673-1687.
Article MathSciNet Google Scholar
Belouchrani A, Abed-Meraim K, Cardoso JF, Moulines E: A blind source separation technique using second-order statistics. IEEE Trans. Signal Process 1997, 45(2):434-444. 10.1109/78.554307
Article Google Scholar
Pham DT: Joint approximate diagonalization of positive definite Hermitian matrices. SIAM J. Matrix Anal. Appl 2001, 22: 1837-1848.
Article MATH MathSciNet Google Scholar
Tichavský P, Yeredor A: Fast approximate joint diagonalization incorporating weight matrices. IEEE Trans. Signal Process 2009, 57(3):878-891.
Article MathSciNet Google Scholar
Chu M, Diele F, Plemmons R, Ragni S: Optimality computation and interpretation of nonnegative matrix factorizations. Technical report, Wake Forest University 2004
Google Scholar
Meyer CD: Matrix Analysis and Applied Linear Algebra. SIAM, Philadelphia; 2000.
Book Google Scholar
Vaidyanathan PP: Multirate Systems and Filter Banks. PTR Prentice Hall, United States; 1993.
MATH Google Scholar
Wang L, Kachenoura A, Albera L, Karfoul A, Shu HZ, Senhadji L: Nonnegative compression for semi-nonnegative independent component analysis. In SAM’14, Proceedings of the Eighth IEEE Sensor Array and Multichannel Signal Processing Workshop. A Coruna, Spain; 22–25 June 2014:81-84.
Google Scholar
Comon P: Independent component analysis, a new concept? Signal Process 1994, 36(3):287-314. 10.1016/0165-1684(94)90029-9
Article MATH Google Scholar
Plumbley MD: Algorithms for nonnegative independent component analysis. IEEE Trans. Neural Netw 2003, 14(3):534-543. 10.1109/TNN.2003.810616
Article Google Scholar
Kim H, Park H: Nonnegative matrix factorization based on alternating nonnegativity constrained least squares and active set method. SIAM J. Matrix Anal. Appl 2008, 30(2):713-730. 10.1137/07069239X
Article MathSciNet MATH Google Scholar
De Lathauwer: Algebraic methods after prewhitening. In Handbook of Blind Source Separation, ed. by P Comon, C Jutten,. Elsevier, Oxford; 2010:155-177. Chap. 5
Chapter Google Scholar
Befroy DE, Shulman GI: Magnetic resonance spectroscopy studies of human metabolism. Diabetes 2011, 60(5):1361-1369. 10.2337/db09-0916
Article Google Scholar
Moussaoui S: Séparation de sources non-négatives: application au traitement des signaux de spectroscopie. PhD thesis, Université Henri Poincaré, (2005)
Google Scholar
Albera L, Comon P, Parra LC, Karfoul A, Kachenoura A, Senhadji L: Biomedical applications. In Handbook of Blind Source Separation ed. by P Comon, C Jutten,. Elsevier, Oxford; 2010:737-777. Chap. 18
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

1INSERM, UMR 1099, F-35000, Rennes, France
Lu Wang, Laurent Albera, Amar Kachenoura & Lotfi Senhadji
LTSI, Université de Rennes 1, Bât. 22, Campus de Beaulieu, F-35000, Rennes, France
Lu Wang, Laurent Albera, Amar Kachenoura & Lotfi Senhadji
Laboratory of Image Science and Technology, School of Computer Science and Engineering, Southeast University, Qun Xian Building, No. 2 Sipailou, 210096, Nanjing, China
Huazhong Shu
Centre de Recherche en Information Biomédicale Sino-Français (CRIBs), F-35000, Rennes, France
Lu Wang, Laurent Albera, Amar Kachenoura, Huazhong Shu & Lotfi Senhadji
Centre INRIA Rennes-Bretagne Atlantique, Bât. 12G, Campus de Beaulieu, F-35042, Rennes, France
Laurent Albera

Authors

Lu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Laurent Albera
View author publications
You can also search for this author in PubMed Google Scholar
Amar Kachenoura
View author publications
You can also search for this author in PubMed Google Scholar
Huazhong Shu
View author publications
You can also search for this author in PubMed Google Scholar
Lotfi Senhadji
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Laurent Albera.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, L., Albera, L., Kachenoura, A. et al. Canonical polyadic decomposition of third-order semi-nonnegative semi-symmetric tensors using LU and QR matrix factorizations. EURASIP J. Adv. Signal Process. 2014, 150 (2014). https://doi.org/10.1186/1687-6180-2014-150

Download citation

Received: 31 March 2014
Accepted: 10 September 2014
Published: 08 October 2014
DOI: https://doi.org/10.1186/1687-6180-2014-150

Canonical polyadic decomposition of third-order semi-nonnegative semi-symmetric tensors using LU and QR matrix factorizations

Abstract

Introduction

Multilinear algebra prerequisites and problem statement

Notations

Definitions and problem formulation

Definition 1.

Definition 2.

Definition 3.

Definition 4.

Definition 5.

Problem 1.

Problem 2.

JDC cost functions

Methods

Problem reformulation

Problem 3.

LU and QR parameterizations of B

Definition 6.

Definition 7.

Definition 8.

Lemma 1.

Lemma 2.

Minimization with respect to the elementary upper triangular matrix U(i,j)(ui,j)

Proposition 1.

Proposition 2.

Minimization with respect to the Givens rotation matrix Q(i,j)(θi,j)

Proposition 3.

Proposition 4.

Proposition 5.

Practical issues

Numerical complexity

Simulation results

Simulated semi-nonnegative INDSCAL model

Convergence

Effect of SNR

Effect of dimension K

Effect of coherence of D

Effect of condition number of D (k)

Test with a non-square matrix A

BSS application on MRS data

Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Minimization with respect to the elementary upper triangular matrix U^(i,j)(u_i,j)

Minimization with respect to the Givens rotation matrix Q^(i,j)(θ_i,j)

Effect of condition number of D ^(k)