On identifiability of 3-tensors of multilinear rank (1; Lr; Lr)

Ming Yang; Dunren Che; Wen Liu; Zhao Kang; Chong Peng; Mingqing Xiao; Qiang Cheng; Ming Yang; Dunren Che; Wen Liu; Zhao Kang; Chong Peng; Mingqing Xiao; Qiang Cheng

doi:10.3934/bdia.2016017

Big Data and Information Analytics

2016, Volume 1, Issue 4: 391-401. doi: 10.3934/bdia.2016017

Previous Article Next Article

On identifiability of 3-tensors of multilinear rank (1; L_r; L_r)

1.
Department of Computer Science Southern Illinois University-Carbondale Carbondale, IL 62901, USA;
2.
Department of Computer Science Southern Illinois University-Carbondale Carbondale, IL 62901, USA;
3.
Department of Mathematics Lamar University Beaumont, TX 77710, USA;
4.
Department of Mathematics Southern Illinois University-Carbondale Carbondale, IL 62901, USA

Published: 01 October 2016

In this paper, we study a specific big data model via multilinear rank tensor decompositions. The model approximates to a given tensor by the sum of multilinear rank (1; L_r; L_r) terms. And we characterize the identifiability property of this model from a geometric point of view. Our main results consists of exact identifiability and generic identifiability. The arguments of generic identifiability relies on the exact identifiability, which is in particular closely related to the well-known "trisecant lemma" in the context of algebraic geometry (see Proposition 2.6 in[1]). This connection discussed in this paper demonstrates a clear geometric picture of this model.

Keywords:

Citation: Ming Yang, Dunren Che, Wen Liu, Zhao Kang, Chong Peng, Mingqing Xiao, Qiang Cheng. On identifiability of 3-tensors of multilinear rank (1; Lr; Lr)[J]. Big Data and Information Analytics, 2016, 1(4): 391-401. doi: 10.3934/bdia.2016017

Related Papers:

[1]	Zhouchen Lin . A Review on Low-Rank Models in Data Analysis. Big Data and Information Analytics, 2016, 1(2): 139-161. doi: 10.3934/bdia.2016001
[2]	Ugo Avila-Ponce de León, Ángel G. C. Pérez, Eric Avila-Vales . A data driven analysis and forecast of an SEIARD epidemic model for COVID-19 in Mexico. Big Data and Information Analytics, 2020, 5(1): 14-28. doi: 10.3934/bdia.2020002
[3]	Subrata Dasgupta . Disentangling data, information and knowledge. Big Data and Information Analytics, 2016, 1(4): 377-390. doi: 10.3934/bdia.2016016
[4]	Guojun Gan, Qiujun Lan, Shiyang Sima . Scalable Clustering by Truncated Fuzzy c-means. Big Data and Information Analytics, 2016, 1(2): 247-259. doi: 10.3934/bdia.2016007
[5]	Elnaz Delpisheh, Aijun An, Heidar Davoudi, Emad Gohari Boroujerdi . Time aware topic based recommender System. Big Data and Information Analytics, 2016, 1(2): 261-274. doi: 10.3934/bdia.2016008
[6]	Hamzeh Khazaei, Marios Fokaefs, Saeed Zareian, Nasim Beigi-Mohammadi, Brian Ramprasad, Mark Shtern, Purwa Gaikwad, Marin Litoiu . How do I choose the right NoSQL solution? A comprehensive theoretical and experimental survey . Big Data and Information Analytics, 2016, 1(2): 185-216. doi: 10.3934/bdia.2016004
[7]	Bill Huajian Yang . Resolutions to flip-over credit risk and beyond-least squares estimates and maximum likelihood estimates with monotonic constraints. Big Data and Information Analytics, 2018, 3(2): 54-67. doi: 10.3934/bdia.2018007
[8]	Tao Wu, Yu Lei, Jiao Shi, Maoguo Gong . An evolutionary multiobjective method for low-rank and sparse matrix decomposition. Big Data and Information Analytics, 2017, 2(1): 23-37. doi: 10.3934/bdia.2017006
[9]	Yaguang Huangfu, Guanqing Liang, Jiannong Cao . MatrixMap: Programming abstraction and implementation of matrix computation for big data analytics. Big Data and Information Analytics, 2016, 1(4): 349-376. doi: 10.3934/bdia.2016015
[10]	Ruiqi Li, Yifan Chen, Xiang Zhao, Yanli Hu, Weidong Xiao . TIME SERIES BASED URBAN AIR QUALITY PREDICATION. Big Data and Information Analytics, 2016, 1(2): 171-183. doi: 10.3934/bdia.2016003

Abstract

1. Introduction

1.1. Content of the paper

The importance and usefulness of tensors that are characterized by multiway arrays for big data sets, has been increasingly recognized in the last decades, as testified by a number of surveys [15,20,14,5,17] and among others. Identifiability property (see [3,10,2,9]), including both exact and generic identifiability, is critical for tensor models in various applications, and widely used in many areas, such as signal processing, statistics, computer science, and so on. For instance, in signal processing, the tensor encodes data from received signals and one needs to decompose the tensor to obtain the transmitted signals. If the uniqueness does not hold, one may not recover the transmitted signals. Therefore, to establish the uniqueness property of appropriate tensor decomposition is not only mathematically interest but also necessary in various real applications. Extensive studies under the framework of algebraic geometry have provided various characteristics involving tensor rank and dimensions to ensure generic identifiability.

In this paper, we consider the model of low multilinear rank tensor decomposition ( $\mathbf{LRD}$ ). The initial idea was proposed by De Lathauwer [6,7,8], where the rank-1 tensors in CP decomposition is replaced by tensors in multilinear rank $(1,\ L_{r},\ L_{r})$ terms. Such approach allows us to model more complex phenomena and to analyze big data sets with complex structures, especially for the cases that tensor components cannot be represented as rank-1 tensors. We extend the theoretical frameworks by establishing the uniqueness conditions of $\mathbf{LRD}$ , which are critical for the applications of tensor-based approaches in handling big data sets. More specifically, if a tensor can be written in a unique manner as a sum of tensors of low multilinear rank, then this decomposition may reveal (meaningful) characteristics that are more general than the components extracted from CP decomposition. The uniqueness property of $\mathbf{LRD}$ can be theoretically guaranteed with mild conditions under our framework, and we provide the new uniqueness criterion of multilinear-rank tensor decomposition that closely relates to the applications of $\mathbf{LRD}$ in blind source separation in signal processing. The theoretical contributions of establishing the explicit uniqueness criterion of $\mathbf{LRD}$ may play significant role in the application domains of tensor-based methods for big data analysis [4].

1.2. Definitions

Definition 1.1. (see Chapter Ⅲ in [19]) Let $\mathbb{K}$ be a field $\mathbb{C}$ or $\mathbb{R}$ and let $A_1,\ldots,A_n$ be $\mathbb{K}$ -vector spaces. The tensor product space $A_1\otimes \cdots\otimes A_n$ is the quotient module $\mathbb{K}(A_1,\ldots,A_n)/R$ where $\mathbb{K}(A_1,\ldots,A_n)$ is the free module generated by all $n$ -tuples $(a_1,\ldots,a_n)\in A_1\times \cdots\times A_n$ and $R$ is the submodule of $\mathbb{K}(A_1,\ldots,A_n)$ generated by elements of the form

$(a_1,\ldots,\alpha a_k+\beta a_k',\ldots,a_n)-\alpha(a_1,\ldots,a_k,\ldots,a_n)-\beta(a_1,\ldots,a_k',\ldots,a_n)$

for all $a_k,a_k'\in A_k,\alpha,\beta\in \mathbb{K}$ , and $k\in\{1,\ldots,n\}$ . We write $a_1\otimes\cdots\otimes a_n$ for the element $(a_1,\ldots,a_n)+R$ in the quotient space $\mathbb{K}/R$ .

An element of $A_1\otimes\cdots\otimes A_n$ that can be expressed in the form $a_1\otimes\cdots\otimes a_n$ is called decomposable. The symbol $\otimes$ is called the tensor product when applied to vectors from abstract vector spaces.

The elements of $A_1\otimes\cdots\otimes A_n$ are called order- $n$ tensors and $I_k= \dim A_k$ , $k=1,\ldots,n$ are the dimensions of the tensors.

If $U\cong\mathbb{K}^l, V\cong\mathbb{K}^m, W\cong\mathbb{K}^n$ , we may identify

$\mathbb{K}^l\otimes \mathbb{K}^m\otimes \mathbb{K}^n=\mathbb{K}^{l\times m\times n}$

through the interpretation of the tensor product of vectors as a tensor via the Segre outer product,

$[u_1,\ldots,u_l]^T \otimes [v_1,\ldots,v_m]^T \otimes [w_1,\ldots,w_n]^T=[u_i v_j w_k]_{i,j,k=1}^{l,m,n}.$

Definition 1.2. The Khatria-Rao Product is the "matching columnwise" Segre outer product. Given matrices $A=[a_1,\ldots,a_K]\in\mathbb{K}^{I\times K}$ and $B=[b_1,\ldots,b_K]\in\mathbb{K}^{J\times K}$ , their Khatria-Rao product is denoted by $A\odot B$ . The result is a matrix of size $(IJ)\times K$ defined by

$A\odot B=[a_1\otimes b_1 a_2\otimes b_2 \cdots a_K\otimes b_K].$

If $a$ and $b$ are vectors, then the Khatria-Rao and Segre outer products are identical, i.e., $a \otimes b = a\odot b$ .

Given standard orthnormal bases $e^{(k)}_{1},\ldots,e^{(k)}_{I_k}$ for $A_k\cong \mathbb{K}^{I_k},k=1,\ldots,N$ , any tensor $\mathcal{X}$ in $A_1\otimes\cdots\otimes A_N \cong\mathbb{K}^{I_1\times\cdots\times I_N}$ , can be expressed as a linear combination

$\mathcal{X}=\sum\limits_{i_1,\ldots,i_N=1}^{I_1,\ldots,I_N}t_{i_1\cdots i_N}e^{(1)}_{i_1}\otimes\cdots\otimes e^{(N)}_{i_N}.$

In older literature, the $t_{i_1\cdots i_N}$ 's are often called the components of $\mathcal{X}$ . $\mathcal{X}$ has rank one or rank-1 if there exist non-zero $a^{(i)}\in A_i$ , $i=1,\ldots,N$ , so that $\mathcal{X}=a^{(1)}\otimes\cdots\otimes a^{(N)}$ and $a^{(1)}\otimes\cdots\otimes a^{(N)}$ is the Segre outer product.

The rank of $\mathcal{X}$ is defined to be the smallest $r$ such that it may be written as a sum of $r$ rank-1 tensors, i.e.,

$rank(\mathcal{X})=\min\left\lbrace r:\mathcal{X}=\sum\limits_{p=1}^r a_p^{(1)}\otimes\cdots\otimes a_p^{(N)}\right\rbrace.$

Definition 1.3. The $n$ -th flattening map on any tensor $\mathcal{X}=[t_{i_1\ldots i_N}]_{i_1,\ldots,i_N=1}^{I_1,\ldots,I_N}\in\mathbb{K}^{I_1\times\cdots\times I_N}$ is the function (see Section 2 of [11])

$\flat _n:\mathbb{K}^{I_1\times\cdots\times I_N}\to \mathbb{K}^{I_n\times(I_1\ldots\hat{I}_n\ldots I_N)}$

defined by

$(\flat_n(\mathcal{X}))_{ij}=(\mathcal{X})_{s_n(i,j)},$

where $s_n(i,j)$ is the $j$ -th element in lexicographic order in the subset of $\left\langle I_1\right\rangle\times\cdots\times\left\langle I_N\right\rangle$ consisting of elements that have $n$ -th coordinate equal to $i$ , and by convention a caret over any entry of a $N$ -tuple means that the respective entry is omitted.

For a tensor $\mathcal{X}=[t_{ijk}]\in \mathbb{K}^{l\times m\times n}$ ,

$\begin{align*} r_1&=\dim span_{\mathbb{K}}\{\mathcal{X}_{1\bullet\bullet},\ldots,\mathcal{X}_{l\bullet\bullet}\},\\ r_2&=\dim span_{\mathbb{K}}\{\mathcal{X}_{\bullet 1\bullet},\ldots,\mathcal{X}_{\bullet m \bullet}\},\\ r_3&=\dim span_{\mathbb{K}}\{\mathcal{X}_{\bullet\bullet 1},\ldots,\mathcal{X}_{\bullet\bullet n}\}. \end{align*}$

Here

$\begin{align*} \mathcal{X}_{i\bullet\bullet}&=[t_{ijk}]_{j,k=1}^{m,n}\in \mathbb{K}^{m\times n}, \\ \mathcal{X}_{\bullet j\bullet}&=[t_{ijk}]_{i,k=1}^{l,n}\in \mathbb{K}^{l\times n},\\ \mathcal{X}_{\bullet\bullet k}&=[t_{ijk}]_{i,j=1}^{l,m}\in \mathbb{K}^{l\times m}. \end{align*}$

The multilinear rank of $\mathcal{X}\in \mathbb{K}^{l\times m\times n}$ is $(r_1,r_2,r_3)$ , with $r_1,r_2,r_3$ defined above.

Definition 1.4. (see Definition 11 in [4]) A decomposition of a tensor $\mathcal{X}\in\mathbb{K}^{I\times J\times K}$ in a sum of rank- $(1,\ L_{r},\ L_{r})$ terms, $1\leq r\leq R$ , is a decomposition of $\mathcal{X}$ of the form

$\mathcal{X}=\sum\limits_{r=1}^R a_r\otimes X_r,$

in which the $\left(J\times K\right)$ matrix $X_r$ is rank- $L_r$ , $1\leq r\leq R$ , and no two of $X_r'$ s are collinear.

It is clear that in $\mathcal{X}=\sum_{r=1}^R a_r\otimes X_r$ one can arbitrarily permute the different rank- $(1,\ L_{r},\ L_{r})$ terms $a_r\otimes X_r$ . Also, one can scale $X_r$ , provided that $a_r$ is counter scaled. We call this decomposition to be essentially unique when it is only subject to these trivial indeterminacies.

Definition 1.5. Let $\mu_{\mathbb{K}}$ be the Lebegue measure on $\mathbb{K}^{I\times R}\times\mathbb{K}^{J\times K\times R}$ . Then $\mathcal{X}=\sum_{r=1}^{R} a_{r}\otimes X_{r}$ in Definition 1.4 is generically unique if $\mu_{\mathbb{K}}=0$ , where $\mu_{\mathbb{K}}$ is defined by

$\begin{align*} \mu_{\mathbb{K}}\{\left(\mathbb{K}^{I\times R}\times\mathbb{K}^{J\times K\times R}\right): \mathcal{X}=\sum\limits_{r=1}^{R} a_{r}\otimes X_{r}\text{ is not unique for}\ a_{r}\in \mathbb{K}^{I}, X_{r}\in \mathbb{K}^{J\times K}\}. \end{align*}$

Note that in Definition 1.4, we could require $\{a_i,\ 1\leq i\leq R\}$ to be an orthogonal frame:

Definition 1.6. A decomposition of a tensor $\mathcal{X}\in\mathbb{K}^{I\times J\times K}$ in an orthogonal frame in a sum of rank- $(1,\ L_{r},\ L_{r})$ terms, $1\leq r\leq R$ , is a decomposition of $\mathcal{X}$ of the form

$\mathcal{X}=\sum\limits_{r=1}^R a_r\otimes X_r.$

As in Definition 1.4, $X_r$ has rank- $L_r$ , $1\leq r\leq R$ , but we need $\{a_i,\ 1\leq i\leq R\}$ to be an orthogonal frame.

1.3. Main results

The main results of the paper are the following, and their proofs will be given in the following sections:

Theorem 1.7. Assume $I\geq R$ , $\mathcal{X}=\sum_{r=1}^{R} a_{r}\otimes X_{r}$ in Definition 1.4 is essentially unique if and only if

$\begin{align*} span_{\mathbb{K}}\{ X_{j_{1}},\ \ldots,\ X_{j_{s}}\}\cap \Sigma_{\leq L_{j_{t}}}(\mathbb{K}^{J\times K})\subset \{X_{j_{1}},\ \ldots,\ X_{j_{s}}\},\ 1\leq t\leq s, \end{align*}$

where $\Sigma_{\leq L}(\mathbb{K}^{J\times K})=\{M\in \mathbb{K}^{J\times K}| \mathrm{rank}\ M\leq L\}$ .

Remark 1. In reasonably small cases, one can use tools from numerical algebraic geometry such as those described in [18,12,13].

Remark 2. A generic $b \times b$ pencil is diagonalizable (as the conditions to have repeated eigenvalues or bounded rank are closed conditions) and thus of rank $b$ . Thus for most (more precisely, a Zariski open subset of) pencils that are not diagonalizable, a perturbation by a general rank one matrix will make it diagonalizable. And there is a normal form for a general point $p$ of $\Sigma_{\leq L}(\mathbb{K}^{J\times K})$ ( $L$ is smaller than $J$ and $K$ ), which is

$\begin{align*} p=b'_{1}\otimes c'_{1}+\cdots+b'_{L}\otimes c'_{L}, \end{align*}$

where $\{b'_1,\ldots,b'_L\},\{c'_1,\ldots,c'_L\}$ are linear independent.

We now establish a simpler condition related to the uniqueness of $\mathcal{X}=\sum_{r=1}^{R} a_{r}\otimes X_{r}$ in Definition 1.4. More precisely, there is a set of tensors $\mathcal{X}$ of measure 0 such that for any $\mathcal{X}$ outside this set, the conditions are sufficient to guarantee the uniqueness of $\mathcal{X}=\sum_{r=1}^{R} a_{r}\otimes X_{r}$ . Notice that these conditions are not truly sufficient, since it fails to provide the conclusion on a set of problems of measure 0. It, however, illustrates very well the situations in which uniqueness should hold.

Theorem 1.8. $\mathcal{X}=a_{1}\otimes X_{1}+a_{2}\otimes X_{2}$ in Definition 1.4 is generically unique if and only if

$I\geq 2,\ J=K\neq \mathrm{one\ of}\ \left\lbrace\dfrac{2L_1+L_2}{2},\dfrac{2L_2+L_1}{2}, L_1, L_2\right\rbrace.$

Theorem 1.9. $\mathcal{X}=\sum_{r=1}^{R} a_{r}\otimes X_{r}$ in Definition 1.4 is generically unique if

$I\geq R,\ K\geq \sum\limits^{R}\limits_{r=1}L_{r},\ J\geq 2 \max \{L_{i}\},\ {{J}\choose{\max \{L_{i}\}}}\geq R, L_{i}+L_{j}>L_{k}$

for all $1\leq i,j,k\leq R$ .

For low multilinear rank decomposition in orthogonal frame, we have the following theorem.

Theorem 1.10. A tensor decomposition of $\mathcal{X}\in\mathbb{K}^{I\times J\times K}$ ,

$\mathcal{X}=\sum\limits_{r=1}^R a_r\otimes X_r,$

as in Definition 1.6 is essentially unique if and only if for any non-identity special orthogonal matrix $E=[\varepsilon_{ij}]_{1\leq i,j\leq R}$ , there exists $k,\ 1\leq k\leq R$ such that

$rank\ (\varepsilon_{k1}X_{1}+\cdots+\varepsilon_{kR}X_{R})\neq L_{1},\ldots,L_R.$

1.4. Outline of the paper

In this paper, we first provide some known and preliminary results related to the tensor decompositions of multilinear rank $(1,\ L_{r},\ L_{r})$ terms that we are considering. Then we establish simple geometric necessary and sufficient conditions which guarantee the uniqueness of tensor decompositions of multilinear rank $(1,\ L_{r},\ L_{r})$ terms (see Theorem 1.7). The conditions are then relaxed to obtain simpler sufficient conditions Theorem 1.8 and Theorem 1.9. Finally, we discuss the uniqueness of tensor decompositions of multilinear rank $(1,\ L_{r},\ L_{r})$ terms in an orthogonal frame that provides better structures.

2. Algebraic criteria of uniqueness

Definition 2.1. For a vector space $V$ , $V^*$ denotes the dual space of linear functionals of $V$ , which is the vector space whose elements are linear maps from $V$ to $\mathbb{K}$ : $\{\alpha: V\mapsto \mathbb{K} | \alpha \ \rm{is}\ \rm{linear}\}$ . If one is working in bases and represents elements of $V$ by column vectors, then elements of $V^*$ are naturally represented by row vectors and the map $V\mapsto \langle\alpha,v\rangle$ is just row-column matrix multiplication. Given a basis $v_{1},\ldots,v_{\mathbf{v}}$ of $V$ , it determines a basis $\alpha_{1},\ldots,\alpha_{\mathbf{v}}$ of $V^*$ by $\langle\alpha_{j},v_{i}\rangle=\delta_{ij}$ , called the dual basis. Now we define $V^{\bot}=\{\alpha\in V^{*}|\langle \alpha,v\rangle=0,\ \forall v\in V\}$ .

2.1. Proof of Theorem 1.7

Proof. $\Leftarrow$ Assume the contrary that $\mathcal{X}=\sum_{r=1}^{R} a'_{r}\otimes X'_{r}$ is different from $\mathcal{X}=\sum_{r=1}^{R} a_{r}\otimes X_{r}$ . Since $a_{1},\ldots,a_{R}$ are independent, we claim that

$a'_{r}\in span_{\mathbb{K}}\{ a_{1}, \ldots, a_{R}\}.$

Since if not, we have

$a^{'*}_{r}\in span^{\bot}_{\mathbb{K}}\{ a_{1}, \ldots, a_{R}\}.$

This implies

$\langle \mathcal{X},\ a^{'*}_{r}\rangle=0=X'_{r},$

which is a contradiction. Therefore, we have $a'_{r}=\sum^{R}_{j=1}\alpha^{r}_{j}a_{j}$ , where $\alpha^{r}_{j}$ are not all zero. From

$\begin{align*} \mathcal{X}=\sum\limits_{r=1}^{R} a'_{r}\otimes X'_{r}=\sum\limits_{r=1}^{R} \left(\sum^{R}_{j=1} \alpha^{r}_{j} a_{r}\otimes X'_{j}\right), \end{align*}$

we know that $X_{r}=\sum^{R}_{j=1} \alpha^{r}_{j} X'_{j}$ . Taking the inverse of the nonsingular $R\times R$ matrix $[\alpha^{r}_{j}]$ , we have $X'_{r}=\sum^{R}_{j=1} \tilde{\alpha}^{r}_{j} X_{j}$ . Consequently, there exist $r,j_{1},j_{2}\in \{1,\ldots,R\}$ such that $j_{1}\neq j_{2}$ and $\tilde{\alpha}^{r}_{j_{1}}\cdot\tilde{\alpha}^{r}_{j_{2}}\neq 0$ . Therefore, we obtain

$\begin{align*} X'_{r}\in span_{\mathbb{K}}\{ X_{j_{1}}, \ldots, X_{j_{s}}\} \cap \Sigma_{\leq L_{r}}\left(\mathbb{K}^{J\times K}\right). \end{align*}$

But $X'_{r}$ does not belong to $\{X_{j_{1}}, \ldots, X_{j_{s}}\}$ , which is a contradiction.

$\Rightarrow$ If there exists $X_{j_t}'\in span_{\mathbb{K}}\{X_{j_1},\ldots,X_{j_s}\}\cap \Sigma_{\leq L_{j_t}}\left(\mathbb{K}^{K\times J}\right)$ such that $X_{j_t}'\notin\{X_{j_1},\ldots,X_{j_s}\}$ , Without loss of generality, we assume that $X_{j_t}'=X_1+\chi_2X_2+\cdots+\chi_RX_R$ . Now

$\begin{align*} &a_1\otimes X_1+\cdots+a_R\otimes X_R\\&=a_1\otimes X_{j_t}'-\chi_2a_1\otimes X_2-\cdots-\chi_Ra_1\otimes X_R+a_2\otimes X_2+\cdots + a_R\otimes X_R\\ &=a_1\otimes X_{j_t}'+\left(a_2-\chi_2a_1\right)\otimes X_2+\cdots+\left(a_R-\chi_Ra_1\right)\otimes X_R\\&=a_1\otimes X_{j_t}'+a'_2\otimes X_2+\cdots+a'_R\otimes X_R. \end{align*}$

So $\mathcal{X}=\sum_{r=1}^{R} a_{r}\otimes X_{r}$ is not unique.

Example 1. A tensor decomposition of $\mathcal{X}\in\mathbb{K}^{I\times J\times K}$ ,

$\mathcal{X}=\sum\limits_{r=1}^R a_r\otimes X_r,$

as in Definition 1.4 is essentially unique if the singular vectors of $X_1,\ldots,X_R$ are linear independent.

Proof. Assume the contrary that $\mathcal{X}=\sum_{r=1}^R a'_r\otimes X'_r$ is different from $\mathcal{X}=\sum_{r=1}^R a_r\otimes X_r$ , then we have

$\begin{align*} X'_{r}=\chi_{1}X_{1}+\cdots+\chi_{R}X_{R}. \end{align*}$

Let

$U_r=\begin{pmatrix} | & | &&| \\ u^{r}_1 & u^{r}_2 & \cdots & u^{r}_{J} \\ | & | &&| \end{pmatrix}$

$V_r=\begin{pmatrix} | & | &&| \\ v^{r}_1 & v^{r}_2 & \cdots & v^{r}_{K} \\ | & | &&| \end{pmatrix}$

and $u^{r}_{j},\ 1\leq r\leq R,\ 1\leq j\leq J$ , $v^{r}_{k},\ 1\leq r\leq R,\ 1\leq k\leq K$ , are linear independent, and let

$X_{r}=\sigma_{1}^{r}u^{r}_{1}\otimes v^{r}_{1}+\cdots+\sigma_{L_r}^{r}u^{r}_{L_r}\otimes v^{r}_{L_r},$

then we can see the rank of $\chi_{1}X_{1}+\cdots+\chi_{R}X_{R}$ should be bigger or equal to $L_{i},1\leq i\leq R$ and equality holds only if $X'_{r}$ is one of $\{X_{1}, \ldots, X_{R}\}$ . And the uniqueness follows from Theorem 1.7.

2.2. Proof of Theorem 1.8

Proof. It is sufficient to prove the case $\min \{L_{1},L_{2}\}\leq J=K<L_{1}+L_2$ . Let $B$ and $C$ denote vector spaces of dimensions $J, K$ respectively. Split $B=B_{1}\oplus B_{0}\oplus B_{2}$ and $C=C_{1}\oplus C_{0}\oplus C_{2}$ , where $B_{1}$ , $B_{0}$ , $B_{2}$ , $C_{1}$ , $C_{0}$ , and $C_{2}$ are of dimensions $L_{1}-l_{b}$ , $l_{b}$ , $L_{2}-l_{b}$ , $L_{1}-l_{c}$ , $l_{c}$ , $L_{2}-l_{c}$ .

Consider

$\begin{align*} X_{1}&=b_{1,1}\otimes c_{1,1}+\cdots+ b_{1,L_{1}-l_{b}}\otimes c_{1,L_{1}-l_{b}}+b_{0,1}\otimes c_{1,L_{1}-l_{b}+1}+\cdots+b_{0,l_{b}}\otimes c_{0,l_{c}}\\ &\in (B_{1}\oplus B_{0})\otimes (C_{1}\oplus C_{0})\cong\mathbb{K}^{L_{1}}\otimes \mathbb{K}^{L_{1}}, \\X_{2}&=b_{2,1}\otimes c_{2,1}+\cdots+ b_{2,L_{1}-l_{b}}\otimes c_{2,L_{1}-l_{b}}+b_{0,1}\otimes c_{2,L_{1}-l_{b}+1}+\cdots+b_{0,l_{b}}\otimes c_{0,l_{c}}\\&\in (B_{2}\oplus B_{0})\otimes (C_{2}\oplus C_{0})\cong\mathbb{K}^{L_{2}}\otimes \mathbb{K}^{L_{2}}, \end{align*}$

where

$\begin{align*} &\{b_{0,1},\ldots,b_{0,l_{b}}\},\\ &\{b_{1,1},\ldots,b_{1,L_{1}-l_{b}}\},\\ &\{b_{2,1},\ldots,b_{2,L_{2}-l_{b}}\},\\ &\{c_{0,1},\ldots,c_{0,l_{c}}\},\\ &\{c_{1,1},\ldots,c_{1,L_{1}-l_{c}}\},\\ &\{c_{2,1},\ldots,c_{2,L_{2}-l_{c}}\}, \end{align*}$

are bases for $B_{0}$ , $B_{1}$ , $B_{2}$ , $C_{0}$ , $C_{1}$ and $C_{2}$ , respectively, $J+l_{b}=L_{1}+L_{2}$ , and $K+l_{c}=L_{1}+L_{2}$ . Suppose $\chi_{1},\ \chi_{2}$ are both nonzero, the matrix pencil $\chi_{1}X_1+\chi_{2}X_2$

$\begin{pmatrix} \chi_1 & & & \\ & \ddots & & \\ & & \chi_1 & \\ & & & \chi_1+\chi_2 & \\ & &&& \ddots & \\ & &&&&\chi_1+\chi_2\\ & &&&&&&\chi_{2} \\ & &&&&&&& \ddots \\ & &&&&&&&&\chi_{2} \end{pmatrix}$

has rank $J$ when $\chi_1\neq-\chi_2$ , and $L_1+L_2-2l_{b}$ when $\chi_1=-\chi_2$ . By simple computation, we know

$\mathrm{rank}\ (\chi_{1}X_1+\chi_{2}X_2) \neq L_1\ or\ L_2$

if and only if

$J,K\neq \mathrm{one\ of}\ \left\lbrace\dfrac{2L_1+L_2}{2},\dfrac{2L_2+L_1}{2}, L_1, L_2\right\rbrace.$

Then Theorem 1.8 follows from Theorem 1.7.

Example 2. For $\mathcal{X}\in \mathbb{K}^{2\times 3\times 3}$ , considering the decomposition in a sum of multilinear rank $(1,2,2)$ , we have

$\begin{align*} \mathcal{X}&=a_1\otimes\left(b_1\otimes c_1+b_2\otimes c_2\right)+a_2\otimes\left(b_2\otimes c_2+b_3\otimes c_3\right) \\&=a_1\otimes\left(b_1\otimes c_1-b_3\otimes c_3\right)+\left(a_1+a_2\right)\otimes\left(b_2\otimes c_2+b_3\otimes c_3\right), \end{align*}$

where $\{b_1,b_2,b_3\},\{c_1,c_2,c_3\},\{a_1,a_2\}$ are bases for $\mathbb{K}^3,\mathbb{K}^3,\mathbb{K}^2$ . So this is not unique.

Example 3. For $\mathcal{X}\in \mathbb{K}^{2\times 4\times 2}$ , considering the decomposition in a sum of multilinear rank $(1,2,2)$ , we have

$\begin{align*} \mathcal{X}&=a_1\otimes \left(b_1\otimes c_1+b_2\otimes c_2\right)+a_2\otimes\left(b_3\otimes c_1+b_4\otimes c_2\right) \\&=a_1\otimes\left((b_1+b_3)\otimes c_1+(b_2+b_4)\otimes c_2\right)+\left(a_2-a_1\right)\otimes\left(b_3\otimes c_1+b_4\otimes c_2\right), \end{align*}$

where $\{b_1,b_2,b_3,b_4\},\{c_1,c_2\},\{a_1,a_2\}$ are basis for $\mathbb{K}^4,\mathbb{K}^2,\mathbb{K}^2$ . So this is not unique.

Example 4. There are explicit Weierstrass canonical forms (see Chapter 10 in [16]) of tensors in $\mathbb{K}^{2\times L\times L}$ . Each of those can be decomposed in a sum of rank- $\left(1,L, L\right)$ terms as follows:

$a_1\otimes(b_1\otimes c_1+\cdots+b_L\otimes c_L)+a_2\otimes(\lambda_{1}b_1\otimes c_1+\cdots+\lambda_{L}b_L\otimes c_L),$

but it is obviously not unique.

2.3. Proof of Theorem 1.9

Proof. It is sufficient to prove the case $I=R$ , $K=\sum^{R}_{r=1}L_{r}$ . Let $B$ and $C$ denote vector spaces of dimensions $J,\ K$ respectively. Choose the splitting of $C$ as $C=\bigoplus_{1\leq r\leq R}C_{r}$ , and fix a basis $\{b_{1},\ldots,b_{J}\}$ for $B$ .

Without loss of generality, for $1\leq p\leq R$ , we can assume

$\begin{align*} E_{j_{p}}=b_{j_{p},1}\otimes c_{j_{p},1}+b_{j_{p},2}\otimes c_{j_{p},2}+\cdots+b_{j_{p},L_{j_{p}}}\otimes c_{j_{p},L_{j_{p}}}\in B_{j_{p}}\otimes C_{j_{p}}, \end{align*}$

where $\{b_{j_{p},1},\ldots,b_{j_{p},L_{j_{p}}}\}\subset \{b_{1},\ldots,b_{J}\}$ (since $J\geq 2 \max \{L_{i}\}, {{J}\choose{\max \{L_{i}\}}}\geq R$ ), $\{c_{j_{p},1},\ldots,c_{j_{p},L_{j_{p}}}\}$ are bases for $B_{j_{p}}$ , $C_{j_{p}}$ , respectively. Further, let

$\begin{align*} E'_{j_{t}}=b_{1}'\otimes c_{1}'+\cdots+b_{L_{j_{t}}}'\otimes c_{L_{j_{t}}}' \end{align*}$

be a general point of $\Sigma_{\leq L_{j_{t}}}(\mathbb{K}^{J\times K})$ and set

$\begin{align*} E'_{j_{t}}=\sum\limits_{1\leq p\leq s}\chi_{p}E_{j_{p}}=\sum\limits_{1\leq p\leq s}\chi_{p}\left(b_{j_{p},1}\otimes c_{j_{p},1}+b_{j_{p},2}\otimes c_{j_{p},2}+\cdots+b_{j_{p},L_{j_{p}}}\otimes c_{j_{p},L_{j_{p}}}\right). \end{align*}$

If there exist $\chi_{\mu},\ \chi_{\nu}$ , which are both nonzero, the pencil

$\begin{pmatrix} x_\mu & & & \\ & \ddots & & \\ & & x_\mu & \\ & & & x_\mu+x_\nu & \\ & &&& \ddots & \\ & &&&&x_\mu+x_\nu\\ & &&&&&&x_{\nu} \\ & &&&&&&& \ddots \\ & &&&&&&&&x_{\nu} \end{pmatrix}$

has rank at least $L_{j_{\mu}}+L_{j_{\nu}}$ , which is bigger than $L_{j_{t}}$ . This implies that $E'_{j_{t}}$ is not a matrix in $\Sigma_{\leq L_{j_{t}}}(\mathbb{K}^{J\times K})$ . Therefore, we prove that $E'_{j_{t}}\in \{E_{j_{1}},\ldots,E_{j_{s}}\}$ . The uniqueness follows from Theorem 1.7.

The following Remark can be easily obtained using elementary combinatorics.

Remark 3. When

$I\geq R,\ J,\ K\geq \sum\limits^{R}\limits_{r=1}L_{r},\ L_{i}+L_{j}>L_{k}\ \,\,\,\,\,\forall 1\leq i,j,k\leq R,$

a low multilinear rank tensor decomposition of $\mathcal{X}$ as in Definition 1.4 has a unique expression

$\begin{align*} \mathcal{X}=\sum\limits_{r=1}^R a_{r}\otimes\left[\left(\sum\limits_{r'=1+\sum_{u=1}^{r-1}L_u}^{\sum_{u=1}^r L_u}b_{r'}\otimes c_{r'}\right)\right]. \end{align*}$

3. Proof of Theorem 1.10

Proof. $\Rightarrow$ Assume the contrary that $\mathcal{X}=\sum_{r=1}^R a'_r\otimes X'_r$ is different from $\mathcal{X}=\sum_{r=1}^R a_r\otimes X_r$ . Let us assume the transformation matrix between the frames $\{e'_{r},\ 1\leq r\leq R\}$ and $\{e_{r},\ 1\leq r\leq R\}$ is $Q$ , which is a $R \times R$ special orthogonal matrix $[\varepsilon_{ir}]$ . Then we have

$[X_1\cdots X_R]\odot\begin{bmatrix} e_1\\ \vdots\\ e_R \end{bmatrix}=[X_1'\cdots X_R']\odot\begin{bmatrix} e_1'\\ \vdots\\ e_R' \end{bmatrix}=[X_1'\cdots X_R']\odot Q\begin{bmatrix} e_1\\ \vdots\\ e_R \end{bmatrix}.$

Since $\{e_{r},\ 1\leq r\leq R\}$ is orthogonal, taking inner product of $\mathcal{X}$ with $e_{r}$ , we have

$\begin{align*} X'_{r}=\varepsilon_{r1}X_{1}+\cdots+\varepsilon_{rR}X_{R},\ 1\leq r\leq R. \end{align*}$

However

$\rm{rank}\ X'_{r}=\rm{rank}\ (\varepsilon_{r1}X_{1}+\cdots+\varepsilon_{rR}X_{R})\neq L_{1},\ldots,L_R,$

which is a contradiction. Therefore $\mathcal{X}=\sum_{r=1}^R a_r\otimes X_r$ as in Definition 1.6 is essentially unique. $\Leftarrow$ Assume for a special orthogonal matrix $Q=[\varepsilon_{ir}]_{R\times R}$ , $\varepsilon_{i1}X_{1}+\cdots+\varepsilon_{iR}X_{R}$ has rank $L_{i}$ for any $1\leq i\leq R$ . Let

$X'_{i}=\varepsilon_{i1}X_{1}+\cdots+\varepsilon_{iR}X_{R},\ 1\leq i\leq R,$

we then have $\mathcal{X}=\sum_{r=1}^R a_r\otimes X_r$ . So it is not unique.

Remark 4. Since the rotation matrix in the plane is

$\begin{pmatrix} \cos\ \theta -\sin\ \theta\\ \sin\ \theta +\cos\ \theta \end{pmatrix} ,$

a tensor decomposition of $\mathcal{X}\in\mathbb{K}^{I\times J\times K}$ in orthogonal frame, $\mathcal{X}= a_1\otimes X_1+a_2\otimes X_2$ as in Definition 1.6 is essentially unique if and only if for any $\theta,\ 0<\theta <\pi$ ,

$rank\ (\cos \theta\ X_1+\sin \theta\ X_2)\neq L_{1}\ \rm{or}\ L_{2},$

and same for $rank\ (-\sin \theta\ X_1+\cos \theta\ X_2)$ .

4. Conclusion

Different from most current approach in the analysis of big data sets, in this paper, some uniqueness characteristics of low multilinear rank tensor decomposition $\mathbf{LRD}$ are given under the framework of algebraic geometry. The proposed framework leads to a new approach for the study of identifiability properties in terms of block tensor decomposition that can be used to handle the big data sets. Several explicit uniqueness criteria for tensor decomposition of low multilinear rank terms are given.

Acknowledgments

The first author Ming Yang is grateful to Mingqing Xiao for his insight and for clarity of proofs. The first author is also grateful to the Qiang Cheng's machine learning lab, where this paper was mainly written.

References

[1]	[ L. Chiantini and C. Ciliberto, Weakly defective varieties, Transactions of the American Mathematical Society, 354(2002), 151-178.
[2]	[ L. Chiantini and G. Ottaviani, On generic identifiability of 3-tensors of small rank, SIAM Journal on Matrix Analysis and Applications, 33(2012), 1018-1037.
[3]	[ L. Chiantini, G. Ottaviani and N. Vannieuwenhoven, An algorithm for generic and low-rank specific identifiability of complex tensors, SIAM Journal on Matrix Analysis and Applications, 35(2014), 1265-1287.
[4]	[ A. Cichocki, D. Mandic, L. De Lathauwer, G. Zhou, Q. Zhao, C. Caiafa and H. Phan, Tensor decompositions for signal processing applications:From two-way to multiway component analysis, Signal Processing Magazine, IEEE, 32(2015), 145-163.
[5]	[ P. Comon, Tensor decompositions, State of the art and applications, J. G. McWhirter and I. K. Proudler. Mathematics in Signal Processing V, Clarendon Press, Oxford, 71(2002), 1-24.
[6]	[ L. DeLathauwer, Decompositions of a higher-order tensor in block terms-part I:Lemmas for partitioned matrices, SIAM Journal on Matrix Analysis and Applications, 30(2008), 1022-1032.
[7]	[ L. DeLathauwer, Decompositions of a higher-order tensor in block terms-part Ⅱ:Definitions and uniqueness, SIAM Journal on Matrix Analysis and Applications, 30(2008), 1033-1066.
[8]	[ L. DeLathauwer and D. Nion, Decompositions of a higher-order tensor in block terms-part Ⅲ:Alternating least squares algorithms, SIAM Journal on Matrix Analysis and Applications, 30(2008), 1067-1083.
[9]	[ I. Domanov and L. DeLathauwer, Generic uniqueness of a structured matrix factorization and applications in blind source separation, IEEE Journal of Selected Topics in Signal Processing, 10(2016), 701-711.
[10]	[ I. Domanov and L. De Lathauwer, Generic uniqueness conditions for the canonical polyadic decomposition and INDSCAL, SIAM Journal on Matrix Analysis and Applications, 36(2015), 1567-1589.
[11]	[ J. Draisma and J. Kuttler, Bounded-rank tensors are defined in bounded degree, Duke Mathematical Journal, 163(2014), 35-63.
[12]	[ J. D. Hauenstein and A. J. Sommese, Witness sets of projections, Applied Mathematics and Computation, Elsevier, 217(2010), 3349-3354.
[13]	[ J. D. Hauenstein and A. J. Sommese, Membership tests for images of algebraic sets by linear projections, Applied Mathematics and Computation, Elsevier, 219(2013), 6809-6818.
[14]	[ R. Ke, W. Li and M. Xiao, Characterization of extreme points of multi-stochastic tensors, Computational Methods in Applied Mathematics, 16(2016), 459-474.
[15]	[ T. G. Kolda and B. W. Bader, Tensor decompositions and applications, SIAM Review, 51(2009), 455-500.
[16]	[ J. M. Landsberg, Tensors:Geometry and Applications, AMS, Providence, Rhode Island, USA, 2012.
[17]	[ M. W. Mahoney, L.-H. Lim and G. E. Carlsson, Algorithmic and statistical challenges in modern largescale data analysis are the focus of MMDS 2008, ACM SIGKDD Explorations Newsletter, 10(2008), 57-60.
[18]	[ F. Malgouyres and J. Landsberg, Stable recovery of the factors from a deep matrix product and application to convolutional network, preprint, arXiv:1703.08044.
[19]	[ Y. Matsushima (E. Kobayashi, translator), Differentiable Manifolds, Marcel Dekker, Inc., North-Holland Publishing Co., North Miami Beach, FL, U.S.A., 1972.
[20]	[ E. E. Papalexakis, C. Faloutsos and N. D. Sidiropoulos, Tensors for data mining and data fusion:Models, applications, and scalable algorithms, ACM Transactions on Intelligent Systems and Technology (TIST), 8(2017), 1-44.

Reader Comments

Your name:*

Email:*
© 2016 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Big Data and Information Analytics

Metrics

Article views(3323) PDF downloads(518) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Big Data and Information Analytics

On identifiability of 3-tensors of multilinear rank (1; L_r; L_r)

Related Papers:

Abstract

1. Introduction

1.1. Content of the paper

1.2. Definitions

1.3. Main results

1.4. Outline of the paper

2. Algebraic criteria of uniqueness

2.1. Proof of Theorem 1.7

2.2. Proof of Theorem 1.8

2.3. Proof of Theorem 1.9

3. Proof of Theorem 1.10

4. Conclusion

Acknowledgments

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Other Articles By Authors

Catalog

Big Data and Information Analytics

On identifiability of 3-tensors of multilinear rank (1; Lr; Lr)

Related Papers:

Abstract

1. Introduction

1.1. Content of the paper

1.2. Definitions

1.3. Main results

1.4. Outline of the paper

2. Algebraic criteria of uniqueness

2.1. Proof of Theorem 1.7

2.2. Proof of Theorem 1.8

2.3. Proof of Theorem 1.9

3. Proof of Theorem 1.10

4. Conclusion

Acknowledgments

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

On identifiability of 3-tensors of multilinear rank (1; L_r; L_r)