Download The Role of Tensor Rank in the Complexity Analysis of Bilinear Forms

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
Preliminaries
Properties of tensor rank
Open problems
The role of tensor rank in the complexity analysis
of bilinear forms
Dario A. Bini
Dipartimento di Matematica, Università di Pisa
www.dm.unipi.it/ebini
ICIAM07, Zürich, 16-20 July 2007
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
1
Preliminaries
Tensors
Tensor rank
Bilinear forms
2
Properties of tensor rank
Border rank
Lower bounds
3
Open problems
The direct sum conjecture
Some tensors of unknown rank
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Tensors
Let F be a number field, say, R, C; tensors of the kind
A = (ai1 ,i2 ,...,ih ) ∈ Fn1 ×n2 ×···×nh ,
that is, h-way arrays, are encountered in many problems of very
different nature [Comon 2001], [Comon, Golub, Lim, Mourrain,
2006], [De Silva, Lim 2006]
Blind source separation
High order factor analysis
Independent component analysis
Candecomp/Parafac model
Complexity analysis
Psycometric, Chemometric, Economy,...
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Tensors
It is surprising that little interplay occurred among these
different research areas
Some properties have been rediscovered in different contexts
Apparently, results obtained in one field have not migrated to
the other fields
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Tensors
It is surprising that little interplay occurred among these
different research areas
Some properties have been rediscovered in different contexts
Apparently, results obtained in one field have not migrated to
the other fields
In this talk I wish to provide an overview of the main results
concerning tensors obtained in the research field of computational
complexity (starting from 1969) with the aim of
creating a synergic exchange of information between these
research areas
presenting problems which might be solved with the more
recent tools
presenting old results that might be adapted and extended to
the new needs
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Tensor rank
Definition (Hitchcock 1927)
A tensor T = (ti1 ,...,ih ) has rank 1 if there exist vectors
(k)
(1) (2)
(h)
u(k) = (ui ) ∈ Fnk , k = 1 : h such that ti1 ,...,ih = ui1 ui2 · · · uih ,
T = u(1) ◦ u(2) ◦ · · · ◦ u(h)
Definition (Hitchcock 1927)
The tensor rank rk(A) of A = (ai1 ,...,ih ) is the minimum number r
of rank-1 tensors Ti ∈ Fn1 ×···×nh such that
A = T1 + T2 + · · · + Tr
Dario A. Bini
canonical decomposition
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Some remarks
For A ∈ Fm×n×p a canonical decomposition
A = u(1) ◦ v(1) ◦ w(1) + u(2) ◦ v(2) ◦ w(2) + · · · + u(r ) ◦ v(r ) ◦ w(r )
is defined by three matrices
U = (ui,j ) ∈ Fm×r , V = (vi,j ) ∈ Fn×r , W = (wi,j ) ∈ Fp×r ,
whose columns are the vectors u(i) , v(i) , w(i) , i = 1 : r ,
respectively.
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Some remarks
A tensor A = (ai,j,k ) ∈ Fm×n×p , can be represented by means of
the set of the 3-slabs Ak = (ai,j,k )i,j ∈ Fm×n of A or by a single
matrix of variables
p
X
A=
sk Ak
k=1
A=
1 0
0 −1
0 1
;
1 0
Dario A. Bini
↔
s1
s2
The role of tensor rank
s2
−s1
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Some remarks
A tensor A = (ai,j,k ) ∈ Fm×n×p , can be represented by means of
the set of the 3-slabs Ak = (ai,j,k )i,j ∈ Fm×n of A or by a single
matrix of variables
p
X
A=
sk Ak
k=1
A=
1 0
0 −1
0 1
;
1 0
↔
s1
s2
s2
−s1
This suggests a different point of view for tensor rank:
rk(A) is the minimum set of rank one matrices which span the linear
space generated by the 3-slabs A1 , . . . , Ap [Gastinel 71, Fiduccia 72].
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Bilinear forms
Problem: Given matrices Ak = (ai,j,k )i=1:m,j=1:n , k = 1 : p
compute the set of bilinear forms
fk (x, y) = xT Ak y, k = 1 : p
with the minimum number of nonscalar multiplications with no use
of commutativity (noncommutative bilinear complexity)
A nonscalar multiplication is a multiplication of the kind
m
n
X
X
s=(
αi xi )(
βj yj ), αi , βj ∈ F
i=1
j=1
Remark: The set of bilinear forms is uniquely determined by the
tensor A = (ai,j,k ).
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
Bilinear forms
A canonical decomposition of the tensor A associated with the set
of bilinear forms provides an algorithm of complexity r . In fact
A=
r
X
u
(`)
◦v
(`)
◦w
(`)
→
Ak =
r
X
wk,` u(`) ◦ v(`)
`=1
`=1
→
xT Ak y =
r
X
wk,` (xT u(`) )(v(`)T y)
`=1
Theorem (Strassen 1975)
The noncommutative bilinear complexity of a set of bilinear forms
fk (x, y) = xT Ak y, k = 1 : p, Ak = (ai,j,k ), is given by the tensor
rank of the associated tensor A = (ai,j,k ).
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
An example
Multiplication of complex numbers
(x1 + ix2 )(y1 + iy2 ) = (x1 y1 − x2 y2 ) + i(x1 y2 + x2 y1 ) = f1 + if2
Apparently, 4 multiplications are needed
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
An example
Multiplication of complex numbers
(x1 + ix2 )(y1 + iy2 ) = (x1 y1 − x2 y2 ) + i(x1 y2 + x2 y1 ) = f1 + if2
Apparently, 4 multiplications are needed
1 0
0 1
Tensor:
A=
;
0 −1
1 0
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Tensors
Tensor rank
Bilinear forms
An example
Multiplication of complex numbers
(x1 + ix2 )(y1 + iy2 ) = (x1 y1 − x2 y2 ) + i(x1 y2 + x2 y1 ) = f1 + if2
Apparently, 4 multiplications are needed
1 0
0 1
Tensor:
A=
;
0 −1
1 0
The tensor rank
1
0
0
1
is at most 3:
0
=
−1
1
1 1
=
−
0
1 1
1
0
1
0
0
−
0
0
−
0
0
0
0
0
0
1
0
1
Algorithm:
s1 = (x1 + x2 )(y1 + y2 ),
s 2 = x1 y1 ,
s 3 = x2 y2
Dario A. Bini
f1 = s2 − s3 ,
f2 = s1 − s2 − s3
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Tensor rank
Main problem: For a given tensor, compute its rank and a canonical
decomposition.
Two ways of attacking the problem
looking for lower bounds to the tensor rank
looking for upper bounds to the tensor rank
Things get more complicate: unlike the case of m × n matrices
The rank depends on the ground field F
High rank tensors can be approximated by low rank tensors
Rational algorithms for tensor rank, like Gaussian elimination, cannot exist
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Topological properties of tensors: Border Rank
For m × n matrices, any sequence {Ak } of m × n matrices of rank
r with limit A = limk Ak is such that rank(A) ≤ r .
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Topological properties of tensors: Border Rank
For m × n matrices, any sequence {Ak } of m × n matrices of rank
r with limit A = limk Ak is such that rank(A) ≤ r .
For higher order tensor this property does not hold anymore
Example (Bini, Capovani, Lotti, Romani 1980)
The tensor
1 0
0 1
0 1
;
0 0
1 0
1
0 1
;
0 0
A=
has rank 3. The tensor
A =
has rank 2 for any 6= 0
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Canonical decomposition
1 0
1
0 1
0 0
=
1 −1
1
=
Dario A. Bini
−
−1
0 1
0 0
0 1
0 0
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Border rank
Definition (Bini, Capovani, Lotti, Romani 1980)
The border rank of A ∈ Fm×n×p (F = R, C) is
brk(A) = min{r : ∀ > 0 ∃E ∈ Fm×n×p : ||E|| < , rk(A+E) = r }
where || · || is any norm
Some properties:
brk(A) ≤ rk(A)
brk(A) is the minimum number of nonscalar multiplications
sufficient to approximate the set of bilinear forms associated
with A with arbitrarily small nonzero error
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
More on rank and border rank
Let A = A + E be such that
rk(A ) = brk(A)
the entries of E are polynomials of degree d
Then
rk(A) ≤ (d + 1)brk(A)
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
More on rank and border rank
Let A = A + E be such that
rk(A ) = brk(A)
the entries of E are polynomials of degree d
Then
rk(A) ≤ (d + 1)brk(A)
Proof:
Write d + 1 copies of the canonical decomposition of length
rk(A ) obtained with (d + 1) pairwise different values of Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
More on rank and border rank
Let A = A + E be such that
rk(A ) = brk(A)
the entries of E are polynomials of degree d
Then
rk(A) ≤ (d + 1)brk(A)
Proof:
Write d + 1 copies of the canonical decomposition of length
rk(A ) obtained with (d + 1) pairwise different values of Take linear combinations of these decompositions with
coefficients γj , j = 1 : P
d + 1 in order to kill the terms in i ,
i = 1 : d and to have
γj = 1
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
More on rank and border rank
Let A = A + E be such that
rk(A ) = brk(A)
the entries of E are polynomials of degree d
Then
rk(A) ≤ (d + 1)brk(A)
Proof:
Write d + 1 copies of the canonical decomposition of length
rk(A ) obtained with (d + 1) pairwise different values of Take linear combinations of these decompositions with
coefficients γj , j = 1 : P
d + 1 in order to kill the terms in i ,
i = 1 : d and to have
γj = 1
Obtain a decomposition of length (d + 1)brk(A) with no error
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Lower bounds and linear algebra
Simple criteria for providing lower bounds on tensor rank and
border rank can be given
Trivial bounds
brk(A) ≥ dim(span(A1 , . . . , Ap ))
Similar inequalities are valid w.r.t. the other coordinates
Assume for simplicity p = dim(span(A1 , . . . , Ap ))
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Lower bounds and linear algebra
Let U, V , W be the matrices defining a canonical factorization of
A of length rk(A)
A=
rk
(A)
X
u(`) ◦ v(`) ◦ w(`)
`=1
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Lower bounds and linear algebra
Let U, V , W be the matrices defining a canonical factorization of
A of length rk(A)
A=
rk
(A)
X
u(`) ◦ v(`) ◦ w(`)
`=1
Assume w.l.o.g. that the first p columns of W are linearly
independent, that is
W = W1 W2 , W1 ∈ Fp×p , det W1 6= 0
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Lower bounds and linear algebra
Let U, V , W be the matrices defining a canonical factorization of
A of length rk(A)
A=
rk
(A)
X
u(`) ◦ v(`) ◦ w(`)
`=1
Assume w.l.o.g. that the first p columns of W are linearly
independent, that is
W = W1 W2 , W1 ∈ Fp×p , det W1 6= 0
P
(−1)
Let A0k = pj=1 wk,j Aj , define
A0 = [A01 , A02 , . . . , A0p ] = A •3 W1−1 ,
i.e., choose a different basis to represent the space spanned by the
slabs A1 , . . . , Ap
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Evidently, A0 has the same rank of A
decomposition is given by U 0 = U, V 0

1

0
W 0 = I W1−1 W2 = 
 0
0
and a canonical
= V , W 0 = W1−1 W , where

0 0 0 ∗ ∗ ∗
1 0 0 ∗ ∗ ∗ 

0 1 0 ∗ ∗ ∗ 
0 0 1 ∗ ∗ ∗
Remark
Any subtensor Ab formed by k 3-slabs [A0σ1 , . . . , A0σk ] is such that
b ≤ rk(A) − (p − k)
rk(A)
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
More generally,
Theorem
rk(A) ≥ max min(p − k + rk(A •3 TS))
T
T : k × p submatrix of Ip ,
S
S: p × p nonsingular
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
More generally,
Theorem
rk(A) ≥ max min(p − k + rk(A •3 TS))
T
T : k × p submatrix of Ip ,
S
S: p × p nonsingular
Corollary
If for any basis of span(A1 , . . . , Ap ) there exists a matrix of rank q
then
rk(A) ≥ p + q − 1
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Higher order generalization:
Corollary
Let A = [A1 , . . . , Anh ] ∈ Fn1 ×···×nh , Ak ∈ Fn1 ×···×nh−1 be such
that nh = dim(span(A1 , . . . , Anh )). If for any basis of
span(A1 , . . . , Anh ) there exists a tensor in the basis of rank q then
rk(A) ≥ nh + q − 1
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Higher order generalization:
Corollary
Let A = [A1 , . . . , Anh ] ∈ Fn1 ×···×nh , Ak ∈ Fn1 ×···×nh−1 be such
that nh = dim(span(A1 , . . . , Anh )). If for any basis of
span(A1 , . . . , Anh ) there exists a tensor in the basis of rank q then
rk(A) ≥ nh + q − 1
Example: For
A=
1 0
0 1
0 1
;
0 0
any basis must contain a nonsingular matrix, therefore
rk(A) ≥ 2 + 2 − 1 = 3
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Lower bounds to border rank
Unfortunately the same technique cannot be applied to border rank
W1 may be singular in the limit as → 0 so that E •3 W1−1 may not be
infinitesimal anymore
Remedy: compute the QR factorization of W and
which has Euclidean norm 1 for any . One has

∗ ∗ ∗ ∗ ∗

0 ∗ ∗ ∗ ∗
W 0 = QT W = 
 0 0 ∗ ∗ ∗
0 0 0 ∗ ∗
multiply W by Q T
∗
∗
∗
∗
Remark
There exists an m × n × k subtensor Ab such that
b ≤ brk(A) − (p − k)
brk(A)
Dario A. Bini
The role of tensor rank

∗
∗ 

∗ 
∗
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Theorem
brk(A) ≥ min min(p − k + brk(A •3 TS))
T
T : k × p submatrix of Ip ,
S
S: p × p nonsingular
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Theorem
brk(A) ≥ min min(p − k + brk(A •3 TS))
T
T : k × p submatrix of Ip ,
S
S: p × p nonsingular
Corollary
If any basis of the linear space spanned by A1 , . . . , Ap is made up
by matrices of rank at most q then brk(A) ≥ p + q − 1
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Theorem
brk(A) ≥ min min(p − k + brk(A •3 TS))
T
T : k × p submatrix of Ip ,
S
S: p × p nonsingular
Corollary
If any basis of the linear space spanned by A1 , . . . , Ap is made up
by matrices of rank at most q then brk(A) ≥ p + q − 1
Corollary (Generalization to higher dimension)
Let A = [A1 , . . . , Anh ]. If any basis of the linear space spanned by
A1 , . . . , Anh is made up by tensors of rank at most q then
brk(A) ≥ nh + q − 1
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Example
2
2
2
The tensor A ∈ Fn ×n ×n whose 3-slabs span the linear space




s1,1 . . . s1,n
S

 .

.. 
..
..
I ⊗S =
 , S =  ..
.
.
. 
S
sn,1 . . . sn,n
is associated with n × n matrix multiplication.
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Example
2
2
2
The tensor A ∈ Fn ×n ×n whose 3-slabs span the linear space




s1,1 . . . s1,n
S

 .

.. 
..
..
I ⊗S =
 , S =  ..
.
.
. 
S
sn,1 . . . sn,n
is associated with n × n matrix multiplication.
All the matrices in any basis of the linear space have rank at least
n. Therefore
brk(A) ≥ n2 + n − 1
The bound can be improved to
brk(A) ≥ n2 + 2n − 2
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Example
The tensor A ∈ F4×4×3 whose 3-slabs

s1 s2 0
 s3 s4 0
0 0 s1
span the linear space

0
0 
s2
is associated with the multiplication of a 2 × 2 triangular matrix
and a full 2 × 2 matrix
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
Border rank
Lower bounds
Example
The tensor A ∈ F4×4×3 whose 3-slabs

s1 s2 0
 s3 s4 0
0 0 s1
span the linear space

0
0 
s2
is associated with the multiplication of a 2 × 2 triangular matrix
and a full 2 × 2 matrix
For any basis of the linear space there exist two matrices which
form a tensor of rank at least 4.
rk(A) ≥ 4 − 2 + 4 = 6
Since there exists a canonical approximate decomposition of length
5 of A then brk(A) ≤ 5 < 6 = rk(A)
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
The direct sum conjecture
Some tensors of unknown rank
The direct sum conjecture
0
0
0
Let A ∈ Fm×n×p , B ∈ Fm ×n ×p . Consider the direct sum of A
and B
0
0
0
C = A ⊕ B ∈ F(m+m )×(n+n )×(p+p )
Direct sum conjecture, Strassen 1973
rk(C) = rk(A) + rk(B)
The complexity of two disjoint sets of bilinear forms is the sum of
the complexities of each set.
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
The direct sum conjecture
Some tensors of unknown rank
The direct sum conjecture
0
0
0
Let A ∈ Fm×n×p , B ∈ Fm ×n ×p . Consider the direct sum of A
and B
0
0
0
C = A ⊕ B ∈ F(m+m )×(n+n )×(p+p )
Direct sum conjecture, Strassen 1973
rk(C) = rk(A) + rk(B)
The complexity of two disjoint sets of bilinear forms is the sum of
the complexities of each set.
Fact
There are cases where brk(C) < brk(A) + brk(B)
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
The direct sum conjecture
Some tensors of unknown rank
Example (A. Schoenhage 81)
brk(A) = 10 < 9 + 4





A=





s11 s12 s13
s21 s22 s23
s21 s22 s23









t
t
t
t
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems




















1
1
1
The direct sum conjecture
Some tensors of unknown rank
. . .
. 1 1 . 1 1
2
2
2
2
1
.
.
−
1
.
.











− −
−
−


− − 






−
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
The direct sum conjecture
Some tensors of unknown rank
Matrix multiplication and open problems
2 × 2 matrix product:

A = I2 ⊗
s11 s12
s21 s22

s
s
11
12
 s21 s22


=

s11 s12 
s21 s22
rk(A) = brk(A) = 7 [Strassen 69, Landsberg 05]


s11 s12 s13
3 × 3 matrix product A = I3 ⊗  s21 s22 s23 
s31 s32 s33
19 ≤ rk(A) ≤ 23 [Laderman 76], [Bläser 03]
13 ≤ brk(A) ≤ 22 [Schönhage 81]
Dario A. Bini
The role of tensor rank
Preliminaries
Properties of tensor rank
Open problems
The direct sum conjecture
Some tensors of unknown rank
Matrix multiplication and open problems

s11
 s21
4 × 4 matrix product A = I4 ⊗ 
 s31
s41
s12
s22
s32
s42
s13
s23
s33
s43
34 ≤ rk(A) ≤ 49
22 ≤ brk(A) ≤ 49
Dario A. Bini
The role of tensor rank

s14
s24 

s34 
s44
Related documents