Pe. Buis et Wr. Dyksen, EFFICIENT VECTOR AND PARALLEL MANIPULATION OF TENSOR-PRODUCTS, ACM transactions on mathematical software, 22(1), 1996, pp. 18-23
We present efficient vector and parallel methods for manipulating tens
or products of matrices. We consider both computing the matrix-vector
product (A(1) x ... x A(K))x and solving the system of linear equation
s (A(1) x ... x A(K))x = b. The methods described are independent of K
. We accompany this article with a companion algorithm which describes
an implementation of a complete set of tensor product routines based
on LAPACK and the Level 2 and 3 Basic Linear Algebra Subprograms (BLAS
) which provide vectorization and parallelization.