`array A = randu(70000, 8);`

array AtA = matmul(A, A, B_transpose);

I received an error stating that the inner dimensions must match. Since I am taking a transpose of one of the matrices, the dimensions will match. The routine seems to be error checking before applying the matopts.

This was indeed a bug that has been fixed. Thanks for bringing this to our attention.

However, note that if we want to implement the product of A-transpose with A, the correct syntax is:

`array AtA = matmul(A, A, A_transpose);`

Another (not-so-fast) alternative is to use the transpose function:

`array AtA = matmul(A.T(), A);`