Modified matrix/vector product in CSR format: for each row we
now precompute minimum index. In the loop over the non-zeros
of the row we then start at the precomputed min. This avoids
one of the if blocks and speeds up the floating point arith
quite a bit. May be able to get rid of the other if block
and speed it up further.
|