Use BLAS acceleration in .dot() when possible #92

bluss · 2016-02-28T16:58:08Z

Use BLAS to compute the dot product when possible (for f32 and f64).

Even though our manual scalar product implementation is pretty good, the BLAS implementation still manages to beat it and finish the 1024 element dot product twice as fast. So this is an improvement even in the simplest kinds of cases. (But we still use the generic dot for vectors shorter than 32 elements, since that is faster).

Introduce an internal module imp_prelude (implementation's prelude) that simplifies importing the main types that we use everywhere.
Add type Priv that can be used to attach private methods that can still be used everywhere in the crate.
Deprecate ArrayBase::zeros (stray extra change).

Simplify based on the new restriction (Copy).

For some measure of long, seems like the smallest vectors benefit from using the plain generic dot product (32 elements or smaller).

Use BLAS acceleration in .dot() when possible

bluss added 7 commits February 28, 2016 17:49

Add .dot() benchmark

33104e6

BLAS acceleration for .dot()

45b2d07

Deprecate free function zeros in favour of ArrayBase::zeros

0e61653

Simplify unrolled_dot

c9f2937

Simplify based on the new restriction (Copy).

Use BLAS in dot only for "long" vectors.

74b2bca

For some measure of long, seems like the smallest vectors benefit from using the plain generic dot product (32 elements or smaller).

Put rblas impl in a private method

aa1e458

rm duplicate extern crate

86b1673

bluss added a commit that referenced this pull request Feb 28, 2016

Merge pull request #92 from bluss/specialize-dot

9bae4eb

Use BLAS acceleration in .dot() when possible

bluss merged commit 9bae4eb into master Feb 28, 2016

bluss deleted the specialize-dot branch February 28, 2016 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use BLAS acceleration in .dot() when possible #92

Use BLAS acceleration in .dot() when possible #92

bluss commented Feb 28, 2016

Uh oh!

Uh oh!

Use BLAS acceleration in .dot() when possible #92

Use BLAS acceleration in .dot() when possible #92

Conversation

bluss commented Feb 28, 2016

Uh oh!

Uh oh!