Research paper in R: Here is why GotoBLAS2 may be the fastest C multi threading library over ATLAS, MKL, GPU, FPGA, and others.
I was just referred to this which shows GotoBLAS2 could be the fastest C multi threading library out there.
http://www.tacc.utexas.edu/tacc-projects/gotoblas2
These claims come from a vignette where there is an R package around these:
http://cran.r-project.org/web/packages/gcbd/index.html
I just extracted out of this from this research paper:
http://cran.r-project.org/web/packages/gcbd/vignettes/gcbd.pdf
.
Between the multithreaded
BLAS implementations, Goto is seen to have a slight advantage over MKL
and Atlas. GPU computing is showing promise but requires relatively large matrices to
outperform multi-threaded BLAS.
—
A second key aspect is the di_erence between static and shared linking. In static linking,
object code is taken from the underlying library and copied into the resulting executable.
This has several key implications. First, the executable becomes larger due to the copy of
the binary code. Second, it makes it marginally faster as the library code is present and no
additional look-up and subsequent redirection has to be performed…. Shared library builds, on the other
hand, result in smaller binaries that may run marginally slower|but which can make use of
di_erent libraries without a rebuild.
—
Shared library builds, on the other
hand, result in smaller binaries that may run marginally slower|but which can make use of
di_erent libraries without a rebuild.