Skip to content

CLBlast 1.5.3

Compare
Choose a tag to compare
@CNugteren CNugteren released this 29 Sep 18:46
· 62 commits to master since this release
d55840e

CLBlast version 1.5.3. Changes since previous release (version 1.5.2):

  • Fix a correctness issue with DGEMM on SM 7.5 Turing GPUs
  • Update cl.hpp to the new opencl.hpp header in the samples
  • Changed the complex sum routine to return the complex sum instead of the absolute complex sum.
  • Various minor fixes and enhancements
  • Added tuned parameters for various devices (see doc/tuning.md)