CLBlast 1.5.3
CLBlast version 1.5.3. Changes since previous release (version 1.5.2):
- Fix a correctness issue with DGEMM on SM 7.5 Turing GPUs
- Update cl.hpp to the new opencl.hpp header in the samples
- Changed the complex sum routine to return the complex sum instead of the absolute complex sum.
- Various minor fixes and enhancements
- Added tuned parameters for various devices (see doc/tuning.md)