Release for ROCM 1.6.4
Pre-release
Pre-release
NOTE: API breaking changes introduced in this release related to: rocblas_iXamax, rocblas_iXamin, complex functions, and half functions.
Changelist:
- correct API: rocblas_samax -> rocblas_isamax, rocblas_damax -> rocblas_idamax
- remove from the API functions for complex and half that have not been implemented
- update to Tensile v3.2.0. This uses sgemm assembly kernels for gfx803 and gfx900
- add rocblas_sgeam and rocblas_dgeam functions
- improve repeatability of rocblas_Xgemm performance tests
- update perf script