Skip to content

Release for ROCM 1.6.4

Pre-release
Pre-release
Compare
Choose a tag to compare
@amcamd amcamd released this 17 Oct 15:04
· 4223 commits to master since this release

NOTE: API breaking changes introduced in this release related to: rocblas_iXamax, rocblas_iXamin, complex functions, and half functions.

Changelist:

  • correct API: rocblas_samax -> rocblas_isamax, rocblas_damax -> rocblas_idamax
  • remove from the API functions for complex and half that have not been implemented
  • update to Tensile v3.2.0. This uses sgemm assembly kernels for gfx803 and gfx900
  • add rocblas_sgeam and rocblas_dgeam functions
  • improve repeatability of rocblas_Xgemm performance tests
  • update perf script