Skip to content

Releases: ROCm/rocSPARSE

rocSPARSE 3.2.0 for ROCm 6.2.2

27 Sep 16:01
b293299
Compare
Choose a tag to compare

rocSPARSE code for ROCm 6.2.2 did not change. The library was rebuilt for the updated ROCm 6.2.2 stack.

rocSPARSE 3.2.0 for ROCm 6.2.1

20 Sep 19:58
b293299
Compare
Choose a tag to compare

rocSPARSE code for ROCm 6.2.1 did not change. The library was rebuilt for the updated ROCm 6.2.1 stack.

rocSPARSE 3.2.0 for ROCm 6.2.0

02 Aug 16:15
b293299
Compare
Choose a tag to compare

Additions

  • New Merge-Path algorithm to SpMM, supporting CSR format
  • SpSM now supports row order
  • rocsparseio I/O functionality has been added to the library
  • rocsparse_set_identity_permutation has been added

Changes

  • Adjusted rocSPARSE dependencies to related HIP packages
  • Binary size has been reduced
  • A namespace has been wrapped around internal rocSPARSE functions and kernels
  • rocsparse_csr_set_pointers, rocsparse_csc_set_pointers, and rocsparse_bsr_set_pointers do now allow the column indices and values arrays to be nullptr if nnz is 0
  • gfx803 target has been removed from address sanitizer builds

Optimizations

  • Improved user manual
  • Improved contribution guidelines
  • SpMV adaptive and LRB algorithms have been further optimized on CSR format
  • Improved performance of SpMV adaptive with symmetrically stored matrices on CSR format

Fixes

  • Compilation errors with BUILD_ROCSPARSE_ILP64=ON have been resolved

rocSPARSE 3.1.2 for ROCm 6.1.2

04 Jun 16:53
edb2770
Compare
Choose a tag to compare

rocSPARSE code for ROCm 6.1.2 did not change. The library was rebuilt for the updated ROCm 6.1.2 stack.

rocSPARSE 3.1.2 for ROCm 6.1.1

08 May 18:00
edb2770
Compare
Choose a tag to compare

rocSPARSE code for ROCm 6.1.1 did not change. The library was rebuilt for the updated ROCm 6.1.1 stack.

rocSPARSE 3.1.2 for ROCm 6.1.0

16 Apr 19:11
edb2770
Compare
Choose a tag to compare

Additions

  • New LRB algorithm to SpMV, supporting CSR format
  • rocBLAS as now an optional dependency for SDDMM algorithms
  • Additional verbose output for csrgemm and bsrgemm

Optimizations

  • Triangular solve with multiple rhs (SpSM, csrsm, ...) now calls SpSV, csrsv, etcetera when nrhs equals 1
  • Improved user manual section Installation and Building for Linux and Windows
  • Improved SpMV in CSR format on MI300

rocSPARSE 3.0.2 for ROCm 6.0.2

31 Jan 20:13
1c5d839
Compare
Choose a tag to compare

rocSPARSE code for ROCm 6.0.2 did not change. The library was rebuilt for the updated ROCm 6.0.2 stack.

rocSPARSE 3.0.2 for ROCm 6.0.0

15 Dec 18:31
456a62e
Compare
Choose a tag to compare

Added

  • Added rocsparse_inverse_permutation
  • Added mixed precisions for SpVV
  • Added uniform int8 precision for Gather and Scatter

Optimized

  • Optimization to doti routine
  • Optimization to spin-looping algorithms

Changed

  • Changed rocsparse_spmv function arguments
  • Changed rocsparse_xbsrmv routines function arguments
  • doti, dotci, spvv, and csr2ell now require calling hipStreamSynchronize after when using host pointer mode
  • Improved documentation
  • Improved verbose output during argument checking on API function calls

Deprecated

  • Deprecated rocsparse_spmv_ex
  • Deprecated rocsparse_xbsrmv_ex routines

Removed

  • Removed auto stages from spmv, spmm, spgemm, spsv, spsm, and spitsv.
  • Removed rocsparse_spmm_ex routine

Fixed

  • Fixed a bug in rocsparse-bench, where SpMV algorithm was not taken into account in CSR format
  • Fixed the BSR/GEBSR routines bsrmv, bsrsv, bsrmm, bsrgeam, gebsrmv, gebsrmm so that block_dim==0 is considered an invalid size
  • Fixed bug where passing nnz = 0 to doti or dotci did not always return a dot product of 0

rocSPARSE 2.5.4 for ROCm 5.7.1

13 Oct 18:57
aadb6e3
Compare
Choose a tag to compare

rocSPARSE code for ROCm 5.7.1 did not change. The library was rebuilt for the updated ROCm 5.7.1 stack.

rocSPARSE 2.5.4 for ROCm 5.7.0

15 Sep 17:29
aadb6e3
Compare
Choose a tag to compare

Added

  • Added more mixed precisions for SpMV, (matrix: float, vectors: double, calculation: double) and (matrix: rocsparse_float_complex, vectors: rocsparse_double_complex, calculation: rocsparse_double_complex)
  • Added support for gfx940, gfx941 and gfx942

Improved

  • Fixed a bug in csrsm and bsrsm

Known Issues

In csritlu0, the algorithm rocsparse_itilu0_alg_sync_split_fusion has some accuracy issues to investigate with XNACK enabled. The fallback is rocsparse_itilu0_alg_sync_split.