Skip to content

Releases: JuliaGPU/CUDA.jl

v3.9.1

08 May 22:08
b7e60f5
Compare
Choose a tag to compare

CUDA v3.9.1

Diff since v3.9.0

Closed issues:

  • Issue with copy_cublasfloat (#1476)
  • Errors when broadcasting random number generators (#1480)
  • CPU version of linear algebra routine is dispatched when using Zygote.gradient (#1481)
  • scan! fails on vectors of structs (#1482)
  • InexactError when getting CUDA version info (#1489)

Merged pull requests:

  • Allow more integer argument types for byte_perm (#1420) (@eschnett)
  • support CuSparseMatrix(::Diagonal) (#1470) (@Roger-luo)
  • Don't emit debug info until the next CUDA version. (#1473) (@maleadt)
  • Update manifest (#1474) (@github-actions[bot])
  • Update manifest (#1479) (@github-actions[bot])
  • fix unsafe_wrap docstring and widen signature (#1483) (@piever)
  • Update manifest (#1484) (@github-actions[bot])
  • Check whether cudaRuntimeGetVersion succeeded. (#1490) (@maleadt)
  • Update manifest (#1494) (@github-actions[bot])
  • Fix #1476: Allow any container in copy_cublasfloat (#1498) (@danielwe)

v3.9.0

09 Apr 09:58
5c40438
Compare
Choose a tag to compare

CUDA v3.9.0

Diff since v3.8.5

Closed issues:

  • Tests for showing (#35)
  • Support LU factorizations (#1193)
  • Int8 WMMA not working in 3.8.4 and 3.8.5 despite merged PR. Add more unit tests? (#1442)
  • Optional CPU cpu kernel call with @cuda (#1443)
  • Add library/artifact management for NCCL (#1446)
  • permutedims returns a lowertriangular matrix (#1451)
  • New broadcast corrupts memory? (#1457)
  • norm does not dispatch on CuSparseMatrixCSC (#1460)
  • scalar * sparse multiplication (#1468)

Merged pull requests:

v3.8.5

14 Mar 20:11
Compare
Choose a tag to compare

CUDA v3.8.5

Diff since v3.8.4

Merged pull requests:

  • Update manifest (#1440) (@github-actions[bot])

v3.8.4

11 Mar 16:50
1526aad
Compare
Choose a tag to compare

CUDA v3.8.4

Diff since v3.8.3

Closed issues:

  • sparse-sparse and sparse-constant multiplication lose sparsity (output dense matrix) (#1264)
  • LLVMExtra fails to load on Julia 1.8 and PPC (#1387)
  • compute-sanitizer CUDA_ERROR_INVALID_VALUE on CUDA.jl 3.0+ (#1415)
  • @cudnnDescriptor is not threadsafe (#1421)
  • Precomplication of CUDA 3.8.3 broken on 1.7.1 due to changes in Random123.jl (#1422)
  • OOM error should include memory status (#1427)
  • WMMA kernel works with Julia 1.7.2 but fails with illegal memory access for Julia 1.8.0-beta1 (#1431)
  • Non Int64 local memory size leads to dynamic function invocation (#1434)
  • "initialization" test failing (#1435)
  • cuda with julia 1.8 not working on windows (working fine(?) on wsl2) (#1436)

Merged pull requests:

v3.8.3

25 Feb 17:59
2319b89
Compare
Choose a tag to compare

CUDA v3.8.3

Diff since v3.8.2

Closed issues:

  • Sparse matrix addition not working (#528)
  • Native implementation of sparse arrays (#829)
  • CUSPARSE: Adding a value to the diagonal (#1372)
  • Conversion by cu casts Float64 to Float32 but not Int64 to Int32 (#1388)
  • CUDA.math_mode!(...; precision) option not working (#1392)
  • cuIpcGetMemHandle failure resulting in CUDA-aware MPI to fail (#1398)
  • axpby! support for BFloat16 (#1399)
  • CUSPARSE does not support integer matrices, breaks printing (#1402)
  • sparse(I, J, V) doesn't support unsorted inputs (#1407)

Merged pull requests:

v3.8.2

18 Feb 17:16
46db50d
Compare
Choose a tag to compare

CUDA v3.8.2

Diff since v3.8.1

Closed issues:

  • CuSparseMatrixCSC missing lu and interactions with UniformScaling (#79)
  • CUSPARSE typo (#1231)
  • similar(A::CuSparse,eltype) returns an Array (#1316)
  • "errormonitor" undefined in julia1.6 (#1375)
  • Pool free can switch tasks (#1384)

Merged pull requests:

v3.8.1

15 Feb 17:25
Compare
Choose a tag to compare

CUDA v3.8.1

Diff since v3.8.0

Closed issues:

  • one(::CuMatrix) result on cpu (#142)
  • Broadcasted setindex! triggers scalar setindex! (#101)
  • OutOfGPUMemoryError With Available Memory (#1346)
  • Distributions.jl with CuArrays (#1347)
  • Views of Flux OneHotArrays (#1349)
  • synchronize(blocking = false) hangs in julia 1.7 eventually (#1350)
  • unsupported call through a literal pointer (call to log1pf) on Julia 1.6.5 (#1352)
  • SpecialFunctions ^1.8 compat entry? (#1354)
  • Performance deprecation using ^ on Float32 (#1358)
  • Method definition setindex!(LinearAlgebra.Diagonal{T, V} ... overwritten in module CUDA (#1364)
  • [PackageCompiler] Segmentation fault with CUDA.jl in multiversioning (#1365)
  • Vectors in customary structs make julia stuck (#1366)
  • sparseCSC-dense matrix multiplication yields unstable results (#1368)
  • UndefVarError: parameters not defined on Windows10 (#1371)

Merged pull requests:

v3.8.0

28 Jan 16:32
e1507d3
Compare
Choose a tag to compare

CUDA v3.8.0

Diff since v3.7.1

Closed issues:

  • Consider reserving memory (#1320)

Merged pull requests:

v3.7.1

27 Jan 11:00
fb01adb
Compare
Choose a tag to compare

CUDA v3.7.1

Diff since v3.7.0

Closed issues:

  • Moving data between devices (#1136)
  • Repeated has_cuda_gpu errors when CUDA_VISIBLE_DEVICES is empty (#1331)
  • Error when env var CUDA_VISIBLE_DEVICES is set but empty (#1336)

Merged pull requests:

v3.7.0

21 Jan 16:55
92f0dce
Compare
Choose a tag to compare

CUDA v3.7.0

Diff since v3.6.4

Closed issues:

  • mul! is missing for plan_fft! (#1311)
  • Segfault with CUDA in a sysimage (#1314)
  • CuSparse does not support broadcast (#1317)
  • CUDA.functional(true) errors instead of printing "why" and returning false (#1318)
  • Interesting timings (#1323)
  • Syncronization how to? (#1324)

Merged pull requests: