v3.1.0
github-actions
released this
28 Apr 16:34
·
1655 commits
to master
since this release
CUDA v3.1.0
Closed issues:
- GPU Implementation of partialsort! (#93)
- Document associativity requirements of scan/reduce operators (#819)
- Problem in reduce_block? (#843)
- CUDNN convolution incorrect for small images (#848)
- Newly-spawned tasks should re-set the device (#851)
- sort!(CUDA.zeros(2^25)) throws invalid configuration argument (code 9, cudaErrorInvalidConfiguration) (#852)
- Type-preserving upload about cu in doc may be wrong (#855)
- Memory corruption / segfault with Threads.@async and planned FFTs (#859)
- Don't call nvmlErrorString (during init?) to prevent crashes on WSL (#860)
- unsafe_copy3d! does not work with stream-ordered allocations (#863)
- CUDA3 seems to have memory leak (#866)
Merged pull requests:
- Implement statistics functions: correlation and covariance (#509) (@berquist)
- @atomic support * and / (#842) (@yuehhua)
- CUDNN docstring revisions. (#844) (@GunnarFarneback)
- Sorting perf (again) (#845) (@xaellison)
- Update manifest (#846) (@github-actions[bot])
- Remove extraneous apostrophe (#847) (@kshyatt)
- reduce_block fixes. (#853) (@maleadt)
- Fix sorting large arrays. (#854) (@maleadt)
- Remove unsupported config launch keyword. (#856) (@maleadt)
- Identify the buffer during unsafe_wrap to support unified free. (#857) (@maleadt)
- Add support for CUDA 11.3. (#858) (@maleadt)
- Work around buggy NVML initialization on WSL (#861) (@maleadt)
- ae/partialsort (#864) (@xaellison)
- Update manifest (#865) (@github-actions[bot])
- Improve multitasking with CUFFT. (#867) (@maleadt)
- Introduce a HandleCache type. (#868) (@maleadt)
- Improve multitasking with CURAND (#869) (@maleadt)
- Document associativity requirement of accumulate (#870) (@HenriDeh)
- Half-Precision Intrinsics (#871) (@iyaja)
- Work around offset calculation bug in cuMemcpy3DAsync. (#872) (@maleadt)
- fix #848: CUDNN convolution incorrect for small images (#873) (@denizyuret)