Releases: sony/nnabla-ext-cuda
Releases · sony/nnabla-ext-cuda
Version 1.0.2 Release
Version 1.0.1 Release
Version 1.0.0 Release
Version 1.0.0rc2 Release
Version 1.0.0pre1 Release
- Feature/20180613 rename ext cuda wheel
- All-reduce during backward
- Misc build 20180618
- Feature/20180220 cpp training mnist training
- Feature/20180428 pad function
- fix document
- add ceil and floor func to cuda ext
- Fix/20180525 fix build
- Trigonometric Functions
- Big change around build system.
- Change NULL to nullptr
- Correspond to termination of python3 support of
futures
library. - Communicator with Half Precision
- [generator] Function templates uses write-only flag in cast
- Function: ClipByNom
- AMSGRAD solver
- Add NOTICE file.
- ClipByValue and ClipGradByValue functions
Version 0.9.9 Release
Version 0.9.8 Release
- Add ND Convolution and Deconvolution in CUDNN
- Move build doc to GitHub markdown
- Add implementation of depthwise deconvolution.
- Configurable CUDNN workspace memory size for Convolutions
- Format.
- Enable binary weight functions to run on CUDA.
- New collectives and MPI-like group concept
- Feature/20171225 round function
- Add all_reduce interface
- auto-format
Version 0.9.7
Version 0.9.6
Version 0.9.5
- Add the Swish activation function
- Fix driver shutting down issue on CUDA
- Add the SELU activation function
- Feature/20171011 out of place allreduce
- LeakyReLU CUDA
- Feature/20170908 quantize
- Fix INQ on Windows
- Feature/20171010 fix build and get coverage
- Add CUDA implementation of
INQAffine
andINQConvolution
- Remove warning from function generator
- [c++][function] Support broadcasting scalar
- Feature/20170831 separate cpp build
- remove obsolete copyright notice
- Function: matrix_diag and matrix_diag_part
- Function: Batch sample-wise matrix multiplication
- Improve backprop options of batch_normalization with CUDNN