Skip to content
This repository has been archived by the owner on Jul 17, 2024. It is now read-only.

v3.1.0

Latest
Compare
Choose a tag to compare
@kernhanda kernhanda released this 10 Jun 21:45
· 8 commits to master since this release

3.1.0

  • Move to VS 2019
  • Fix a codegen error that was resulting in incorrect functional behavior
  • Fix regressions in audio training tutorial (#232)
  • Add importing of Sum nodes to ONNX importer
  • Fix crash in LLVMContext::SetName
  • Improved performance of CNN models on Pi3 with new implementations of spatial, pointwise and regular convolutions
  • Improved performance of reorder node
  • New nodes: ReorderDataCodeNode, SpatialConvolutionNode, MatrixMatrixMultiplyCodeNode
  • Implement parallelization strategies for matrix multiplication nodes.
  • Only enable new MatrixMatrixMultipleCodeNode path for select ARM targets like Pi, and not Intel/AMD CPUs
  • Add the flag --skip_ellcode to compile and wrap.py tools to use OpenBLAS for linear algebra computations.

Check out the latest model benchmarks here!