20 Oct 23:54

aimetci

1.28.1

d8096c9

version 1.28.1

Release of the AI Model Efficiency toolkit package
User guide: https://quic.github.io/aimet-pages/releases/1.28.1/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.28.1/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 30

06 Sep 10:04

aimetci

1.28.0

354d605

version 1.28.0

What's New

Keras

Added Support for Spatial SVD Compression feature.
[experimental] Debugging APIs have been added for dumping intermediate tensor outputs. This data can be used with current QNN/SNPE tools for debugging accuracy problems.

PyTorch

Upgraded AIMET Pytorch default version to 1.13. AIMET remains compatible with Pytorch version 1.9.

ONNX

[experimental] Debugging APIs have been added for dumping intermediate tensor outputs. This data can be used with current QNN/SNPE tools for debugging accuracy problems.

TensorFlow

No significant updates

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.28.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.28.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.28.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.28.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 24

28 Jul 19:14

aimetci

1.27.0

7bbe05d

version 1.27.0

What's New

Keras

Update support for TFOpLambda layers in Batch Norm Folding with extra call args/kwargs.

PyTorch

Added AIMET to support PyTorch version 1.13.0. Only ONNX opset 14 is supported for export.
[experimental] Debugging APIs have been added for dumping intermediate tensor data. This data can be used with current QNN/SNPE tools for debugging accuracy problems. Layer Output Generation API gives incorrect tensor data for the layer just before Relu when used for original FP32 model.
[experimental] Support for embedding AIMET encodings within the graph using ONNX quantize/dequantize operators. Currently this is option is only supported when using 8bit per-tensor quantization.
Fixed a bug in AIMET QuantSim for PyTorch models to handle non-contiguous tensors.

ONNX

AIMET support for ONNX 1.11.0 has been added. However there is currently limited op support in QNN/SNPE. If the model fails to load please continue to use opset 11 for export.

TensorFlow

[experimental] Debugging APIs have been added for dumping intermediate tensor outputs. This data can be used with current QNN/SNPE tools for debugging accuracy problems.

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.27.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.27.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.27.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.27.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

12 Jul 00:30

aimetci

1.26.1

2a8b8e1

version 1.26.1

What's New

TensorFlow

Upgraded AIMET to support TensorFlow version 2.10.1 (AIMET remains compatible with TensorFlow 2.4).
Several bug fixes

Common

Upgraded to Ubuntu 20 base image for all variants.

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.26.1
Installation guide: https://quic.github.io/aimet-pages/releases/1.26.1/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.26.1/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.26.1/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

12 May 22:34

aimetci

1.26.0

57ed1b5

version 1.26.0

What's New

Keras

Added a feature called BN Re-estimation that can improve model accuracy after QAT for INT4 quantization.
Updated the AutoQuant feature to automatically choose the optimal calibration scheme, create an HTML report on which optimizations were applied.
Update to Model Preparer to replace separable conventional with depth wise and point wise conv layers.
Fixes BN fold implementation to account for a subsequent multi-input layer
Fixed a bug where min/max encoding values were not aligned with scale/offset during QAT.

PyTorch

Several bug fixes

TensorFlow

Added a feature called BN Re-estimation that can improve model accuracy after QAT for INT4 quantization
Updated the AutoQuant feature to automatically choose the optimal calibration scheme, create an HTML report on which optimizations were applied.
Fixed a bug where min/max encoding values were not aligned with scale/offset during QAT.

Common

Documentation updates for taking AIMET models to target.
Standalone Batchnorm layers parameter’s conversion such that it will behave as linear/dense layer.

Experimental

Added new Architecture Checker feature to identify and report model architecture constructs that are not ideal for quantized runtimes. Users can utilize this information to change their model architectures accordingly.

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.26.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.26.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.26.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.26.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

09 Mar 23:14

aimetci

1.25.0

0824ecb

version 1.25.0

What's New

Keras

Added QuantAnalyzer feature
Adds Batch Normalization folding for Functional Keras Models. This allows the default config files to work for super grouping.
Resolved an issue with quantizer placement in Sequential blocks in subclassed models

PyTorch

Added AutoQuant V2 which includes advanced features such as out-of-the-box inference, model preparer, quant scheme search, improved summary report, etc.
Fixes to resolve minor accuracy diffs in the learnedGrid quantizer for per-channel quantization
Fixes to improve EfficientNetB4 accuracy w/respect to target
Fixed rare case where quantizer may calculate incorrect offset when generating QAT 2.0 learned encodings

TensorFlow

Added QuantAnalyzer feature
Fixed an accuracy issue due to rare cases where the incorrect BN epsilon was being used
Fixed an accuracy issue due to Quantsim export incorrectly recomputing QAT2.0 encodings

Common

Updated AIMET python package version format to support latest pip
Fixed an issue where not all inputs might be quantized properly

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.25.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.25.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.25.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.25.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

20 Jan 00:18

aimetci

1.24.0

eda99b2

version 1.24.0

What's New

Export quantsim configuration for configuring downstream target quantization

PyTorch

Fixes to resolve minor accuracy diffs in the learnedGrid quantizer for per-channel quantization
Added support for AMP 2.0 which enables faster automatic mixed precision
Added support for QAT for INT4 quantized models – includes a feature for performing BN Re-estimation after QAT

Keras

Added support for AMP 2.0 which enables faster automatic mixed precision
Support for basic transformer networks
Added support for subclassed models. The current subclassing feature includes support for only a single level of subclassing and does not support lambdas.
Added QAT per-channel gradient support
Minor updates to the quantization configuration
Fixed QuantSim bug where layers using dtypes other than float were incorrectly quantized

TensorFlow

Added an additional prelu mapping pattern to ensure proper folding and quantsim node placement
Fixed per-channel encoding representation to align with Pytorch and Keras

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.24.0
Installation guide: https://quic.github.io/aimet-pages/releases/1.24.0/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.24.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.24.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

14 Nov 19:00

aimetci

1.23.0

a422782

version 1.23.0

What's New

TF-enhanced calibration scheme has been accelerated using a custom CUDA kernel. Runs significantly faster now.
Installation instructions are now combined with rest of the documentation (User-Guide and API docs)

PyTorch

Fixed backward pass of the fake-quantize (QcQuantizeWrapper) nodes to handle symmetric mode correctly
Per-channel quantization is now enabled on a per-op-type basis
Support for recursively excluding module from a root module in QuantSim
Support for excluding layers when running model validator and model preparer
Reduced memory usage in AdaRound
Fixed bugs in AdaRound for per-channel quantization
Made ConnectedGraph more robust when identifying custom layers
Added jupyter notebook-based examples for the following features
AutoQuant: Added support for sparse conv layers in QuantSim (experimental)

Keras

Added support for Keras per-channel quantization
Changed interface to CLE to accept a pre-compiled model
Added jupyter notebook-based examples for the following features: Transformer quantization

TensorFlow

Fix to avoid unnecessary indexing in AdaRound

Documentation

Release main page: https://github.com/quic/aimet/releases/tag/1.23.0
Installation guide: https://quic.github.io/aimet-pages/releases/latest/install/index.html
User guide: https://quic.github.io/aimet-pages/releases/1.23.0/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.23.0/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 16

15 Sep 19:07

aimetci

1.22.2

87a9222

version 1.22.2

What's new

Tensorflow

Added support for supergroups : MatMul + Add
Added support for TF-Slim BN name with backslash
Added support for Depthwise + Conv in CLS

Documentation

User guide: https://quic.github.io/aimet-pages/releases/1.22.2/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.22.2/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Assets 19

04 Aug 10:29

aimetci

1.22.1

e09d587

1.22.1

What's Changed

Added support for QuantizableMultiHeadAttention for PyTorch nn.transformer layers by @quic-kyuykim
Support functional conv2d in model preparer by @quic-kyuykim
Enable qat with multi gpu by @quic-mangal
Optimize forward pass logic of PyTorch QAT 2.0 by @quic-geunlee
Fix functional depthwise conv support on model preparer by @quic-kyuykim
Fix bug in model validator to correctly identify functional ops in leaf module by @quic-klhsieh
Support dynamic functional conv2d in model preparer by @quic-kyuykim
Added updated default runtime config, also a per-channel one. Fixed n… by @quic-akhobare
Include residing module info in model validator by @quic-klhsieh
Support for Keras MultiHeadAttention Layer by @quic-ashvkuma

Documentation

User guide: https://quic.github.io/aimet-pages/releases/1.22.1/user_guide/index.html
API documentation: https://quic.github.io/aimet-pages/releases/1.22.1/api_docs/index.html
Documentation main page: https://quic.github.io/aimet-pages/index.html

Contributors

quic-mangal, quic-akhobare, and 4 other contributors

Assets 16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's New

Documentation

What's new

Documentation

What's Changed

Contributors

Releases: quic/aimet

version 1.28.1

version 1.28.0

What's New

Documentation

version 1.27.0

What's New

Documentation

version 1.26.1

What's New

Documentation

version 1.26.0

What's New

Documentation

version 1.25.0

What's New

Documentation

version 1.24.0

What's New

Documentation

version 1.23.0

What's New

Documentation

version 1.22.2

What's new

Documentation

1.22.1

What's Changed

Contributors