1.22.1
What's Changed
- Added support for QuantizableMultiHeadAttention for PyTorch nn.transformer layers by @quic-kyuykim
- Support functional conv2d in model preparer by @quic-kyuykim
- Enable qat with multi gpu by @quic-mangal
- Optimize forward pass logic of PyTorch QAT 2.0 by @quic-geunlee
- Fix functional depthwise conv support on model preparer by @quic-kyuykim
- Fix bug in model validator to correctly identify functional ops in leaf module by @quic-klhsieh
- Support dynamic functional conv2d in model preparer by @quic-kyuykim
- Added updated default runtime config, also a per-channel one. Fixed n… by @quic-akhobare
- Include residing module info in model validator by @quic-klhsieh
- Support for Keras MultiHeadAttention Layer by @quic-ashvkuma
Documentation
- User guide: https://quic.github.io/aimet-pages/releases/1.22.1/user_guide/index.html
- API documentation: https://quic.github.io/aimet-pages/releases/1.22.1/api_docs/index.html
- Documentation main page: https://quic.github.io/aimet-pages/index.html