Skip to content

Release v3.2.0

Latest
Compare
Choose a tag to compare
@ScottTodd ScottTodd released this 10 Feb 19:39
· 23 commits to main since this release
v3.2.0
f3bef2d

Notable changes

Framework support

  • The TOSA dialect, commonly used when importing from TensorFlow Lite (LiteRT), is in the process of incrementing to v1.0: https://discourse.llvm.org/t/rfc-tosa-dialect-increment-to-v1-0/83708. During this transition, we anticipate that files imported from .tflite to .mlirbc may not be compatible with the IREE compiler. See #19683 and #19777 for more details.
  • Support for importing and compiling TensorFlow models is known to be unstable. We expect this to improve after migrating to new APIs. Follow #19917 for updates.

Compiler

  • #19714: The legacy_sync compilation mode has been removed since all in-tree compiler targets have been migrated off of it. The remaining asynchronous mode produces non-blocking operations that enable more multi-device parallelism.
  • #19720: The tuner now supports specializing for GPUs that share the same ISA but have different hardware capabilities.
  • Compilation time improvements, particularly for large, sharded models: #19791, #19794
  • #19881: The llvm-cpu target now supports parsing AArch64 cpu features.

Runtime

Development tools

Changelog

Full list of changes: v3.1.0...v3.2.0