Skip to content

Commit

Permalink
CHANGELOG.md update for ROCm 4.1 (#1263)
Browse files Browse the repository at this point in the history
  • Loading branch information
zaliu authored Jan 19, 2021
1 parent d175277 commit 47dd2c4
Showing 1 changed file with 17 additions and 2 deletions.
19 changes: 17 additions & 2 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,18 @@
# Change Log for Tensile

## [Tensile 4.24.0 for ROCm 4.0.0]
## [Tensile 4.26.0 for ROCm 4.1.0]
### Added
- ROCm 4.0 TargetID support in Tensile; source kernels force xnack=OFF
- Make messagepack python dependency optional
- TensileCreateLibraryFiles: auto create target for build time lib generation
- Tensile cluster tuning tool
- Framework for filtering solutions
- Workflow for manually editing Kernels
- Tuning client design doc
- MatrixInstruction for general int8
- Tensile integration test for TensileCreateLibrary
- Trig float and random narrow init patterns for new client
- Summation dimension mirroring (contributed by timlathy & Slimakanzer)
- ROCm 4.1 TargetID support in Tensile; source kernels force xnack=OFF
- Tensile/Utilities/merge.py revamp for merging logic yaml files
- now merge.py requires python3
- add `-v` verbosity levels (up to 2)
Expand All @@ -13,6 +23,11 @@
- Support for High Precision Accumulate FP16/BF16 In FP32 Out
- CHANGELOG.md

### Optimizations
- Refine PersistentKernel: support PKn1, EPS, optimize LW-vmcnt and sMagicDiv2

### Fixed
- targets to clang-offload-bundler updated to use hipv4 prefix when appropriate
- Fix bugs of tail-loop branch label, and LR addr restore
- locateExe in Tensile/Common.py looks in defaultPath first
- Honor $ENV{ROCM_PATH} to support relocatable ROCm location

0 comments on commit 47dd2c4

Please sign in to comment.