[QNN EP] QNN SDK 2.28.2 #22844

adrianlizarraga · 2024-11-14T21:54:40Z

Description

Updates pipelines to use QNN SDK 2.28.2.241116.
Re-enable LayerNormalization unit tests that failed with accuracy errors with the previous QNN SDK (2.28.0).
Update QNN EP to no longer provide a dummy bias for LayerNorm if the QNN SDK version is >= 2.28.0.

Motivation and Context

Use the latest QNN SDK. This version improves inference latency for certain customer models.

…ckend

…rm implicit bias bug has been fixed.

onnxruntime/core/providers/qnn/builder/opbuilder/layer_norm_op_builder.cc

…peline to build in Release

tools/ci_build/github/azure-pipelines/qnn-ep-nuget-packaging-pipeline.yml

onnxruntime/core/providers/qnn/builder/qnn_backend_manager.cc

### Description - Updates pipelines to use QNN SDK 2.28.2.241116. - Re-enable LayerNormalization unit tests that failed with accuracy errors with the previous QNN SDK (2.28.0). - Update QNN EP to no longer provide a dummy bias for LayerNorm if the QNN SDK version is >= 2.28.0. ### Motivation and Context Use the latest QNN SDK. This version improves inference latency for certain customer models.

### Description  All three PRs are cherry-picked in this round: 1. [Refactor SkipLayerNorm and handle beta properly (#22862) ](#22862) 2. [[TensorRT EP] Exclude DDS ops from running on TRT (#22875)](#22875) 3. [[QNN EP] QNN SDK 2.28.2 (#22844) ](#22844) ### Motivation and Context  --------- Signed-off-by: Liqun Fu <liqfu@microsoft.com> Signed-off-by: Liqun Fu <liqun.fu@microsoft.com> Co-authored-by: Chi Lo <54722500+chilo-ms@users.noreply.github.com> Co-authored-by: liqun Fu <liqfu@microsoft.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Adrian Lizarraga <adlizarraga@microsoft.com>

### Description - Updates pipelines to use QNN SDK 2.28.2.241116. - Re-enable LayerNormalization unit tests that failed with accuracy errors with the previous QNN SDK (2.28.0). - Update QNN EP to no longer provide a dummy bias for LayerNorm if the QNN SDK version is >= 2.28.0. ### Motivation and Context Use the latest QNN SDK. This version improves inference latency for certain customer models.

adrianlizarraga added 5 commits November 14, 2024 13:53

Handle dummy ret val for call to profileGetEvents() with QNN Saver ba…

f965b1a

…ckend

Try to re-enable unit tests disabled for QNN 2.27 and 2.28.0. LayerNo…

4fa576d

…rm implicit bias bug has been fixed.

Update pipeline qnn sdk versions

f88a4ce

Merge branch 'main' into adrianl/qnn-sdk-2.28.2

f790028

Add unit test comments

6d112d4

adrianlizarraga commented Nov 15, 2024

View reviewed changes

onnxruntime/core/providers/qnn/builder/opbuilder/layer_norm_op_builder.cc Show resolved Hide resolved

sophies927 added release:1.20.1 triage:approved Approved for cherrypicks for release labels Nov 18, 2024

adrianlizarraga added 2 commits November 18, 2024 15:17

Merge branch 'main' into adrianl/qnn-sdk-2.28.2

6911db0

Update pipelines to use official QNN SDK 2.28.2; default QNN Nuget pi…

93555b3

…peline to build in Release

adrianlizarraga marked this pull request as ready for review November 19, 2024 00:18

adrianlizarraga requested a review from a team as a code owner November 19, 2024 00:18

adrianlizarraga commented Nov 19, 2024

View reviewed changes

tools/ci_build/github/azure-pipelines/qnn-ep-nuget-packaging-pipeline.yml Outdated Show resolved Hide resolved

adrianlizarraga changed the title ~~[QNN EP] [DRAFT] QNN SDK 2.28.2~~ [QNN EP] QNN SDK 2.28.2 Nov 19, 2024

Go back to RelWithDebInfo for QNN Nuget package

aa6c460

adrianlizarraga requested review from HectorSVC and jywu-msft November 19, 2024 00:45

adrianlizarraga commented Nov 19, 2024

View reviewed changes

onnxruntime/core/providers/qnn/builder/qnn_backend_manager.cc Show resolved Hide resolved

HectorSVC approved these changes Nov 19, 2024

View reviewed changes

jywu-msft approved these changes Nov 19, 2024

View reviewed changes

yf711 merged commit 497b06f into main Nov 19, 2024
93 checks passed

yf711 deleted the adrianl/qnn-sdk-2.28.2 branch November 19, 2024 04:10

yf711 mentioned this pull request Nov 19, 2024

[ORT 1.20.1 Release] Cherry pick 2nd round #22845

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN EP] QNN SDK 2.28.2 #22844

[QNN EP] QNN SDK 2.28.2 #22844

adrianlizarraga commented Nov 14, 2024 •

edited

Loading

[QNN EP] QNN SDK 2.28.2 #22844

[QNN EP] QNN SDK 2.28.2 #22844

Conversation

adrianlizarraga commented Nov 14, 2024 • edited Loading

Description

Motivation and Context

adrianlizarraga commented Nov 14, 2024 •

edited

Loading