Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xe: sdpa: fix several errors related to out of bound accesses in sdpa #2497

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

umar456
Copy link
Contributor

@umar456 umar456 commented Jan 24, 2025

Description

This PR addresses several out of bound access errors related to second token shapes(Q = 1 K/V = SEQ_LEN+1).

Additionally this PR fixes issues related to compilation errors when using common zero points masks are used without scales when quantizing.

@umar456 umar456 added bug A confirmed library bug platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel labels Jan 24, 2025
@umar456 umar456 requested review from a team as code owners January 24, 2025 00:43
@github-actions github-actions bot added the component:tests Codeowner: @oneapi-src/onednn-arch label Jan 24, 2025
@umar456 umar456 force-pushed the uarshad/fix_sdpa_config_2nd_token branch from 3952a04 to cd354b5 Compare January 24, 2025 00:45
@umar456 umar456 force-pushed the uarshad/fix_sdpa_config_2nd_token branch from cd354b5 to 9a94ead Compare January 24, 2025 00:49
@umar456
Copy link
Contributor Author

umar456 commented Jan 24, 2025

make test
disable device_cpu
disable benchdnn_all
enable benchdnn_graph

--case=complex_fusion/mha/sdpa-compressed-kv-int4-gs32.json

# llama-2-7b-chat
--in-shapes=0:1x32x128x384*abdc+1:1x32x1x384+2:1x32x1x384+3:1x32x384x128+5:1x1x384x384+6:1x32x384x128+7:1x32x384x1+8:1x32x384x1,\
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: better to reset dt here, to align with other tests, and avoid misunderstanding.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug A confirmed library bug component:tests Codeowner: @oneapi-src/onednn-arch platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants