Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

musa: workaround for Guilty Lockup in cleaning src0 in #10032 #10042

Merged
merged 1 commit into from
Oct 28, 2024

Conversation

yeahdongcn
Copy link
Contributor

We’re encountering an MTGPU Guilty Lockup issue during the model warm-up stage after merging #10032. This PR reverts this change for MUSA only.

I've raised an internal issue and will remove this workaround once it has been resolved.

@yeahdongcn yeahdongcn marked this pull request as ready for review October 25, 2024 08:16
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
@yeahdongcn
Copy link
Contributor Author

Hi @JohannesGaessler

Could you please review this PR? I know the code looks ugly, but it works for now.

Copy link
Collaborator

@JohannesGaessler JohannesGaessler left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I assume you're aware that this results in broken K cache quantization.

@yeahdongcn
Copy link
Contributor Author

I assume you're aware that this results in broken K cache quantization.

Yes. After reviewing all contexts, this approach appears to be the only viable solution to avoid a crash.

Thanks for approving this!

@JohannesGaessler JohannesGaessler merged commit 524afee into ggerganov:master Oct 28, 2024
53 checks passed
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024
Signed-off-by: Xiaodong Ye <xiaodong.ye@mthreads.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants