Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MPS] Fix MPS clamp issue with different dtypes between input and min/max tensors #105747

Closed
wants to merge 5 commits into from

Conversation

DenisVieriu97
Copy link
Collaborator

  • Fix the FP16 clamp issue (FP32 and FP16 are not broadcast compatible)
  • Fix clamp (cached graph nodes were previously replaced with the cast version)

@DenisVieriu97 DenisVieriu97 added the ciflow/mps Run MPS tests (subset of trunk) label Jul 21, 2023
@pytorch-bot
Copy link

pytorch-bot bot commented Jul 21, 2023

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/105747

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit 1eae4c3:

NEW FAILURE - The following job has failed:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@pytorch-bot pytorch-bot bot added the release notes: mps Release notes category label Jul 21, 2023
@DenisVieriu97 DenisVieriu97 changed the title Fix MPS clamp issue with FP16 max/min and FP32 input [MPS] Fix MPS clamp issue with FP16 max/min and FP32 input Jul 21, 2023
@DenisVieriu97 DenisVieriu97 changed the title [MPS] Fix MPS clamp issue with FP16 max/min and FP32 input [MPS] Fix MPS clamp issue with different dtypes between input and min/max tensors Jul 21, 2023
@janeyx99 janeyx99 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jul 24, 2023
@DenisVieriu97
Copy link
Collaborator Author

@pytorchbot rebase

@pytorchmergebot
Copy link
Collaborator

@pytorchbot started a rebase job onto refs/remotes/origin/viable/strict. Check the current status here

kulinseth and others added 5 commits August 1, 2023 23:53
* Fix the FP16 clamp issue.

* Fix clamp (cached graph nodes were previously replaced with the cast version)

---------

Co-authored-by: Denis Vieriu <dvieriu@apple.com>
@pytorchmergebot
Copy link
Collaborator

Successfully rebased clamp_fix onto refs/remotes/origin/viable/strict, please pull locally before adding more changes (for example, via git checkout clamp_fix && git pull --rebase)

@DenisVieriu97
Copy link
Collaborator Author

@pytorchbot merge -f "MPS tests & lint are green"

@pytorchmergebot
Copy link
Collaborator

Merge started

Your change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use -f as last resort and instead consider -i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging
Check the merge workflow status
here

@malfet
Copy link
Contributor

malfet commented Aug 16, 2023

@kulinseth , @DenisVieriu97 just curious, have you check if it matches cpu/CUDA behavior? I.e. I assume it should do something like that only if AMP is enable, isn't it?

@kulinseth
Copy link
Collaborator

@kulinseth , @DenisVieriu97 just curious, have you check if it matches cpu/CUDA behavior? I.e. I assume it should do something like that only if AMP is enable, isn't it?

We checked with cpu @malfet

@kulinseth
Copy link
Collaborator

@kulinseth , @DenisVieriu97 just curious, have you check if it matches cpu/CUDA behavior? I.e. I assume it should do something like that only if AMP is enable, isn't it?

We checked with cpu @malfet. Also it’s affecting correctness of few language models

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
ciflow/mps Run MPS tests (subset of trunk) Merged open source release notes: mps Release notes category triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants