Mistral testing #888

SalmanMohammadi · 2024-04-26T23:22:56Z

Context

What is the purpose of this PR? Is it to

add a new feature
fix a bug
update tests and/or documentation
other (please add here)

Please link to any issues this PR addresses.

Changelog

I've started adding scripts to verify the implementation of mistral. I'm using the reference implementation from the official repo. There's another implementation in the repo which uses xformers for the attention mechanism, but it's not straightforward to replicate. I ended up running into lots of issues when I initially tried.

So far, I've added a script to compare the attention implementation. I've verified the attention implementation produces consistent ouputs using python -m tests.torchtune.models.mistral.scripts.compare_attention. I'll be keeping the reference implementation in tests/torchtune/models/mistral/scripts/mistral_reference.py.

Next steps

I'm generally following this process - the plan is to continue copying and testing the components of the mistral implementation, and then testing models as a whole and implementing mapping torchune.models.mistral into the reference implementation. Finally, I'll add unit tests to integrate into CI.

Good to make sure I'm not too far off the mark : )

pytorch-bot · 2024-04-26T23:22:59Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/888

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 190cc8a with merge base bec7bab ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

tests/torchtune/models/test_mistral.py

… model. Added comparison scripts and verified correctness.

SalmanMohammadi · 2024-04-27T16:47:57Z

I've updated scripts for the rest of the mistral components. I need to write the comparison involving mapping state dicts, update the unit test, and (potentially) add LoRA comparisons.

…d unit test with expected value

SalmanMohammadi · 2024-04-27T19:59:01Z

Okay, all seems good. We now have a unit test for the base mistral model using the copied implementation from the mistral repo.

For the unfortunate reviewer seeing my +1160 line PR (I hope you read this first!):
I'm hoping the mistral/scripts/compare_{component}.py for the individual components weren't unnecessary - I'm realising that we already test each component in llama2/scripts/compare_{component}.py. The only component that I'm testing that wasn't compared in llama2/scripts/ is mistral_mlp. Maybe it's good that they've been verified with two implementations? If it's not useful I can take them out - mistral/scripts/compare_mistral.py is the main one.

SalmanMohammadi · 2024-04-27T20:00:10Z

tests/torchtune/models/mistral/scripts/compare_mistral.py

@@ -0,0 +1,186 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.


This is the main file used for comparing implementations.

ebsmothers · 2024-04-29T20:47:52Z

Thanks for all this extensive testing!

I'm hoping the mistral/scripts/compare_{component}.py for the individual components weren't unnecessary - I'm realising that we already test each component in llama2/scripts/compare_{component}.py. The only component that I'm testing that wasn't compared in llama2/scripts/ is mistral_mlp.

I think we wanna find the right balance of rigorous testing and maintenance here. So while I don't want your work to be in vain, I wonder if we should just add those comparison scripts that differ nontrivially from the Llama2 ones, and for other components point to the Llama2 ones. So in this case that would mean keep compare_mistral and compare_feedforward (since you mentioned it's not tested under llama2). Then you can add a readme to tests/torchtune/models/mistral/scripts (similar to this one in the llama2 scripts directory) and state that components X, Y, and Z are identical to the llama2 ones and their comparison scripts can be found in that directory. (If you want you can even move the MLP comparison under llama2 so that everything is colocated, but tbh I have no strong preference here.)

SalmanMohammadi · 2024-04-29T23:22:25Z

not in vain at all - I learnt lots! I've updated and added a README.

ebsmothers

Looks good! Two small nits, with green CI this is good to merge

tests/torchtune/models/mistral/scripts/README.md

tests/torchtune/models/mistral/scripts/compare_mistral.py

Co-authored-by: ebsmothers <ebs@meta.com>

…into mistral-tests

SalmanMohammadi · 2024-05-01T17:10:33Z

Thanks again for your review @ebsmothers :)

Adding initial script for testing mistral reference attention

0d6b2ab

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 26, 2024

SalmanMohammadi marked this pull request as draft April 26, 2024 23:23

SalmanMohammadi commented Apr 26, 2024

View reviewed changes

tests/torchtune/models/test_mistral.py Outdated Show resolved Hide resolved

SalmanMohammadi changed the title ~~Adding initial script for testing mistral reference attention~~ Mistral testing Apr 27, 2024

Added reference implementation of mistral components, and for mistral…

d417f83

… model. Added comparison scripts and verified correctness.

Added compare_mistral for mapping state dicts into ref. model. Update…

90dcbee

…d unit test with expected value

SalmanMohammadi commented Apr 27, 2024

View reviewed changes

fixing test fixture to use seed from config

42a605c

removing duplicated mistral comparison scripts, added README.md

f2f5066

SalmanMohammadi marked this pull request as ready for review April 30, 2024 09:38

ebsmothers reviewed May 1, 2024

View reviewed changes

tests/torchtune/models/mistral/scripts/README.md Outdated Show resolved Hide resolved

tests/torchtune/models/mistral/scripts/compare_mistral.py Outdated Show resolved Hide resolved

SalmanMohammadi and others added 3 commits May 1, 2024 18:08

Update tests/torchtune/models/mistral/scripts/README.md

e21614d

Co-authored-by: ebsmothers <ebs@meta.com>

removing commented code

ccf3bb2

Merge branch 'mistral-tests' of github.com:SalmanMohammadi/torchtune …

190cc8a

…into mistral-tests

ebsmothers approved these changes May 1, 2024

View reviewed changes

ebsmothers merged commit 06c5fcb into pytorch:main May 1, 2024
29 checks passed

SalmanMohammadi deleted the mistral-tests branch July 20, 2024 22:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mistral testing #888

Mistral testing #888

SalmanMohammadi commented Apr 26, 2024 •

edited

Loading

pytorch-bot bot commented Apr 26, 2024 •

edited

Loading

SalmanMohammadi commented Apr 27, 2024

SalmanMohammadi commented Apr 27, 2024 •

edited

Loading

SalmanMohammadi Apr 27, 2024

ebsmothers commented Apr 29, 2024

SalmanMohammadi commented Apr 29, 2024

ebsmothers left a comment

SalmanMohammadi commented May 1, 2024

		@@ -0,0 +1,186 @@
		# Copyright (c) Meta Platforms, Inc. and affiliates.

Mistral testing #888

Mistral testing #888

Conversation

SalmanMohammadi commented Apr 26, 2024 • edited Loading

Context

Changelog

Next steps

pytorch-bot bot commented Apr 26, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/888

✅ No Failures

SalmanMohammadi commented Apr 27, 2024

SalmanMohammadi commented Apr 27, 2024 • edited Loading

SalmanMohammadi Apr 27, 2024

Choose a reason for hiding this comment

ebsmothers commented Apr 29, 2024

SalmanMohammadi commented Apr 29, 2024

ebsmothers left a comment

Choose a reason for hiding this comment

SalmanMohammadi commented May 1, 2024

SalmanMohammadi commented Apr 26, 2024 •

edited

Loading

pytorch-bot bot commented Apr 26, 2024 •

edited

Loading

SalmanMohammadi commented Apr 27, 2024 •

edited

Loading