Feat: Add example for Mistral #644

NanoCode012 · 2023-09-27T15:48:47Z

Todo:

Update transformer commit

In future PR, allow sample_packing.

Upstream PR: huggingface/transformers#26447

winglian · 2023-09-27T16:37:25Z

examples/mistral/config.yml

@@ -0,0 +1,69 @@
+base_model: mistralai/Mistral-7B-v0.1
+base_model_config: mistralai/Mistral-7B-v0.1
+model_type: MistralForCausalLM


Should be able to use AutoModelForCausalLM

Yes, but I think it would be better to specify for later parsing (aka is_mistral_derived..)

winglian · 2023-09-27T17:23:04Z

let's use [78dd120](https://github.com/huggingface/transformers/commit/78dd1202823ca035b9609ddbcdaac2945a6530ff)

winglian · 2023-09-27T17:24:25Z

this PR should fix the issues we were having w transformers main that we had to pin it in the first place #636

examples/mistral/config.yml

winglian · 2023-09-27T21:45:55Z

also, see #646

* Feat: Add example for Mistral * chore: turn off flash * chore: add is_mistral_derived_model * chore: update following PR

winglian reviewed Sep 27, 2023

View reviewed changes

mhenrichsen reviewed Sep 27, 2023

View reviewed changes

examples/mistral/config.yml Outdated Show resolved Hide resolved

mhenrichsen reviewed Sep 27, 2023

View reviewed changes

examples/mistral/config.yml Outdated Show resolved Hide resolved

NanoCode012 added 4 commits September 28, 2023 10:15

Feat: Add example for Mistral

52b8f3b

chore: turn off flash

9efc37c

chore: add is_mistral_derived_model

b29f370

chore: update following PR

de10bd4

NanoCode012 force-pushed the feat/mistral branch from b6aa8ee to de10bd4 Compare September 28, 2023 01:24

NanoCode012 marked this pull request as ready for review September 28, 2023 01:31

NanoCode012 merged commit eb41f76 into axolotl-ai-cloud:main Sep 28, 2023
4 checks passed

NanoCode012 deleted the feat/mistral branch September 28, 2023 11:15

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

Feat: Add example for Mistral (axolotl-ai-cloud#644)

cdc33dd

* Feat: Add example for Mistral * chore: turn off flash * chore: add is_mistral_derived_model * chore: update following PR

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Add example for Mistral #644

Feat: Add example for Mistral #644

NanoCode012 commented Sep 27, 2023 •

edited

Loading

winglian Sep 27, 2023

NanoCode012 Sep 28, 2023

winglian commented Sep 27, 2023

winglian commented Sep 27, 2023

winglian commented Sep 27, 2023

Feat: Add example for Mistral #644

Feat: Add example for Mistral #644

Conversation

NanoCode012 commented Sep 27, 2023 • edited Loading

winglian Sep 27, 2023

Choose a reason for hiding this comment

NanoCode012 Sep 28, 2023

Choose a reason for hiding this comment

winglian commented Sep 27, 2023

winglian commented Sep 27, 2023

winglian commented Sep 27, 2023

NanoCode012 commented Sep 27, 2023 •

edited

Loading