[New Feature] Is Mixtral supported? #879

markusdr · 2024-07-08T22:31:38Z

Can you confirm if Mixtral is currently supported, e.g., mistralai/Mixtral-8x7B-Instruct-v0.1? I saw in another issue that Mistral is supported, but I'm not sure about Mixtral-8x7B since it's a different architecture.

The text was updated successfully, but these errors were encountered:

research4pan · 2024-07-09T02:18:07Z

Thanks for your interest in LMFlow! We have tested Mixtral-8x7B in A40 (48G)*8 servers, so the dense training of mixtral-8x7B is currently supported in LMFlow. Sparse training is still under implementation, which we will add to our roadmap and schedule the implementation soon. Multi-node (https://github.com/OptimalScale/LMFlow/blob/main/readme/multi_node.md) can be utilized for larger model training such as Mixtral-8x22B, but we haven't yet tested models that large.

Hope this information can be helpful 😄

wheresmyhair mentioned this issue Jul 9, 2024

[Roadmap] LMFlow Roadmap #862

Open

31 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Feature] Is Mixtral supported? #879

[New Feature] Is Mixtral supported? #879

markusdr commented Jul 8, 2024

research4pan commented Jul 9, 2024 •

edited

Loading

[New Feature] Is Mixtral supported? #879

[New Feature] Is Mixtral supported? #879

Comments

markusdr commented Jul 8, 2024

research4pan commented Jul 9, 2024 • edited Loading

research4pan commented Jul 9, 2024 •

edited

Loading