Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Support finetune Deepseek v2 #663

Merged
merged 30 commits into from
Jun 13, 2024
Merged

Conversation

HIT-cwh
Copy link
Collaborator

@HIT-cwh HIT-cwh commented May 8, 2024

No description provided.

@pppppM pppppM merged commit f7d1aea into InternLM:main Jun 13, 2024
3 checks passed
llkn-2 pushed a commit to llkn-2/xtuner that referenced this pull request Jul 31, 2024
* support deepseek v2

* fix dispatch

* refactor deepseek v2

* fix lint

* fix bugs

* fix bugs

* delete useless codes

* refactor deepseek config

* rewrite DeepseekV2PreTrainedModel.from_pretrained

* revert sft.py to main

* delete useless codes

* add deepseek v2 config

* add deepseek readme

* add HFCheckpointHook

* optimize mixtral moe

* fix bugs

* delete useless codes

* delete evalchathook

* fix bugs

* fix bugs

* add moe SUPPORT_MODELS and fix HFCheckpointHook

* add moe SUPPORT_MODELS and fix HFCheckpointHook

* fix bugs

* refactor modeling_deepseek

* update deepseek readme

* support deepseek v2 lite

* fix bugs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants