Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump max new tokens for NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO #702

Merged
merged 2 commits into from
Jan 17, 2024

Conversation

nsarrazin
Copy link
Collaborator

No description provided.

@nsarrazin nsarrazin added the models This issue is related to model performance/reliability label Jan 17, 2024
@nsarrazin
Copy link
Collaborator Author

bumped it again to mixtral levels of context window, but with a lower max new tokens. cc @gary149

@nsarrazin nsarrazin merged commit 9b5d65a into main Jan 17, 2024
3 checks passed
@nsarrazin nsarrazin deleted the models/bump-nous-mixtral-max-tokens branch January 17, 2024 12:06
ice91 pushed a commit to ice91/chat-ui that referenced this pull request Oct 30, 2024
…uggingface#702)

* Bump max new tokens for NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO

* bumped values a bit higher seems to work ok
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
models This issue is related to model performance/reliability
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants