Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets #1291

Merged
merged 2 commits into from
Feb 21, 2024

Conversation

dameikle
Copy link
Contributor

@dameikle dameikle commented Feb 14, 2024

Description

This change adds an additional clause when configuring the trainer to allow load_best_model_at_end to be set when using custom evaluation datasets (i.e. test_datasets is populated and val_set_size is to zero).

Motivation and Context

When using a custom test dataset for evaluation was unable to set early_stopping_patience as load_best_model_at_end was never set on the training configuration. This change allows the early stopping callback to be configured in the trainer successful.

How has this been tested?

I have been using this successfully for various training runs for models.

@NanoCode012
Copy link
Collaborator

Hey! Please let us know if you need help running lint. You can find a short guide on how to set it up here: https://github.com/OpenAccess-AI-Collective/axolotl?tab=readme-ov-file#contributing-

@NanoCode012 NanoCode012 linked an issue Feb 17, 2024 that may be closed by this pull request
8 tasks
@dameikle
Copy link
Contributor Author

@winglian @NanoCode012 apologies for not installing the hooks and noticing the lint issue. Hopefully this is good now. Thanks for all you do with Axolotl, it is a really great project

@NanoCode012 NanoCode012 merged commit 3c00f40 into axolotl-ai-cloud:main Feb 21, 2024
7 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Can't have load_best_model_at_end with explicit test_datasets
4 participants