Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reinforce PR #1288

Merged
merged 11 commits into from
Dec 19, 2024
Merged

Reinforce PR #1288

merged 11 commits into from
Dec 19, 2024

Conversation

dmahan93
Copy link
Contributor

  • Add reinforce leave one out
  • Add model weight sharing via pointers
  • Add online dataset

Still need to test and get an example of how to orchestrate the online dataset + weight server

- Add reinforce leave one out
- Add model weight sharing via pointers
- Add online dataset
- Add example files
- TODO: Add the whole online loop instructions once vllm fork is pushed
@dmahan93 dmahan93 marked this pull request as ready for review September 24, 2024 21:57
@Quentin-Anthony Quentin-Anthony self-assigned this Sep 24, 2024
@dmahan93
Copy link
Contributor Author

@Quentin-Anthony should be ready for a review 🙌

Copy link
Member

@Quentin-Anthony Quentin-Anthony left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Finally was able to finish reviewing and testing this! Merging into main. Thanks a ton @dmahan93

@Quentin-Anthony Quentin-Anthony merged commit f7a5a6f into main Dec 19, 2024
1 of 4 checks passed
@Quentin-Anthony Quentin-Anthony deleted the add-reinforce branch December 19, 2024 23:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants