feat: Allow passing model and tokenizer to ArgillaTrainer directly #3751

tomaarsen · 2023-09-12T12:11:35Z

Hello!

Description

Closes #3631.

This is important to give users freedom to very specifically set up their tokenizer. This is required e.g. for SFT with TRL.

Type of change

New feature (non-breaking change which adds functionality)
Refactor (change restructuring the codebase without changing functionality)
Improvement (change adding some improvement to an existing functionality)

How Has This Been Tested

Updated the relevant tests (TRL, Transformers) to also train with the passed model & tokenizer.

Checklist

I added relevant documentation
I followed the style guidelines of this project
I did a self-review of my code
I made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I filled out the contributor form (see text above)
I have added relevant notes to the CHANGELOG.md file (See https://keepachangelog.com/)

TODO:

CHANGELOG
Documentation
Double-check docstrings

Tom Aarsen

+ TRL tests

tomaarsen · 2023-09-12T13:29:06Z

The test failures seem unrelated, all ConnectionTimeout and TransportError(503, ''). @gabrielmbmb Did we find a solution for this yet?

davidberenstein1957 · 2023-09-13T11:10:12Z

@tomaarsen, we did not but Gabri just mentioned to rerun it.

davidberenstein1957

Looks Good! some tiny remarks.

docs/_source/guides/llms/practical_guides/fine_tune.md

src/argilla/client/feedback/training/base.py

src/argilla/client/feedback/training/frameworks/trl.py

Co-authored-by: David Berenstein <david.m.berenstein@gmail.com>

Also removed tokenizer from setfit (where it didn't do anything) and updated some docstrings

…illa-io/argilla into feat/trainer_model_tokenizer

codecov · 2023-09-14T13:14:45Z

Codecov Report

Patch coverage is 88.46% of modified lines.

Files Changed	Coverage
src/argilla/training/transformers.py	`72.72%`
src/argilla/client/feedback/training/base.py	`100.00%`
...lient/feedback/training/frameworks/transformers.py	`100.00%`
...argilla/client/feedback/training/frameworks/trl.py	`100.00%`

📢 Thoughts on this report? Let us know!.

github-actions · 2023-09-14T14:01:09Z

The URL of the deployed environment for this PR is https://argilla-quickstart-pr-3751-ki24f765kq-no.a.run.app

Hello! # Argilla Community Growers Ever since #3751, `model` can also be an already initialized model. This edge case was being missed before. This should help with the test failures on #3911. **Type of change** (Please delete options that are not relevant. Remember to title the PR according to the type of change) - [x] Bug fix (non-breaking change which fixes an issue) - [ ] New feature (non-breaking change which adds functionality) - [ ] Breaking change (fix or feature that would cause existing functionality to not work as expected) - [ ] Refactor (change restructuring the codebase without changing functionality) - [ ] Improvement (change adding some improvement to an existing functionality) - [ ] Documentation update **How Has This Been Tested** `pytest .\tests\integration\client\feedback\training\test_trainer.py::test_argilla_trainer_text_classification_with_model_tokenizer` **Checklist** - [ ] I added relevant documentation - [ ] follows the style guidelines of this project - [x] I did a self-review of my code - [ ] I made corresponding changes to the documentation - [ ] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I filled out [the contributor form](https://tally.so/r/n9XrxK) (see text above) - [ ] I have added relevant notes to the CHANGELOG.md file (See https://keepachangelog.com/) --- - Tom Aarsen

tomaarsen added 4 commits September 12, 2023 10:17

Allow PreTrainedModel and PreTrainedTokenizer for TRL

8157ca2

+ TRL tests

Allow passing model and tokenizer for transformers

3575697

Speed up transformers test

a380137

Improve TRL tests

1d3ec7c

tomaarsen marked this pull request as ready for review September 12, 2023 12:29

tomaarsen added 2 commits September 13, 2023 12:06

Add note on passing a model and tokenizer to the Trainer

bd49653

Add CHANGELOG

cf9a093

tomaarsen requested a review from davidberenstein1957 September 13, 2023 10:10

davidberenstein1957 reviewed Sep 13, 2023

View reviewed changes

docs/_source/guides/llms/practical_guides/fine_tune.md Outdated Show resolved Hide resolved

src/argilla/client/feedback/training/base.py Show resolved Hide resolved

src/argilla/client/feedback/training/frameworks/trl.py Show resolved Hide resolved

tomaarsen and others added 3 commits September 13, 2023 13:22

Add hint to doc note

fc3d009

Co-authored-by: David Berenstein <david.m.berenstein@gmail.com>

Add warning if tokenizer is passed with the wrong frameworks

f07904c

Also removed tokenizer from setfit (where it didn't do anything) and updated some docstrings

Merge branch 'feat/trainer_model_tokenizer' of https://github.com/arg…

07f2df0

…illa-io/argilla into feat/trainer_model_tokenizer

Merge branch 'develop' into feat/trainer_model_tokenizer

de872a4

tomaarsen merged commit 3aac61f into develop Sep 15, 2023

tomaarsen deleted the feat/trainer_model_tokenizer branch September 15, 2023 08:21

tomaarsen mentioned this pull request Oct 10, 2023

Hotfix: Always set pretrained_model_name_or_path as string #3914

Merged

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Allow passing model and tokenizer to ArgillaTrainer directly #3751

feat: Allow passing model and tokenizer to ArgillaTrainer directly #3751

tomaarsen commented Sep 12, 2023 •

edited

Loading

tomaarsen commented Sep 12, 2023 •

edited

Loading

davidberenstein1957 commented Sep 13, 2023

davidberenstein1957 left a comment

codecov bot commented Sep 14, 2023 •

edited

Loading

github-actions bot commented Sep 14, 2023

feat: Allow passing model and tokenizer to ArgillaTrainer directly #3751

feat: Allow passing model and tokenizer to ArgillaTrainer directly #3751

Conversation

tomaarsen commented Sep 12, 2023 • edited Loading

Description

tomaarsen commented Sep 12, 2023 • edited Loading

davidberenstein1957 commented Sep 13, 2023

davidberenstein1957 left a comment

Choose a reason for hiding this comment

codecov bot commented Sep 14, 2023 • edited Loading

Codecov Report

github-actions bot commented Sep 14, 2023

tomaarsen commented Sep 12, 2023 •

edited

Loading

tomaarsen commented Sep 12, 2023 •

edited

Loading

codecov bot commented Sep 14, 2023 •

edited

Loading