Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LITA integration #9578

Merged
merged 13 commits into from
Jul 11, 2024
Merged

LITA integration #9578

merged 13 commits into from
Jul 11, 2024

Conversation

Slyne
Copy link
Contributor

@Slyne Slyne commented Jul 1, 2024

What does this PR do ?

This PR aims to add LITA from checkpoint conversion, dataset preprocessing for DVC datasets to finetuning.
It also adds vila to the lita model.

Collection: [Note which collection this PR will affect]

  • multimodal
  • nlp

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

@Slyne
Copy link
Contributor Author

Slyne commented Jul 1, 2024

TO DO LIST:

  1. to move convert_llava_hf_to_nemo.py
  2. to add tutorial


event_prompts = [
"What is the action performed in this video?",
"Can you highlight the action performed in this video?" "What is the main event or action captured in this video?",

Check warning

Code scanning / CodeQL

Implicit string concatenation in a list Warning

Implicit string concatenation. Maybe missing a comma?
return time_to_string(value) + f"<!|t{value}t|!>"

value = re.sub(r"<([\d.]{1,20})s>", repl, value)
value = re.sub(r"\s([\d.]{1,20})s[\s|\.|,|>]", repl, value)

Check warning

Code scanning / CodeQL

Duplication in regular expression character class Warning

This regular expression includes duplicate character '|' in a set of characters.
return time_to_string(value) + f"<!|t{value}t|!>"

text = re.sub(r"<([\d.]{1,20})s>", repl, text)
text = re.sub(r"\s([\d.]{1,20})s[\s|\.|,|>]", repl, text)

Check warning

Code scanning / CodeQL

Duplication in regular expression character class Warning

This regular expression includes duplicate character '|' in a set of characters.
nemo/collections/multimodal/parts/utils.py Fixed Show fixed Hide fixed
@Slyne Slyne changed the title [DRAFT]LITA integration LITA integration Jul 5, 2024
@github-actions github-actions bot added the CI label Jul 5, 2024
@yaoyu-33 yaoyu-33 self-requested a review July 8, 2024 17:26
Slyne Deng and others added 9 commits July 8, 2024 23:49
Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne Deng <slyned@nvidia.com>
yaoyu-33
yaoyu-33 previously approved these changes Jul 9, 2024
@yaoyu-33 yaoyu-33 added Run CICD and removed Run CICD labels Jul 9, 2024
Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne Deng <slyned@nvidia.com>
@yaoyu-33 yaoyu-33 merged commit 900ca0b into NVIDIA:main Jul 11, 2024
394 of 396 checks passed
@Slyne Slyne mentioned this pull request Jul 11, 2024
8 tasks
marcromeyn pushed a commit that referenced this pull request Jul 11, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
marcromeyn pushed a commit that referenced this pull request Jul 11, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
maanug-nv pushed a commit that referenced this pull request Jul 14, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
ertkonuk pushed a commit that referenced this pull request Jul 19, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
Signed-off-by: Tugrul Konuk <ertkonuk@gmail.com>
malay-nagda pushed a commit to malay-nagda/NeMo that referenced this pull request Jul 26, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
Signed-off-by: Malay Nagda <malayn@malayn-mlt.client.nvidia.com>
tonyjie pushed a commit to tonyjie/NeMo that referenced this pull request Aug 6, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
Signed-off-by: tonyjie <jl4257@cornell.edu>
monica-sekoyan pushed a commit that referenced this pull request Oct 14, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
hainan-xv pushed a commit to hainan-xv/NeMo that referenced this pull request Nov 5, 2024
* add lita

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* add part of the tutorial and fix format

Signed-off-by: slyne deng <slyned@nvidia.com>

* add tutorial

Signed-off-by: slyne deng <slyned@nvidia.com>

* fix Tutorial ckpt conversion

Signed-off-by: slyne deng <slyned@nvidia.com>

* Apply isort and black reformatting

Signed-off-by: Slyne <Slyne@users.noreply.github.com>

* update cicd

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* add to CIICD test

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* changes based on review comments

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix bot warning

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* update cicd main

Signed-off-by: Slyne Deng <slyned@nvidia.com>

* fix cicd ckpt conversion

Signed-off-by: Slyne Deng <slyned@nvidia.com>

---------

Signed-off-by: Slyne Deng <slyned@nvidia.com>
Signed-off-by: Slyne <Slyne@users.noreply.github.com>
Signed-off-by: slyne deng <slyned@nvidia.com>
Co-authored-by: Slyne Deng <slyned@nvidia.com>
Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
Signed-off-by: Hainan Xu <hainanx@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants