Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

set mock in GPTDatasetConfig #10435

Merged
merged 8 commits into from
Sep 19, 2024
Merged

set mock in GPTDatasetConfig #10435

merged 8 commits into from
Sep 19, 2024

Conversation

akoumpa
Copy link
Member

@akoumpa akoumpa commented Sep 9, 2024

What does this PR do ?

Previously, the GPTDatasetConfig.mock attribute was not set correctly, as a result, training scripts were passing a MockGPTDataset class to MCore, but the .mock was still set to default value (False), thus instead of MockGPTDataset it was instantiating a GPTDataset.

The fix is to set the .mock to have the correct value.

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

@akoumpa akoumpa self-assigned this Sep 9, 2024
@github-actions github-actions bot added the NLP label Sep 9, 2024
@akoumpa akoumpa changed the title Pass mock to GPTDatasetConfig set mock in GPTDatasetConfig Sep 9, 2024
@akoumpa akoumpa requested review from dimapihtar and removed request for dimapihtar September 9, 2024 22:11
@akoumpa akoumpa force-pushed the akoumparouli/fix_mock_dataset branch from 6320b85 to b9db63a Compare September 10, 2024 04:31
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Copy link
Collaborator

@cuichenx cuichenx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@akoumpa akoumpa added Run CICD and removed Run CICD labels Sep 10, 2024
@akoumpa akoumpa added Run CICD and removed Run CICD labels Sep 11, 2024
@akoumpa akoumpa added Run CICD and removed Run CICD labels Sep 11, 2024
@akoumpa akoumpa added Run CICD and removed Run CICD labels Sep 16, 2024
@akoumpa akoumpa merged commit 3653bed into main Sep 19, 2024
145 of 156 checks passed
@akoumpa akoumpa deleted the akoumparouli/fix_mock_dataset branch September 19, 2024 16:25
rachitgarg91 pushed a commit to rachitgarg91/NeMo that referenced this pull request Sep 26, 2024
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
monica-sekoyan pushed a commit that referenced this pull request Oct 14, 2024
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
tomlifu pushed a commit to tomlifu/NeMo that referenced this pull request Oct 25, 2024
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com>
tomlifu pushed a commit to tomlifu/NeMo that referenced this pull request Oct 25, 2024
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Lifu Zhang <tomzhanglf@gmail.com>
hainan-xv pushed a commit to hainan-xv/NeMo that referenced this pull request Nov 5, 2024
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Co-authored-by: Pablo Garay <palenq@gmail.com>
Signed-off-by: Hainan Xu <hainanx@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants