Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use torch scaled_dot_product_attention #1

Draft
wants to merge 697 commits into
base: main
Choose a base branch
from
Draft

use torch scaled_dot_product_attention #1

wants to merge 697 commits into from
This pull request is big! We’re only showing the most recent 250 commits.

Commits on Aug 22, 2024

  1. force optimizer.param_groups to match mcore_optimizer.param_groups af…

    …ter restoring from a checkpoint (NVIDIA#10225)
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Aug 22, 2024
    Configuration menu
    Copy the full SHA
    78f57fe View commit details
    Browse the repository at this point in the history
  2. Update TRTLLM 0.12 (NVIDIA#10215)

    * Update TRTLLM 0.12
    
    * Add model config
    
    * Change config
    
    * Change deploy script
    
    * Apply isort and black reformatting
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    
    * Remove parameter
    
    ---------
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    Co-authored-by: meatybobby <meatybobby@users.noreply.github.com>
    Co-authored-by: Onur Yilmaz <35306097+oyilmaz-nvidia@users.noreply.github.com>
    3 people authored Aug 22, 2024
    Configuration menu
    Copy the full SHA
    5269caf View commit details
    Browse the repository at this point in the history
  3. Tutorial: audio codec inference (NVIDIA#10186)

    Signed-off-by: Ante Jukić <ajukic@nvidia.com>
    anteju authored Aug 22, 2024
    Configuration menu
    Copy the full SHA
    42c2910 View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2024

  1. Move trt imports in nemo.collections.llm inside respective functions (N…

    …VIDIA#10234)
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    hemildesai authored Aug 23, 2024
    Configuration menu
    Copy the full SHA
    753c70e View commit details
    Browse the repository at this point in the history
  2. Add tests for LazyNeMoIterator and fix case with metadata_only=True a…

    …nd offsets in manifest (NVIDIA#10198)
    
    * Add tests for LazyNeMoIterator and fix case with manifest_only=True and offsets in manifest
    
    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    
    * Address code review
    
    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    
    * fix tests
    
    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    
    * fix tests
    
    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    
    ---------
    
    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    pzelasko authored Aug 23, 2024
    Configuration menu
    Copy the full SHA
    d4d6a5b View commit details
    Browse the repository at this point in the history
  3. [NeMo-UX] Fix a serialization bug that prevents users from moving che…

    …ckpoints (NVIDIA#9939)
    
    * perfor serialization using relative paths to allow users to move checkpoints after they're saved
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * remove unused import
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix artifact load
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix path artifact
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * remove unused import
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
    ashors1 and ashors1 authored Aug 23, 2024
    Configuration menu
    Copy the full SHA
    1c90b5e View commit details
    Browse the repository at this point in the history
  4. Add MemoryProfileCallback (NVIDIA#10166)

    * Add MemoryProfileCallback
    
    Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com>
    
    * Remove reference cycles, save snapshot on specific ranks
    
    Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com>
    
    * Remove unnecessary imports
    
    Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com>
    
    * Update docstring
    
    Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com>
    
    ---------
    
    Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com>
    Signed-off-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com>
    Signed-off-by: Shriya Rishab <69161273+ShriyaPalsamudram@users.noreply.github.com>
    Co-authored-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com>
    ShriyaPalsamudram and ShriyaPalsamudram authored Aug 23, 2024
    Configuration menu
    Copy the full SHA
    6d1be93 View commit details
    Browse the repository at this point in the history
  5. Lower bound transformers to support nemotron (NVIDIA#10240)

    Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    Co-authored-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    thomasdhc and thomasdhc authored Aug 23, 2024
    Configuration menu
    Copy the full SHA
    d415621 View commit details
    Browse the repository at this point in the history

Commits on Aug 24, 2024

  1. [Audio] SSL Pretraining framework for flow-matching model for audio p…

    …rocessing (NVIDIA#10052)
    
    Flow matching generative model with SSL pretraining framework
    
    Signed-off-by: Pin-Jui Ku <pku@nvidia.com>
    Co-authored-by: Kuray107 <Kuray107@users.noreply.github.com>
    Kuray107 and Kuray107 authored Aug 24, 2024
    Configuration menu
    Copy the full SHA
    7cc99e9 View commit details
    Browse the repository at this point in the history

Commits on Aug 26, 2024

  1. Revert torchrun fix for model import (NVIDIA#10251)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Aug 26, 2024
    Configuration menu
    Copy the full SHA
    8d9cfee View commit details
    Browse the repository at this point in the history
  2. [NeMo-UX[ Move nemotron imports inline (NVIDIA#10255)

    * Move nemotron transformers + tokenizer imports inline to reduce number of required deps
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    Co-authored-by: marcromeyn <marcromeyn@users.noreply.github.com>
    marcromeyn and marcromeyn authored Aug 26, 2024
    Configuration menu
    Copy the full SHA
    642c97a View commit details
    Browse the repository at this point in the history
  3. Wrap CPU model init with megatron_lazy_init_context (NVIDIA#10219)

    * Wrap CPU model init with megatron_lazy_init_context
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Cleanup checkpoint-dir if saving fails
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Aug 26, 2024
    Configuration menu
    Copy the full SHA
    8210e9c View commit details
    Browse the repository at this point in the history
  4. sdpa work

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Aug 26, 2024
    Configuration menu
    Copy the full SHA
    fad3414 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    6f6fc27 View commit details
    Browse the repository at this point in the history
  6. Apply isort and black reformatting

    Signed-off-by: titu1994 <titu1994@users.noreply.github.com>
    titu1994 committed Aug 26, 2024
    Configuration menu
    Copy the full SHA
    941c7f5 View commit details
    Browse the repository at this point in the history
  7. Bump Dockerfile.ci (2024-08-22) (NVIDIA#10227)

    * [🤠]: Howdy folks, let's bump `Dockerfile.ci` to 124bcff !
    
    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    
    * fix bert flags
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Aug 26, 2024
    Configuration menu
    Copy the full SHA
    ea8f49b View commit details
    Browse the repository at this point in the history
  8. salm export trtllm (NVIDIA#10245)

    Signed-off-by: slyne deng <slyned@nvidia.com>
    Co-authored-by: slyne deng <slyned@nvidia.com>
    Slyne and slyne deng authored Aug 26, 2024
    Configuration menu
    Copy the full SHA
    69973f9 View commit details
    Browse the repository at this point in the history

Commits on Aug 27, 2024

  1. [🤠]: Howdy folks, let's bump Dockerfile.ci to ef85bc9 ! (NVIDIA#10250)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    59a3e96 View commit details
    Browse the repository at this point in the history
  2. [🤠]: Howdy folks, let's bump Dockerfile.ci to 01ca03f ! (NVIDIA#10266)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Signed-off-by: oliver könig <okoenig@nvidia.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    49f13fb View commit details
    Browse the repository at this point in the history
  3. Load model in the target export precision by default in PTQ (NVIDIA#1…

    …0267)
    
    * Load model in the target export precision by default
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Enable megatron_amp_O2=true to actually use half-precision
    
    Signed-off-by: Jan Lasek <jlasek@nvidia.com>
    
    ---------
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    Signed-off-by: Jan Lasek <jlasek@nvidia.com>
    janekl authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    2f422dd View commit details
    Browse the repository at this point in the history
  4. Add WandbPlugin, NsysPlugin and PreemptionPlugin to nemo.lightning.ru…

    …n.plugins (NVIDIA#10223)
    
    * Add WandbPlugin, NsysPlugin and PreemptionPlugin to nemo.lightning.run.plugins
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * Remove duplicate
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add entity to wandb logger
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add documentation
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * Add warning
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * PR feedback
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * Add comments
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
    hemildesai and hemildesai authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    fd75162 View commit details
    Browse the repository at this point in the history
  5. [NeMo-UX] Handle absolute logger directories in nemo_logger (NVIDIA#1…

    …0259)
    
    * handle absolute and relative logger directories
    
    Signed-off-by: Anna Shors <ashors@nvidia.com>
    
    * merge lines
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    ---------
    
    Signed-off-by: Anna Shors <ashors@nvidia.com>
    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    38800cd View commit details
    Browse the repository at this point in the history
  6. Add sdxl notebook (NVIDIA#10139)

    * Add sdxl notebook
    
    Signed-off-by: mingyuanm <mingyuanm@nvidia.com>
    
    * Rename
    
    Signed-off-by: mingyuanm <mingyuanm@nvidia.com>
    
    * final Update SDXL notebook
    
    Signed-off-by: mingyuanm <mingyuanm@nvidia.com>
    
    ---------
    
    Signed-off-by: mingyuanm <mingyuanm@nvidia.com>
    Victor49152 authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    57aa305 View commit details
    Browse the repository at this point in the history
  7. Add Llama31 Config (NVIDIA#10260)

    * add llama31 config
    
    * Apply isort and black reformatting
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    
    * fix init method
    
    * typo
    
    * revert llama3-70b init method std
    
    ---------
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com>
    Co-authored-by: Chen Cui <chcui@nvidia.com>
    3 people authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    19668e5 View commit details
    Browse the repository at this point in the history
  8. Added offloading support for LoRA adapters (NVIDIA#10237)

    Signed-off-by: Selvaraj Anandaraj <selvaraja@login-eos02.eos.clusters.nvidia.com>
    Co-authored-by: Selvaraj Anandaraj <selvaraja@login-eos02.eos.clusters.nvidia.com>
    Co-authored-by: Chen Cui <chcui@nvidia.com>
    3 people authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    c7c3eae View commit details
    Browse the repository at this point in the history
  9. Add Qwen2 to Nemo 2 (NVIDIA#10258)

    * add qwen2
    
    * typo
    
    * Apply isort and black reformatting
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    
    * qwen without pip install issue
    
    * Apply isort and black reformatting
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    
    * remove calculate vocab size divisible
    
    ---------
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com>
    suiyoubi and suiyoubi authored Aug 27, 2024
    Configuration menu
    Copy the full SHA
    f53600a View commit details
    Browse the repository at this point in the history

Commits on Aug 28, 2024

  1. Lazy import tokenizers (NVIDIA#10213)

    * Move inflect to lazy import
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Use lazy imports for tokenizer libraries
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * sacremoses lazy import
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix cyclic import
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * import fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * move pangu
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    e68f981 View commit details
    Browse the repository at this point in the history
  2. add rampup bs documentation (NVIDIA#9884) (NVIDIA#10289)

    * create documentation for rampup bs
    
    
    
    * fix format
    
    
    
    * fix format
    
    
    
    * fix config format
    
    
    
    * move config stage
    
    
    
    * add example
    
    
    
    * fix table
    
    
    
    * fix table
    
    
    
    * fix grammar
    
    
    
    * fix grammar
    
    
    
    ---------
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    dimapihtar authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    5ff7f22 View commit details
    Browse the repository at this point in the history
  3. Add Starcoder to Nemo 2 (NVIDIA#10230)

    * Add sc1/sc2 to nemo-ux
    
    * Apply isort and black reformatting
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    
    * typo
    
    * Apply isort and black reformatting
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    
    * fix import
    
    * remove pip install dependency for sc
    
    * Apply isort and black reformatting
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    
    * remove pip install dependency for sc2
    
    * Apply isort and black reformatting
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    
    * typo
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com>
    Signed-off-by: Ao Tang <aot@nvidia.com>
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com>
    Co-authored-by: Chen Cui <chcui@nvidia.com>
    Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
    4 people authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    4805fe9 View commit details
    Browse the repository at this point in the history
  4. comment out ASR_dev_run_Speech_To_Text_HF_Finetuning until fixed (NVI…

    …DIA#10293)
    
    Signed-off-by: Pablo Garay <pagaray@nvidia.com>
    pablo-garay authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    2438fa9 View commit details
    Browse the repository at this point in the history
  5. Adding a Garbage-collection callback for a synchronized garbage-colle…

    …ction across ALL processes (NVIDIA#10261)
    
    * Adding Garbage-collection callback for a synchrozned gc across processed
    
    Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com>
    
    * 1
    
    Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com>
    Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com>
    
    * PR feedback
    
    Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com>
    
    ---------
    
    Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com>
    Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com>
    Co-authored-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com>
    Co-authored-by: gautham-kollu <gautham-kollu@users.noreply.github.com>
    3 people authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    5040546 View commit details
    Browse the repository at this point in the history
  6. Do not overwrite wandb name in NeMo Logger (NVIDIA#10265)

    * Do not overwrite wandb name in NeMo Logger
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Do not overwrite tensorboard name
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Fix tests
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    hemildesai authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    1d2d507 View commit details
    Browse the repository at this point in the history
  7. Bump Dockerfile.ci (2024-08-28) (NVIDIA#10278)

    * [🤠]: Howdy folks, let's bump `Dockerfile.ci` to 34e607e !
    
    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    
    * refactor: Deprecate `async_grad_allreduce`
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    5bbfa53 View commit details
    Browse the repository at this point in the history
  8. Multimodal trtllm export and infer script (NVIDIA#10287)

    * salm export trtllm
    
    Signed-off-by: slyne deng <slyned@nvidia.com>
    
    * add export script
    
    Signed-off-by: slyne deng <slyned@nvidia.com>
    
    * fix style
    
    Signed-off-by: slyne deng <slyned@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Slyne <Slyne@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: slyne deng <slyned@nvidia.com>
    Signed-off-by: Slyne <Slyne@users.noreply.github.com>
    Co-authored-by: slyne deng <slyned@nvidia.com>
    Co-authored-by: Slyne <Slyne@users.noreply.github.com>
    3 people authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    60ac8aa View commit details
    Browse the repository at this point in the history
  9. [TTS] Add config and modules for 22khz and 44khz audio codec (NVIDIA#…

    …10107)
    
    * [TTS] Add config and modules for 22khz and 44khz audio codec
    
    Signed-off-by: Ryan <rlangman@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: rlangman <rlangman@users.noreply.github.com>
    
    * [TTS] Add argument docstring to new modules
    
    Signed-off-by: Ryan <rlangman@nvidia.com>
    
    ---------
    
    Signed-off-by: Ryan <rlangman@nvidia.com>
    Signed-off-by: rlangman <rlangman@users.noreply.github.com>
    Co-authored-by: rlangman <rlangman@users.noreply.github.com>
    rlangman and rlangman authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    a860e6b View commit details
    Browse the repository at this point in the history
  10. Add example script to run NeMo 2.0 llama pretraining with NeMo-Run (N…

    …VIDIA#10226)
    
    * Add example script to run NeMo 2.0 llama pretraining with NeMo-Run
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Fixes
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Address PR comments
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    hemildesai authored Aug 28, 2024
    Configuration menu
    Copy the full SHA
    f45422a View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2024

  1. Add FSDP for NeMo 2.0 (NVIDIA#9748)

    * modify code structure and add strategy
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * correct doc url
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * oextract common elements and add callback
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * add iomixin
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * update strategies
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * update callback
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * add training step to strategy
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * remove unused import
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * add iomixin to strategy & precision
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: artbataev <artbataev@users.noreply.github.com>
    
    * add val/test steps to strategy
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * add documentations
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * add default sharding for fsdp. add setup callback detection
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * extract checkpoint io logic
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * clean up unused imports
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * sync new megatron strategy changes
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * break down setup callback back into strategy
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * reorder stuff
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * fix data logic
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * minor fix
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * add dtensor ckpt conversion support
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * support hsdp
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * remove iomixin
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * fix import
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * refactor loss reduction
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * clean up
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * add unittest
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    
    * clean up
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    
    ---------
    
    Signed-off-by: jasonwan <jasonwan@nvidia.com>
    Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    Signed-off-by: artbataev <artbataev@users.noreply.github.com>
    Co-authored-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com>
    Co-authored-by: artbataev <artbataev@users.noreply.github.com>
    3 people authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    22f0bb0 View commit details
    Browse the repository at this point in the history
  2. Export fp8 te nemo to trt-llm (NVIDIA#10096)

    * initial commit
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * PR draft
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * fixed scaling weights
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * fixed zarr loading, added flags, refactor
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    
    * fix expert key mapping
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * refactor
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    
    * fix: failed test was finishing with exit code 0
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * test commit -- rerun github checks
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * bugfix: naming
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * bugfix v2: naming
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * apply code review changes
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    
    * fix TensorRTLLM build (fp8 still not supported)
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    
    * undo refactor
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * bugfix: arguments to dist_convert
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com>
    Co-authored-by: Piotr Kaminski <pikaminski@nvidia.com>
    Co-authored-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    3 people authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    9796b69 View commit details
    Browse the repository at this point in the history
  3. Bugfix: loading scaling factors for pyt 24.07 (NVIDIA#10297)

    * bugfix: loading scaling factors
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * list instead of set
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    ---------
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    Co-authored-by: Piotr Kaminski <pikaminski@nvidia.com>
    Laplasjan107 and Piotr Kaminski authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    3ed93c1 View commit details
    Browse the repository at this point in the history
  4. Sanity checks for unfinished checkpoints removal (NVIDIA#10228)

    * Added sanity checks
    
    Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com>
    
    * Updated error msg
    
    Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com>
    
    * Added checks for NeMo 2.0
    
    Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com>
    
    * Updated NeMo 2.0 test
    
    Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com>
    
    * fixed test_nemo_logger.py
    
    Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com>
    
    ---------
    
    Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com>
    Signed-off-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com>
    Co-authored-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com>
    jbieniusiewi and jbieniusiewi authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    006d65f View commit details
    Browse the repository at this point in the history
  5. allow disabling validation (NVIDIA#10273)

    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    maanug-nv authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    cdf61f9 View commit details
    Browse the repository at this point in the history
  6. make torch_dist ckpt strategy as default (NVIDIA#9852) (NVIDIA#10291)

    copy of NVIDIA#9852
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
    dimapihtar and dimapihtar authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    736a6fc View commit details
    Browse the repository at this point in the history
  7. TRT-LLM 0.12 + ModelOpt 0.17.0 updates (NVIDIA#10301)

    * Update trtllm-build options
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Pull QUANT_CFG_CHOICES into try/catch for HAVE_MODELOPT consistency
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Remove deprecated parallel group setup
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Remove deprecated size settings
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Use max_seq_len instead of max_output_len [part I]
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    ---------
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    janekl authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    ea0f69f View commit details
    Browse the repository at this point in the history
  8. add documentation for reset_lr feature (NVIDIA#9639) (NVIDIA#10290)

    * Change default parallel_save to False (NVIDIA#9633)
    
    
    
    * Unwrap ckpt_io for model opt (async save) (NVIDIA#9622) (NVIDIA#9634)
    
    
    
    * add reset_lr documentation
    
    
    
    * fix style
    
    
    
    * fix style
    
    
    
    * fix style
    
    
    
    * add image
    
    
    
    * fix typo
    
    
    
    * fix plot
    
    
    
    * fix plot
    
    
    
    * change plot size
    
    
    
    * fix style
    
    
    
    * move image
    
    
    
    * add reset_lr to intro page
    
    
    
    ---------
    
    Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com>
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com>
    Co-authored-by: mikolajblaz <mikolajblaz@users.noreply.github.com>
    dimapihtar and mikolajblaz authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    eff7ddd View commit details
    Browse the repository at this point in the history
  9. [NeMo UX] expose num_dataset_builder_threads argument (NVIDIA#10281)

    * expose num_dataset_builder_threads arg
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * upate docstring
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    3ebe567 View commit details
    Browse the repository at this point in the history
  10. Disable SP (NVIDIA#10282)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    d0128da View commit details
    Browse the repository at this point in the history
  11. ci: Selective triggering (NVIDIA#10195)

    * ci: Selective triggering
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * simplify pass-through
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * fix
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * fix
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * refactor tests that dont use template
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    81f18f6 View commit details
    Browse the repository at this point in the history

Commits on Aug 30, 2024

  1. [🤠]: Howdy folks, let's bump Dockerfile.ci to 9ab31cb ! (NVIDIA#10311)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    4d5f1aa View commit details
    Browse the repository at this point in the history
  2. Log Gradient Norms (NVIDIA#10244)

    * override optimizer step to log
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * import fix
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * remove norm from output
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * also log num zeros in grad
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * bugfix
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    ---------
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    maanug-nv authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    b5d1d5f View commit details
    Browse the repository at this point in the history
  3. Add support for LoRA on vLLM (NVIDIA#10009)

    * Added basic support for adding LoRA checkpoints in HF format when running deploy_vllm_triton.py
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    
    * Moved the conversion logic from the convert_nemo_to_canonical.py script to a reusable module, removed the tar unpacking, removed the dependencies on OmegaConf and NLPSaveRestoreConnector.
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    
    * Implemented on-load conversion of Nemo format LoRA checkpoints into HF format for vLLM.
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    
    * Added logger initialization, improved some messages.
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    
    * Moved the LoRA converter script to nemo.export.utils.
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    
    * Fixed the description of the query.py script.
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: apanteleev <apanteleev@users.noreply.github.com>
    
    * Fixed the missing file close.
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com>
    Signed-off-by: apanteleev <apanteleev@users.noreply.github.com>
    Co-authored-by: apanteleev <apanteleev@users.noreply.github.com>
    Co-authored-by: Onur Yilmaz <35306097+oyilmaz-nvidia@users.noreply.github.com>
    Co-authored-by: Eric Harper <complex451@gmail.com>
    4 people authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    d886151 View commit details
    Browse the repository at this point in the history
  4. Flexible passing args to TensorRTLLM in nemo_export.py (NVIDIA#10315)

    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    janekl authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    1ce9089 View commit details
    Browse the repository at this point in the history
  5. add back HF Finetune script to CI (NVIDIA#10308)

    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    Co-authored-by: Nithin Rao Koluguri <nithinraok>
    nithinraok authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    e5f22a8 View commit details
    Browse the repository at this point in the history
  6. Add Yi 1.5 34b Neva support (NVIDIA#10083)

    * Yi1.5 34b neva prompt template
    
    Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
    
    * apply black and isort
    
    Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
    
    * add tokenizer and inference configs in docstring
    
    Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
    
    ---------
    
    Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
    HuiyingLi authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    a777a44 View commit details
    Browse the repository at this point in the history
  7. move to cpu only for log probs (NVIDIA#10316)

    * move to cpu only for log probs
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    Co-authored-by: Nithin Rao Koluguri <nithinraok>
    Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
    nithinraok and nithinraok authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    0ba9979 View commit details
    Browse the repository at this point in the history
  8. [NeMo-UX] Don't create attention mask for GPTs (NVIDIA#10242)

    * remove attention mask as default
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * fix non TE case
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
    Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    3 people authored Aug 30, 2024
    Configuration menu
    Copy the full SHA
    b87e1e3 View commit details
    Browse the repository at this point in the history

Commits on Aug 31, 2024

  1. Make get_optim_config iterable (NVIDIA#10318)

    * Make get_optim_config iterable
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Update error message
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Aug 31, 2024
    Configuration menu
    Copy the full SHA
    9a22005 View commit details
    Browse the repository at this point in the history
  2. Fix llama3 pretraining NeMo 2.0 script (NVIDIA#10307)

    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    hemildesai authored Aug 31, 2024
    Configuration menu
    Copy the full SHA
    b698ae5 View commit details
    Browse the repository at this point in the history

Commits on Sep 2, 2024

  1. Support TE-DPA For Stable Diffusion (NVIDIA#10288)

    * [SD] Add te-dpa support
    
    Signed-off-by: Wil Kong <alpha0422@gmail.com>
    
    * [SD] Add te-dpa support, resolve compatibility with TE-master
    
    Signed-off-by: Wil Kong <alpha0422@gmail.com>
    
    * [SD] Add te-dpa support, add check for attention configs.
    
    Signed-off-by: Wil Kong <alpha0422@gmail.com>
    
    * Fix bugs of flash-attn and dpa in SD.
    
    Signed-off-by: Wil Kong <alpha0422@gmail.com>
    
    * Fix the issue of DPA API change.
    
    Signed-off-by: Wil Kong <alpha0422@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: alpha0422 <alpha0422@users.noreply.github.com>
    Signed-off-by: Wil Kong <alpha0422@gmail.com>
    
    ---------
    
    Signed-off-by: Wil Kong <alpha0422@gmail.com>
    Signed-off-by: alpha0422 <alpha0422@users.noreply.github.com>
    Co-authored-by: Mengdi Wang <didow@nvidia.com>
    alpha0422 and Mengdi Wang authored Sep 2, 2024
    Configuration menu
    Copy the full SHA
    78357ae View commit details
    Browse the repository at this point in the history

Commits on Sep 3, 2024

  1. fix tokenizer restoration (NVIDIA#10336)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Sep 3, 2024
    Configuration menu
    Copy the full SHA
    8cd751b View commit details
    Browse the repository at this point in the history
  2. remove virtual pipeline parallel apex dependency (NVIDIA#10317)

    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Sep 3, 2024
    Configuration menu
    Copy the full SHA
    9472fc3 View commit details
    Browse the repository at this point in the history
  3. Add option to selectively load context in nemo.lightning.io (NVIDIA#1…

    …0279)
    
    * Add option to selectively load context in nemo.lightning.io
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Update docstring
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Fixes
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Co-authored-by: Marc Romeyn <mromeijn@nvidia.com>
    Co-authored-by: Abhishree Thittenamane <47577437+athitten@users.noreply.github.com>
    3 people authored Sep 3, 2024
    Configuration menu
    Copy the full SHA
    ac89593 View commit details
    Browse the repository at this point in the history
  4. Add EP to mixtral-8x22b recipe (NVIDIA#10337)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Sep 3, 2024
    Configuration menu
    Copy the full SHA
    8eb1827 View commit details
    Browse the repository at this point in the history
  5. Bugfix: export to trt-llm multi_block_mode flag (NVIDIA#10334)

    * bugfix
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    
    * add quantisation flags to deploy_triton.py
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    
    ---------
    
    Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com>
    Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    Co-authored-by: Piotr Kaminski <pikaminski@nvidia.com>
    Co-authored-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
    3 people authored Sep 3, 2024
    Configuration menu
    Copy the full SHA
    ab6aba3 View commit details
    Browse the repository at this point in the history
  6. fix (NVIDIA#10339)

    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    yaoyu-33 authored Sep 3, 2024
    Configuration menu
    Copy the full SHA
    a1fd899 View commit details
    Browse the repository at this point in the history

Commits on Sep 4, 2024

  1. Add comment to address a frequently asked question (NVIDIA#10321)

    * Add comment to address a frequently asked question
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * wording
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * add docstring to GPTSFTPackedDataset
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    ---------
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    cuichenx authored Sep 4, 2024
    Configuration menu
    Copy the full SHA
    0d2d7c4 View commit details
    Browse the repository at this point in the history
  2. Fix async checkpointing in nemo.lightning (NVIDIA#10324)

    * Fix async checkpointing in nemo.lightning
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * PR feedback
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add unit tests
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
    hemildesai and hemildesai authored Sep 4, 2024
    Configuration menu
    Copy the full SHA
    dd02d02 View commit details
    Browse the repository at this point in the history
  3. [Draft]Add Nemotron4 recipes and Long Context Recipe (NVIDIA#10262)

    * Add Nemotron recipes and Long Context Recipe
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    
    * Add Nemotron4 recipes
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    
    * Change long context recipe for llama3 mixtral and nemotron. Add Expert
    Parallel Support
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    
    * Revert Chainedoptimizer change
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    
    * Update mixed_precision.py
    
    Signed-off-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com>
    
    * Update mixtral_8x7b.py
    
    Signed-off-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com>
    
    * Add option to not generate attention mask
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    
    * Remove nemtron
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    
    * Resolve mock conflict
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    
    ---------
    
    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    Signed-off-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    BoxiangW and akoumpa authored Sep 4, 2024
    Configuration menu
    Copy the full SHA
    32ba985 View commit details
    Browse the repository at this point in the history
  4. [NeMo-UX] Adding copyright to collections.llm & lightning (NVIDIA#10345)

    * Adding copyright to collections.llm & lightning
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    Co-authored-by: marcromeyn <marcromeyn@users.noreply.github.com>
    marcromeyn and marcromeyn authored Sep 4, 2024
    Configuration menu
    Copy the full SHA
    8134f33 View commit details
    Browse the repository at this point in the history
  5. added support for FC model in Diarization with ASR and timestamps (NV…

    …IDIA#10154)
    
    Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com>
    Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
    KunalDhawan and nithinraok authored Sep 4, 2024
    Configuration menu
    Copy the full SHA
    73bec06 View commit details
    Browse the repository at this point in the history
  6. Remove apply_query_key_layer_scaling for GPT models (NVIDIA#10349)

    * remove qk layer scaling
    
    * add sc2
    suiyoubi authored Sep 4, 2024
    Configuration menu
    Copy the full SHA
    d8efee9 View commit details
    Browse the repository at this point in the history

Commits on Sep 5, 2024

  1. remove grad clipping from mixed_precision plugin (NVIDIA#10303)

    * remove grad clipping from mixed_precision plugin
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Raise an expection if user tries to use Trainer's clip_grad
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * Remove gradient_clip_val from recipes
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    7738b1d View commit details
    Browse the repository at this point in the history
  2. Add option to selectively restore model weights and optimizer states …

    …in AutoResume and MegatronStrategy (NVIDIA#10295)
    
    * Add option to selectively restore model weights and optimizer states in AutoResume and MegatronStrategy
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * PR feedback
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Fix PEFT checkpointing
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    
    * address comments
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    
    * Refactor selective restore
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * fix tokenizer issue for peft load
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    
    * fix peft optimizer states loading
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    
    * Fix fabric strategy
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * PR feedback
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Rename SelectiveRestoreConfig to RestoreConfig
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Update hf llama 3 paths
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Fixes
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Fix tests
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
    Co-authored-by: Chen Cui <chcui@nvidia.com>
    Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
    4 people authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    19f904e View commit details
    Browse the repository at this point in the history
  3. alltoall (NVIDIA#10357)

    Signed-off-by: Malay Nagda <malayn@nvidia.com>
    malay-nagda authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    e6db2f3 View commit details
    Browse the repository at this point in the history
  4. Fix links (NVIDIA#10359)

    * fix links
    
    Signed-off-by: eharper <eharper@nvidia.com>
    
    * fix links
    
    Signed-off-by: eharper <eharper@nvidia.com>
    
    ---------
    
    Signed-off-by: eharper <eharper@nvidia.com>
    ericharper authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    a567380 View commit details
    Browse the repository at this point in the history
  5. Improve TE import guards (NVIDIA#10322)

    * improve TE import guards
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * small fixes
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * bug fixes
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
    ashors1 and ashors1 authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    a9746a6 View commit details
    Browse the repository at this point in the history
  6. ci: Detect secrets (NVIDIA#10343)

    * ci: Add secrets detector
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * chore: Add baseline
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    5bd2b89 View commit details
    Browse the repository at this point in the history
  7. [🤠]: Howdy folks, let's bump Dockerfile.ci to 3396356 ! (NVIDIA#10353)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Sep 5, 2024
    Configuration menu
    Copy the full SHA
    fdf1979 View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2024

  1. [NeMo-UX] Turn on mcore performance optimizations (NVIDIA#10209)

    * expose TP overlap
    
    Signed-off-by: Jieming Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * add tp overlap recipes
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * turn on pipeline parallel overlap
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * refactor
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * Update base.py
    
    Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
    
    * Update megatron_parallel.py
    
    Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
    
    * remove env var
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * add optimization config
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix typo
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * refactor into megatron parallel setup
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * refactor
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix config ordering, add wgrad deferral
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * cleanup
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * use config
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * clean
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * enable wgrad defferal
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add grad bucket size
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * move everthing into a callback
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * cleanup
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix imports
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * move userbuffer init
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * cleanup
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix VP
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * address comments
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add gradient accum guard
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * Update base.py
    
    Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
    
    * address comments
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * address comments
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Jieming Zhang <jiemingz@nvidia.com>
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
    Co-authored-by: Jieming Zhang <jiemingz@nvidia.com>
    Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    3 people authored Sep 6, 2024
    Configuration menu
    Copy the full SHA
    1d5de59 View commit details
    Browse the repository at this point in the history
  2. [NeMo-UX] checkpointing improvements (NVIDIA#10241)

    * save model weights and artifacts to separate directories
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * add save_artifacts_on_train_end
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * do not save optimizer states in final checkpoint
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * WIP support for saving only last k optimizer states
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * minor cleanup
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Revert support for saving last k optimizer states. This will be addressed in a subsequent PR.
    
    * use storage_options to determine when to skip saving optimizer states
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * fix variable names, make checkpoint load work when optimizer states don't exist in the checkpoint
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * FSDP updates, provide option to save optimizer states on_train_end
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * simplify implementation, remove save_best_model option
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * update default value of ckpt_include_optimizer for fsdp
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * remove unused imports
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * remove unused import
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * cleanup
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * make storage_options optional again
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix failing tests
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * address some comments
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * use save_weights_only to determine whether to save optimizer states
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * add some comments
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix tests
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * fixes
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * remove unnecessary line
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
    ashors1 and ashors1 authored Sep 6, 2024
    Configuration menu
    Copy the full SHA
    34393c6 View commit details
    Browse the repository at this point in the history
  3. [Nemo Unit Tests] Split CPU unit tests (NVIDIA#10365)

    * Split CPU unit tests
    
    * Split CPU unit tests
    
    * Fix:Run pytest in specific paths
    
    * Fix:Run pytest in specific paths
    
    * Fix:Run pytest in specific paths
    pablo-garay authored Sep 6, 2024
    Configuration menu
    Copy the full SHA
    ad5ef75 View commit details
    Browse the repository at this point in the history
  4. ci: Fix checkout of secrets detector (NVIDIA#10381)

    * ci: Fix checkout of secrets detector
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 6, 2024
    Configuration menu
    Copy the full SHA
    95944ee View commit details
    Browse the repository at this point in the history
  5. only log consumed samples during training (NVIDIA#10371)

    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Sep 6, 2024
    Configuration menu
    Copy the full SHA
    7ba0681 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2024

  1. Configuration menu
    Copy the full SHA
    62c1dce View commit details
    Browse the repository at this point in the history
  2. [NeMo-UX] Checkpointing fixes (NVIDIA#10376)

    * remove save_best_model from default logger
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix broken checkpoint restore
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix fsdp
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * rename weights path to avoid confusion
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Revert "rename weights path to avoid confusion". We'll add this in a separate PR
    
    This reverts commit 72bae8b.
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Sep 7, 2024
    Configuration menu
    Copy the full SHA
    9e372d3 View commit details
    Browse the repository at this point in the history
  3. add auto configurator to NeMo (NVIDIA#10270)

    * add base configs
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add auto configurator functionality
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * add runner
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add end-to-end example for auto configurator
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add unit tests for auto configurator
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add GPT configs
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add GPT configs
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * switch to dataclass
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * switch to dataclass
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix dataclasses usage
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * remove unused imports
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove extra function
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix docstring style
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * take Config object as input for model
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * add nemotron support
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * remove search_config.py
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * move configs creation to Basic class
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * move to common basic class
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * rename main config
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove base configs for models
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: artbataev <artbataev@users.noreply.github.com>
    
    * change auto conf functionality
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix docstring
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * remove unused imports
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add changes
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove activations_checkpoint_num_layers
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove gbs from config
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix logs
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix performance calculation
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix end-to-end example
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix model config
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * minor changes
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * minor changes
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix unit tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * add README
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix README
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix README
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix readme
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix readme
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove extra arg
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove unused imports
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add nemo-run installation
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix unit tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix unit tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    ---------
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    Signed-off-by: artbataev <artbataev@users.noreply.github.com>
    Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
    Co-authored-by: artbataev <artbataev@users.noreply.github.com>
    3 people authored Sep 7, 2024
    Configuration menu
    Copy the full SHA
    cda2a63 View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2024

  1. fix mixtraltopk (NVIDIA#10366)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Co-authored-by: Marc Romeyn <mromeijn@nvidia.com>
    akoumpa and marcromeyn authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    f666682 View commit details
    Browse the repository at this point in the history
  2. ci: Fix release tag (NVIDIA#10367)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    e1f375e View commit details
    Browse the repository at this point in the history
  3. Akoumparouli/nemo ux tokenizer fix (NVIDIA#10351)

    * save tokenizer to disk
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Track Hf tokenizer assets
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * raise exception if dst file exists
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * minor
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * remove print
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add tokenizercontext
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Add TokenizerContext
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * restore tokenizer from separate dir
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * update artifact __init__.py
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * TokenizerContext connector
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * bugix on_import_ckpt
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * rm code
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Drop tokenizercontext
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * drop tokenizer load from tokenizercontext
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * undo
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * undo
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Move to util function
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * use save_hf_tokenizer_assets
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * add tokenizer restoration in resume.py
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * bot fixes
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * rm
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * wrap tokenizer restoration in try/catch
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * load_artifacts
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * param fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * more fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * lazy import tensorboard
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * move code out of file context manager
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Allow skippable artifacts
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * rebase fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * checkpoint structure change update
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    a26ed2f View commit details
    Browse the repository at this point in the history
  4. Add option to resume from specific path in AutoResume (NVIDIA#10373)

    * Add option to resume from specific path in AutoResume
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Fix path
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    hemildesai authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    dd63de1 View commit details
    Browse the repository at this point in the history
  5. ci: Cleanup of release-freeze automation (NVIDIA#10392)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    6f1c414 View commit details
    Browse the repository at this point in the history
  6. ci: Toggle pre-release (NVIDIA#10394)

    * ci: Toggle pre-release
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    ab82b56 View commit details
    Browse the repository at this point in the history
  7. ci: Toggle pre-release (NVIDIA#10395)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    bcf7e0f View commit details
    Browse the repository at this point in the history
  8. ci: Toggle pre-release (NVIDIA#10396)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    21cb949 View commit details
    Browse the repository at this point in the history
  9. ci: Automate pre-release (NVIDIA#10397)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    30385aa View commit details
    Browse the repository at this point in the history
  10. Akoumparouli/nemo ux validate dataset asset accessibility (NVIDIA#10309)

    * Add validate_dataset_asset_accessibility
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Add CI tests for validate_dataset_asset_accessibility
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix for zipped lists
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    2404c4e View commit details
    Browse the repository at this point in the history
  11. [🤠]: Howdy folks, let's bump NeMo 2.1.0rc0 ! (NVIDIA#10399)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: ko3n1g <16716991+ko3n1g@users.noreply.github.com>
    github-actions[bot] and ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    9921e6c View commit details
    Browse the repository at this point in the history
  12. ci: Update baseline (NVIDIA#10400)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    f6cd74b View commit details
    Browse the repository at this point in the history
  13. ci(chore): Minor change (NVIDIA#10401)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    94c5fd8 View commit details
    Browse the repository at this point in the history
  14. ci: Swap merge/cherry-pick order (NVIDIA#10389)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    41502ff View commit details
    Browse the repository at this point in the history
  15. ci: Fix release tag (NVIDIA#10402)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    19382eb View commit details
    Browse the repository at this point in the history
  16. Ko3n1g/ci/fix release workflow 2 (NVIDIA#10403)

    * ci: Improve release workflow
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * ci: Fix cherry-picking
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    73a8ef8 View commit details
    Browse the repository at this point in the history
  17. ci: Send Slack alert on failed cherry pick (NVIDIA#10404)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    a4f95f1 View commit details
    Browse the repository at this point in the history
  18. ci: Allow concurrent docker system prune (NVIDIA#10405)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    0e5e5d5 View commit details
    Browse the repository at this point in the history
  19. ci: Use PAT for cherry-picking (NVIDIA#10406)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    46e908e View commit details
    Browse the repository at this point in the history
  20. Alit/mamba ux cicd (NVIDIA#10370)

    * add mamba init
    
    * more ssm
    
    * add 370m
    
    * add hybrid
    
    * fix issue
    
    * integrate model and tokenizer config for ssm
    
    * add all mamba configs
    
    * modify state re pattern
    
    * revert gpt stuff
    
    * remove SSM class and training script
    
    * Apply isort and black reformatting
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    
    * remove faulty export
    
    * add script to test
    
    * Apply isort and black reformatting
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    
    * some recent fixes
    
    * Apply isort and black reformatting
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    
    * test script tp/pp1
    
    * Apply isort and black reformatting
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    
    * add cicd
    
    * include MLM mamba dist ckpt commit
    
    * add license head and address more comments
    
    * Apply isort and black reformatting
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    
    * add guard
    
    * remove guard from TransformerConfig
    
    * update scripts
    
    * Apply isort and black reformatting
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    Signed-off-by: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com>
    Co-authored-by: Ali Taghibakhshi <ataghibakhsh@login-eos01.eos.clusters.nvidia.com>
    Co-authored-by: JRD971000 <JRD971000@users.noreply.github.com>
    Co-authored-by: oliver könig <okoenig@nvidia.com>
    4 people authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    9f9bf4d View commit details
    Browse the repository at this point in the history
  21. ci: Allow default token to write workflows (NVIDIA#10407)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    a95f3a2 View commit details
    Browse the repository at this point in the history
  22. ci: More permissions for cherry-pick automation (NVIDIA#10409)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    4bf8101 View commit details
    Browse the repository at this point in the history
  23. ci: Overhaul cherry-pick workflow (NVIDIA#10410)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    0a40662 View commit details
    Browse the repository at this point in the history
  24. ci: Ignore failures on cherry-picking (NVIDIA#10411)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    0d0e724 View commit details
    Browse the repository at this point in the history
  25. ci: Minor change (NVIDIA#10412)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    52c7f2a View commit details
    Browse the repository at this point in the history
  26. ci: Fix cherry-pick config (NVIDIA#10413)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    7d27792 View commit details
    Browse the repository at this point in the history
  27. ci: Minor change (NVIDIA#10414)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    91863d2 View commit details
    Browse the repository at this point in the history
  28. ci: Minor change (NVIDIA#10415)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    48fab9d View commit details
    Browse the repository at this point in the history
  29. ci: Remove dead code (NVIDIA#10416)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    573d910 View commit details
    Browse the repository at this point in the history
  30. Ko3n1g/ci/test cherry picking 2 (NVIDIA#10417)

    * ci: Cherrypick continue on error
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * ci: Fix cherry pick branch
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    14c3d4a View commit details
    Browse the repository at this point in the history
  31. ci: Small test (NVIDIA#10419)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    aab78f0 View commit details
    Browse the repository at this point in the history
  32. ci: Small fix (NVIDIA#10420)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 8, 2024
    Configuration menu
    Copy the full SHA
    b7ee0b8 View commit details
    Browse the repository at this point in the history

Commits on Sep 9, 2024

  1. [NeMo-UX] Integrating CLI (NVIDIA#10300)

    * Adding nemo-run to requirements
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Updating nemo-run entrypoint inside setup.py
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Remove nemo-run from requirements until we have a pypi package
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Update entrypoint naming
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Setting up cli recipe for llama3-8b
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Move AutoTokenizer import inline for starcoder
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Move AutoTokenizer import inline for starcoder2
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Use target for factories inside llama3_8b
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Update other recipes
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Fix some bugs in the recipes
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Adding some examples
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Adding repl example
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Starting to add a notebook example as well
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Fix wrong imports
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    * Apply isort and black reformatting
    
    Signed-off-by: pre-commit-ci[bot] <pre-commit-ci[bot]@users.noreply.github.com>
    
    * Fix wrong imports
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    * Fix typo + add script with default executor
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    * Add nemo-run to Dockerfile.ci
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Adding copyright to recipes
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    * Adding guides to recipes dir
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Adding hatchling to Dockerfile.ci
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Move install to different line
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * fix install
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Move llama3_pretraining to scripts for now
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Remove img folder & use images from release instead
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    * Updating default of num_nodes in all recipes
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    * Adding tests for all recipes
    
    Signed-off-by: Marc Romeijn <mromeijn@nvidia.com>
    
    * ddAing docstrings
    
    Signed-off-by: Marc Romeijn <mromeijn@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    
    * Fix failing tests inside test_mixtral_8x7b_64k
    
    Signed-off-by: Marc Romeijn <mromeijn@nvidia.com>
    
    * Rename fabric to _fabric to avoid name collision with package fabric
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add rename comment
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Marc Romeyn <mromeijn@nvidia.com>
    Signed-off-by: pre-commit-ci[bot] <pre-commit-ci[bot]@users.noreply.github.com>
    Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com>
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Signed-off-by: Marc Romeijn <mromeijn@nvidia.com>
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: pre-commit-ci[bot] <pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: marcromeyn <marcromeyn@users.noreply.github.com>
    Co-authored-by: Hemil Desai <hemild@nvidia.com>
    Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    7 people authored Sep 9, 2024
    Configuration menu
    Copy the full SHA
    d12fbbd View commit details
    Browse the repository at this point in the history
  2. [Nemo Unit Tests] Split GPU unit tests (NVIDIA#10380)

    * Split GPU unit tests
    
    * Make L0_Unit_Tests_GPU_Lightning optional since flaky
    
    * Add time for GPU_Core test
    
    * Add time for GPU_Audio test
    pablo-garay authored Sep 9, 2024
    Configuration menu
    Copy the full SHA
    fb39fad View commit details
    Browse the repository at this point in the history
  3. Support Energon as dataloader in NeVA (NVIDIA#10305)

    * energon support draft
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * removing yi, not in PRs scope
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * remove mp_spawn + add energon req
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * [pre-commit.ci] auto fixes from pre-commit.com hooks
    
    for more information, see https://pre-commit.ci
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    * removing seq packing
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    * fixing encode_sft
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * fix use_energon
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    * adding docstrings + addressing comments + CodeQL fixes
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    * addressing comments
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    * add yi_34b
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    * adding assertion for energon + peft, not supported due to dataloader being saved with dist_ckpt
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    * energon usage warning
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: paul-gibbons <paul@gibbonspaul.com>
    Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
    Co-authored-by: paul-gibbons <paul-gibbons@users.noreply.github.com>
    Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
    4 people authored Sep 9, 2024
    Configuration menu
    Copy the full SHA
    dc61f7a View commit details
    Browse the repository at this point in the history
  4. 24.07 perf numbers (NVIDIA#10253)

    Signed-off-by: Malay Nagda <malayn@nvidia.com>
    Co-authored-by: Sangkug Lym <slym@nvidia.com>
    malay-nagda and erhoo82 authored Sep 9, 2024
    Configuration menu
    Copy the full SHA
    8e3d65d View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    ba7962e View commit details
    Browse the repository at this point in the history
  6. Neva update to NeMo 2.0 (NVIDIA#10292)

    * add some placeholders
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * few updates
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * update
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * fix logging
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * add neva mock dataset
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * update neva conversation
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * neva data update
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * fixes to conversation
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * data save
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Update paths and fix init
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * some fixes
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * fix combined embeddings logic
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * revert debug code
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix mock dataset for neva
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix dataset part and add llava
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix and updates
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix and updates
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix and updates
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Add checkpoint saving dev test script
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix and updates for neva pretraining
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * Fix and updates for neva finetuning
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * Fix minor issues in neva 2.0 upgrade
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Update init
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * update examples
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * Formatting
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * fix formatting
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Add license and fix formatting
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * Fix formatting
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix examples
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix neva api file
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Update tokens
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * Update arg apis
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * Update model and pretrain script
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * finetune and inference fixes
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * address comments
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * fix
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix dataset resuming
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * address comments
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * clean up
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * update loss mask class
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * fix
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * fix args
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * address comments
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    * remove examples
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * remove factory
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    ---------
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    Co-authored-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    yaoyu-33 and yaoyu-33 authored Sep 9, 2024
    Configuration menu
    Copy the full SHA
    176c54f View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    e6f6a48 View commit details
    Browse the repository at this point in the history
  8. [Nemo CICD] Make flaky test optional (NVIDIA#10438)

    * Make flaky test optional
    
    * Make flaky test optional
    pablo-garay authored Sep 9, 2024
    Configuration menu
    Copy the full SHA
    8f0d0c7 View commit details
    Browse the repository at this point in the history

Commits on Sep 10, 2024

  1. [Nemo CICD] Make flaky test optional (NVIDIA#10442)

    * Make flaky test optional
    
    * Make flaky test optional
    
    * Make flaky test optional
    pablo-garay authored Sep 10, 2024
    Configuration menu
    Copy the full SHA
    4259169 View commit details
    Browse the repository at this point in the history
  2. ci: Fix secrets detector on forks (NVIDIA#10426)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 10, 2024
    Configuration menu
    Copy the full SHA
    ae243d4 View commit details
    Browse the repository at this point in the history
  3. [Nemo CICD] Make flaky test optional (NVIDIA#10446)

    * Make flaky test optional
    
    * Make flaky test optional
    
    * Make flaky test optional
    
    * Make flaky test optional
    
    ---------
    
    Signed-off-by: Pablo Garay <palenq@gmail.com>
    pablo-garay authored Sep 10, 2024
    Configuration menu
    Copy the full SHA
    c3e6a6e View commit details
    Browse the repository at this point in the history
  4. Expand pyproject.toml to include package metadata for uv (NVIDIA#10350)

    Also edits get_git_hash to not error when git is not installed in the
    local container.
    
    Signed-off-by: Peter St. John <pstjohn@nvidia.com>
    pstjohn authored Sep 10, 2024
    Configuration menu
    Copy the full SHA
    18d81b1 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    1be9cc1 View commit details
    Browse the repository at this point in the history
  6. [Nemo CICD] Make flaky test optional (NVIDIA#10450)

    * Make flaky test optional
    
    * Make flaky test optional
    pablo-garay authored Sep 10, 2024
    Configuration menu
    Copy the full SHA
    766ded5 View commit details
    Browse the repository at this point in the history
  7. [Nemo CICD] Make flaky test optional (NVIDIA#10452)

    * Make flaky test optional
    
    * Make flaky test optional
    
    * Make flaky test optional
    pablo-garay authored Sep 10, 2024
    Configuration menu
    Copy the full SHA
    3c4def6 View commit details
    Browse the repository at this point in the history

Commits on Sep 11, 2024

  1. Configuration menu
    Copy the full SHA
    79c4786 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0df6610 View commit details
    Browse the repository at this point in the history
  3. add parakeet-tdt_ctc-110m model (NVIDIA#10461)

    * add parakeet-tdt_ctc-110m model
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    Co-authored-by: Nithin Rao Koluguri <nithinraok>
    Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
    nithinraok and nithinraok authored Sep 11, 2024
    Configuration menu
    Copy the full SHA
    fd8c6a4 View commit details
    Browse the repository at this point in the history
  4. Bump Dockerfile.ci (2024-09-09) (NVIDIA#10423)

    * [🤠]: Howdy folks, let's bump `Dockerfile.ci` to 8307fcd !
    
    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    
    * update TE import paths
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * Update parallelisms.rst
    
    fix sed typo.
    
    Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    
    * fix for mcore dist opt refactor: move overlap_grad_reduce/overlap_param_gather to ddp config
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * remove overlap_grad_reduce overlap_param_gather from autoconfig
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * subclass TransformerConfig because megatronmodule expects it to have fp8 attr
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * revert change; Use ModelParallelConfig & add fp8
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix, set NVTE_APPLY_QK_LAYER_SCALIN=1
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    6 people authored Sep 11, 2024
    Configuration menu
    Copy the full SHA
    1163e1e View commit details
    Browse the repository at this point in the history
  5. MCORE interface for TP-only FP8 AMAX reduction (NVIDIA#10437)

    Signed-off-by: Sangkug Lym <slym@nvidia.com>
    Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    erhoo82 and akoumpa authored Sep 11, 2024
    Configuration menu
    Copy the full SHA
    46aa1ee View commit details
    Browse the repository at this point in the history
  6. Support MCORE Distributed Optimizer (NVIDIA#10363)

    * add the mcore interface for optim arg; overlap param AG with optimizer
    
    Signed-off-by: Sangkug Lym <slym@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: erhoo82 <erhoo82@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Sangkug Lym <slym@nvidia.com>
    Signed-off-by: erhoo82 <erhoo82@users.noreply.github.com>
    Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    Co-authored-by: erhoo82 <erhoo82@users.noreply.github.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    3 people authored Sep 11, 2024
    Configuration menu
    Copy the full SHA
    2089c53 View commit details
    Browse the repository at this point in the history

Commits on Sep 12, 2024

  1. Configuration menu
    Copy the full SHA
    2c21e0a View commit details
    Browse the repository at this point in the history
  2. Fix nemo run entrypoints (NVIDIA#10464)

    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    hemildesai and pablo-garay authored Sep 12, 2024
    Configuration menu
    Copy the full SHA
    c611e53 View commit details
    Browse the repository at this point in the history
  3. [Nemo CICD] Make flaky test optional (NVIDIA#10476)

    * Optional test needs optional field set true
    
    * make flaky test optional
    pablo-garay authored Sep 12, 2024
    Configuration menu
    Copy the full SHA
    70f4426 View commit details
    Browse the repository at this point in the history
  4. add ci tests for Auto Configurator (NVIDIA#10390)

    * add ci tests for Auto Configurator
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix unit tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix ci tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix unit tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix ci tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix ci tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix unit tests
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    ---------
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    dimapihtar and pablo-garay authored Sep 12, 2024
    Configuration menu
    Copy the full SHA
    057041c View commit details
    Browse the repository at this point in the history

Commits on Sep 13, 2024

  1. [🤠]: Howdy folks, let's bump Dockerfile.ci to 76f9f48 ! (NVIDIA#10477)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Sep 13, 2024
    Configuration menu
    Copy the full SHA
    cb3df0b View commit details
    Browse the repository at this point in the history

Commits on Sep 15, 2024

  1. [NeMo-UX] Rename weights path during resume (NVIDIA#10391)

    * rename weights path to avoid confusion
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * use pathlib utils rather than os
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * update resume_from_path and context_path
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * address comment
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    ashors1 and pablo-garay authored Sep 15, 2024
    Configuration menu
    Copy the full SHA
    3a60491 View commit details
    Browse the repository at this point in the history
  2. [NeMo-UX] Use custom BatchProgress class which does not restore sta…

    …tes (NVIDIA#10383)
    
    * [WIP] fix batch sampler to match megatron dataloaders
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * make batchprogress configurable
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
    Co-authored-by: Shriya Rishab <69161273+ShriyaPalsamudram@users.noreply.github.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    4 people authored Sep 15, 2024
    Configuration menu
    Copy the full SHA
    b5798de View commit details
    Browse the repository at this point in the history

Commits on Sep 16, 2024

  1. Remove Apex dependency if not using norm (NVIDIA#10468)

    Signed-off-by: Chen Cui <chcui@nvidia.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    cuichenx and pablo-garay authored Sep 16, 2024
    Configuration menu
    Copy the full SHA
    9621be2 View commit details
    Browse the repository at this point in the history
  2. Update adapter saving logic to be compatible with save_weights_only (

    …NVIDIA#10466)
    
    * update adapter save logic to be compatible with `save_weights_only`
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    3 people authored Sep 16, 2024
    Configuration menu
    Copy the full SHA
    0f8a531 View commit details
    Browse the repository at this point in the history
  3. Akoumparouli/nemo ux update param name (NVIDIA#10441)

    * NeMoLogger: update dir to log_dir
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * NeMologger: update calls
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Co-authored-by: Marc Romeyn <mromeijn@nvidia.com>
    akoumpa and marcromeyn authored Sep 16, 2024
    Configuration menu
    Copy the full SHA
    62deef0 View commit details
    Browse the repository at this point in the history
  4. Draft: Expose MCore Cudagraph interface (NVIDIA#10121)

    * cuda graph modules
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * bug fixes
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add mcore cuda graph interface
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add mcore cuda graph interface
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix whitespace
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * change te import path
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add cudagraph manager
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * separate external cudagraph flag
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * separate config flags
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * update mcore changes
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add check
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
    Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    5 people authored Sep 16, 2024
    Configuration menu
    Copy the full SHA
    99af1ce View commit details
    Browse the repository at this point in the history
  5. [NeMo-UX] Add token drop callback and optimize mixtral configs (NVIDI…

    …A#10361)
    
    * add token drop plugin
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add checks
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add expert parallel configs
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * amend comment
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * add comm overlap
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix rebase errors
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * fix typo
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * add test configs
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * fix
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    ---------
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
    Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    5 people authored Sep 16, 2024
    Configuration menu
    Copy the full SHA
    cc494c9 View commit details
    Browse the repository at this point in the history
  6. fix partial audio transcription order: (NVIDIA#10379)

    * fix partial audio transcription order:
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * update transcribe_speech.py
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * fix canary transcription
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    * for filepaths
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    * add override config option
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * remove unused imports
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * clean up
    
    Signed-off-by: stevehuang52 <heh@nvidia.com>
    
    * completely remove partial audio transcription
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    * update doc strings
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * remove unused imports
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * support for translate_speech.py
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * suggested changes from som
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    Signed-off-by: stevehuang52 <heh@nvidia.com>
    Co-authored-by: Nithin Rao Koluguri <nithinraok>
    Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
    Co-authored-by: stevehuang52 <heh@nvidia.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    4 people authored Sep 16, 2024
    Configuration menu
    Copy the full SHA
    a250726 View commit details
    Browse the repository at this point in the history
  7. ci: Fix hyperlink to PR (NVIDIA#10494)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 16, 2024
    Configuration menu
    Copy the full SHA
    d419955 View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    16568d7 View commit details
    Browse the repository at this point in the history

Commits on Sep 17, 2024

  1. fix NeMoLogger log -> log_dir rename (NVIDIA#10498)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    8ff8804 View commit details
    Browse the repository at this point in the history
  2. ci: Fix base branch of secrets detector (NVIDIA#10501)

    * ci: Fix base branch of secrets detector
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    f6a905c View commit details
    Browse the repository at this point in the history
  3. Call reload_model_params only if there's no optimizer state (NVIDIA#1…

    …0470)
    
    * Call reload_model_params only if there's no optimizer state
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add nemo-ux resume test
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    9f67409 View commit details
    Browse the repository at this point in the history
  4. ci: Disable flaky secrets test (NVIDIA#10503)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    cda4be3 View commit details
    Browse the repository at this point in the history
  5. Add missing import guards for causal_conv1d and mamba_ssm dependencies (

    NVIDIA#10429)
    
    * Add causal_conv1d import guard
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Add mamba_ssm import guard
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: janekl <janekl@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    Signed-off-by: janekl <janekl@users.noreply.github.com>
    Co-authored-by: janekl <janekl@users.noreply.github.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    3 people authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    df3575a View commit details
    Browse the repository at this point in the history
  6. Update doc for fp8 trt-llm export (NVIDIA#10444)

    * Update doc for fp8 trt-llm export
    
    Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com>
    
    * Apply review suggestions
    
    Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com>
    
    * code review 
    
    Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com>
    Laplasjan107 authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    308eaac View commit details
    Browse the repository at this point in the history
  7. [SD] TE-DPA: disbale use te-dpa in inference flow. (NVIDIA#10488)

    Co-authored-by: Mengdi Wang <didow@nvidia.com>
    alpha0422 and Mengdi Wang authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    16a1e0c View commit details
    Browse the repository at this point in the history
  8. Add py-modules to pyproject.toml (NVIDIA#10509)

    * Lower bound transformers to support nemotron
    
    Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    
    * Add py-modules
    
    Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    
    * Update py-modules
    
    Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    
    ---------
    
    Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    Co-authored-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    thomasdhc and thomasdhc authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    da993db View commit details
    Browse the repository at this point in the history
  9. Add nemo2 conversion scripts for export (NVIDIA#10375)

    * Filter optimizer state in TRTLLM
    
    * Add convert script
    
    * Fix script
    
    * Add max positional embedding
    
    * Apply isort and black reformatting
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    
    * Add header comment
    
    ---------
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    Co-authored-by: meatybobby <meatybobby@users.noreply.github.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    3 people authored Sep 17, 2024
    Configuration menu
    Copy the full SHA
    a7d1896 View commit details
    Browse the repository at this point in the history

Commits on Sep 18, 2024

  1. [🤠]: Howdy folks, let's bump Dockerfile.ci to 0bda578 ! (NVIDIA#10518)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    848bdfb View commit details
    Browse the repository at this point in the history
  2. Update modelopt to 0.17.0 (NVIDIA#10489)

    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    janekl authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    bb4b5c6 View commit details
    Browse the repository at this point in the history
  3. add save_last_n_optim_states flag (NVIDIA#10098)

    * add save_last_n_optim_states flag
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * add feature to drop optimizer states from checkpoint
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix drop_optim_states logic
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * remove file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * undo remove extra line
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix style
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add drop optimizer states support
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * change drop optim states logic
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * remove extra import
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove extra parameter
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix style
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add unit test for drop optim states feature
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add unit test for drop optim states feature
    
    Co-authored-by: oliver könig <okoenig@nvidia.com>
    Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com>
    
    * move drop optim scripts to NeMoModelCheckpoint
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * move drop_optim_states functionality to NeMoModelCheckpoint
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * remove drop_optim_states functionality from NLPDDPStrategy
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove extra param
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * reset changes
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove extra line
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add safe save and remove ckpt when drop optim states
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * remove unused param
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add load state dict
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix drop optim strategy
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * return extra line
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * remove ckpt on a single node
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * change params naming
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix bug
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix bug
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * add file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * revert changes
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * restore current ckpt state dict when dropping optim states
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * minor changes
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * minor changes
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com>
    Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
    Co-authored-by: oliver könig <okoenig@nvidia.com>
    3 people authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    0504c92 View commit details
    Browse the repository at this point in the history
  4. fix asr finetune (NVIDIA#10508)

    * fix asr finetune
    
    Signed-off-by: stevehuang52 <heh@nvidia.com>
    
    * fix vad infer
    
    Signed-off-by: stevehuang52 <heh@nvidia.com>
    
    ---------
    
    Signed-off-by: stevehuang52 <heh@nvidia.com>
    stevehuang52 authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    bb721f8 View commit details
    Browse the repository at this point in the history
  5. add a feature to drop checkpoint layers (NVIDIA#10200)

    * add skip_train support
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add drop_layers support
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add script example of how to trim the model
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add documentation for drop_layers feature
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add unit test for skip_train feature
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add unit test for skip_train feature
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix syntax error
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix skip_train test
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * add noop_forward_patch
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    * fix doc file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix doc file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix doc file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix doc file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix doc file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * fix doc file
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    ---------
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com>
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
    dimapihtar and dimapihtar authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    f4ef524 View commit details
    Browse the repository at this point in the history
  6. move test (NVIDIA#10529)

    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Sep 18, 2024
    Configuration menu
    Copy the full SHA
    cd861e2 View commit details
    Browse the repository at this point in the history

Commits on Sep 19, 2024

  1. handle logging case where grad_norm is None (NVIDIA#10457)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    07c1c80 View commit details
    Browse the repository at this point in the history
  2. Make nemo_run dependency optional (llm/__init__ ) (NVIDIA#10453)

    * Make llm/__init__ nemo_run dependency optional
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: Marc Romeyn <mromeijn@nvidia.com>
    3 people authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    05573d7 View commit details
    Browse the repository at this point in the history
  3. move mamba installation (NVIDIA#10447)

    * move mamba installation
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * FIX
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * remove mounts
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add no-deps
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add --no-index
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * --no-binary
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * --no-binary
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * use git clone
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add --depth=1 to mcore git clone
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix typo
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Move mamba before nemo_run
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * revert megatron git clone
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    akoumpa and pablo-garay authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    8a244ff View commit details
    Browse the repository at this point in the history
  4. Update inference tests scripts and models (NVIDIA#10505)

    * Update infer_data_path.py
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Remove unused download_nemo_checkpoint function
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Change file permissions
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Bugfix for trt_llm_export_kwargs
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Update export test script and model metadata
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: janekl <janekl@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    Signed-off-by: janekl <janekl@users.noreply.github.com>
    Co-authored-by: janekl <janekl@users.noreply.github.com>
    Co-authored-by: Onur Yilmaz <35306097+oyilmaz-nvidia@users.noreply.github.com>
    3 people authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    28851be View commit details
    Browse the repository at this point in the history
  5. Adds Llama 3.1 405b configurations (NVIDIA#10472)

    * adds Llama 3.1 405b configurations
    
    Signed-off-by: Elnifio <elnifio0519@gmail.com>
    
    * refines comments
    
    Signed-off-by: Elnifio <elnifio0519@gmail.com>
    
    * removes untested parts
    
    Signed-off-by: Elnifio <elnifio0519@gmail.com>
    
    * removes associated unused libraries
    
    Signed-off-by: Elnifio <elnifio0519@gmail.com>
    
    * small patches
    
    Signed-off-by: Elnifio <elnifio0519@gmail.com>
    
    ---------
    
    Signed-off-by: Elnifio <elnifio0519@gmail.com>
    Co-authored-by: Shriya Rishab <69161273+ShriyaPalsamudram@users.noreply.github.com>
    Elnifio and ShriyaPalsamudram authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    b721f12 View commit details
    Browse the repository at this point in the history
  6. FP8 plugin recipes (NVIDIA#10208)

    * create mixed precision plugin recipes
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * cleanup
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * fix annotation
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * full definition instead of attach
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * update docstrings
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    
    * update fp8
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    
    * refactor import
    
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    
    ---------
    
    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com>
    Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    Co-authored-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com>
    Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com>
    Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
    4 people authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    3e251a0 View commit details
    Browse the repository at this point in the history
  7. [nemo-ux] Added nemotron recipes and tests (NVIDIA#10432)

    * [NeMo-UX] Added nemotron recipes and tests
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    * fixed issues with precision handling
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ahmadki <ahmadki@users.noreply.github.com>
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    * missing LoRA config in finetuning
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    * nemotron recipe imports
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    * cleanup and linting
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    * moved gradient clipping to OptimizerConfig,
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    * Added long context nemotron
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    
    ---------
    
    Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com>
    Signed-off-by: ahmadki <ahmadki@users.noreply.github.com>
    Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    Co-authored-by: ahmadki <ahmadki@users.noreply.github.com>
    Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    3 people authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    259744e View commit details
    Browse the repository at this point in the history
  8. Pass mock to GPTDatasetConfig (NVIDIA#10435)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Co-authored-by: Pablo Garay <palenq@gmail.com>
    akoumpa and pablo-garay authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    3653bed View commit details
    Browse the repository at this point in the history
  9. added energon dataloader for neva training (NVIDIA#10451)

    * added energon dataloader for neva training
    
    * Apply isort and black reformatting
    
    Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com>
    
    * specify global batch size to support grad accumulation
    
    * adding neva pretrain example
    
    * Apply isort and black reformatting
    
    Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com>
    
    * change pretraine example to handle new ckpt reloading
    
    * fixed code quality warnings and unused imports
    
    Signed-off-by: ykarnati <ykarnati@nvidia.com>
    
    * minor changes for PR comments
    
    * Apply isort and black reformatting
    
    Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com>
    
    * refactor conversation template config
    
    * Apply isort and black reformatting
    
    Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com>
    
    * remove optional import
    
    ---------
    
    Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com>
    Signed-off-by: ykarnati <ykarnati@nvidia.com>
    Co-authored-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com>
    yashaswikarnati and yashaswikarnati authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    7354740 View commit details
    Browse the repository at this point in the history
  10. Add unit tests for model configs in nemo.collections.llm (NVIDIA#10497)

    * Add unit tests for model configs in nemo.collections.llm
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add nemotron tests
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add gemma
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add other models
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * PR feedback
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
    hemildesai and hemildesai authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    9e1ce6f View commit details
    Browse the repository at this point in the history
  11. nemo-ux: optim & model state restore test (NVIDIA#10325)

    * optim restore test
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add missing import
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    8d3e561 View commit details
    Browse the repository at this point in the history
  12. Add copyright headers to nemo llm examples (NVIDIA#10543)

    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    hemildesai authored Sep 19, 2024
    Configuration menu
    Copy the full SHA
    45ff28f View commit details
    Browse the repository at this point in the history

Commits on Sep 20, 2024

  1. upgrade librosa version to fix librosa.display.specshow issue, matplo…

    …tlib get_cmap (NVIDIA#10542) (NVIDIA#10553)
    
    Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com>
    Co-authored-by: Kunal Dhawan <kunaldhawan97@gmail.com>
    github-actions[bot] and KunalDhawan authored Sep 20, 2024
    Configuration menu
    Copy the full SHA
    d2d2aa0 View commit details
    Browse the repository at this point in the history
  2. replace unbiased with correction (NVIDIA#10555)

    * replace unbiased with correction
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    Co-authored-by: Nithin Rao Koluguri <nithinraok>
    Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
    nithinraok and nithinraok authored Sep 20, 2024
    Configuration menu
    Copy the full SHA
    44d2ae7 View commit details
    Browse the repository at this point in the history

Commits on Sep 21, 2024

  1. Akoumparouli/nemo ux ckpt conv bugfix (NVIDIA#10558)

    * Call original _te._get_extra_te_kwargs and modify device
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add dtype utils
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Fix Llama ckpt conversion dtype
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * propage ckpt precision changes
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * minor improvement
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * bot happy
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 21, 2024
    Configuration menu
    Copy the full SHA
    bc10d7c View commit details
    Browse the repository at this point in the history
  2. add autoresume to nemo 2 test (NVIDIA#10556)

    Signed-off-by: ashors1 <ashors@nvidia.com>
    ashors1 authored Sep 21, 2024
    Configuration menu
    Copy the full SHA
    d2af2a4 View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2024

  1. ci: Add original author as reviewer to cherry-pick (NVIDIA#10566)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 22, 2024
    Configuration menu
    Copy the full SHA
    cfc9a6c View commit details
    Browse the repository at this point in the history
  2. ci: Improve title of cherry-picked PR (NVIDIA#10568)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 22, 2024
    Configuration menu
    Copy the full SHA
    e7e55b2 View commit details
    Browse the repository at this point in the history
  3. [🤠]: Howdy folks, let's bump Dockerfile.ci to c394f78 ! (NVIDIA#10562)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Sep 22, 2024
    Configuration menu
    Copy the full SHA
    0ee4d7e View commit details
    Browse the repository at this point in the history
  4. ci: Further improve cherry pick title (NVIDIA#10569)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 22, 2024
    Configuration menu
    Copy the full SHA
    ed23cc7 View commit details
    Browse the repository at this point in the history
  5. [🤠]: Howdy folks, let's bump Dockerfile.ci to 811a26a ! (NVIDIA#10565)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Signed-off-by: oliver könig <okoenig@nvidia.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Sep 22, 2024
    Configuration menu
    Copy the full SHA
    f7f7d1a View commit details
    Browse the repository at this point in the history
  6. ci: Send link to release page (NVIDIA#10570)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 22, 2024
    Configuration menu
    Copy the full SHA
    cb5c2b7 View commit details
    Browse the repository at this point in the history
  7. ci: Add label to cherry pick PR (NVIDIA#10574)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 22, 2024
    Configuration menu
    Copy the full SHA
    38c0e3d View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2024

  1. [🤠]: Howdy folks, let's bump Dockerfile.ci to 8e69382 ! (NVIDIA#10577)

    Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
    Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
    ko3n1g and pablo-garay authored Sep 23, 2024
    Configuration menu
    Copy the full SHA
    eb274ab View commit details
    Browse the repository at this point in the history
  2. Remove running validating after finetuning (NVIDIA#10560)

    * remove running validating after finetuning
    
    * Apply isort and black reformatting
    
    Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com>
    Co-authored-by: Huy Vu2 <huvu@login-eos01.eos.clusters.nvidia.com>
    Co-authored-by: huvunvidia <huvunvidia@users.noreply.github.com>
    3 people authored Sep 23, 2024
    Configuration menu
    Copy the full SHA
    110db0c View commit details
    Browse the repository at this point in the history
  3. bugfix (NVIDIA#10561)

    Signed-off-by: Maanu Grover <maanug@nvidia.com>
    maanug-nv authored Sep 23, 2024
    Configuration menu
    Copy the full SHA
    9ed0d6c View commit details
    Browse the repository at this point in the history
  4. remove exp dir (NVIDIA#10460)

    * remove exp dir
    
    * add back tests
    
    * Apply isort and black reformatting
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    
    * add run_id to dir
    
    * fix dir->log_dir
    
    ---------
    
    Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com>
    Co-authored-by: JRD971000 <JRD971000@users.noreply.github.com>
    JRD971000 and JRD971000 authored Sep 23, 2024
    Configuration menu
    Copy the full SHA
    c6d1b7d View commit details
    Browse the repository at this point in the history
  5. ci: Send direct alert on failed cherry-pick (NVIDIA#10588)

    * ci: Send direct alert on failed cherry-pick
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * also to me
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 23, 2024
    Configuration menu
    Copy the full SHA
    6400bd5 View commit details
    Browse the repository at this point in the history
  6. Add ConfigValidation plugin to nemo.lightning.run (NVIDIA#10541)

    * Add validation plugin to nemo.lightning.run
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * Rename to ConfigValidationPlugin
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add tests for each recipes with plugins
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    
    * Add nsys plugin test
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    * Add more tests
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    
    ---------
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    Signed-off-by: hemildesai <hemildesai@users.noreply.github.com>
    Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
    hemildesai and hemildesai authored Sep 23, 2024
    Configuration menu
    Copy the full SHA
    7439b13 View commit details
    Browse the repository at this point in the history
  7. Fix pps issue on nemo export (NVIDIA#10544)

    * fix minor import bug
    
    Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com>
    
    * fix pps bug
    
    * remove config file
    
    Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com>
    
    ---------
    
    Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com>
    Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
    oyilmaz-nvidia and shanmugamr1992 authored Sep 23, 2024
    Configuration menu
    Copy the full SHA
    c02ea12 View commit details
    Browse the repository at this point in the history

Commits on Sep 24, 2024

  1. fix type error in llm collection (NVIDIA#10552)

    * fix type bugs
    
    Signed-off-by: stevehuang52 <heh@nvidia.com>
    
    * Update mixin.py
    
    add type hint
    
    Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: stevehuang52 <stevehuang52@users.noreply.github.com>
    
    * Update mixin.py
    
    Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: stevehuang52 <stevehuang52@users.noreply.github.com>
    
    * Update mixin.py
    
    Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: stevehuang52 <heh@nvidia.com>
    Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com>
    Signed-off-by: stevehuang52 <stevehuang52@users.noreply.github.com>
    Co-authored-by: stevehuang52 <stevehuang52@users.noreply.github.com>
    stevehuang52 and stevehuang52 authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    53a10a7 View commit details
    Browse the repository at this point in the history
  2. ci: Safer sequence escaping (NVIDIA#10595)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    6023b80 View commit details
    Browse the repository at this point in the history
  3. ci: Fix issues with version bump (NVIDIA#10467)

    * ci: Fix issues with version bump
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * fix
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    c4e4157 View commit details
    Browse the repository at this point in the history
  4. ci: Add missing test specs (NVIDIA#10597)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    810d07f View commit details
    Browse the repository at this point in the history
  5. Extending modelopt spec for TEDotProductAttention (NVIDIA#10523)

    * Extend modelopt spec for TEDotProductAttention to support sliding window attention
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    * Simplify import guarding
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    
    ---------
    
    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    janekl authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    0fad1c1 View commit details
    Browse the repository at this point in the history
  6. Update Multi_Task_Adapters.ipynb (NVIDIA#10600)

    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    pzelasko authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    849e7e0 View commit details
    Browse the repository at this point in the history
  7. Change default for always_save_context to True (NVIDIA#10547)

    Signed-off-by: Abhishree <abhishreetm@gmail.com>
    athitten authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    f351f64 View commit details
    Browse the repository at this point in the history
  8. Import guard for SimpleMultiModalDataModule (NVIDIA#10592)

    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    70bc06b View commit details
    Browse the repository at this point in the history
  9. add support for train_time_interval to consider hydra object (NVIDIA#…

    …10559)
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    Co-authored-by: Nithin Rao Koluguri <nithinraok>
    nithinraok authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    9d5a1aa View commit details
    Browse the repository at this point in the history
  10. Move update_config_with_dtype_overrides logging to debug (NVIDIA#10602)

    * Move update_config_with_dtype_overrides logging to debug
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * update comment
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    akoumpa authored Sep 24, 2024
    Configuration menu
    Copy the full SHA
    877144a View commit details
    Browse the repository at this point in the history

Commits on Sep 25, 2024

  1. ci: Wrap into quotes (NVIDIA#10616)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 25, 2024
    Configuration menu
    Copy the full SHA
    0ec10d2 View commit details
    Browse the repository at this point in the history
  2. Romeyn/sampler (NVIDIA#10525)

    * Introducing MegatronStep to make things more extensible
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Improve megatron callbacks
    
    Signed-off-by: Marc Romeijn <mromeijn@nvidia.com>
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Some small fixes
    
    Signed-off-by: Marc Romeijn <mromeijn@nvidia.com>
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * remove debg code
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add forward_only to forward backward func
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add global-batch-sampler support to MegatronStep
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: Marc Romeijn <mromeijn@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: Marc Romeijn <mromeijn@nvidia.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: sichu <sichu@nvidia.com>
    4 people authored Sep 25, 2024
    Configuration menu
    Copy the full SHA
    e8304d6 View commit details
    Browse the repository at this point in the history
  3. Add inference optimization blog post announcement to README (NVIDIA#1…

    …0623)
    
    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    pzelasko authored Sep 25, 2024
    Configuration menu
    Copy the full SHA
    e35a659 View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2024

  1. Fix mb_calculator import in lora tutorial (NVIDIA#10624)

    Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
    BoxiangW authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    dcc3a16 View commit details
    Browse the repository at this point in the history
  2. Fix LoRA contiguous tensor (NVIDIA#10611)

    * contiguous
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * fix load
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * add test script
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: artbataev <artbataev@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Chen Cui <chcui@nvidia.com>
    Signed-off-by: cuichenx <cuichenx@users.noreply.github.com>
    Signed-off-by: artbataev <artbataev@users.noreply.github.com>
    Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
    Co-authored-by: artbataev <artbataev@users.noreply.github.com>
    3 people authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    51f47f1 View commit details
    Browse the repository at this point in the history
  3. Fix Clip initializing issue in r2.0.0 (NVIDIA#10585)

    * update post process
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Fix wild card matching in nemo2
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com>
    Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    Co-authored-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
    yaoyu-33 and yaoyu-33 authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    016c1e4 View commit details
    Browse the repository at this point in the history
  4. Adding T5 to NeMo 2.0 (NVIDIA#10263)

    * huvu/t5_nemo2.0 first commit from local
    
    * runable training
    
    * commit to save
    
    * update nemo/collections/llm/t5/data/pre_training.py, adding cicd test
    
    * updating codes
    
    * reset nemo/collections/nlp/parts/megatron_trainer_builder.py
    
    * reset megatron_lm_encoder_decoder_model.py, remove t5_release_test_config.sh
    
    * update init files
    
    * update Dockerfile.ci
    
    * fix wandb for cicd test
    
    * update training data path
    
    * remove uninstall TE
    
    * update .github/workflows/cicd-main.yml, disable fused/flashAttn
    
    * adjusting val_check_interval for action ci-cd tests
    
    * restore .github/workflows/cicd-main.yml
    
    * update
    
    * update nemologger args
    
    * just run pretraining in cicd test
    
    * update code llm strutures
    
    * adding tests/collections/llm/megatron_t5_pretraining.py
    
    * fix cicd
    
    * fix cicd
    
    * fix cicd
    
    * Apply isort and black reformatting
    
    Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com>
    Signed-off-by: Huy Vu <86480512+huvunvidia@users.noreply.github.com>
    Co-authored-by: Huy Vu2 <huvu@login-eos01.eos.clusters.nvidia.com>
    Co-authored-by: huvunvidia <huvunvidia@users.noreply.github.com>
    3 people authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    eee0137 View commit details
    Browse the repository at this point in the history
  5. ci: Add CICD result feedback (NVIDIA#10629)

    * ci: Add CICD result feedback
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    a98c5ed View commit details
    Browse the repository at this point in the history
  6. .nemo conversion bug fix (NVIDIA#10598)

    * bug fix
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * bug fix
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: dimapihtar <dpihtar@gmail.com>
    Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com>
    Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
    dimapihtar and dimapihtar authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    a6c2fef View commit details
    Browse the repository at this point in the history
  7. ci: Fix mention (NVIDIA#10635)

    * ci: Fix mention
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    38e5e09 View commit details
    Browse the repository at this point in the history
  8. Fix asr warnings (NVIDIA#10469)

    * check
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * 1
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * added to examples/asr
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * deprecates cuda.amp.autocast to replace with amp.autocast(**Args)
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    * check
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * 1
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * added to examples/asr
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * deprecates cuda.amp.autocast to replace with amp.autocast(**Args)
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    * tested on V100
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * Apply isort and black reformatting
    
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    
    * replace cuda for jit scripted modules
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * device type fix
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    * revert diar for CI
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    
    ---------
    
    Signed-off-by: Nithin Rao Koluguri <nithinraok>
    Signed-off-by: nithinraok <nithinraok@users.noreply.github.com>
    Co-authored-by: Nithin Rao Koluguri <nithinraok>
    Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
    nithinraok and nithinraok authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    4a9a226 View commit details
    Browse the repository at this point in the history
  9. ci: Fix hyperlink for feedback (NVIDIA#10637)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    ab4859b View commit details
    Browse the repository at this point in the history
  10. sdpa flag to false & sdpa_backend arg

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    3e31500 View commit details
    Browse the repository at this point in the history
  11. Apply isort and black reformatting

    Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
    WoodieDudy committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    d604f8b View commit details
    Browse the repository at this point in the history
  12. change arg name

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    41acec1 View commit details
    Browse the repository at this point in the history
  13. Apply isort and black reformatting

    Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
    WoodieDudy committed Sep 26, 2024
    Configuration menu
    Copy the full SHA
    e2aab5b View commit details
    Browse the repository at this point in the history
  14. Support LoRA in TensorRTMMExporter (NVIDIA#10347)

    * Support LoRA in TensorRTMMExporter
    
    * Apply isort and black reformatting
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    
    * Add lora uids
    
    * Fix mm lora
    
    * Apply isort and black reformatting
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    
    * Fix lora
    
    * Fix conf and deploy script
    
    * Apply isort and black reformatting
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: meatybobby <meatybobby@users.noreply.github.com>
    Co-authored-by: meatybobby <meatybobby@users.noreply.github.com>
    meatybobby and meatybobby authored Sep 26, 2024
    Configuration menu
    Copy the full SHA
    5e66cad View commit details
    Browse the repository at this point in the history

Commits on Sep 27, 2024

  1. Nemo ux HF import tests (NVIDIA#10274)

    * Add __all__ for mistral & mixtral
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Add model import test
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Add to cicd
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * skip mixtral test until use_safetnsors arg is resolved
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * syntax
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * typo
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * remove unused imports
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * remove streaming ckpt
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Update ckpt paths
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 27, 2024
    Configuration menu
    Copy the full SHA
    5b88aaa View commit details
    Browse the repository at this point in the history
  2. chore(ci): Increase shm to 64gb (NVIDIA#10656)

    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Sep 27, 2024
    Configuration menu
    Copy the full SHA
    a725511 View commit details
    Browse the repository at this point in the history
  3. Add lazy init for export (NVIDIA#10613)

    * Add lazy init for export
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 27, 2024
    Configuration menu
    Copy the full SHA
    fdaf607 View commit details
    Browse the repository at this point in the history
  4. Update modelopt layer spec for Mixtral (NVIDIA#10660)

    Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
    janekl authored Sep 27, 2024
    Configuration menu
    Copy the full SHA
    4f59502 View commit details
    Browse the repository at this point in the history
  5. Update llm recipe README to add a note about handling multi-process j…

    …obs (NVIDIA#10649)
    
    Signed-off-by: Hemil Desai <hemild@nvidia.com>
    hemildesai authored Sep 27, 2024
    Configuration menu
    Copy the full SHA
    d51d8b9 View commit details
    Browse the repository at this point in the history
  6. Support Canary parallel inference (NVIDIA#9517)

    * add Canary cats
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: karpnv <karpnv@users.noreply.github.com>
    
    * rm use_distributed_sampler=False
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: karpnv <karpnv@users.noreply.github.com>
    
    * rm use_distributed_sampler
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    * update lhotse
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: karpnv <karpnv@users.noreply.github.com>
    
    * fix global_rank
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    * OmegaConf.set_struct
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    * review fix
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: karpnv <karpnv@users.noreply.github.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: pzelasko <pzelasko@users.noreply.github.com>
    
    * predict_step return
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    
    ---------
    
    Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com>
    Signed-off-by: karpnv <karpnv@users.noreply.github.com>
    Signed-off-by: Piotr Żelasko <petezor@gmail.com>
    Signed-off-by: pzelasko <pzelasko@users.noreply.github.com>
    Co-authored-by: karpnv <karpnv@users.noreply.github.com>
    Co-authored-by: Piotr Żelasko <petezor@gmail.com>
    Co-authored-by: pzelasko <pzelasko@users.noreply.github.com>
    4 people authored Sep 27, 2024
    Configuration menu
    Copy the full SHA
    cbb1344 View commit details
    Browse the repository at this point in the history
  7. adding resume pretraining to CICD (NVIDIA#10640)

    Co-authored-by: Huy Vu2 <huvu@login-eos01.eos.clusters.nvidia.com>
    huvunvidia and Huy Vu2 authored Sep 27, 2024
    Configuration menu
    Copy the full SHA
    23c7de1 View commit details
    Browse the repository at this point in the history

Commits on Sep 30, 2024

  1. Require setuptools>=70 and update deprecated api (NVIDIA#10659)

    * Require setuptools>=70 and update deprecated api
    
    Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: thomasdhc <thomasdhc@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    Signed-off-by: thomasdhc <thomasdhc@users.noreply.github.com>
    Co-authored-by: Dong Hyuk Chang <donghyukc@nvidia.com>
    Co-authored-by: thomasdhc <thomasdhc@users.noreply.github.com>
    3 people authored Sep 30, 2024
    Configuration menu
    Copy the full SHA
    fd78cc6 View commit details
    Browse the repository at this point in the history
  2. Akoumparouli/fix get tokenizer list (NVIDIA#10596)

    * import fix
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * add get_tokenizer_list import test
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * cycle import fix attempt
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix2
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * fix3
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    * avoid import
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * remove relative imports
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    
    ---------
    
    Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
    Signed-off-by: akoumpa <akoumpa@users.noreply.github.com>
    Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
    akoumpa and akoumpa authored Sep 30, 2024
    Configuration menu
    Copy the full SHA
    9913441 View commit details
    Browse the repository at this point in the history
  3. [NeMo-UX] Support save_last="link" (NVIDIA#10548)

    * provide support for save_last='link'
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix symlinks when top_k checkpoint not saved
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * support symlinks with async checkpointing
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * only unlink on rank 0
    
    Signed-off-by: Anna Shors <ashors@nvidia.com>
    
    * fix race condition
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * force linked checkpoint to correspond to last finalized checkpoint
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix last_model_path after restore
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * move symlink removal to strategy
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * remove unneeded lines
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * add some more documentation
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * address some comments
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix syntax
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * avoid overwriting _save_last_checkpoint
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix base call
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * small fix
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * add test for save_last=link
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * clean up test
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * use megatroncheckpointio in test
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * add async test and clean up
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix remaining merge conflicts
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * check number of saved checkpoints
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * remove unused import
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * run test on gpu only
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * fix a small bug and add a resume test
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    * Apply isort and black reformatting
    
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    
    * remove old comment
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    
    ---------
    
    Signed-off-by: ashors1 <ashors@nvidia.com>
    Signed-off-by: Anna Shors <ashors@nvidia.com>
    Signed-off-by: ashors1 <ashors1@users.noreply.github.com>
    Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
    ashors1 and ashors1 authored Sep 30, 2024
    Configuration menu
    Copy the full SHA
    d664b74 View commit details
    Browse the repository at this point in the history
  4. Update the downloading path (NVIDIA#10662)

    Signed-off-by: mingyuanm <mingyuanm@nvidia.com>
    Victor49152 authored Sep 30, 2024
    Configuration menu
    Copy the full SHA
    c0a05f6 View commit details
    Browse the repository at this point in the history

Commits on Oct 1, 2024

  1. ci: Stability to CI/CD (NVIDIA#10694)

    * ci: Switch to `--runtime=nvidia`
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * fix success marker
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    * f
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    
    ---------
    
    Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
    ko3n1g authored Oct 1, 2024
    Configuration menu
    Copy the full SHA
    32503fd View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7660730 View commit details
    Browse the repository at this point in the history
  3. fix config args

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Oct 1, 2024
    Configuration menu
    Copy the full SHA
    86e60c3 View commit details
    Browse the repository at this point in the history
  4. Apply isort and black reformatting

    Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
    WoodieDudy committed Oct 1, 2024
    Configuration menu
    Copy the full SHA
    f712628 View commit details
    Browse the repository at this point in the history

Commits on Oct 3, 2024

  1. add condition on version

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Oct 3, 2024
    Configuration menu
    Copy the full SHA
    fd78849 View commit details
    Browse the repository at this point in the history
  2. Apply isort and black reformatting

    Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
    WoodieDudy committed Oct 3, 2024
    Configuration menu
    Copy the full SHA
    1aec220 View commit details
    Browse the repository at this point in the history

Commits on Oct 5, 2024

  1. update condition on version

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Oct 5, 2024
    Configuration menu
    Copy the full SHA
    e978045 View commit details
    Browse the repository at this point in the history
  2. remove condition on torch version

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Oct 5, 2024
    Configuration menu
    Copy the full SHA
    18e30ed View commit details
    Browse the repository at this point in the history
  3. Apply isort and black reformatting

    Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
    WoodieDudy committed Oct 5, 2024
    Configuration menu
    Copy the full SHA
    c95dc01 View commit details
    Browse the repository at this point in the history

Commits on Oct 7, 2024

  1. move code to init

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    ca21430 View commit details
    Browse the repository at this point in the history
  2. Apply isort and black reformatting

    Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
    WoodieDudy committed Oct 7, 2024
    Configuration menu
    Copy the full SHA
    6741826 View commit details
    Browse the repository at this point in the history

Commits on Oct 8, 2024

  1. refactor

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Oct 8, 2024
    Configuration menu
    Copy the full SHA
    5dee79f View commit details
    Browse the repository at this point in the history
  2. Apply isort and black reformatting

    Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
    WoodieDudy committed Oct 8, 2024
    Configuration menu
    Copy the full SHA
    de3835d View commit details
    Browse the repository at this point in the history
  3. refactor

    Signed-off-by: WoodieDudy <goshagks@gmail.com>
    WoodieDudy committed Oct 8, 2024
    Configuration menu
    Copy the full SHA
    8af1241 View commit details
    Browse the repository at this point in the history