-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
use torch scaled_dot_product_attention #1
base: main
Are you sure you want to change the base?
Commits on Aug 22, 2024
-
force optimizer.param_groups to match mcore_optimizer.param_groups af…
…ter restoring from a checkpoint (NVIDIA#10225) Signed-off-by: ashors1 <ashors@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 78f57fe - Browse repository at this point
Copy the full SHA 78f57feView commit details -
Update TRTLLM 0.12 (NVIDIA#10215)
* Update TRTLLM 0.12 * Add model config * Change config * Change deploy script * Apply isort and black reformatting Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> * Remove parameter --------- Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> Co-authored-by: meatybobby <meatybobby@users.noreply.github.com> Co-authored-by: Onur Yilmaz <35306097+oyilmaz-nvidia@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 5269caf - Browse repository at this point
Copy the full SHA 5269cafView commit details -
Tutorial: audio codec inference (NVIDIA#10186)
Signed-off-by: Ante Jukić <ajukic@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 42c2910 - Browse repository at this point
Copy the full SHA 42c2910View commit details
Commits on Aug 23, 2024
-
Move trt imports in nemo.collections.llm inside respective functions (N…
…VIDIA#10234) Signed-off-by: Hemil Desai <hemild@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 753c70e - Browse repository at this point
Copy the full SHA 753c70eView commit details -
Add tests for LazyNeMoIterator and fix case with metadata_only=True a…
…nd offsets in manifest (NVIDIA#10198) * Add tests for LazyNeMoIterator and fix case with manifest_only=True and offsets in manifest Signed-off-by: Piotr Żelasko <petezor@gmail.com> * Address code review Signed-off-by: Piotr Żelasko <petezor@gmail.com> * fix tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> * fix tests Signed-off-by: Piotr Żelasko <petezor@gmail.com> --------- Signed-off-by: Piotr Żelasko <petezor@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for d4d6a5b - Browse repository at this point
Copy the full SHA d4d6a5bView commit details -
[NeMo-UX] Fix a serialization bug that prevents users from moving che…
…ckpoints (NVIDIA#9939) * perfor serialization using relative paths to allow users to move checkpoints after they're saved Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * remove unused import Signed-off-by: ashors1 <ashors@nvidia.com> * fix artifact load Signed-off-by: ashors1 <ashors@nvidia.com> * fix path artifact Signed-off-by: ashors1 <ashors@nvidia.com> * remove unused import Signed-off-by: ashors1 <ashors@nvidia.com> --------- Signed-off-by: ashors1 <ashors@nvidia.com> Signed-off-by: ashors1 <ashors1@users.noreply.github.com> Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 1c90b5e - Browse repository at this point
Copy the full SHA 1c90b5eView commit details -
Add MemoryProfileCallback (NVIDIA#10166)
* Add MemoryProfileCallback Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com> * Apply isort and black reformatting Signed-off-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com> * Remove reference cycles, save snapshot on specific ranks Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com> * Remove unnecessary imports Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com> * Apply isort and black reformatting Signed-off-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com> * Update docstring Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com> --------- Signed-off-by: Shriya Palsamudram <spalsamudram@nvidia.com> Signed-off-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com> Signed-off-by: Shriya Rishab <69161273+ShriyaPalsamudram@users.noreply.github.com> Co-authored-by: ShriyaPalsamudram <ShriyaPalsamudram@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 6d1be93 - Browse repository at this point
Copy the full SHA 6d1be93View commit details -
Lower bound transformers to support nemotron (NVIDIA#10240)
Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> Co-authored-by: Dong Hyuk Chang <donghyukc@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for d415621 - Browse repository at this point
Copy the full SHA d415621View commit details
Commits on Aug 24, 2024
-
[Audio] SSL Pretraining framework for flow-matching model for audio p…
…rocessing (NVIDIA#10052) Flow matching generative model with SSL pretraining framework Signed-off-by: Pin-Jui Ku <pku@nvidia.com> Co-authored-by: Kuray107 <Kuray107@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 7cc99e9 - Browse repository at this point
Copy the full SHA 7cc99e9View commit details
Commits on Aug 26, 2024
-
Revert torchrun fix for model import (NVIDIA#10251)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 8d9cfee - Browse repository at this point
Copy the full SHA 8d9cfeeView commit details -
[NeMo-UX[ Move nemotron imports inline (NVIDIA#10255)
* Move nemotron transformers + tokenizer imports inline to reduce number of required deps Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> --------- Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> Co-authored-by: marcromeyn <marcromeyn@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 642c97a - Browse repository at this point
Copy the full SHA 642c97aView commit details -
Wrap CPU model init with megatron_lazy_init_context (NVIDIA#10219)
* Wrap CPU model init with megatron_lazy_init_context Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Cleanup checkpoint-dir if saving fails Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 8210e9c - Browse repository at this point
Copy the full SHA 8210e9cView commit details -
Configuration menu - View commit details
-
Copy full SHA for fad3414 - Browse repository at this point
Copy the full SHA fad3414View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6f6fc27 - Browse repository at this point
Copy the full SHA 6f6fc27View commit details -
Apply isort and black reformatting
Signed-off-by: titu1994 <titu1994@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 941c7f5 - Browse repository at this point
Copy the full SHA 941c7f5View commit details -
Bump
Dockerfile.ci
(2024-08-22) (NVIDIA#10227)* [🤠]: Howdy folks, let's bump `Dockerfile.ci` to 124bcff ! Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * fix bert flags Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Oliver Koenig <okoenig@nvidia.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ea8f49b - Browse repository at this point
Copy the full SHA ea8f49bView commit details -
salm export trtllm (NVIDIA#10245)
Signed-off-by: slyne deng <slyned@nvidia.com> Co-authored-by: slyne deng <slyned@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 69973f9 - Browse repository at this point
Copy the full SHA 69973f9View commit details
Commits on Aug 27, 2024
-
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to ef85bc9 ! (NVIDIA#10250)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 59a3e96 - Browse repository at this point
Copy the full SHA 59a3e96View commit details -
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to 01ca03f ! (NVIDIA#10266)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: oliver könig <okoenig@nvidia.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 49f13fb - Browse repository at this point
Copy the full SHA 49f13fbView commit details -
Load model in the target export precision by default in PTQ (NVIDIA#1…
…0267) * Load model in the target export precision by default Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Enable megatron_amp_O2=true to actually use half-precision Signed-off-by: Jan Lasek <jlasek@nvidia.com> --------- Signed-off-by: Jan Lasek <janek.lasek@gmail.com> Signed-off-by: Jan Lasek <jlasek@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2f422dd - Browse repository at this point
Copy the full SHA 2f422ddView commit details -
Add WandbPlugin, NsysPlugin and PreemptionPlugin to nemo.lightning.ru…
…n.plugins (NVIDIA#10223) * Add WandbPlugin, NsysPlugin and PreemptionPlugin to nemo.lightning.run.plugins Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * Remove duplicate Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add entity to wandb logger Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add documentation Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * Add warning Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * PR feedback Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * Add comments Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fd75162 - Browse repository at this point
Copy the full SHA fd75162View commit details -
[NeMo-UX] Handle absolute logger directories in nemo_logger (NVIDIA#1…
…0259) * handle absolute and relative logger directories Signed-off-by: Anna Shors <ashors@nvidia.com> * merge lines Signed-off-by: ashors1 <ashors@nvidia.com> --------- Signed-off-by: Anna Shors <ashors@nvidia.com> Signed-off-by: ashors1 <ashors@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 38800cd - Browse repository at this point
Copy the full SHA 38800cdView commit details -
Add sdxl notebook (NVIDIA#10139)
* Add sdxl notebook Signed-off-by: mingyuanm <mingyuanm@nvidia.com> * Rename Signed-off-by: mingyuanm <mingyuanm@nvidia.com> * final Update SDXL notebook Signed-off-by: mingyuanm <mingyuanm@nvidia.com> --------- Signed-off-by: mingyuanm <mingyuanm@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 57aa305 - Browse repository at this point
Copy the full SHA 57aa305View commit details -
Add Llama31 Config (NVIDIA#10260)
* add llama31 config * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * fix init method * typo * revert llama3-70b init method std --------- Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 19668e5 - Browse repository at this point
Copy the full SHA 19668e5View commit details -
Added offloading support for LoRA adapters (NVIDIA#10237)
Signed-off-by: Selvaraj Anandaraj <selvaraja@login-eos02.eos.clusters.nvidia.com> Co-authored-by: Selvaraj Anandaraj <selvaraja@login-eos02.eos.clusters.nvidia.com> Co-authored-by: Chen Cui <chcui@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for c7c3eae - Browse repository at this point
Copy the full SHA c7c3eaeView commit details -
Add Qwen2 to Nemo 2 (NVIDIA#10258)
* add qwen2 * typo * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * qwen without pip install issue * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * remove calculate vocab size divisible --------- Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f53600a - Browse repository at this point
Copy the full SHA f53600aView commit details
Commits on Aug 28, 2024
-
Lazy import tokenizers (NVIDIA#10213)
* Move inflect to lazy import Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Use lazy imports for tokenizer libraries Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * sacremoses lazy import Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix cyclic import Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * import fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * move pangu Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for e68f981 - Browse repository at this point
Copy the full SHA e68f981View commit details -
add rampup bs documentation (NVIDIA#9884) (NVIDIA#10289)
* create documentation for rampup bs * fix format * fix format * fix config format * move config stage * add example * fix table * fix table * fix grammar * fix grammar --------- Signed-off-by: dimapihtar <dpihtar@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 5ff7f22 - Browse repository at this point
Copy the full SHA 5ff7f22View commit details -
Add Starcoder to Nemo 2 (NVIDIA#10230)
* Add sc1/sc2 to nemo-ux * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * typo * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * fix import * remove pip install dependency for sc * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * remove pip install dependency for sc2 * Apply isort and black reformatting Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> * typo Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> --------- Signed-off-by: suiyoubi <suiyoubi@users.noreply.github.com> Signed-off-by: Ao Tang <aot@nvidia.com> Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> Co-authored-by: suiyoubi <suiyoubi@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com> Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 4805fe9 - Browse repository at this point
Copy the full SHA 4805fe9View commit details -
comment out ASR_dev_run_Speech_To_Text_HF_Finetuning until fixed (NVI…
…DIA#10293) Signed-off-by: Pablo Garay <pagaray@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 2438fa9 - Browse repository at this point
Copy the full SHA 2438fa9View commit details -
Adding a Garbage-collection callback for a synchronized garbage-colle…
…ction across ALL processes (NVIDIA#10261) * Adding Garbage-collection callback for a synchrozned gc across processed Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com> * 1 Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com> * Apply isort and black reformatting Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com> Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com> * PR feedback Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com> --------- Signed-off-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com> Signed-off-by: gautham-kollu <gautham-kollu@users.noreply.github.com> Co-authored-by: Gautham Kollu <gkollu@login-eos01.eos.clusters.nvidia.com> Co-authored-by: gautham-kollu <gautham-kollu@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 5040546 - Browse repository at this point
Copy the full SHA 5040546View commit details -
Do not overwrite wandb name in NeMo Logger (NVIDIA#10265)
* Do not overwrite wandb name in NeMo Logger Signed-off-by: Hemil Desai <hemild@nvidia.com> * Do not overwrite tensorboard name Signed-off-by: Hemil Desai <hemild@nvidia.com> * Fix tests Signed-off-by: Hemil Desai <hemild@nvidia.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 1d2d507 - Browse repository at this point
Copy the full SHA 1d2d507View commit details -
Bump
Dockerfile.ci
(2024-08-28) (NVIDIA#10278)* [🤠]: Howdy folks, let's bump `Dockerfile.ci` to 34e607e ! Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * refactor: Deprecate `async_grad_allreduce` Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Oliver Koenig <okoenig@nvidia.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 5bbfa53 - Browse repository at this point
Copy the full SHA 5bbfa53View commit details -
Multimodal trtllm export and infer script (NVIDIA#10287)
* salm export trtllm Signed-off-by: slyne deng <slyned@nvidia.com> * add export script Signed-off-by: slyne deng <slyned@nvidia.com> * fix style Signed-off-by: slyne deng <slyned@nvidia.com> * Apply isort and black reformatting Signed-off-by: Slyne <Slyne@users.noreply.github.com> --------- Signed-off-by: slyne deng <slyned@nvidia.com> Signed-off-by: Slyne <Slyne@users.noreply.github.com> Co-authored-by: slyne deng <slyned@nvidia.com> Co-authored-by: Slyne <Slyne@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 60ac8aa - Browse repository at this point
Copy the full SHA 60ac8aaView commit details -
[TTS] Add config and modules for 22khz and 44khz audio codec (NVIDIA#…
…10107) * [TTS] Add config and modules for 22khz and 44khz audio codec Signed-off-by: Ryan <rlangman@nvidia.com> * Apply isort and black reformatting Signed-off-by: rlangman <rlangman@users.noreply.github.com> * [TTS] Add argument docstring to new modules Signed-off-by: Ryan <rlangman@nvidia.com> --------- Signed-off-by: Ryan <rlangman@nvidia.com> Signed-off-by: rlangman <rlangman@users.noreply.github.com> Co-authored-by: rlangman <rlangman@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for a860e6b - Browse repository at this point
Copy the full SHA a860e6bView commit details -
Add example script to run NeMo 2.0 llama pretraining with NeMo-Run (N…
…VIDIA#10226) * Add example script to run NeMo 2.0 llama pretraining with NeMo-Run Signed-off-by: Hemil Desai <hemild@nvidia.com> * Fixes Signed-off-by: Hemil Desai <hemild@nvidia.com> * Address PR comments Signed-off-by: Hemil Desai <hemild@nvidia.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for f45422a - Browse repository at this point
Copy the full SHA f45422aView commit details
Commits on Aug 29, 2024
-
Add FSDP for NeMo 2.0 (NVIDIA#9748)
* modify code structure and add strategy Signed-off-by: jasonwan <jasonwan@nvidia.com> * correct doc url Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * oextract common elements and add callback Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * add iomixin Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * update strategies Signed-off-by: jasonwan <jasonwan@nvidia.com> * update callback Signed-off-by: jasonwan <jasonwan@nvidia.com> * add training step to strategy Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * remove unused import Signed-off-by: jasonwan <jasonwan@nvidia.com> * add iomixin to strategy & precision Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: artbataev <artbataev@users.noreply.github.com> * add val/test steps to strategy Signed-off-by: jasonwan <jasonwan@nvidia.com> * add documentations Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * add default sharding for fsdp. add setup callback detection Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * extract checkpoint io logic Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * clean up unused imports Signed-off-by: jasonwan <jasonwan@nvidia.com> * sync new megatron strategy changes Signed-off-by: jasonwan <jasonwan@nvidia.com> * break down setup callback back into strategy Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * reorder stuff Signed-off-by: jasonwan <jasonwan@nvidia.com> * fix data logic Signed-off-by: jasonwan <jasonwan@nvidia.com> * minor fix Signed-off-by: jasonwan <jasonwan@nvidia.com> * add dtensor ckpt conversion support Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * support hsdp Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * remove iomixin Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * fix import Signed-off-by: jasonwan <jasonwan@nvidia.com> * refactor loss reduction Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * clean up Signed-off-by: jasonwan <jasonwan@nvidia.com> * add unittest Signed-off-by: jasonwan <jasonwan@nvidia.com> * Apply isort and black reformatting Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> * clean up Signed-off-by: jasonwan <jasonwan@nvidia.com> --------- Signed-off-by: jasonwan <jasonwan@nvidia.com> Signed-off-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> Signed-off-by: artbataev <artbataev@users.noreply.github.com> Co-authored-by: blahBlahhhJ <blahBlahhhJ@users.noreply.github.com> Co-authored-by: artbataev <artbataev@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 22f0bb0 - Browse repository at this point
Copy the full SHA 22f0bb0View commit details -
Export fp8 te nemo to trt-llm (NVIDIA#10096)
* initial commit Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * PR draft Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * fixed scaling weights Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * fixed zarr loading, added flags, refactor Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> * fix expert key mapping Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * refactor Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> * fix: failed test was finishing with exit code 0 Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * test commit -- rerun github checks Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * bugfix: naming Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * bugfix v2: naming Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * apply code review changes Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> * fix TensorRTLLM build (fp8 still not supported) Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> * undo refactor Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * bugfix: arguments to dist_convert Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> --------- Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com> Co-authored-by: Piotr Kaminski <pikaminski@nvidia.com> Co-authored-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9796b69 - Browse repository at this point
Copy the full SHA 9796b69View commit details -
Bugfix: loading scaling factors for pyt 24.07 (NVIDIA#10297)
* bugfix: loading scaling factors Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * list instead of set Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> --------- Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> Co-authored-by: Piotr Kaminski <pikaminski@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 3ed93c1 - Browse repository at this point
Copy the full SHA 3ed93c1View commit details -
Sanity checks for unfinished checkpoints removal (NVIDIA#10228)
* Added sanity checks Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com> * Apply isort and black reformatting Signed-off-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com> * Updated error msg Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com> * Added checks for NeMo 2.0 Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com> * Updated NeMo 2.0 test Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com> * Apply isort and black reformatting Signed-off-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com> * fixed test_nemo_logger.py Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com> --------- Signed-off-by: Jacek Bieniusiewicz <jbieniusiewi@nvidia.com> Signed-off-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com> Co-authored-by: jbieniusiewi <jbieniusiewi@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 006d65f - Browse repository at this point
Copy the full SHA 006d65fView commit details -
allow disabling validation (NVIDIA#10273)
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for cdf61f9 - Browse repository at this point
Copy the full SHA cdf61f9View commit details -
make torch_dist ckpt strategy as default (NVIDIA#9852) (NVIDIA#10291)
copy of NVIDIA#9852 Signed-off-by: dimapihtar <dpihtar@gmail.com> Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 736a6fc - Browse repository at this point
Copy the full SHA 736a6fcView commit details -
TRT-LLM 0.12 + ModelOpt 0.17.0 updates (NVIDIA#10301)
* Update trtllm-build options Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Pull QUANT_CFG_CHOICES into try/catch for HAVE_MODELOPT consistency Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Remove deprecated parallel group setup Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Remove deprecated size settings Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Use max_seq_len instead of max_output_len [part I] Signed-off-by: Jan Lasek <janek.lasek@gmail.com> --------- Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for ea0f69f - Browse repository at this point
Copy the full SHA ea0f69fView commit details -
add documentation for reset_lr feature (NVIDIA#9639) (NVIDIA#10290)
* Change default parallel_save to False (NVIDIA#9633) * Unwrap ckpt_io for model opt (async save) (NVIDIA#9622) (NVIDIA#9634) * add reset_lr documentation * fix style * fix style * fix style * add image * fix typo * fix plot * fix plot * change plot size * fix style * move image * add reset_lr to intro page --------- Signed-off-by: Mikołaj Błaż <mblaz@nvidia.com> Signed-off-by: dimapihtar <dpihtar@gmail.com> Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com> Co-authored-by: mikolajblaz <mikolajblaz@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for eff7ddd - Browse repository at this point
Copy the full SHA eff7dddView commit details -
[NeMo UX] expose
num_dataset_builder_threads
argument (NVIDIA#10281)* expose num_dataset_builder_threads arg Signed-off-by: ashors1 <ashors@nvidia.com> * upate docstring Signed-off-by: ashors1 <ashors@nvidia.com> --------- Signed-off-by: ashors1 <ashors@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 3ebe567 - Browse repository at this point
Copy the full SHA 3ebe567View commit details -
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for d0128da - Browse repository at this point
Copy the full SHA d0128daView commit details -
ci: Selective triggering (NVIDIA#10195)
* ci: Selective triggering Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * simplify pass-through Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * fix Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * fix Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * refactor tests that dont use template Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 81f18f6 - Browse repository at this point
Copy the full SHA 81f18f6View commit details
Commits on Aug 30, 2024
-
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to 9ab31cb ! (NVIDIA#10311)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 4d5f1aa - Browse repository at this point
Copy the full SHA 4d5f1aaView commit details -
Log Gradient Norms (NVIDIA#10244)
* override optimizer step to log Signed-off-by: Maanu Grover <maanug@nvidia.com> * import fix Signed-off-by: Maanu Grover <maanug@nvidia.com> * remove norm from output Signed-off-by: Maanu Grover <maanug@nvidia.com> * also log num zeros in grad Signed-off-by: Maanu Grover <maanug@nvidia.com> * bugfix Signed-off-by: Maanu Grover <maanug@nvidia.com> --------- Signed-off-by: Maanu Grover <maanug@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for b5d1d5f - Browse repository at this point
Copy the full SHA b5d1d5fView commit details -
Add support for LoRA on vLLM (NVIDIA#10009)
* Added basic support for adding LoRA checkpoints in HF format when running deploy_vllm_triton.py Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> * Moved the conversion logic from the convert_nemo_to_canonical.py script to a reusable module, removed the tar unpacking, removed the dependencies on OmegaConf and NLPSaveRestoreConnector. Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> * Implemented on-load conversion of Nemo format LoRA checkpoints into HF format for vLLM. Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> * Added logger initialization, improved some messages. Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> * Moved the LoRA converter script to nemo.export.utils. Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> * Fixed the description of the query.py script. Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> * Apply isort and black reformatting Signed-off-by: apanteleev <apanteleev@users.noreply.github.com> * Fixed the missing file close. Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> --------- Signed-off-by: Alexey Panteleev <alpanteleev@nvidia.com> Signed-off-by: apanteleev <apanteleev@users.noreply.github.com> Co-authored-by: apanteleev <apanteleev@users.noreply.github.com> Co-authored-by: Onur Yilmaz <35306097+oyilmaz-nvidia@users.noreply.github.com> Co-authored-by: Eric Harper <complex451@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for d886151 - Browse repository at this point
Copy the full SHA d886151View commit details -
Flexible passing args to TensorRTLLM in nemo_export.py (NVIDIA#10315)
Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 1ce9089 - Browse repository at this point
Copy the full SHA 1ce9089View commit details -
add back HF Finetune script to CI (NVIDIA#10308)
Signed-off-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: Nithin Rao Koluguri <nithinraok>
Configuration menu - View commit details
-
Copy full SHA for e5f22a8 - Browse repository at this point
Copy the full SHA e5f22a8View commit details -
Add Yi 1.5 34b Neva support (NVIDIA#10083)
* Yi1.5 34b neva prompt template Signed-off-by: HuiyingLi <willwin.lee@gmail.com> * apply black and isort Signed-off-by: HuiyingLi <willwin.lee@gmail.com> * add tokenizer and inference configs in docstring Signed-off-by: HuiyingLi <willwin.lee@gmail.com> --------- Signed-off-by: HuiyingLi <willwin.lee@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for a777a44 - Browse repository at this point
Copy the full SHA a777a44View commit details -
move to cpu only for log probs (NVIDIA#10316)
* move to cpu only for log probs Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> --------- Signed-off-by: Nithin Rao Koluguri <nithinraok> Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> Co-authored-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 0ba9979 - Browse repository at this point
Copy the full SHA 0ba9979View commit details -
[NeMo-UX] Don't create attention mask for GPTs (NVIDIA#10242)
* remove attention mask as default Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * fix non TE case Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> --------- Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com> Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for b87e1e3 - Browse repository at this point
Copy the full SHA b87e1e3View commit details
Commits on Aug 31, 2024
-
Make get_optim_config iterable (NVIDIA#10318)
* Make get_optim_config iterable Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Update error message Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 9a22005 - Browse repository at this point
Copy the full SHA 9a22005View commit details -
Fix llama3 pretraining NeMo 2.0 script (NVIDIA#10307)
Signed-off-by: Hemil Desai <hemild@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for b698ae5 - Browse repository at this point
Copy the full SHA b698ae5View commit details
Commits on Sep 2, 2024
-
Support TE-DPA For Stable Diffusion (NVIDIA#10288)
* [SD] Add te-dpa support Signed-off-by: Wil Kong <alpha0422@gmail.com> * [SD] Add te-dpa support, resolve compatibility with TE-master Signed-off-by: Wil Kong <alpha0422@gmail.com> * [SD] Add te-dpa support, add check for attention configs. Signed-off-by: Wil Kong <alpha0422@gmail.com> * Fix bugs of flash-attn and dpa in SD. Signed-off-by: Wil Kong <alpha0422@gmail.com> * Fix the issue of DPA API change. Signed-off-by: Wil Kong <alpha0422@gmail.com> * Apply isort and black reformatting Signed-off-by: alpha0422 <alpha0422@users.noreply.github.com> Signed-off-by: Wil Kong <alpha0422@gmail.com> --------- Signed-off-by: Wil Kong <alpha0422@gmail.com> Signed-off-by: alpha0422 <alpha0422@users.noreply.github.com> Co-authored-by: Mengdi Wang <didow@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 78357ae - Browse repository at this point
Copy the full SHA 78357aeView commit details
Commits on Sep 3, 2024
-
fix tokenizer restoration (NVIDIA#10336)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 8cd751b - Browse repository at this point
Copy the full SHA 8cd751bView commit details -
remove virtual pipeline parallel apex dependency (NVIDIA#10317)
Signed-off-by: ashors1 <ashors@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 9472fc3 - Browse repository at this point
Copy the full SHA 9472fc3View commit details -
Add option to selectively load context in nemo.lightning.io (NVIDIA#1…
…0279) * Add option to selectively load context in nemo.lightning.io Signed-off-by: Hemil Desai <hemild@nvidia.com> * Update docstring Signed-off-by: Hemil Desai <hemild@nvidia.com> * Fixes Signed-off-by: Hemil Desai <hemild@nvidia.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com> Co-authored-by: Marc Romeyn <mromeijn@nvidia.com> Co-authored-by: Abhishree Thittenamane <47577437+athitten@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ac89593 - Browse repository at this point
Copy the full SHA ac89593View commit details -
Add EP to mixtral-8x22b recipe (NVIDIA#10337)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 8eb1827 - Browse repository at this point
Copy the full SHA 8eb1827View commit details -
Bugfix: export to trt-llm multi_block_mode flag (NVIDIA#10334)
* bugfix Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> * Apply isort and black reformatting Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> * add quantisation flags to deploy_triton.py Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> --------- Signed-off-by: Piotr Kaminski <pikaminski@nvidia.com> Signed-off-by: Laplasjan107 <Laplasjan107@users.noreply.github.com> Co-authored-by: Piotr Kaminski <pikaminski@nvidia.com> Co-authored-by: Laplasjan107 <Laplasjan107@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ab6aba3 - Browse repository at this point
Copy the full SHA ab6aba3View commit details -
Configuration menu - View commit details
-
Copy full SHA for a1fd899 - Browse repository at this point
Copy the full SHA a1fd899View commit details
Commits on Sep 4, 2024
-
Add comment to address a frequently asked question (NVIDIA#10321)
* Add comment to address a frequently asked question Signed-off-by: Chen Cui <chcui@nvidia.com> * wording Signed-off-by: Chen Cui <chcui@nvidia.com> * add docstring to GPTSFTPackedDataset Signed-off-by: Chen Cui <chcui@nvidia.com> --------- Signed-off-by: Chen Cui <chcui@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0d2d7c4 - Browse repository at this point
Copy the full SHA 0d2d7c4View commit details -
Fix async checkpointing in nemo.lightning (NVIDIA#10324)
* Fix async checkpointing in nemo.lightning Signed-off-by: Hemil Desai <hemild@nvidia.com> * PR feedback Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add unit tests Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for dd02d02 - Browse repository at this point
Copy the full SHA dd02d02View commit details -
[Draft]Add Nemotron4 recipes and Long Context Recipe (NVIDIA#10262)
* Add Nemotron recipes and Long Context Recipe Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> * Add Nemotron4 recipes Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> * Change long context recipe for llama3 mixtral and nemotron. Add Expert Parallel Support Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> * Revert Chainedoptimizer change Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> * Update mixed_precision.py Signed-off-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com> * Update mixtral_8x7b.py Signed-off-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com> * Add option to not generate attention mask Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> * Remove nemtron Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> * Resolve mock conflict Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> --------- Signed-off-by: Boxiang Wang <boxiangw@nvidia.com> Signed-off-by: BoxiangW <45734921+BoxiangW@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 32ba985 - Browse repository at this point
Copy the full SHA 32ba985View commit details -
[NeMo-UX] Adding copyright to collections.llm & lightning (NVIDIA#10345)
* Adding copyright to collections.llm & lightning Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> --------- Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> Co-authored-by: marcromeyn <marcromeyn@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 8134f33 - Browse repository at this point
Copy the full SHA 8134f33View commit details -
added support for FC model in Diarization with ASR and timestamps (NV…
…IDIA#10154) Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com> Co-authored-by: Nithin Rao <nithinrao.koluguri@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 73bec06 - Browse repository at this point
Copy the full SHA 73bec06View commit details -
Remove apply_query_key_layer_scaling for GPT models (NVIDIA#10349)
* remove qk layer scaling * add sc2
Configuration menu - View commit details
-
Copy full SHA for d8efee9 - Browse repository at this point
Copy the full SHA d8efee9View commit details
Commits on Sep 5, 2024
-
remove grad clipping from mixed_precision plugin (NVIDIA#10303)
* remove grad clipping from mixed_precision plugin Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Raise an expection if user tries to use Trainer's clip_grad Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * Remove gradient_clip_val from recipes Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 7738b1d - Browse repository at this point
Copy the full SHA 7738b1dView commit details -
Add option to selectively restore model weights and optimizer states …
…in AutoResume and MegatronStrategy (NVIDIA#10295) * Add option to selectively restore model weights and optimizer states in AutoResume and MegatronStrategy Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * PR feedback Signed-off-by: Hemil Desai <hemild@nvidia.com> * Fix PEFT checkpointing Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> * address comments Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> * Refactor selective restore Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * fix tokenizer issue for peft load Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> * fix peft optimizer states loading Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> * Fix fabric strategy Signed-off-by: Hemil Desai <hemild@nvidia.com> * PR feedback Signed-off-by: Hemil Desai <hemild@nvidia.com> * Rename SelectiveRestoreConfig to RestoreConfig Signed-off-by: Hemil Desai <hemild@nvidia.com> * Update hf llama 3 paths Signed-off-by: Hemil Desai <hemild@nvidia.com> * Fixes Signed-off-by: Hemil Desai <hemild@nvidia.com> * Fix tests Signed-off-by: Hemil Desai <hemild@nvidia.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> Co-authored-by: hemildesai <hemildesai@users.noreply.github.com> Co-authored-by: Chen Cui <chcui@nvidia.com> Co-authored-by: cuichenx <cuichenx@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 19f904e - Browse repository at this point
Copy the full SHA 19f904eView commit details -
Signed-off-by: Malay Nagda <malayn@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for e6db2f3 - Browse repository at this point
Copy the full SHA e6db2f3View commit details -
* fix links Signed-off-by: eharper <eharper@nvidia.com> * fix links Signed-off-by: eharper <eharper@nvidia.com> --------- Signed-off-by: eharper <eharper@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for a567380 - Browse repository at this point
Copy the full SHA a567380View commit details -
Improve TE import guards (NVIDIA#10322)
* improve TE import guards Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * small fixes Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * bug fixes Signed-off-by: ashors1 <ashors@nvidia.com> --------- Signed-off-by: ashors1 <ashors@nvidia.com> Signed-off-by: ashors1 <ashors1@users.noreply.github.com> Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for a9746a6 - Browse repository at this point
Copy the full SHA a9746a6View commit details -
ci: Detect secrets (NVIDIA#10343)
* ci: Add secrets detector Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * chore: Add baseline Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 5bd2b89 - Browse repository at this point
Copy the full SHA 5bd2b89View commit details -
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to 3396356 ! (NVIDIA#10353)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fdf1979 - Browse repository at this point
Copy the full SHA fdf1979View commit details
Commits on Sep 6, 2024
-
[NeMo-UX] Turn on mcore performance optimizations (NVIDIA#10209)
* expose TP overlap Signed-off-by: Jieming Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * add tp overlap recipes Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * turn on pipeline parallel overlap Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * refactor Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * Update base.py Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com> * Update megatron_parallel.py Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com> * remove env var Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * add optimization config Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix typo Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * refactor into megatron parallel setup Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * refactor Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix config ordering, add wgrad deferral Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * cleanup Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * use config Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * clean Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * enable wgrad defferal Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add grad bucket size Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * move everthing into a callback Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * cleanup Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix imports Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * move userbuffer init Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * cleanup Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix VP Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * address comments Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add gradient accum guard Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * Update base.py Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com> * address comments Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * address comments Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> --------- Signed-off-by: Jieming Zhang <jiemingz@nvidia.com> Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> Signed-off-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com> Co-authored-by: Jieming Zhang <jiemingz@nvidia.com> Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 1d5de59 - Browse repository at this point
Copy the full SHA 1d5de59View commit details -
[NeMo-UX] checkpointing improvements (NVIDIA#10241)
* save model weights and artifacts to separate directories Signed-off-by: ashors1 <ashors@nvidia.com> * add save_artifacts_on_train_end Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * do not save optimizer states in final checkpoint Signed-off-by: ashors1 <ashors@nvidia.com> * WIP support for saving only last k optimizer states Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * minor cleanup Signed-off-by: ashors1 <ashors@nvidia.com> * Revert support for saving last k optimizer states. This will be addressed in a subsequent PR. * use storage_options to determine when to skip saving optimizer states Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * fix variable names, make checkpoint load work when optimizer states don't exist in the checkpoint Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * FSDP updates, provide option to save optimizer states on_train_end Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * simplify implementation, remove save_best_model option Signed-off-by: ashors1 <ashors@nvidia.com> * update default value of ckpt_include_optimizer for fsdp Signed-off-by: ashors1 <ashors@nvidia.com> * remove unused imports Signed-off-by: ashors1 <ashors@nvidia.com> * remove unused import Signed-off-by: ashors1 <ashors@nvidia.com> * cleanup Signed-off-by: ashors1 <ashors@nvidia.com> * make storage_options optional again Signed-off-by: ashors1 <ashors@nvidia.com> * fix failing tests Signed-off-by: ashors1 <ashors@nvidia.com> * address some comments Signed-off-by: ashors1 <ashors@nvidia.com> * use save_weights_only to determine whether to save optimizer states Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * add some comments Signed-off-by: ashors1 <ashors@nvidia.com> * fix tests Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * fixes Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * remove unnecessary line Signed-off-by: ashors1 <ashors@nvidia.com> --------- Signed-off-by: ashors1 <ashors@nvidia.com> Signed-off-by: ashors1 <ashors1@users.noreply.github.com> Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 34393c6 - Browse repository at this point
Copy the full SHA 34393c6View commit details -
[Nemo Unit Tests] Split CPU unit tests (NVIDIA#10365)
* Split CPU unit tests * Split CPU unit tests * Fix:Run pytest in specific paths * Fix:Run pytest in specific paths * Fix:Run pytest in specific paths
Configuration menu - View commit details
-
Copy full SHA for ad5ef75 - Browse repository at this point
Copy the full SHA ad5ef75View commit details -
ci: Fix checkout of secrets detector (NVIDIA#10381)
* ci: Fix checkout of secrets detector Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 95944ee - Browse repository at this point
Copy the full SHA 95944eeView commit details -
only log consumed samples during training (NVIDIA#10371)
Signed-off-by: ashors1 <ashors@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 7ba0681 - Browse repository at this point
Copy the full SHA 7ba0681View commit details
Commits on Sep 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 62c1dce - Browse repository at this point
Copy the full SHA 62c1dceView commit details -
[NeMo-UX] Checkpointing fixes (NVIDIA#10376)
* remove save_best_model from default logger Signed-off-by: ashors1 <ashors@nvidia.com> * fix broken checkpoint restore Signed-off-by: ashors1 <ashors@nvidia.com> * fix fsdp Signed-off-by: ashors1 <ashors@nvidia.com> * rename weights path to avoid confusion Signed-off-by: ashors1 <ashors@nvidia.com> * Revert "rename weights path to avoid confusion". We'll add this in a separate PR This reverts commit 72bae8b. --------- Signed-off-by: ashors1 <ashors@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 9e372d3 - Browse repository at this point
Copy the full SHA 9e372d3View commit details -
add auto configurator to NeMo (NVIDIA#10270)
* add base configs Signed-off-by: dimapihtar <dpihtar@gmail.com> * add auto configurator functionality Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * add runner Signed-off-by: dimapihtar <dpihtar@gmail.com> * add end-to-end example for auto configurator Signed-off-by: dimapihtar <dpihtar@gmail.com> * add unit tests for auto configurator Signed-off-by: dimapihtar <dpihtar@gmail.com> * add GPT configs Signed-off-by: dimapihtar <dpihtar@gmail.com> * add GPT configs Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * switch to dataclass Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * switch to dataclass Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix dataclasses usage Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * remove unused imports Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove extra function Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix docstring style Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * take Config object as input for model Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * add nemotron support Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * remove search_config.py Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * move configs creation to Basic class Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * move to common basic class Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * rename main config Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove base configs for models Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: artbataev <artbataev@users.noreply.github.com> * change auto conf functionality Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix docstring Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * remove unused imports Signed-off-by: dimapihtar <dpihtar@gmail.com> * add changes Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove activations_checkpoint_num_layers Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove gbs from config Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix logs Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix performance calculation Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix end-to-end example Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix model config Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * minor changes Signed-off-by: dimapihtar <dpihtar@gmail.com> * minor changes Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix unit tests Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * add README Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix README Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix README Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix readme Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix readme Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove extra arg Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove unused imports Signed-off-by: dimapihtar <dpihtar@gmail.com> * add nemo-run installation Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix unit tests Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix unit tests Signed-off-by: dimapihtar <dpihtar@gmail.com> --------- Signed-off-by: dimapihtar <dpihtar@gmail.com> Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> Signed-off-by: artbataev <artbataev@users.noreply.github.com> Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com> Co-authored-by: artbataev <artbataev@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for cda2a63 - Browse repository at this point
Copy the full SHA cda2a63View commit details
Commits on Sep 8, 2024
-
fix mixtraltopk (NVIDIA#10366)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Marc Romeyn <mromeijn@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for f666682 - Browse repository at this point
Copy the full SHA f666682View commit details -
ci: Fix release tag (NVIDIA#10367)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for e1f375e - Browse repository at this point
Copy the full SHA e1f375eView commit details -
Akoumparouli/nemo ux tokenizer fix (NVIDIA#10351)
* save tokenizer to disk Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Track Hf tokenizer assets Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * raise exception if dst file exists Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * minor Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * remove print Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add tokenizercontext Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Add TokenizerContext Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * restore tokenizer from separate dir Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * update artifact __init__.py Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * TokenizerContext connector Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * bugix on_import_ckpt Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * rm code Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Drop tokenizercontext Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * drop tokenizer load from tokenizercontext Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * undo Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * undo Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Move to util function Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * use save_hf_tokenizer_assets Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * add tokenizer restoration in resume.py Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * bot fixes Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * rm Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * wrap tokenizer restoration in try/catch Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * load_artifacts Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * param fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * more fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * lazy import tensorboard Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * move code out of file context manager Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Allow skippable artifacts Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * rebase fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * checkpoint structure change update Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for a26ed2f - Browse repository at this point
Copy the full SHA a26ed2fView commit details -
Add option to resume from specific path in AutoResume (NVIDIA#10373)
* Add option to resume from specific path in AutoResume Signed-off-by: Hemil Desai <hemild@nvidia.com> * Fix path Signed-off-by: Hemil Desai <hemild@nvidia.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for dd63de1 - Browse repository at this point
Copy the full SHA dd63de1View commit details -
ci: Cleanup of release-freeze automation (NVIDIA#10392)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 6f1c414 - Browse repository at this point
Copy the full SHA 6f1c414View commit details -
ci: Toggle pre-release (NVIDIA#10394)
* ci: Toggle pre-release Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for ab82b56 - Browse repository at this point
Copy the full SHA ab82b56View commit details -
ci: Toggle pre-release (NVIDIA#10395)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for bcf7e0f - Browse repository at this point
Copy the full SHA bcf7e0fView commit details -
ci: Toggle pre-release (NVIDIA#10396)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 21cb949 - Browse repository at this point
Copy the full SHA 21cb949View commit details -
ci: Automate pre-release (NVIDIA#10397)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 30385aa - Browse repository at this point
Copy the full SHA 30385aaView commit details -
Akoumparouli/nemo ux validate dataset asset accessibility (NVIDIA#10309)
* Add validate_dataset_asset_accessibility Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Add CI tests for validate_dataset_asset_accessibility Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix for zipped lists Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 2404c4e - Browse repository at this point
Copy the full SHA 2404c4eView commit details -
[🤠]: Howdy folks, let's bump NeMo
2.1.0rc0
! (NVIDIA#10399)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: ko3n1g <16716991+ko3n1g@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9921e6c - Browse repository at this point
Copy the full SHA 9921e6cView commit details -
ci: Update baseline (NVIDIA#10400)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for f6cd74b - Browse repository at this point
Copy the full SHA f6cd74bView commit details -
ci(chore): Minor change (NVIDIA#10401)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 94c5fd8 - Browse repository at this point
Copy the full SHA 94c5fd8View commit details -
ci: Swap merge/cherry-pick order (NVIDIA#10389)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 41502ff - Browse repository at this point
Copy the full SHA 41502ffView commit details -
ci: Fix release tag (NVIDIA#10402)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 19382eb - Browse repository at this point
Copy the full SHA 19382ebView commit details -
Ko3n1g/ci/fix release workflow 2 (NVIDIA#10403)
* ci: Improve release workflow Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * ci: Fix cherry-picking Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 73a8ef8 - Browse repository at this point
Copy the full SHA 73a8ef8View commit details -
ci: Send Slack alert on failed cherry pick (NVIDIA#10404)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for a4f95f1 - Browse repository at this point
Copy the full SHA a4f95f1View commit details -
ci: Allow concurrent docker system prune (NVIDIA#10405)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0e5e5d5 - Browse repository at this point
Copy the full SHA 0e5e5d5View commit details -
ci: Use PAT for cherry-picking (NVIDIA#10406)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 46e908e - Browse repository at this point
Copy the full SHA 46e908eView commit details -
Alit/mamba ux cicd (NVIDIA#10370)
* add mamba init * more ssm * add 370m * add hybrid * fix issue * integrate model and tokenizer config for ssm * add all mamba configs * modify state re pattern * revert gpt stuff * remove SSM class and training script * Apply isort and black reformatting Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> * remove faulty export * add script to test * Apply isort and black reformatting Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> * some recent fixes * Apply isort and black reformatting Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> * test script tp/pp1 * Apply isort and black reformatting Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> * add cicd * include MLM mamba dist ckpt commit * add license head and address more comments * Apply isort and black reformatting Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> * add guard * remove guard from TransformerConfig * update scripts * Apply isort and black reformatting Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> --------- Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> Signed-off-by: Ali Taghibakhshi <71892896+JRD971000@users.noreply.github.com> Co-authored-by: Ali Taghibakhshi <ataghibakhsh@login-eos01.eos.clusters.nvidia.com> Co-authored-by: JRD971000 <JRD971000@users.noreply.github.com> Co-authored-by: oliver könig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 9f9bf4d - Browse repository at this point
Copy the full SHA 9f9bf4dView commit details -
ci: Allow default token to write workflows (NVIDIA#10407)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for a95f3a2 - Browse repository at this point
Copy the full SHA a95f3a2View commit details -
ci: More permissions for cherry-pick automation (NVIDIA#10409)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 4bf8101 - Browse repository at this point
Copy the full SHA 4bf8101View commit details -
ci: Overhaul cherry-pick workflow (NVIDIA#10410)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0a40662 - Browse repository at this point
Copy the full SHA 0a40662View commit details -
ci: Ignore failures on cherry-picking (NVIDIA#10411)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0d0e724 - Browse repository at this point
Copy the full SHA 0d0e724View commit details -
ci: Minor change (NVIDIA#10412)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 52c7f2a - Browse repository at this point
Copy the full SHA 52c7f2aView commit details -
ci: Fix cherry-pick config (NVIDIA#10413)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 7d27792 - Browse repository at this point
Copy the full SHA 7d27792View commit details -
ci: Minor change (NVIDIA#10414)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 91863d2 - Browse repository at this point
Copy the full SHA 91863d2View commit details -
ci: Minor change (NVIDIA#10415)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 48fab9d - Browse repository at this point
Copy the full SHA 48fab9dView commit details -
ci: Remove dead code (NVIDIA#10416)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 573d910 - Browse repository at this point
Copy the full SHA 573d910View commit details -
Ko3n1g/ci/test cherry picking 2 (NVIDIA#10417)
* ci: Cherrypick continue on error Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * ci: Fix cherry pick branch Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 14c3d4a - Browse repository at this point
Copy the full SHA 14c3d4aView commit details -
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for aab78f0 - Browse repository at this point
Copy the full SHA aab78f0View commit details -
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for b7ee0b8 - Browse repository at this point
Copy the full SHA b7ee0b8View commit details
Commits on Sep 9, 2024
-
[NeMo-UX] Integrating CLI (NVIDIA#10300)
* Adding nemo-run to requirements Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Updating nemo-run entrypoint inside setup.py Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Remove nemo-run from requirements until we have a pypi package Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Update entrypoint naming Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Setting up cli recipe for llama3-8b Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Move AutoTokenizer import inline for starcoder Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Move AutoTokenizer import inline for starcoder2 Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Use target for factories inside llama3_8b Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Update other recipes Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Fix some bugs in the recipes Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Adding some examples Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Adding repl example Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Starting to add a notebook example as well Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Fix wrong imports Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Apply isort and black reformatting Signed-off-by: pre-commit-ci[bot] <pre-commit-ci[bot]@users.noreply.github.com> * Fix wrong imports Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> * Fix typo + add script with default executor Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> * Add nemo-run to Dockerfile.ci Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Adding copyright to recipes Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> * Adding guides to recipes dir Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Adding hatchling to Dockerfile.ci Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Move install to different line Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * fix install Signed-off-by: Hemil Desai <hemild@nvidia.com> * Move llama3_pretraining to scripts for now Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Remove img folder & use images from release instead Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> * Updating default of num_nodes in all recipes Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> * Adding tests for all recipes Signed-off-by: Marc Romeijn <mromeijn@nvidia.com> * ddAing docstrings Signed-off-by: Marc Romeijn <mromeijn@nvidia.com> * Apply isort and black reformatting Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> * Fix failing tests inside test_mixtral_8x7b_64k Signed-off-by: Marc Romeijn <mromeijn@nvidia.com> * Rename fabric to _fabric to avoid name collision with package fabric Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add rename comment Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Marc Romeyn <mromeijn@nvidia.com> Signed-off-by: pre-commit-ci[bot] <pre-commit-ci[bot]@users.noreply.github.com> Signed-off-by: marcromeyn <marcromeyn@users.noreply.github.com> Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: Marc Romeijn <mromeijn@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: marcromeyn <marcromeyn@users.noreply.github.com> Co-authored-by: Hemil Desai <hemild@nvidia.com> Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for d12fbbd - Browse repository at this point
Copy the full SHA d12fbbdView commit details -
[Nemo Unit Tests] Split GPU unit tests (NVIDIA#10380)
* Split GPU unit tests * Make L0_Unit_Tests_GPU_Lightning optional since flaky * Add time for GPU_Core test * Add time for GPU_Audio test
Configuration menu - View commit details
-
Copy full SHA for fb39fad - Browse repository at this point
Copy the full SHA fb39fadView commit details -
Support Energon as dataloader in NeVA (NVIDIA#10305)
* energon support draft Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * removing yi, not in PRs scope Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * remove mp_spawn + add energon req Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> * removing seq packing Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> * fixing encode_sft Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * fix use_energon Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> * adding docstrings + addressing comments + CodeQL fixes Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> * addressing comments Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> * add yi_34b Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> * adding assertion for energon + peft, not supported due to dataloader being saved with dist_ckpt Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> * energon usage warning Signed-off-by: paul-gibbons <paul@gibbonspaul.com> * Apply isort and black reformatting Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> --------- Signed-off-by: paul-gibbons <paul@gibbonspaul.com> Signed-off-by: paul-gibbons <paul-gibbons@users.noreply.github.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: paul-gibbons <paul-gibbons@users.noreply.github.com> Co-authored-by: Yu Yao <54727607+yaoyu-33@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for dc61f7a - Browse repository at this point
Copy the full SHA dc61f7aView commit details -
24.07 perf numbers (NVIDIA#10253)
Signed-off-by: Malay Nagda <malayn@nvidia.com> Co-authored-by: Sangkug Lym <slym@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 8e3d65d - Browse repository at this point
Copy the full SHA 8e3d65dView commit details -
Configuration menu - View commit details
-
Copy full SHA for ba7962e - Browse repository at this point
Copy the full SHA ba7962eView commit details -
Neva update to NeMo 2.0 (NVIDIA#10292)
* add some placeholders Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * few updates Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * update Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fix logging Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * add neva mock dataset Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * update neva conversation Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * neva data update Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fixes to conversation Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * data save Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Update paths and fix init Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * some fixes Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fix combined embeddings logic Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * revert debug code Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix mock dataset for neva Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix dataset part and add llava Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix and updates Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix and updates Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix and updates Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Add checkpoint saving dev test script Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix and updates for neva pretraining Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Fix and updates for neva finetuning Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Fix minor issues in neva 2.0 upgrade Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Update init Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * update examples Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Formatting Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix formatting Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Add license and fix formatting Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Fix formatting Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix examples Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix neva api file Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Update tokens Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Update arg apis Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * Update model and pretrain script Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * finetune and inference fixes Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * address comments Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix dataset resuming Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * address comments Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * clean up Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * update loss mask class Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * fix Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * fix args Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * address comments Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> * remove examples Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * remove factory Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> --------- Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> Co-authored-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 176c54f - Browse repository at this point
Copy the full SHA 176c54fView commit details -
Configuration menu - View commit details
-
Copy full SHA for e6f6a48 - Browse repository at this point
Copy the full SHA e6f6a48View commit details -
[Nemo CICD] Make flaky test optional (NVIDIA#10438)
* Make flaky test optional * Make flaky test optional
Configuration menu - View commit details
-
Copy full SHA for 8f0d0c7 - Browse repository at this point
Copy the full SHA 8f0d0c7View commit details
Commits on Sep 10, 2024
-
[Nemo CICD] Make flaky test optional (NVIDIA#10442)
* Make flaky test optional * Make flaky test optional * Make flaky test optional
Configuration menu - View commit details
-
Copy full SHA for 4259169 - Browse repository at this point
Copy the full SHA 4259169View commit details -
ci: Fix secrets detector on forks (NVIDIA#10426)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for ae243d4 - Browse repository at this point
Copy the full SHA ae243d4View commit details -
[Nemo CICD] Make flaky test optional (NVIDIA#10446)
* Make flaky test optional * Make flaky test optional * Make flaky test optional * Make flaky test optional --------- Signed-off-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for c3e6a6e - Browse repository at this point
Copy the full SHA c3e6a6eView commit details -
Expand pyproject.toml to include package metadata for uv (NVIDIA#10350)
Also edits get_git_hash to not error when git is not installed in the local container. Signed-off-by: Peter St. John <pstjohn@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 18d81b1 - Browse repository at this point
Copy the full SHA 18d81b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1be9cc1 - Browse repository at this point
Copy the full SHA 1be9cc1View commit details -
[Nemo CICD] Make flaky test optional (NVIDIA#10450)
* Make flaky test optional * Make flaky test optional
Configuration menu - View commit details
-
Copy full SHA for 766ded5 - Browse repository at this point
Copy the full SHA 766ded5View commit details -
[Nemo CICD] Make flaky test optional (NVIDIA#10452)
* Make flaky test optional * Make flaky test optional * Make flaky test optional
Configuration menu - View commit details
-
Copy full SHA for 3c4def6 - Browse repository at this point
Copy the full SHA 3c4def6View commit details
Commits on Sep 11, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 79c4786 - Browse repository at this point
Copy the full SHA 79c4786View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0df6610 - Browse repository at this point
Copy the full SHA 0df6610View commit details -
add parakeet-tdt_ctc-110m model (NVIDIA#10461)
* add parakeet-tdt_ctc-110m model Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> --------- Signed-off-by: Nithin Rao Koluguri <nithinraok> Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> Co-authored-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fd8c6a4 - Browse repository at this point
Copy the full SHA fd8c6a4View commit details -
Bump
Dockerfile.ci
(2024-09-09) (NVIDIA#10423)* [🤠]: Howdy folks, let's bump `Dockerfile.ci` to 8307fcd ! Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * update TE import paths Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * Update parallelisms.rst fix sed typo. Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> * fix for mcore dist opt refactor: move overlap_grad_reduce/overlap_param_gather to ddp config Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * remove overlap_grad_reduce overlap_param_gather from autoconfig Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * subclass TransformerConfig because megatronmodule expects it to have fp8 attr Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * revert change; Use ModelParallelConfig & add fp8 Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix, set NVTE_APPLY_QK_LAYER_SCALIN=1 Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 1163e1e - Browse repository at this point
Copy the full SHA 1163e1eView commit details -
MCORE interface for TP-only FP8 AMAX reduction (NVIDIA#10437)
Signed-off-by: Sangkug Lym <slym@nvidia.com> Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 46aa1ee - Browse repository at this point
Copy the full SHA 46aa1eeView commit details -
Support MCORE Distributed Optimizer (NVIDIA#10363)
* add the mcore interface for optim arg; overlap param AG with optimizer Signed-off-by: Sangkug Lym <slym@nvidia.com> * Apply isort and black reformatting Signed-off-by: erhoo82 <erhoo82@users.noreply.github.com> --------- Signed-off-by: Sangkug Lym <slym@nvidia.com> Signed-off-by: erhoo82 <erhoo82@users.noreply.github.com> Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: erhoo82 <erhoo82@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 2089c53 - Browse repository at this point
Copy the full SHA 2089c53View commit details
Commits on Sep 12, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 2c21e0a - Browse repository at this point
Copy the full SHA 2c21e0aView commit details -
Fix nemo run entrypoints (NVIDIA#10464)
Signed-off-by: Hemil Desai <hemild@nvidia.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for c611e53 - Browse repository at this point
Copy the full SHA c611e53View commit details -
[Nemo CICD] Make flaky test optional (NVIDIA#10476)
* Optional test needs optional field set true * make flaky test optional
Configuration menu - View commit details
-
Copy full SHA for 70f4426 - Browse repository at this point
Copy the full SHA 70f4426View commit details -
add ci tests for Auto Configurator (NVIDIA#10390)
* add ci tests for Auto Configurator Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix unit tests Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix ci tests Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix unit tests Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix ci tests Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix ci tests Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix unit tests Signed-off-by: dimapihtar <dpihtar@gmail.com> --------- Signed-off-by: dimapihtar <dpihtar@gmail.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 057041c - Browse repository at this point
Copy the full SHA 057041cView commit details
Commits on Sep 13, 2024
-
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to 76f9f48 ! (NVIDIA#10477)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for cb3df0b - Browse repository at this point
Copy the full SHA cb3df0bView commit details
Commits on Sep 15, 2024
-
[NeMo-UX] Rename weights path during resume (NVIDIA#10391)
* rename weights path to avoid confusion Signed-off-by: ashors1 <ashors@nvidia.com> * use pathlib utils rather than os Signed-off-by: ashors1 <ashors@nvidia.com> * update resume_from_path and context_path Signed-off-by: ashors1 <ashors@nvidia.com> * address comment Signed-off-by: ashors1 <ashors@nvidia.com> --------- Signed-off-by: ashors1 <ashors@nvidia.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 3a60491 - Browse repository at this point
Copy the full SHA 3a60491View commit details -
[NeMo-UX] Use custom
BatchProgress
class which does not restore sta……tes (NVIDIA#10383) * [WIP] fix batch sampler to match megatron dataloaders Signed-off-by: ashors1 <ashors@nvidia.com> * make batchprogress configurable Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> --------- Signed-off-by: ashors1 <ashors@nvidia.com> Signed-off-by: ashors1 <ashors1@users.noreply.github.com> Co-authored-by: ashors1 <ashors1@users.noreply.github.com> Co-authored-by: Shriya Rishab <69161273+ShriyaPalsamudram@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for b5798de - Browse repository at this point
Copy the full SHA b5798deView commit details
Commits on Sep 16, 2024
-
Remove Apex dependency if not using norm (NVIDIA#10468)
Signed-off-by: Chen Cui <chcui@nvidia.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 9621be2 - Browse repository at this point
Copy the full SHA 9621be2View commit details -
Update adapter saving logic to be compatible with
save_weights_only
(……NVIDIA#10466) * update adapter save logic to be compatible with `save_weights_only` Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> --------- Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> Co-authored-by: cuichenx <cuichenx@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 0f8a531 - Browse repository at this point
Copy the full SHA 0f8a531View commit details -
Akoumparouli/nemo ux update param name (NVIDIA#10441)
* NeMoLogger: update dir to log_dir Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * NeMologger: update calls Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Marc Romeyn <mromeijn@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 62deef0 - Browse repository at this point
Copy the full SHA 62deef0View commit details -
Draft: Expose MCore Cudagraph interface (NVIDIA#10121)
* cuda graph modules Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * bug fixes Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add mcore cuda graph interface Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add mcore cuda graph interface Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix whitespace Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * change te import path Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add cudagraph manager Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * separate external cudagraph flag Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * separate config flags Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * update mcore changes Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add check Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> --------- Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com> Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 99af1ce - Browse repository at this point
Copy the full SHA 99af1ceView commit details -
[NeMo-UX] Add token drop callback and optimize mixtral configs (NVIDI…
…A#10361) * add token drop plugin Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add checks Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add expert parallel configs Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * amend comment Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * add comm overlap Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix rebase errors Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * fix typo Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * add test configs Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * fix Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> --------- Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com> Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for cc494c9 - Browse repository at this point
Copy the full SHA cc494c9View commit details -
fix partial audio transcription order: (NVIDIA#10379)
* fix partial audio transcription order: Signed-off-by: Nithin Rao Koluguri <nithinraok> * update transcribe_speech.py Signed-off-by: Nithin Rao Koluguri <nithinraok> * fix canary transcription Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> * for filepaths Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> * add override config option Signed-off-by: Nithin Rao Koluguri <nithinraok> * remove unused imports Signed-off-by: Nithin Rao Koluguri <nithinraok> * clean up Signed-off-by: stevehuang52 <heh@nvidia.com> * completely remove partial audio transcription Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> * update doc strings Signed-off-by: Nithin Rao Koluguri <nithinraok> * remove unused imports Signed-off-by: Nithin Rao Koluguri <nithinraok> * support for translate_speech.py Signed-off-by: Nithin Rao Koluguri <nithinraok> * suggested changes from som Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> --------- Signed-off-by: Nithin Rao Koluguri <nithinraok> Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> Signed-off-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: nithinraok <nithinraok@users.noreply.github.com> Co-authored-by: stevehuang52 <heh@nvidia.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for a250726 - Browse repository at this point
Copy the full SHA a250726View commit details -
ci: Fix hyperlink to PR (NVIDIA#10494)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for d419955 - Browse repository at this point
Copy the full SHA d419955View commit details -
Configuration menu - View commit details
-
Copy full SHA for 16568d7 - Browse repository at this point
Copy the full SHA 16568d7View commit details
Commits on Sep 17, 2024
-
fix NeMoLogger log -> log_dir rename (NVIDIA#10498)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 8ff8804 - Browse repository at this point
Copy the full SHA 8ff8804View commit details -
ci: Fix base branch of secrets detector (NVIDIA#10501)
* ci: Fix base branch of secrets detector Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for f6a905c - Browse repository at this point
Copy the full SHA f6a905cView commit details -
Call reload_model_params only if there's no optimizer state (NVIDIA#1…
…0470) * Call reload_model_params only if there's no optimizer state Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add nemo-ux resume test Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9f67409 - Browse repository at this point
Copy the full SHA 9f67409View commit details -
ci: Disable flaky secrets test (NVIDIA#10503)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for cda4be3 - Browse repository at this point
Copy the full SHA cda4be3View commit details -
Add missing import guards for causal_conv1d and mamba_ssm dependencies (
NVIDIA#10429) * Add causal_conv1d import guard Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Add mamba_ssm import guard Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Apply isort and black reformatting Signed-off-by: janekl <janekl@users.noreply.github.com> --------- Signed-off-by: Jan Lasek <janek.lasek@gmail.com> Signed-off-by: janekl <janekl@users.noreply.github.com> Co-authored-by: janekl <janekl@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for df3575a - Browse repository at this point
Copy the full SHA df3575aView commit details -
Update doc for fp8 trt-llm export (NVIDIA#10444)
* Update doc for fp8 trt-llm export Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com> * Apply review suggestions Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com> * code review Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com> --------- Signed-off-by: Piotr Kamiński <67481570+Laplasjan107@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 308eaac - Browse repository at this point
Copy the full SHA 308eaacView commit details -
[SD] TE-DPA: disbale use te-dpa in inference flow. (NVIDIA#10488)
Co-authored-by: Mengdi Wang <didow@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 16a1e0c - Browse repository at this point
Copy the full SHA 16a1e0cView commit details -
Add py-modules to pyproject.toml (NVIDIA#10509)
* Lower bound transformers to support nemotron Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> * Add py-modules Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> * Update py-modules Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> --------- Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> Co-authored-by: Dong Hyuk Chang <donghyukc@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for da993db - Browse repository at this point
Copy the full SHA da993dbView commit details -
Add nemo2 conversion scripts for export (NVIDIA#10375)
* Filter optimizer state in TRTLLM * Add convert script * Fix script * Add max positional embedding * Apply isort and black reformatting Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> * Add header comment --------- Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> Co-authored-by: meatybobby <meatybobby@users.noreply.github.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for a7d1896 - Browse repository at this point
Copy the full SHA a7d1896View commit details
Commits on Sep 18, 2024
-
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to 0bda578 ! (NVIDIA#10518)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 848bdfb - Browse repository at this point
Copy the full SHA 848bdfbView commit details -
Update modelopt to 0.17.0 (NVIDIA#10489)
Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for bb4b5c6 - Browse repository at this point
Copy the full SHA bb4b5c6View commit details -
add save_last_n_optim_states flag (NVIDIA#10098)
* add save_last_n_optim_states flag Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * add feature to drop optimizer states from checkpoint Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix drop_optim_states logic Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * remove file Signed-off-by: dimapihtar <dpihtar@gmail.com> * undo remove extra line Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix style Signed-off-by: dimapihtar <dpihtar@gmail.com> * add drop optimizer states support Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * change drop optim states logic Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * remove extra import Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove extra parameter Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix style Signed-off-by: dimapihtar <dpihtar@gmail.com> * add unit test for drop optim states feature Signed-off-by: dimapihtar <dpihtar@gmail.com> * add unit test for drop optim states feature Co-authored-by: oliver könig <okoenig@nvidia.com> Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com> * move drop optim scripts to NeMoModelCheckpoint Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * move drop_optim_states functionality to NeMoModelCheckpoint Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * remove drop_optim_states functionality from NLPDDPStrategy Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove extra param Signed-off-by: dimapihtar <dpihtar@gmail.com> * reset changes Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove extra line Signed-off-by: dimapihtar <dpihtar@gmail.com> * add safe save and remove ckpt when drop optim states Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * remove unused param Signed-off-by: dimapihtar <dpihtar@gmail.com> * add load state dict Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix drop optim strategy Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * return extra line Signed-off-by: dimapihtar <dpihtar@gmail.com> * remove ckpt on a single node Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * change params naming Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix bug Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix bug Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * add file Signed-off-by: dimapihtar <dpihtar@gmail.com> * revert changes Signed-off-by: dimapihtar <dpihtar@gmail.com> * restore current ckpt state dict when dropping optim states Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * minor changes Signed-off-by: dimapihtar <dpihtar@gmail.com> * minor changes Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> --------- Signed-off-by: dimapihtar <dpihtar@gmail.com> Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com> Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com> Co-authored-by: oliver könig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0504c92 - Browse repository at this point
Copy the full SHA 0504c92View commit details -
fix asr finetune (NVIDIA#10508)
* fix asr finetune Signed-off-by: stevehuang52 <heh@nvidia.com> * fix vad infer Signed-off-by: stevehuang52 <heh@nvidia.com> --------- Signed-off-by: stevehuang52 <heh@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for bb721f8 - Browse repository at this point
Copy the full SHA bb721f8View commit details -
add a feature to drop checkpoint layers (NVIDIA#10200)
* add skip_train support Signed-off-by: dimapihtar <dpihtar@gmail.com> * add drop_layers support Signed-off-by: dimapihtar <dpihtar@gmail.com> * add script example of how to trim the model Signed-off-by: dimapihtar <dpihtar@gmail.com> * add documentation for drop_layers feature Signed-off-by: dimapihtar <dpihtar@gmail.com> * add unit test for skip_train feature Signed-off-by: dimapihtar <dpihtar@gmail.com> * add unit test for skip_train feature Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix syntax error Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix skip_train test Signed-off-by: dimapihtar <dpihtar@gmail.com> * add noop_forward_patch Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> * fix doc file Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix doc file Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix doc file Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix doc file Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix doc file Signed-off-by: dimapihtar <dpihtar@gmail.com> * fix doc file Signed-off-by: dimapihtar <dpihtar@gmail.com> --------- Signed-off-by: dimapihtar <dpihtar@gmail.com> Signed-off-by: Dmytro Pykhtar <37850217+dimapihtar@users.noreply.github.com> Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f4ef524 - Browse repository at this point
Copy the full SHA f4ef524View commit details -
Configuration menu - View commit details
-
Copy full SHA for cd861e2 - Browse repository at this point
Copy the full SHA cd861e2View commit details
Commits on Sep 19, 2024
-
handle logging case where grad_norm is None (NVIDIA#10457)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 07c1c80 - Browse repository at this point
Copy the full SHA 07c1c80View commit details -
Make nemo_run dependency optional (llm/__init__ ) (NVIDIA#10453)
* Make llm/__init__ nemo_run dependency optional Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: Marc Romeyn <mromeijn@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 05573d7 - Browse repository at this point
Copy the full SHA 05573d7View commit details -
move mamba installation (NVIDIA#10447)
* move mamba installation Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * FIX Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * remove mounts Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add no-deps Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add --no-index Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * --no-binary Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * --no-binary Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * use git clone Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add --depth=1 to mcore git clone Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix typo Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Move mamba before nemo_run Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * revert megatron git clone Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 8a244ff - Browse repository at this point
Copy the full SHA 8a244ffView commit details -
Update inference tests scripts and models (NVIDIA#10505)
* Update infer_data_path.py Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Remove unused download_nemo_checkpoint function Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Change file permissions Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Bugfix for trt_llm_export_kwargs Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Update export test script and model metadata Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Apply isort and black reformatting Signed-off-by: janekl <janekl@users.noreply.github.com> --------- Signed-off-by: Jan Lasek <janek.lasek@gmail.com> Signed-off-by: janekl <janekl@users.noreply.github.com> Co-authored-by: janekl <janekl@users.noreply.github.com> Co-authored-by: Onur Yilmaz <35306097+oyilmaz-nvidia@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 28851be - Browse repository at this point
Copy the full SHA 28851beView commit details -
Adds Llama 3.1 405b configurations (NVIDIA#10472)
* adds Llama 3.1 405b configurations Signed-off-by: Elnifio <elnifio0519@gmail.com> * refines comments Signed-off-by: Elnifio <elnifio0519@gmail.com> * removes untested parts Signed-off-by: Elnifio <elnifio0519@gmail.com> * removes associated unused libraries Signed-off-by: Elnifio <elnifio0519@gmail.com> * small patches Signed-off-by: Elnifio <elnifio0519@gmail.com> --------- Signed-off-by: Elnifio <elnifio0519@gmail.com> Co-authored-by: Shriya Rishab <69161273+ShriyaPalsamudram@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for b721f12 - Browse repository at this point
Copy the full SHA b721f12View commit details -
FP8 plugin recipes (NVIDIA#10208)
* create mixed precision plugin recipes Signed-off-by: Maanu Grover <maanug@nvidia.com> * cleanup Signed-off-by: Maanu Grover <maanug@nvidia.com> * fix annotation Signed-off-by: Maanu Grover <maanug@nvidia.com> * full definition instead of attach Signed-off-by: Maanu Grover <maanug@nvidia.com> * update docstrings Signed-off-by: Maanu Grover <maanug@nvidia.com> * update fp8 Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> * Apply isort and black reformatting Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> * refactor import Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> --------- Signed-off-by: Maanu Grover <maanug@nvidia.com> Signed-off-by: Jimmy Zhang <jiemingz@nvidia.com> Signed-off-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com> Co-authored-by: JimmyZhang12 <67203904+JimmyZhang12@users.noreply.github.com> Co-authored-by: Jimmy Zhang <jiemingz@nvidia.com> Co-authored-by: JimmyZhang12 <JimmyZhang12@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 3e251a0 - Browse repository at this point
Copy the full SHA 3e251a0View commit details -
[nemo-ux] Added nemotron recipes and tests (NVIDIA#10432)
* [NeMo-UX] Added nemotron recipes and tests Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> * fixed issues with precision handling Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> * Apply isort and black reformatting Signed-off-by: ahmadki <ahmadki@users.noreply.github.com> Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> * missing LoRA config in finetuning Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> * nemotron recipe imports Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> * cleanup and linting Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> * moved gradient clipping to OptimizerConfig, Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> * Added long context nemotron Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> --------- Signed-off-by: Ahmad Kiswani <kiswani.ahmad@gmail.com> Signed-off-by: ahmadki <ahmadki@users.noreply.github.com> Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: ahmadki <ahmadki@users.noreply.github.com> Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 259744e - Browse repository at this point
Copy the full SHA 259744eView commit details -
Pass mock to GPTDatasetConfig (NVIDIA#10435)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Co-authored-by: Pablo Garay <palenq@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 3653bed - Browse repository at this point
Copy the full SHA 3653bedView commit details -
added energon dataloader for neva training (NVIDIA#10451)
* added energon dataloader for neva training * Apply isort and black reformatting Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> * specify global batch size to support grad accumulation * adding neva pretrain example * Apply isort and black reformatting Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> * change pretraine example to handle new ckpt reloading * fixed code quality warnings and unused imports Signed-off-by: ykarnati <ykarnati@nvidia.com> * minor changes for PR comments * Apply isort and black reformatting Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> * refactor conversation template config * Apply isort and black reformatting Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> * remove optional import --------- Signed-off-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com> Signed-off-by: ykarnati <ykarnati@nvidia.com> Co-authored-by: yashaswikarnati <yashaswikarnati@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 7354740 - Browse repository at this point
Copy the full SHA 7354740View commit details -
Add unit tests for model configs in nemo.collections.llm (NVIDIA#10497)
* Add unit tests for model configs in nemo.collections.llm Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add nemotron tests Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add gemma Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add other models Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * PR feedback Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9e1ce6f - Browse repository at this point
Copy the full SHA 9e1ce6fView commit details -
nemo-ux: optim & model state restore test (NVIDIA#10325)
* optim restore test Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add missing import Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 8d3e561 - Browse repository at this point
Copy the full SHA 8d3e561View commit details -
Add copyright headers to nemo llm examples (NVIDIA#10543)
Signed-off-by: Hemil Desai <hemild@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 45ff28f - Browse repository at this point
Copy the full SHA 45ff28fView commit details
Commits on Sep 20, 2024
-
upgrade librosa version to fix librosa.display.specshow issue, matplo…
…tlib get_cmap (NVIDIA#10542) (NVIDIA#10553) Signed-off-by: Kunal Dhawan <kunaldhawan97@gmail.com> Co-authored-by: Kunal Dhawan <kunaldhawan97@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for d2d2aa0 - Browse repository at this point
Copy the full SHA d2d2aa0View commit details -
replace unbiased with correction (NVIDIA#10555)
* replace unbiased with correction Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> --------- Signed-off-by: Nithin Rao Koluguri <nithinraok> Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> Co-authored-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 44d2ae7 - Browse repository at this point
Copy the full SHA 44d2ae7View commit details
Commits on Sep 21, 2024
-
Akoumparouli/nemo ux ckpt conv bugfix (NVIDIA#10558)
* Call original _te._get_extra_te_kwargs and modify device Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add dtype utils Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Fix Llama ckpt conversion dtype Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * propage ckpt precision changes Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * minor improvement Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * bot happy Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for bc10d7c - Browse repository at this point
Copy the full SHA bc10d7cView commit details -
add autoresume to nemo 2 test (NVIDIA#10556)
Signed-off-by: ashors1 <ashors@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for d2af2a4 - Browse repository at this point
Copy the full SHA d2af2a4View commit details
Commits on Sep 22, 2024
-
ci: Add original author as reviewer to cherry-pick (NVIDIA#10566)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for cfc9a6c - Browse repository at this point
Copy the full SHA cfc9a6cView commit details -
ci: Improve title of cherry-picked PR (NVIDIA#10568)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for e7e55b2 - Browse repository at this point
Copy the full SHA e7e55b2View commit details -
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to c394f78 ! (NVIDIA#10562)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 0ee4d7e - Browse repository at this point
Copy the full SHA 0ee4d7eView commit details -
ci: Further improve cherry pick title (NVIDIA#10569)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for ed23cc7 - Browse repository at this point
Copy the full SHA ed23cc7View commit details -
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to 811a26a ! (NVIDIA#10565)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Signed-off-by: oliver könig <okoenig@nvidia.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f7f7d1a - Browse repository at this point
Copy the full SHA f7f7d1aView commit details -
ci: Send link to release page (NVIDIA#10570)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for cb5c2b7 - Browse repository at this point
Copy the full SHA cb5c2b7View commit details -
ci: Add label to cherry pick PR (NVIDIA#10574)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 38c0e3d - Browse repository at this point
Copy the full SHA 38c0e3dView commit details
Commits on Sep 23, 2024
-
[🤠]: Howdy folks, let's bump
Dockerfile.ci
to 8e69382 ! (NVIDIA#10577)Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: pablo-garay <7166088+pablo-garay@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for eb274ab - Browse repository at this point
Copy the full SHA eb274abView commit details -
Remove running validating after finetuning (NVIDIA#10560)
* remove running validating after finetuning * Apply isort and black reformatting Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com> --------- Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com> Co-authored-by: Huy Vu2 <huvu@login-eos01.eos.clusters.nvidia.com> Co-authored-by: huvunvidia <huvunvidia@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 110db0c - Browse repository at this point
Copy the full SHA 110db0cView commit details -
Signed-off-by: Maanu Grover <maanug@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 9ed0d6c - Browse repository at this point
Copy the full SHA 9ed0d6cView commit details -
* remove exp dir * add back tests * Apply isort and black reformatting Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> * add run_id to dir * fix dir->log_dir --------- Signed-off-by: JRD971000 <JRD971000@users.noreply.github.com> Co-authored-by: JRD971000 <JRD971000@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for c6d1b7d - Browse repository at this point
Copy the full SHA c6d1b7dView commit details -
ci: Send direct alert on failed cherry-pick (NVIDIA#10588)
* ci: Send direct alert on failed cherry-pick Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * also to me Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 6400bd5 - Browse repository at this point
Copy the full SHA 6400bd5View commit details -
Add ConfigValidation plugin to nemo.lightning.run (NVIDIA#10541)
* Add validation plugin to nemo.lightning.run Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * Rename to ConfigValidationPlugin Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add tests for each recipes with plugins Signed-off-by: Hemil Desai <hemild@nvidia.com> * Apply isort and black reformatting Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> * Add nsys plugin test Signed-off-by: Hemil Desai <hemild@nvidia.com> * Add more tests Signed-off-by: Hemil Desai <hemild@nvidia.com> --------- Signed-off-by: Hemil Desai <hemild@nvidia.com> Signed-off-by: hemildesai <hemildesai@users.noreply.github.com> Co-authored-by: hemildesai <hemildesai@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 7439b13 - Browse repository at this point
Copy the full SHA 7439b13View commit details -
Fix pps issue on nemo export (NVIDIA#10544)
* fix minor import bug Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com> * fix pps bug * remove config file Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com> --------- Signed-off-by: Onur Yilmaz <oyilmaz@nvidia.com> Co-authored-by: Shanmugam Ramasamy <111910568+shanmugamr1992@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for c02ea12 - Browse repository at this point
Copy the full SHA c02ea12View commit details
Commits on Sep 24, 2024
-
fix type error in llm collection (NVIDIA#10552)
* fix type bugs Signed-off-by: stevehuang52 <heh@nvidia.com> * Update mixin.py add type hint Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: stevehuang52 <stevehuang52@users.noreply.github.com> * Update mixin.py Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: stevehuang52 <stevehuang52@users.noreply.github.com> * Update mixin.py Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> --------- Signed-off-by: stevehuang52 <heh@nvidia.com> Signed-off-by: He Huang (Steve) <105218074+stevehuang52@users.noreply.github.com> Signed-off-by: stevehuang52 <stevehuang52@users.noreply.github.com> Co-authored-by: stevehuang52 <stevehuang52@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 53a10a7 - Browse repository at this point
Copy the full SHA 53a10a7View commit details -
ci: Safer sequence escaping (NVIDIA#10595)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 6023b80 - Browse repository at this point
Copy the full SHA 6023b80View commit details -
ci: Fix issues with version bump (NVIDIA#10467)
* ci: Fix issues with version bump Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * fix Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for c4e4157 - Browse repository at this point
Copy the full SHA c4e4157View commit details -
ci: Add missing test specs (NVIDIA#10597)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 810d07f - Browse repository at this point
Copy the full SHA 810d07fView commit details -
Extending modelopt spec for TEDotProductAttention (NVIDIA#10523)
* Extend modelopt spec for TEDotProductAttention to support sliding window attention Signed-off-by: Jan Lasek <janek.lasek@gmail.com> * Simplify import guarding Signed-off-by: Jan Lasek <janek.lasek@gmail.com> --------- Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 0fad1c1 - Browse repository at this point
Copy the full SHA 0fad1c1View commit details -
Update Multi_Task_Adapters.ipynb (NVIDIA#10600)
Signed-off-by: Piotr Żelasko <petezor@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 849e7e0 - Browse repository at this point
Copy the full SHA 849e7e0View commit details -
Change default for always_save_context to True (NVIDIA#10547)
Signed-off-by: Abhishree <abhishreetm@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for f351f64 - Browse repository at this point
Copy the full SHA f351f64View commit details -
Import guard for SimpleMultiModalDataModule (NVIDIA#10592)
Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 70bc06b - Browse repository at this point
Copy the full SHA 70bc06bView commit details -
add support for train_time_interval to consider hydra object (NVIDIA#…
…10559) Signed-off-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: Nithin Rao Koluguri <nithinraok>
Configuration menu - View commit details
-
Copy full SHA for 9d5a1aa - Browse repository at this point
Copy the full SHA 9d5a1aaView commit details -
Move update_config_with_dtype_overrides logging to debug (NVIDIA#10602)
* Move update_config_with_dtype_overrides logging to debug Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * update comment Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 877144a - Browse repository at this point
Copy the full SHA 877144aView commit details
Commits on Sep 25, 2024
-
ci: Wrap into quotes (NVIDIA#10616)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 0ec10d2 - Browse repository at this point
Copy the full SHA 0ec10d2View commit details -
* Introducing MegatronStep to make things more extensible Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Improve megatron callbacks Signed-off-by: Marc Romeijn <mromeijn@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Some small fixes Signed-off-by: Marc Romeijn <mromeijn@nvidia.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * remove debg code Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add forward_only to forward backward func Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add global-batch-sampler support to MegatronStep Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: Marc Romeijn <mromeijn@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: Marc Romeijn <mromeijn@nvidia.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: sichu <sichu@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for e8304d6 - Browse repository at this point
Copy the full SHA e8304d6View commit details -
Add inference optimization blog post announcement to README (NVIDIA#1…
…0623) Signed-off-by: Piotr Żelasko <petezor@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for e35a659 - Browse repository at this point
Copy the full SHA e35a659View commit details
Commits on Sep 26, 2024
-
Fix mb_calculator import in lora tutorial (NVIDIA#10624)
Signed-off-by: Boxiang Wang <boxiangw@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for dcc3a16 - Browse repository at this point
Copy the full SHA dcc3a16View commit details -
Fix LoRA contiguous tensor (NVIDIA#10611)
* contiguous Signed-off-by: Chen Cui <chcui@nvidia.com> * fix load Signed-off-by: Chen Cui <chcui@nvidia.com> * add test script Signed-off-by: Chen Cui <chcui@nvidia.com> * Apply isort and black reformatting Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: artbataev <artbataev@users.noreply.github.com> --------- Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: cuichenx <cuichenx@users.noreply.github.com> Signed-off-by: artbataev <artbataev@users.noreply.github.com> Co-authored-by: cuichenx <cuichenx@users.noreply.github.com> Co-authored-by: artbataev <artbataev@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 51f47f1 - Browse repository at this point
Copy the full SHA 51f47f1View commit details -
Fix Clip initializing issue in r2.0.0 (NVIDIA#10585)
* update post process Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Fix wild card matching in nemo2 Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> * Apply isort and black reformatting Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> --------- Signed-off-by: yaoyu-33 <yaoyu.094@gmail.com> Signed-off-by: yaoyu-33 <yaoyu-33@users.noreply.github.com> Co-authored-by: yaoyu-33 <yaoyu-33@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 016c1e4 - Browse repository at this point
Copy the full SHA 016c1e4View commit details -
Adding T5 to NeMo 2.0 (NVIDIA#10263)
* huvu/t5_nemo2.0 first commit from local * runable training * commit to save * update nemo/collections/llm/t5/data/pre_training.py, adding cicd test * updating codes * reset nemo/collections/nlp/parts/megatron_trainer_builder.py * reset megatron_lm_encoder_decoder_model.py, remove t5_release_test_config.sh * update init files * update Dockerfile.ci * fix wandb for cicd test * update training data path * remove uninstall TE * update .github/workflows/cicd-main.yml, disable fused/flashAttn * adjusting val_check_interval for action ci-cd tests * restore .github/workflows/cicd-main.yml * update * update nemologger args * just run pretraining in cicd test * update code llm strutures * adding tests/collections/llm/megatron_t5_pretraining.py * fix cicd * fix cicd * fix cicd * Apply isort and black reformatting Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com> --------- Signed-off-by: huvunvidia <huvunvidia@users.noreply.github.com> Signed-off-by: Huy Vu <86480512+huvunvidia@users.noreply.github.com> Co-authored-by: Huy Vu2 <huvu@login-eos01.eos.clusters.nvidia.com> Co-authored-by: huvunvidia <huvunvidia@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for eee0137 - Browse repository at this point
Copy the full SHA eee0137View commit details -
ci: Add CICD result feedback (NVIDIA#10629)
* ci: Add CICD result feedback Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for a98c5ed - Browse repository at this point
Copy the full SHA a98c5edView commit details -
.nemo conversion bug fix (NVIDIA#10598)
* bug fix Signed-off-by: dimapihtar <dpihtar@gmail.com> * bug fix Signed-off-by: dimapihtar <dpihtar@gmail.com> * Apply isort and black reformatting Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> --------- Signed-off-by: dimapihtar <dpihtar@gmail.com> Signed-off-by: dimapihtar <dimapihtar@users.noreply.github.com> Co-authored-by: dimapihtar <dimapihtar@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for a6c2fef - Browse repository at this point
Copy the full SHA a6c2fefView commit details -
ci: Fix mention (NVIDIA#10635)
* ci: Fix mention Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 38e5e09 - Browse repository at this point
Copy the full SHA 38e5e09View commit details -
Fix asr warnings (NVIDIA#10469)
* check Signed-off-by: Nithin Rao Koluguri <nithinraok> * 1 Signed-off-by: Nithin Rao Koluguri <nithinraok> * added to examples/asr Signed-off-by: Nithin Rao Koluguri <nithinraok> * deprecates cuda.amp.autocast to replace with amp.autocast(**Args) Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> * check Signed-off-by: Nithin Rao Koluguri <nithinraok> * 1 Signed-off-by: Nithin Rao Koluguri <nithinraok> * added to examples/asr Signed-off-by: Nithin Rao Koluguri <nithinraok> * deprecates cuda.amp.autocast to replace with amp.autocast(**Args) Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> * tested on V100 Signed-off-by: Nithin Rao Koluguri <nithinraok> * Apply isort and black reformatting Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> * replace cuda for jit scripted modules Signed-off-by: Nithin Rao Koluguri <nithinraok> * device type fix Signed-off-by: Nithin Rao Koluguri <nithinraok> * revert diar for CI Signed-off-by: Nithin Rao Koluguri <nithinraok> --------- Signed-off-by: Nithin Rao Koluguri <nithinraok> Signed-off-by: nithinraok <nithinraok@users.noreply.github.com> Co-authored-by: Nithin Rao Koluguri <nithinraok> Co-authored-by: nithinraok <nithinraok@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 4a9a226 - Browse repository at this point
Copy the full SHA 4a9a226View commit details -
ci: Fix hyperlink for feedback (NVIDIA#10637)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for ab4859b - Browse repository at this point
Copy the full SHA ab4859bView commit details -
sdpa flag to false & sdpa_backend arg
Signed-off-by: WoodieDudy <goshagks@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 3e31500 - Browse repository at this point
Copy the full SHA 3e31500View commit details -
Apply isort and black reformatting
Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for d604f8b - Browse repository at this point
Copy the full SHA d604f8bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 41acec1 - Browse repository at this point
Copy the full SHA 41acec1View commit details -
Apply isort and black reformatting
Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for e2aab5b - Browse repository at this point
Copy the full SHA e2aab5bView commit details -
Support LoRA in TensorRTMMExporter (NVIDIA#10347)
* Support LoRA in TensorRTMMExporter * Apply isort and black reformatting Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> * Add lora uids * Fix mm lora * Apply isort and black reformatting Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> * Fix lora * Fix conf and deploy script * Apply isort and black reformatting Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> --------- Signed-off-by: meatybobby <meatybobby@users.noreply.github.com> Co-authored-by: meatybobby <meatybobby@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 5e66cad - Browse repository at this point
Copy the full SHA 5e66cadView commit details
Commits on Sep 27, 2024
-
Nemo ux HF import tests (NVIDIA#10274)
* Add __all__ for mistral & mixtral Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Add model import test Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Add to cicd Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * skip mixtral test until use_safetnsors arg is resolved Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * syntax Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * typo Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * remove unused imports Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * remove streaming ckpt Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Update ckpt paths Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Signed-off-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 5b88aaa - Browse repository at this point
Copy the full SHA 5b88aaaView commit details -
chore(ci): Increase shm to 64gb (NVIDIA#10656)
Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for a725511 - Browse repository at this point
Copy the full SHA a725511View commit details -
Add lazy init for export (NVIDIA#10613)
* Add lazy init for export Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fdaf607 - Browse repository at this point
Copy the full SHA fdaf607View commit details -
Update modelopt layer spec for Mixtral (NVIDIA#10660)
Signed-off-by: Jan Lasek <janek.lasek@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 4f59502 - Browse repository at this point
Copy the full SHA 4f59502View commit details -
Update llm recipe README to add a note about handling multi-process j…
…obs (NVIDIA#10649) Signed-off-by: Hemil Desai <hemild@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for d51d8b9 - Browse repository at this point
Copy the full SHA d51d8b9View commit details -
Support Canary parallel inference (NVIDIA#9517)
* add Canary cats Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> * Apply isort and black reformatting Signed-off-by: karpnv <karpnv@users.noreply.github.com> * rm use_distributed_sampler=False Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> * Apply isort and black reformatting Signed-off-by: karpnv <karpnv@users.noreply.github.com> * rm use_distributed_sampler Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> * update lhotse Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> * Apply isort and black reformatting Signed-off-by: karpnv <karpnv@users.noreply.github.com> * fix global_rank Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> * OmegaConf.set_struct Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> * review fix Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> * Apply isort and black reformatting Signed-off-by: karpnv <karpnv@users.noreply.github.com> * Apply isort and black reformatting Signed-off-by: pzelasko <pzelasko@users.noreply.github.com> * predict_step return Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> --------- Signed-off-by: Nikolay Karpov <nkarpov@nvidia.com> Signed-off-by: karpnv <karpnv@users.noreply.github.com> Signed-off-by: Piotr Żelasko <petezor@gmail.com> Signed-off-by: pzelasko <pzelasko@users.noreply.github.com> Co-authored-by: karpnv <karpnv@users.noreply.github.com> Co-authored-by: Piotr Żelasko <petezor@gmail.com> Co-authored-by: pzelasko <pzelasko@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for cbb1344 - Browse repository at this point
Copy the full SHA cbb1344View commit details -
adding resume pretraining to CICD (NVIDIA#10640)
Co-authored-by: Huy Vu2 <huvu@login-eos01.eos.clusters.nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 23c7de1 - Browse repository at this point
Copy the full SHA 23c7de1View commit details
Commits on Sep 30, 2024
-
Require setuptools>=70 and update deprecated api (NVIDIA#10659)
* Require setuptools>=70 and update deprecated api Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> * Apply isort and black reformatting Signed-off-by: thomasdhc <thomasdhc@users.noreply.github.com> --------- Signed-off-by: Dong Hyuk Chang <donghyukc@nvidia.com> Signed-off-by: thomasdhc <thomasdhc@users.noreply.github.com> Co-authored-by: Dong Hyuk Chang <donghyukc@nvidia.com> Co-authored-by: thomasdhc <thomasdhc@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for fd78cc6 - Browse repository at this point
Copy the full SHA fd78cc6View commit details -
Akoumparouli/fix get tokenizer list (NVIDIA#10596)
* import fix Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * add get_tokenizer_list import test Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * cycle import fix attempt Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix2 Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * fix3 Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> * avoid import Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * remove relative imports Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> * Apply isort and black reformatting Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> --------- Signed-off-by: Alexandros Koumparoulis <akoumparouli@nvidia.com> Signed-off-by: akoumpa <akoumpa@users.noreply.github.com> Co-authored-by: akoumpa <akoumpa@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 9913441 - Browse repository at this point
Copy the full SHA 9913441View commit details -
[NeMo-UX] Support
save_last="link"
(NVIDIA#10548)* provide support for save_last='link' Signed-off-by: ashors1 <ashors@nvidia.com> * fix symlinks when top_k checkpoint not saved Signed-off-by: ashors1 <ashors@nvidia.com> * support symlinks with async checkpointing Signed-off-by: ashors1 <ashors@nvidia.com> * only unlink on rank 0 Signed-off-by: Anna Shors <ashors@nvidia.com> * fix race condition Signed-off-by: ashors1 <ashors@nvidia.com> * force linked checkpoint to correspond to last finalized checkpoint Signed-off-by: ashors1 <ashors@nvidia.com> * fix last_model_path after restore Signed-off-by: ashors1 <ashors@nvidia.com> * move symlink removal to strategy Signed-off-by: ashors1 <ashors@nvidia.com> * remove unneeded lines Signed-off-by: ashors1 <ashors@nvidia.com> * add some more documentation Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * address some comments Signed-off-by: ashors1 <ashors@nvidia.com> * fix syntax Signed-off-by: ashors1 <ashors@nvidia.com> * avoid overwriting _save_last_checkpoint Signed-off-by: ashors1 <ashors@nvidia.com> * fix base call Signed-off-by: ashors1 <ashors@nvidia.com> * small fix Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * add test for save_last=link Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * clean up test Signed-off-by: ashors1 <ashors@nvidia.com> * use megatroncheckpointio in test Signed-off-by: ashors1 <ashors@nvidia.com> * add async test and clean up Signed-off-by: ashors1 <ashors@nvidia.com> * fix remaining merge conflicts Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * check number of saved checkpoints Signed-off-by: ashors1 <ashors@nvidia.com> * remove unused import Signed-off-by: ashors1 <ashors@nvidia.com> * run test on gpu only Signed-off-by: ashors1 <ashors@nvidia.com> * fix a small bug and add a resume test Signed-off-by: ashors1 <ashors@nvidia.com> * Apply isort and black reformatting Signed-off-by: ashors1 <ashors1@users.noreply.github.com> * remove old comment Signed-off-by: ashors1 <ashors@nvidia.com> --------- Signed-off-by: ashors1 <ashors@nvidia.com> Signed-off-by: Anna Shors <ashors@nvidia.com> Signed-off-by: ashors1 <ashors1@users.noreply.github.com> Co-authored-by: ashors1 <ashors1@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for d664b74 - Browse repository at this point
Copy the full SHA d664b74View commit details -
Update the downloading path (NVIDIA#10662)
Signed-off-by: mingyuanm <mingyuanm@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for c0a05f6 - Browse repository at this point
Copy the full SHA c0a05f6View commit details
Commits on Oct 1, 2024
-
ci: Stability to CI/CD (NVIDIA#10694)
* ci: Switch to `--runtime=nvidia` Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * fix success marker Signed-off-by: Oliver Koenig <okoenig@nvidia.com> * f Signed-off-by: Oliver Koenig <okoenig@nvidia.com> --------- Signed-off-by: Oliver Koenig <okoenig@nvidia.com>
Configuration menu - View commit details
-
Copy full SHA for 32503fd - Browse repository at this point
Copy the full SHA 32503fdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7660730 - Browse repository at this point
Copy the full SHA 7660730View commit details -
Configuration menu - View commit details
-
Copy full SHA for 86e60c3 - Browse repository at this point
Copy the full SHA 86e60c3View commit details -
Apply isort and black reformatting
Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f712628 - Browse repository at this point
Copy the full SHA f712628View commit details
Commits on Oct 3, 2024
-
Signed-off-by: WoodieDudy <goshagks@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for fd78849 - Browse repository at this point
Copy the full SHA fd78849View commit details -
Apply isort and black reformatting
Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 1aec220 - Browse repository at this point
Copy the full SHA 1aec220View commit details
Commits on Oct 5, 2024
-
Signed-off-by: WoodieDudy <goshagks@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for e978045 - Browse repository at this point
Copy the full SHA e978045View commit details -
remove condition on torch version
Signed-off-by: WoodieDudy <goshagks@gmail.com>
Configuration menu - View commit details
-
Copy full SHA for 18e30ed - Browse repository at this point
Copy the full SHA 18e30edView commit details -
Apply isort and black reformatting
Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for c95dc01 - Browse repository at this point
Copy the full SHA c95dc01View commit details
Commits on Oct 7, 2024
-
Configuration menu - View commit details
-
Copy full SHA for ca21430 - Browse repository at this point
Copy the full SHA ca21430View commit details -
Apply isort and black reformatting
Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 6741826 - Browse repository at this point
Copy the full SHA 6741826View commit details
Commits on Oct 8, 2024
-
Configuration menu - View commit details
-
Copy full SHA for 5dee79f - Browse repository at this point
Copy the full SHA 5dee79fView commit details -
Apply isort and black reformatting
Signed-off-by: WoodieDudy <WoodieDudy@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for de3835d - Browse repository at this point
Copy the full SHA de3835dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8af1241 - Browse repository at this point
Copy the full SHA 8af1241View commit details