Skip to content

Pull requests: microsoft/DeepSpeed

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Explicitly use the linalg.vector_norm call in comm/
#6960 opened Jan 17, 2025 by loadams Loading…
Pin numpy version
#6953 opened Jan 15, 2025 by BLOrange-AMD Loading…
Set dataloader shuffle=true
#6950 opened Jan 14, 2025 by loadams Draft
1 task
Update torch.norm to torch.linalg.norm and torch.linalg.vector_norm
#6931 opened Jan 8, 2025 by loadams Loading…
3 of 4 tasks
Autotp training
#6922 opened Jan 2, 2025 by inkcherry Loading…
fix: RuntimeError for UCP large DP
#6918 opened Dec 29, 2024 by saforem2 Loading…
Tecorigin sdaa accelerator
#6903 opened Dec 23, 2024 by siqi654321 Loading…
Use ds-specific module id to avoid conflicts
#6847 opened Dec 10, 2024 by tjruwase Loading…
Update MII tests to support transformers latest
#6686 opened Oct 29, 2024 by loadams Loading…
modify_load_save_model
#6626 opened Oct 15, 2024 by ssklzx Loading…
Improve consistency of zero_grad
#6554 opened Sep 18, 2024 by tohtana Draft
[NaN check] Add NaN check to support bfloat16.
#5879 opened Aug 8, 2024 by ys950902 Loading…
Fix circular import in ds_transformer.py
#5804 opened Jul 28, 2024 by sznmelvin Loading…
Hybrid Offloading for ZeRO3
#5625 opened Jun 7, 2024 by tohtana Draft
ProTip! Add no:assignee to see everything that’s not assigned.