-
MosaicML
- San Francisco
- @abhi_venigalla
-
composer Public
Forked from mosaicml/composerComposing methods for ML training efficiency
Python Apache License 2.0 UpdatedApr 29, 2024 -
transformers Public archive
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedDec 27, 2023 -
aws-neuron-samples Public
Forked from aws-neuron/aws-neuron-samplesExample code for AWS Neuron SDK developers building inference and training applications
Jupyter Notebook Other UpdatedNov 8, 2023 -
llm-foundry Public
Forked from mosaicml/llm-foundryLLM training code for MosaicML foundation models
Python Apache License 2.0 UpdatedNov 3, 2023 -
tutel Public
Forked from microsoft/TutelTutel MoE: An Optimized Mixture-of-Experts Implementation
Python MIT License UpdatedJan 13, 2023 -
mosaicml-benchmarks Public
Forked from stanford-crfm/mosaicml-benchmarksFast and flexible reference benchmarks
Python Apache License 2.0 UpdatedDec 14, 2022 -
mistral Public
Forked from stanford-crfm/mistralMistral: A strong, northwesterly wind: Framework for transparent and accessible large-scale language model training, built with Hugging Face 🤗 Transformers.
Python Apache License 2.0 UpdatedNov 4, 2022 -
Megatron-LM Public
Forked from ngoyal2707/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedNov 1, 2022 -
DeepSpeed Public
Forked from microsoft/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python MIT License UpdatedMay 24, 2022 -
datasets Public
Forked from huggingface/datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Python Apache License 2.0 UpdatedFeb 17, 2022