deepspeed

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM

pytorch llama gpt lora finetune ppo peft deepspeed llm chatgpt rlhf reward-models chatglm chatglm-6b

Updated Apr 28, 2023
Python

HomebrewNLP / revlib

Star

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

deep-learning pytorch tpu revnet xla deepspeed momentumnet

Updated Aug 6, 2022
Python

CoinCheung / gdGPT

Star

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

nlp bloom pipeline pytorch deepspeed llm full-finetune model-parallization flash-attention llama2 baichuan2-7b chatglm3-6b mixtral-8x7b

Updated Feb 5, 2024
Python

OpenCSGs / llm-inference

Star

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource management, monitoring, and more.

transformer ray deepspeed llama-cpp vllm llm-inference

Updated May 17, 2024
Python

xyjigsaw / LLM-Pretrain-SFT

Star

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

llama lora mistral deepspeed large-language-models baichuan2

Updated Jan 30, 2024
Python

jackaduma / Alpaca-LoRA-RLHF-PyTorch

Star

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca

pytorch llama gpt lora alpaca finetune ppo peft deepspeed llm chatgpt rlhf reward-models

Updated Apr 28, 2023
Python

glb400 / Toy-RecLM

Star

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

recommender-system sasrec deepspeed large-language-models llama2 actions-speak-louder-than-words

Updated Apr 25, 2024
Python

Improve this page

Add a description, image, and links to the deepspeed topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the deepspeed topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deepspeed

Here are 51 public repositories matching this topic...

InternLM / lmdeploy

PKU-Alignment / safe-rlhf

zjunlp / KnowLM

alibaba / Megatron-LLaMA

Xirider / finetune-gpt2xl

OpenMOSS / CoLLiE

antgroup / glake

LambdaLabsML / distributed-training-guide

sunzeyeah / RLHF

stanleylsx / llms_tool

git-cloner / llama2-lora-fine-tuning

openpsi-project / ReaLHF

bobo0810 / LearnDeepSpeed

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

HomebrewNLP / revlib

CoinCheung / gdGPT

OpenCSGs / llm-inference

xyjigsaw / LLM-Pretrain-SFT

jackaduma / Alpaca-LoRA-RLHF-PyTorch

glb400 / Toy-RecLM

Improve this page

Add this topic to your repo