Skip to content
@OpenRLHF

OpenRLHF

Open-sourced Reinforcment Learning from Human Feedback

Popular repositories Loading

  1. OpenRLHF OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

    Python 3.2k 301

  2. OpenRLHF-Docs OpenRLHF-Docs Public

    2 1

Repositories

Showing 2 of 2 repositories
  • OpenRLHF-Docs Public
    OpenRLHF/OpenRLHF-Docs’s past year of commit activity
    2 1 0 0 Updated Dec 19, 2024
  • OpenRLHF Public

    An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

    OpenRLHF/OpenRLHF’s past year of commit activity
    Python 3,231 Apache-2.0 301 62 6 Updated Dec 19, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python

Most used topics

Loading…