trl

The overall aim of this project is to create a term rewriting system that could be useful in everyday programming, and to represent data in a way that roughly correspond to the definition of a term in formal logic. Terms should be familiar to any programmer because they are basically constants, variables, and function symbols.

syntax-tree term-rewriting trl term-database

Updated Dec 16, 2020
C#

rasyosef / phi-1_5-instruct

Star

Notebooks to create an instruction following version of Microsoft's Phi 1.5 LLM with Supervised Fine Tuning and Direct Preference Optimization (DPO)

transformers pytorch trl llm supervised-finetuning direct-preference-optimization

Updated Aug 17, 2024

SofiaKhutsieva / LLM_experiments

Star

Эксперименты с LLM (инференс, rag, дообучение)

mistral peft rag trl llm langchain llamacpp

Updated Mar 23, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the trl topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trl topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trl

Here are 12 public repositories matching this topic...

jasonvanf / llama-trl

argilla-io / notus

sugarandgugu / Simple-Trl-Training

RobinSmits / Dutch-LLMs

ssbuild / llm_rlhf

rasyosef / phi-2-sft-and-dpo

SharathHebbar / sft_mathgpt2

SharathHebbar / dpo_chatgpt2

pberlandier / irl-to-bal

WCoetser / Trl.TermDataRepresentation

rasyosef / phi-1_5-instruct

SofiaKhutsieva / LLM_experiments

Improve this page

Add this topic to your repo