KTO loss #410

vulkomilev · 2024-11-27T20:00:35Z

Summary

This is the kto loss implemented by references from other projects

Details

I am not sure about the correctness (because this is my first PR) of the final results so I expect a lot of comments

Testing Done

I have done the basic testing inspired from cpo

pramodith · 2024-11-29T19:01:57Z

test/chunked_loss/test_cpo_loss.py

@@ -126,7 +126,7 @@ def test_correctness(
        input1, weight1, target, bias1, alpha=alpha
    )
    loss2 = LigerFusedLinearCPOFunction.apply(
-        input2, weight2, target, bias2, ignore_index, beta, alpha, True
+        input2, weight2, target, bias2, ignore_index, beta, alpha, False


Why are we changing the test case for an unrelated alignment algo?

Sorry my bad.

pramodith

Hey, I think this code needs to be refactored to make things a bit cleaner and easier to understand. Could you also write out the equations for KTO in the description to the PR so that its easier for a reviewer to understand?

pramodith · 2024-11-29T19:04:00Z

src/liger_kernel/chunked_loss/fused_linear_preference_kto.py

+from torch.nn import functional as F
+
+
+class LigerFusedLinearKTOPreferenceBase(torch.autograd.Function):


Why is this class needed, can't you reuse https://github.com/linkedin/Liger-Kernel/blob/main/src/liger_kernel/chunked_loss/fused_linear_preference.py?

I am getting this error
E RuntimeError: CUDA error: device-side assert triggered E CUDA kernel errors might be asynchronously reported at some other API call, so the stacktrace below might be incorrect. E For debugging consider passing CUDA_LAUNCH_BLOCKING=1 E Compile with TORCH_USE_CUDA_DSA` to enable device-side assertions.

src/liger_kernel/chunked_loss/fused_linear_preference.py:210: RuntimeError
---------------------------------------------------------------------------------------------------------------- Captured stdout call -----------------------------------------------------------------------------------------------------------------

---------------------------------------------------------------------------------------------------------------- Captured stderr call -----------------------------------------------------------------------------------------------------------------
NoneType: None
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [6,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [7,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [12,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [83,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [32,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [43,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [54,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [59,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
../aten/src/ATen/native/cuda/ScatterGatherKernel.cu:144: operator(): block: [0,0,0], thread: [62,0,0] Assertion idx_dim >= 0 && idx_dim < index_size && "index out of bounds" failed.
=============================================================================================================== short test summary info ===============================================================================================================
FAILED test/chunked_loss/test_kto_loss.py::test_correctness[-100-0.1-1.0-False-1.0-dtype0-0.005-0.005-3-47-31-123] - RuntimeError: CUDA error: device-side assert triggered
================================================================================================================== 1 failed in 1.86s ============================================`

I will do the equations and the formatting. Also I need two arguments 'reference_chosen_logps' and 'reference_rejected_logps' to my custom loss function.

I'm a bit confused do you still need this file? The base classes abstract preference_loss_fn does accept those two arguments, you can set beta=0 if it's not needed.

In case you need a completely new function signature, my advice would be to add a new overloaded function in the existing base class.

vulkomilev · 2024-12-04T22:30:21Z

Okay code formatted and comment about the source of the loss added

pramodith

@vulkomilev can you please make sure that all unsued code is deleted and can you also confirm if

make checkstyle
make test works?

It'd be great if you can add the equations of KTO in the PRs description similar to #386

vulkomilev · 2024-12-06T22:11:01Z

make checkstyle and make test works now.The commented code was removed and I have added the formula in kto_loss.py but I am not sure about the formmating

pramodith · 2024-12-11T22:26:11Z

@ByronHsu @shivam15s could either of you please take over reviewing this PR, have to switch my focus to other stuff.

vulkomilev added 3 commits November 21, 2024 00:07

working on tests

ef59f91

test are working but I have problem with assertions

b053b0c

basic test working

2461a33

vulkomilev mentioned this pull request Nov 28, 2024

[RFC] Liger FlexChunkLoss: Alignment and Distillation loss #371

Open

12 tasks

pramodith reviewed Nov 29, 2024

View reviewed changes

pramodith requested changes Nov 29, 2024

View reviewed changes

vulkomilev added 2 commits December 2, 2024 21:57

returned to fused loss

5deb7f9

fromated code and addded source for kto

cf0c3eb

pramodith requested changes Dec 5, 2024

View reviewed changes

checkstyles tests and formula done

440241c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KTO loss #410

KTO loss #410

vulkomilev commented Nov 27, 2024

pramodith Nov 29, 2024

vulkomilev Nov 30, 2024

pramodith left a comment

pramodith Nov 29, 2024

vulkomilev Dec 1, 2024

vulkomilev Dec 1, 2024

pramodith Dec 5, 2024

vulkomilev commented Dec 4, 2024

pramodith left a comment

vulkomilev commented Dec 6, 2024

pramodith commented Dec 11, 2024

		from torch.nn import functional as F


		class LigerFusedLinearKTOPreferenceBase(torch.autograd.Function):

KTO loss #410

Are you sure you want to change the base?

KTO loss #410

Conversation

vulkomilev commented Nov 27, 2024

Summary

Details

Testing Done

pramodith Nov 29, 2024

Choose a reason for hiding this comment

vulkomilev Nov 30, 2024

Choose a reason for hiding this comment

pramodith left a comment

Choose a reason for hiding this comment

pramodith Nov 29, 2024

Choose a reason for hiding this comment

vulkomilev Dec 1, 2024

Choose a reason for hiding this comment

vulkomilev Dec 1, 2024

Choose a reason for hiding this comment

pramodith Dec 5, 2024

Choose a reason for hiding this comment

vulkomilev commented Dec 4, 2024

pramodith left a comment

Choose a reason for hiding this comment

vulkomilev commented Dec 6, 2024

pramodith commented Dec 11, 2024