Shifting labels to enable learning next token prediction #10

LiorZ · 2023-04-10T08:57:09Z

Dear lucidrains,
Many thanks for your amazing contribution to the community.
In the following pull request I modified the code so that the labels to the right during training before calculating the loss to enable next-token prediction (otherwise, the loss is calculated for the token, which leads to learning the trivial, current token)

L

…prediction

lucidrains · 2023-10-12T15:43:18Z

@LiorZ hey Lior

it should be shifted already https://github.com/lucidrains/perceiver-ar-pytorch/blob/main/perceiver_ar_pytorch/autoregressive_wrapper.py#L83

Shifting labels one place to the right to enable learning next token …

d90bd8f

…prediction

LiorZ changed the title ~~Shifting labels one place to the right to enable learning next token prediction~~ Shifting labels to enable learning next token prediction Apr 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shifting labels to enable learning next token prediction #10

Shifting labels to enable learning next token prediction #10

LiorZ commented Apr 10, 2023

lucidrains commented Oct 12, 2023

Shifting labels to enable learning next token prediction #10

Are you sure you want to change the base?

Shifting labels to enable learning next token prediction #10

Conversation

LiorZ commented Apr 10, 2023

lucidrains commented Oct 12, 2023