Releases · fkodom/yet-another-retnet

More streamlined support for training

example training script

RetNet.forward is no longer just a wrapper for RetNet.forward_parallel. It accepts inputs, labels Tensors, and returns a loss value.

class RetNet:
    ...
    def forward(self, inputs: Tensor, labels: Tensor) -> Tensor:
        pred = self.forward_parallel(inputs)
        criterion = nn.CrossEntropyLoss()
        return criterion(rearrange(pred, "b n c -> (b n) c"), labels.flatten())

include example TorchData datapipe -- top 100 project gutenberg books
example streaming text generation with trained RetNet

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

New Contributors

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

Releases: fkodom/yet-another-retnet

0.5.1

What's Changed

Contributors

0.5.0

What's Changed

Contributors

0.4.2

What's Changed

New Contributors

Contributors

0.4.1

What's Changed

Contributors

0.4.0

What's Changed

New Contributors

Contributors

0.3.1

What's Changed

New Contributors

Contributors

0.3.0

0.2.0

What's Changed

Contributors

0.1.3

0.1.2