Releases: fkodom/yet-another-retnet
Releases · fkodom/yet-another-retnet
0.5.1
0.5.0
0.4.2
0.4.1
0.4.0
0.3.1
What's Changed
- bug fix: change F.relu to F.silu by @Dongyeongkim in #7
New Contributors
- @Dongyeongkim made their first contribution in #7
Full Changelog: 0.3.0...0.3.1
0.3.0
More streamlined support for training
- example training script
RetNet.forward
is no longer just a wrapper forRetNet.forward_parallel
. It acceptsinputs
,labels
Tensors, and returns a loss value.class RetNet: ... def forward(self, inputs: Tensor, labels: Tensor) -> Tensor: pred = self.forward_parallel(inputs) criterion = nn.CrossEntropyLoss() return criterion(rearrange(pred, "b n c -> (b n) c"), labels.flatten())
- include example TorchData datapipe -- top 100 project gutenberg books
- example streaming text generation with trained RetNet
0.2.0
0.1.3
Set default layer_norm_eps=1e-6
, as updated in the official implementation:
microsoft/torchscale@2c29de0
0.1.2
Remove extra complex conjugation from the relative position embedding.
Reference: microsoft/torchscale#49