Adding nn.Embedding layer. #406

Narsil · 2023-01-26T19:59:30Z

Attempt to create nn.Embedding layer.

However I am not able to finish the gradients part.

Narsil · 2023-01-26T20:04:45Z

src/nn/embedding.rs

+{
+    type Output = Tensor<Rank2<SEQ, DIM>, f32, D>;
+    fn forward(&self, input: Tensor<Rank1<SEQ>, usize, D>) -> Self::Output {
+        self.weight.retaped().gather(input)


This is the particular line that seems to be not working.

In other layer I found

self.weight.retaped::<T>() // T is the tape of the input

Which seems to be the trick. However, it seems to me that the weight could contain the tape, and the input cannot ( since it's only indexing within the weight tensor).

I'm out of ideas to make this work. Could you provide any help @coreylowman ?

I figure out a way by making GatherTo generic on the Tape too. I'm not to sure about the modifications though.

src/tensor_ops/select_and_gather/mod.rs

coreylowman · 2023-01-26T20:48:07Z

src/nn/embedding.rs

+    fn forward(&self, input: Tensor<Rank2<BATCH, SEQ>, usize, D, T>) -> Self::Output {
+        self.weight.retaped::<T>().gather(input)


One trick we could do here is with SplitTape and PutTape:

Suggested change

fn forward(&self, input: Tensor<Rank2<BATCH, SEQ>, usize, D, T>) -> Self::Output {

self.weight.retaped::<T>().gather(input)

fn forward(&self, input: Tensor<Rank2<BATCH, SEQ>, usize, D, T>) -> Self::Output {

let (input, tape) = input.split_tape();

self.weight.clone().put_tape(tape).gather(input)

I think this should avoid the need to change select/gather?

It works !
Still somehow mystical how the tape thing works :)

Nice! Hmm I wonder what we could to do make it easier to understand intuitively... That is a big difference with pytorch, and while most use cases shouldn't need to do stuff with tapes, for understanding internals it would be helpful.

coreylowman · 2023-01-26T20:48:34Z

src/nn/embedding.rs

+    fn forward(&self, input: Tensor<Rank2<BATCH, SEQ>, usize, D, T>) -> Self::Output {
+        self.weight.retaped::<T>().gather(input)


Also, do you still need the retaped even with the modifications to gather?

Not with the manual split_tape.

coreylowman

Looks great, thanks for the contribution!

Narsil commented Jan 26, 2023

View reviewed changes

coreylowman reviewed Jan 26, 2023

View reviewed changes

src/tensor_ops/select_and_gather/mod.rs Outdated Show resolved Hide resolved

coreylowman reviewed Jan 26, 2023

View reviewed changes

Narsil changed the title ~~[WIP] Adding nn.Embedding layer.~~ Adding nn.Embedding layer. Jan 26, 2023

Narsil added 3 commits January 26, 2023 23:00

Adding nn.Embedding layer.

b9bd911

Fixing gather ?

8b35563

Prevent modifications on GatherTo.

dcd1b5a

Narsil force-pushed the add_embedding branch from a64673d to dcd1b5a Compare January 26, 2023 22:04

Rebased to BuildModule.

b9ecf07

coreylowman approved these changes Jan 30, 2023

View reviewed changes

coreylowman merged commit d4bc18e into coreylowman:main Jan 30, 2023

Narsil deleted the add_embedding branch January 31, 2023 09:54

coreylowman mentioned this pull request Feb 7, 2023

Embedding Layers #121

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding nn.Embedding layer. #406

Adding nn.Embedding layer. #406

Narsil commented Jan 26, 2023 •

edited

Loading

Narsil Jan 26, 2023

Narsil Jan 26, 2023

coreylowman Jan 26, 2023

Narsil Jan 26, 2023

coreylowman Jan 26, 2023

coreylowman Jan 26, 2023

Narsil Jan 26, 2023

coreylowman left a comment

		fn forward(&self, input: Tensor<Rank2<BATCH, SEQ>, usize, D, T>) -> Self::Output {
		self.weight.retaped::<T>().gather(input)

Adding nn.Embedding layer. #406

Adding nn.Embedding layer. #406

Conversation

Narsil commented Jan 26, 2023 • edited Loading

Narsil Jan 26, 2023

Choose a reason for hiding this comment

Narsil Jan 26, 2023

Choose a reason for hiding this comment

coreylowman Jan 26, 2023

Choose a reason for hiding this comment

Narsil Jan 26, 2023

Choose a reason for hiding this comment

coreylowman Jan 26, 2023

Choose a reason for hiding this comment

coreylowman Jan 26, 2023

Choose a reason for hiding this comment

Narsil Jan 26, 2023

Choose a reason for hiding this comment

coreylowman left a comment

Choose a reason for hiding this comment

Narsil commented Jan 26, 2023 •

edited

Loading