How is torch broadcasting (T, T) @ (B, T, C) ?! #34

whydna · 2023-10-16T03:43:26Z

At around 53:10 of the lecture, Andrej does a matrix multiplication with tensors of size (T, T) and (B, T, C). More precisely: (8, 8) @ (4, 8, 2).

Now, even after looking over PyTorch docs on broadcasting semantics, I'm surprised to see that this works - but sure enough, running the code produces an output of (4, 8, 2).

Can anyone explain how this broadcast works?

// align trailing dimensions
     8, 8
4, 8, 2

// pad missing dimensions with 1
1, 8, 8
4, 8, 2

// duplicate 1 dimensions until match
4, 8, 8
4, 8 ,2

// now what???

The text was updated successfully, but these errors were encountered:

remorses · 2023-10-16T13:10:39Z

You can think matrix moltiplication working on the last 2 dimensions and using the first dimension only for batching

You basically can ignore the first dimension, pytorch does the matrix moltiplication for each row and concats them at the end

remorses · 2023-10-16T13:18:56Z

I think the same reasoning also applies when using images, Tensors usually have shape [Batch, Channels, Height, Width] (NCHW), you can consider the image colors as group of different images, a batching dimension

whydna · 2023-10-16T19:04:17Z

@remorses ty for the answer. Does this follow the standard broadcasting rules or is this a special case? Can't seem to find in docs.

remorses · 2023-10-17T07:59:10Z

It’s using the usual broadcasting rules, if you mean if it follows the matrix multiplication rule of having 1 dimension in common then yes, the dimension in common must the the second from the right

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How is torch broadcasting (T, T) @ (B, T, C) ?! #34

How is torch broadcasting (T, T) @ (B, T, C) ?! #34

whydna commented Oct 16, 2023

remorses commented Oct 16, 2023

remorses commented Oct 16, 2023

whydna commented Oct 16, 2023

remorses commented Oct 17, 2023

How is torch broadcasting (T, T) @ (B, T, C) ?! #34

How is torch broadcasting (T, T) @ (B, T, C) ?! #34

Comments

whydna commented Oct 16, 2023

remorses commented Oct 16, 2023

remorses commented Oct 16, 2023

whydna commented Oct 16, 2023

remorses commented Oct 17, 2023