Make MultiHeadAttention
to work with all attention operators
#336
Loading
MultiHeadAttention
to work with all attention operators
#336