-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add benchmark for github.com/awni/transducer #13
Comments
Sure, I will add that. One thing to note: It has a specific constraint or requirement: the jonit network consists of only an adder. It is not possible to add nn.Linear or any activation layers to it. Also, the authors don't provide any WERs for the models trained with it. |
@csukuangfj I see, interesting. The input logit is (B, T, V) shape in their implementation, it looks like a CTC logit shape. I wonder how he made use of the prediction network output? |
Actually, what |
How does the simple rnnt performs compared with a normal joiner with activations and linear transform? Is it much worse? |
I don't have such results available. You can either ask awni or do some experiments by yourself. |
github.com/awni/transducer implementation seems use much less memory and run 20x faster than torchaudio RNNT. Is that possible to add this to the benchmarking? Thanks.
The text was updated successfully, but these errors were encountered: