Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

transducer #21

Open
BestSonny opened this issue Feb 27, 2018 · 5 comments
Open

transducer #21

BestSonny opened this issue Feb 27, 2018 · 5 comments

Comments

@BestSonny
Copy link

@awni In the infer code, are you still using ground truth labels for testing phase? This confused me since we do not have a ground truth when applying to an unseen data. Or do you just forward a fake input (such as batch_size x 1 with all zero label) when in the testing phase?

Also, will you maintain this excellent project in the future?

Thank you very much.

@awni
Copy link
Owner

awni commented Feb 27, 2018

Yes great point. In the committed code the ground truth labels are being used for the transducer. I have some code to do this properly using the inferred output but I haven't had a chance to clean it up and commit it yet. That's why I haven't posted "transducer" results in the README yet.

@awni awni closed this as completed Feb 27, 2018
@awni
Copy link
Owner

awni commented Feb 27, 2018

And yes, I will continue to support this project though any collaborators would be very welcome!!

@BestSonny
Copy link
Author

@awni Have you made any progress on cleaning up the decoder?

@yjchun
Copy link

yjchun commented Mar 10, 2018

Just for reference, here is my inference code.
https://gist.github.com/yjchun/aae0b86f89ffb46a54c28e7747124ee4

May not be useful as it is because it uses my model class's internal logic.
The problem is inference is awfully slow. Takes 3 seconds for 10 seconds of audio with 400 hidden layers. 161 chars = 161 prediction+joint network calls at minimum.
Had to read Grave's paper in and out for a few days for this piece of code.

And thank you @awni for sharing your transducer loss implementation which was very helpful.

@Duum
Copy link

Duum commented May 24, 2018

Here is my implement for greed decoder.
you may need some change for your lstm decoder network.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants