-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Add English-Spanish translation problem #1626
Conversation
Thanks for your pull request. It looks like this may be your first contribution to a Google open source project (if not, look below for help). Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA). 📝 Please visit https://cla.developers.google.com/ to sign. Once you've signed (or fixed any issues), please reply here (e.g. What to do if you already signed the CLAIndividual signers
Corporate signers
ℹ️ Googlers: Go here for more info. |
Done CLI |
CLAs look good, thanks! ℹ️ Googlers: Go here for more info. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great thanks, looks good!
Did you manage to get good results from the models? Great thanks for contributing! |
PiperOrigin-RevId: 258838381
Hello! Thanks for adding en-es module and making it available! I have ran into an issue when I tried to download data using 't2t-datagen' command.
It works when I use english to german module for example by doing
Some part of the error message that I get clearly show that spanish module is not part of the available one at the moment. (there is no enes module appearing where it should be since things seem to be ordered alphabetically.)
I have tried installing through 'pip install tensor2tensor' as well as 'pip install git+https://github.com/tensorflow/tensor2tensor.git' to check if the last changes were not reflected for the official release. |
Hi!
I was able to successfully add and use a English-Spanish translation problem, using Common Crawl, EuroParl v7, UN Corpus and ParaCrawl as my datasets.
I trained both an EN-ES and an ES-EN Transformer models (using the
transformer_big_single_gpu
hyperparameters set) with this new problem specification.