Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for the new 450 language translation models from Google T5X "madlad" - apparently Apache-2 #4316

Closed
cmp-nct opened this issue Dec 4, 2023 · 8 comments
Labels
enhancement New feature or request stale

Comments

@cmp-nct
Copy link
Contributor

cmp-nct commented Dec 4, 2023

Example: https://huggingface.co/jbochi/madlad400-3b-mt/tree/main
In Googles own space: https://huggingface.co/google/madlad400-10b-mt

The guy converted the format of the 3 smallest models (3b,7b,10b) to HF transformers. Given the severe lack in non english output a good translation model would be a gift.
I just tried the CPU demo of the 3B, it produced quite good output, if that gets better with 7B+ it would be a real solution for a huge amount of people.
It could be added as a 2nd stage into llama.cpp

Though the architecture is "T5ForConditionalGeneration" which isn't supported.

So far there was no urgent reason to add those T5 models, they did not stick out as special but the idea to output text in every single language worldwide .. that would be remarkable

@cmp-nct cmp-nct added the enhancement New feature or request label Dec 4, 2023
@cmp-nct cmp-nct closed this as completed Dec 4, 2023
@JettScythe
Copy link

@cmp-nct can you please link to the PR where this was completed?

@cmp-nct
Copy link
Contributor Author

cmp-nct commented Jan 5, 2024

@cmp-nct can you please link to the PR where this was completed?

I’ve closed it as not important. Given the many things we need done and the almost zero interest for translation here.
The model from Google was ok but it was quite flawed when testing it in depth.

@vasicvuk
Copy link

This would be great if supported +1. @cmp-nct maybe you can reopen the issue so the support for this models is planned

@sorasoras
Copy link

That's interesting to me at least.

@easp
Copy link

easp commented Feb 29, 2024

Perhaps relevant: #5763

@MathiasSchindler
Copy link

@cmp-nct can you please link to the PR where this was completed?

I’ve closed it as not important. Given the many things we need done and the almost zero interest for translation here. The model from Google was ok but it was quite flawed when testing it in depth.

madlad400 is remarkable in the sense that it has the most permissive license in comparison to meta's nllb200 or seamless communication. The translation quality varies a lot between the 419 listed languages, with some language combinations producing a very decent translation quality. Some language pairs seem to be heavily influenced by the datasets from the European Parliament (with sometimes cute and hilarius translation mistakes coming from the European Parliament debates). I have heard from other users that some languages in the Indic family and Chinese family should not be used at the moment.

I have a huge interest if someone could support it.

@github-actions github-actions bot added the stale label Apr 6, 2024
Copy link
Contributor

This issue was closed because it has been inactive for 14 days since being marked as stale.

@Nikola-Milovic
Copy link

Surprising lack of interest for translational models, any reasons why?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request stale
Projects
None yet
Development

No branches or pull requests

7 participants