-
Notifications
You must be signed in to change notification settings - Fork 816
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal to integrate into 🤗 Hub #555
Proposal to integrate into 🤗 Hub #555
Conversation
@patrickvonplaten Thank you so much, this is a really great and useful feature :D. I have learned a lot from the huggingface transformers repo and as you can see, our repo has the same structure as the transformers repo then it would easily to integrated with HuggingFace_hub. I'm on a vacation and will be back in a few days. :D. |
Fix AutoProcessor
@patrickvonplaten Merged :D. Can you tell me what is a next steps ? |
Hey @dathudeptrai, Awesome to see that the PR is merged 🥳 In a next step, I think we can create an organization on the hub, here: https://huggingface.co/organizations/new (maybe called Also cc @julien-c , @osanseviero |
@patrickvonplaten I just added |
@dathudeptrai thank you for creating the org! That's awesome. There are some additional steps in our side. The two main things missing, I think, are:
@dathudeptrai something that could be interesting is to implement a |
@dathudeptrai by looking at the examples and familiarizing myself with the library, I was wondering if you would have an idea of the example code snippet that will be shown to the users. From what I see, there are two open questions:
Thank you for the library! I've been playing with it and it's awesome! |
Yes, almost TTS model now is 2 stages (text2mel and mel2wav). We can combine into one end2end model for the inference stage :D.
Unlike transformers for NLP where the input is almost the same, the text2mel's inputs are varied, they can add more input such as speaker_ids (for multi-speakers), language_ids (for multi-lingual), speaker_embeddings (for voice clone), style embedding (for emotional TTS) and some inputs to adjust speed, f0, energy ... But generally, we only need 2 inputs (input_ids and speaker_ids) :D. |
Hi @dathudeptrai. We got some exciting news! Last week our team worked on open-sourcing the code for adding code snippets as well as running the inference API for other libraries. This is in the huggingface_hub repo. This PR adds the code snippet as we discussed :) your users will already benefit from being able to search for all TensorFlowTTS models. |
@osanseviero Awesome! :D. I'm uploading all our models to https://huggingface.co/tensorspeech, will add a model card soon :D |
Awesome! I'm looking forward to see this :) As a tip, you can use different tags |
Does the tflite model available for ESPNET model https://huggingface.co/julien-c/kan-bayashi-jsut_tts_train_tacotron2_ja? |
Hi TensorSpeech team! I hereby propose an integration with the HuggingFace model hub 🤗
This integration would allow you to freely download/upload models from/to the Hugging Face Hub: https://huggingface.co/.
Your users could then directly download model weights, etc within Python without having to manually downloads weights.
Taking your fastspeech_2_inference.ipynb example the following diff would show the code could change to be able to directly download weights from the model hub.
As an example, I uploaded a fastspeech model to this repo of the HF hub:
I uploaded some weights exemplary to the hub here: https://huggingface.co/patrickvonplaten/tf_tts_fast_speech_2.
If you'd like to add this feature to your library we would obviously change the organization name from
patrickvonplaten
totensorspeech
.You can try it out by running the following code:
Besides freely storing your model weights, we also provide git version control and download statistics for your models :-) We can also provide you with a hosted inference API where users could try out your models directly on the website.
We've already integrated with a couple of other libraries - you can check them out here:
Sorry for the missing tests in the PR - I just did the minimal changes to showcase you how the integration with the HF hub could look like :-) I'd also be more than happy to add you guys to a Slack channel where we could discuss further.
Cheers,
Patrick & Hugging Face team
Also cc @julien-c