You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoints as the foundation, specifically on a Hindi dataset. While I've achieved impressive results in seen-to-seen and unseen-to-seen tasks, with a remarkable 95% match, I'm eager to enhance the performance in the seen-to-unseen task. Presently, I'm encountering a moderate 60% match when working with the reference speaker for unseen-to-unseen and seen-to-unseen tasks. I would greatly appreciate any insights or suggestions you have to improve these results further.
The text was updated successfully, but these errors were encountered:
Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoints as the foundation, specifically on a Hindi dataset. While I've achieved impressive results in seen-to-seen and unseen-to-seen tasks, with a remarkable 95% match, I'm eager to enhance the performance in the seen-to-unseen task. Presently, I'm encountering a moderate 60% match when working with the reference speaker for unseen-to-unseen and seen-to-unseen tasks. I would greatly appreciate any insights or suggestions you have to improve these results further.
Hi @MuruganR96 , I want to do what you did and fine-tune FreeVC on a non-English dataset. Your results of 95% match on seen-to-seen would be perfect for my use case. Can you please provide guidance or share your code?
Hello! I'm delighted to come across this remarkable project, and thanks for sharing it as an open-source project. Currently, my focus lies on fine-tuning the freevc-s model using pretrained checkpoints as the foundation, specifically on a Hindi dataset. While I've achieved impressive results in seen-to-seen and unseen-to-seen tasks, with a remarkable 95% match, I'm eager to enhance the performance in the seen-to-unseen task. Presently, I'm encountering a moderate 60% match when working with the reference speaker for unseen-to-unseen and seen-to-unseen tasks. I would greatly appreciate any insights or suggestions you have to improve these results further.
The text was updated successfully, but these errors were encountered: