You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi!
I was wondering how we could generate a longer audio sequence with the model trained to generate 10 second audio clips? Isnt the Unet architecture fixed and hence the output dimensions the same? the diffusers pipeline seems to do be changing the Unet dimensions,but then dont we need to train it again?
Thank you for your patience,
Pranav
The text was updated successfully, but these errors were encountered:
Hi!
I was wondering how we could generate a longer audio sequence with the model trained to generate 10 second audio clips? Isnt the Unet architecture fixed and hence the output dimensions the same? the diffusers pipeline seems to do be changing the Unet dimensions,but then dont we need to train it again?
Thank you for your patience,
Pranav
The text was updated successfully, but these errors were encountered: