Reconstructions sound very similar even with different training files #31
-
Hi all! I'm trying since a little while to get meaningful output training the model and using the reconstruction script. The timbres of the output files generated by reconstruction.py all sound quite similar (kind of metallic), even with checkpoints generated from different types of sounds. I originally thought this made sense as my first checkpoints were generated from drum machine hi-hat sounds. But then I got very similar results reconstructing the same audio with checkpoints from a synthesizer recording with a different timbre. attaching the dropbox folder here with inputs and ouputs... Maybe this phenomenon sounds familiar to somebody? The one thing I can guess is that I didn't provide enough training source material in minutes of audio, will be trying that now.. https://www.dropbox.com/scl/fo/t5m3lpc3gryhiqycxact4/h?dl=0&rlkey=3vmpyewf1nrqk3e8dqjojxptl |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Hi ! I think you are still at stage 1 of training, you should wait until stage 2 to start having nicely sounding sounds (see the article for more info) ! Another problem could be your dataset size, I've read somewhere that it's about 15mn long which is far from enough ! Something like ~2hours would make a better fit :) |
Beta Was this translation helpful? Give feedback.
Hi ! I think you are still at stage 1 of training, you should wait until stage 2 to start having nicely sounding sounds (see the article for more info) ! Another problem could be your dataset size, I've read somewhere that it's about 15mn long which is far from enough ! Something like ~2hours would make a better fit :)