Scaling parameters for non audio data #18

Sinjini15 · 2022-09-21T16:35:14Z

Sinjini15
Sep 21, 2022

Hey, thanks for this cool diffusion repo, it's really interesting.

I am playing around with the model and trying to use it to generate different types of non-audio distribution. My input is a random distribution of size (1,18,512). However, when I train the model and then perform inference, two things happen - the model loss doesn't seem to converge and the outputs don't match the original distribution.

The training code is as follows:

for epoch in range(epochs):
    l = 0.0
    for i,x in enumerate(train_loader):

        optimizer.zero_grad()
        loss = diffusion(x)
        loss.backward() # Do this many times
        optimizer.step()
        #scheduler.step()
        l += loss.detach().cpu().item()

Any suggestions for how I can get the loss to converge?

My optimizer is RMSProp and learning rate is 1e-4.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scaling parameters for non audio data #18

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 0 comments

Select a reply

Scaling parameters for non audio data #18

Sinjini15 Sep 21, 2022

Replies: 0 comments

Sinjini15
Sep 21, 2022